Welcome to OGeek Q&A Community for programmer and developer-Open, Learning and Share
Welcome To Ask or Share your Answers For Others

Categories

0 votes
163 views
in Technique[技术] by (71.8m points)

python - How to remove non-ascii characters from a list

I have an object type DataFrame with some elements that are text and some are numbers.

when I convert a column to a list, some of the elements have non-ascii characters. Is there a way to get rid of the characters, like .encode('ascii', 'ignore') but for a list?

here is the list that I get:

['Central Parku202c',
 'Top of the Rock',
 'Statue of Libertyu202c',
 'Brooklyn Bridge'
]
question from:https://stackoverflow.com/questions/66061881/how-to-remove-non-ascii-characters-from-a-list

与恶龙缠斗过久,自身亦成为恶龙;凝视深渊过久,深渊将回以凝视…
Welcome To Ask or Share your Answers For Others

1 Reply

0 votes
by (71.8m points)

If you want to post-process your list, you can apply encode('ascii', 'ignore') over it:

my_list = [
    'Central Parku202c',
    'Top of the Rock',
    'Statue of Libertyu202c',
    'Brooklyn Bridge'
]
my_list = [e.encode('ascii', 'ignore').decode("utf-8") for e in my_list]
print(my_list)

And the output should be:

['Central Park', 'Top of the Rock', 'Statue of Liberty', 'Brooklyn Bridge']

与恶龙缠斗过久,自身亦成为恶龙;凝视深渊过久,深渊将回以凝视…
OGeek|极客中国-欢迎来到极客的世界,一个免费开放的程序员编程交流平台!开放,进步,分享!让技术改变生活,让极客改变未来! Welcome to OGeek Q&A Community for programmer and developer-Open, Learning and Share
Click Here to Ask a Question

...