程式扎記: [ Python 常見問題 ] UnicodeDecodeError: 'utf8' codec can't decode byte 0x9c

2017年8月8日星期二

[ Python 常見問題 ] UnicodeDecodeError: 'utf8' codec can't decode byte 0x9c

Source From Here
Question
I have a socket server that is supposed to receive UTF-8 valid characters from clients. The problem is some clients (mainly hackers) are sending all the wrong kind of data over it.

I need to be able to make the string UTF-8 with or without those characters.

How-To
Please refer to "Unicode HowTo"

view plaincopy to clipboardprint?
str = unicode(str, errors='replace')  

view plaincopy to clipboardprint?
str = unicode(str, errors='ignore')  

Note: This solution will strip out (ignore) the characters in question returning the string without them. Only use this if your need is to strip them not convert them.

Alternatively, use the open method from the codecs module to read in the file:

view plaincopy to clipboardprint?
import codecs  
with codecs.open(file_name, "r",encoding='utf-8', errors='ignore') as fdata:  
    ...  

程式扎記

標籤

2017年8月8日星期二

[ Python 常見問題 ] UnicodeDecodeError: 'utf8' codec can't decode byte 0x9c

沒有留言:

張貼留言

[Git 常見問題] error: The following untracked working tree files would be overwritten by merge

檢舉濫用情形

學習筆記

標籤

2017年8月8日 星期二

[ Python 常見問題 ] UnicodeDecodeError: 'utf8' codec can't decode byte 0x9c

沒有留言:

張貼留言

[Git 常見問題] error: The following untracked working tree files would be overwritten by merge

檢舉濫用情形

學習筆記

2017年8月8日星期二