Message225818
What problem is purposed to solve clean_surrogate_escapes()? Could you please provide user scenario or two?
Possible alternative implementation is:
def clean_surrogate_escapes(s):
return s.encode('utf-8', 'surrogatepass').decode('utf-8', 'replace')
It can be faster for some data (for mostly ASCII with rare surrogates it is superfast). For other data 'utf-16' can be better choice. |
|
| Date |
User |
Action |
Args |
| 2014-08-24 13:49:12 | serhiy.storchaka | set | recipients:
+ serhiy.storchaka, ncoghlan, pitrou, vstinner, ezio.melotti, Arfrever, r.david.murray |
| 2014-08-24 13:49:12 | serhiy.storchaka | set | messageid: <[email protected]> |
| 2014-08-24 13:49:12 | serhiy.storchaka | link | issue18814 messages |
| 2014-08-24 13:49:12 | serhiy.storchaka | create | |
|