python 解决illegal multibyte sequence

2021-09-05 06:11:20

读取html遇到illegal multibyte sequence

1.第一种情况：更换编码方式

查看网页源码，找到charset，得到该网页编码方式

<meta http-equiv="Content-Type" content="text/html; charset=gb2312" />

2.第二种情况：尽管更换了编码方式依然报错

将gb2312更换为gb18030

码农公寓