爬虫问题
Whisper Lv4

乱码问题

解决1

1
r = requests.get(url, headers=headers)

读取时读取r.content

解决2
查看原网页中head标签中的charset设置

1
bf = bs4.BeautifulSoup(html,'html.parser',from_encoding="xxx")

requests.get返回空内容

没有传headers , 构造headers传入get

base64编码存图片

1
2
3
4
5
6
7
8
9
# coding=utf-8 import os, base64 
img_str = 'xxxxxxxxxxxx'
img_data = base64.b64decode(img_str)

# 如果是"data:image/jpg:base64,",保存以png格式
# 如果是"data:image/png:base64,"保存以jpg格式

with open('test.png', 'wb') as f:
f.write(img_data)