Python问题合集

爬虫

发布日期: 2021-11-16

更新日期: 2021-11-16

文章字数: 434

阅读时长: 2 分

中文乱码

# 通用中文乱码的解决方案
img_name = img_name.encode('iso-8859-1').decode('gbk')
# 手动设置相应数据的编码格式
response = requests.get(url=url, headers=headers)
response.encoding = 'utf-8'
#  text -> content
response.text 改成 response.content

爬虫

解决python爬虫requests.exceptions.SSLError: HTTPSConnectionPool(host='XXX', port=443)问题

安装cryptography、pyOpenSSL、certifi三个模块即可

pip install cryptography
pip install pyOpenSSL
pip install certifi

requests库提示警告：InsecureRequestWarning: Unverified HTTPS request is being made. Adding certificate ver

加入如下代码即可

requests.packages.urllib3.disable_warnings()

提取html文件报错lxml.etree.XMLSyntaxError: Opening and ending tag mismatch: meta line 4 and head

原因在于html标签未加 / 所有的标签最好都用 / 来结束

TypeError: can only concatenate str (not “list”) to str(列表和字符串的报错解决方法)

# 强转即可
str(title)

警告DeprecationWarning: executable_path has been deprecated, please pass in a Service object

# 是使用api过期导致此警告 使用这个（测试案例）即可
from selenium import webdriver
from selenium.webdriver.chrome.service import Service

s = Service("chromedriver.exe")
driver = webdriver.Chrome(service=s)
driver.get('https://www.baidu.com/')
driver.quit()

标签定位不到selenium.common.exceptions.ElementNotInteractableException: Message: element not interactable

# 使用标签的绝对定位
bro.find_element(By.XPATH,''

l类型错误 :TypeError: ‘ItemMeta‘ object does not support item assignment

# 报错原因：未找到具体item，load出错，item后面需要加()进行实例化。
item = MeinvproItem()

scrapy框架写入数据库中存储数据部分代码报错 redis.exceptions.DataError: Invalid input of type: 'dict'. Convert to a byte, string or number first.

# 使用旧版本pip install redis==2.10.6，即可解决

jupyter

每次打开jupyter提示如下行信息

autopep8

pip install autopep8

设置代码提示功能

# 安装nbextensions
pip install jupyter_contrib_nbextensions -i https://pypi.mirrors.ustc.edu.cn/simple
jupyter contrib nbextension install --user
-------------------------------------------------------------------------------------------------------------------
# 安装nbextensions_configurator
pip install --user jupyter_nbextensions_configurator 
jupyter nbextensions_configurator enable --user
# 勾选Hinterland启用代码自动补全

shaoshaossm

https://shaoshaossm.github.io/2021-11-16-python-pa-chong-wen-ti-he-ji.html

本博客所有文章除特別声明外，均采用 CC BY 4.0 许可协议。转载请注明来源 shaoshaossm !

Python

Python爬虫模拟登陆和异步爬虫

2021-11-17 爬虫

Python

爬虫常用正则表达式

2021-11-16 表达式

正则表达式

中文乱码

爬虫

jupyter

你的赏识是我前进的动力