scrapy 带认证的代理
官方方法:
from w3lib.http import basic_auth_header
class CustomProxyMiddleware(object):
def process_request(self, request, spider):
request.meta['proxy'] = "https://<PROXY_IP_OR_URL>:<PROXY_PORT>"
request.headers['Proxy-Authorization'] = basic_auth_header(
'<PROXY_USERNAME>', '<PROXY_PASSWORD>')
SRE实战 互联网时代守护先锋,助力企业售后服务体系运筹帷幄!一键直达领取阿里云限量特价优惠。
DOWNLOADER_MIDDLEWARES = {
'<PROJECT_NAME>.middlewares.CustomProxyMiddleware': 350,
'scrapy.downloadermiddlewares.httpproxy.HttpProxyMiddleware': 400,
}
经过测试,不带认证的这么写也没有问题
更多精彩

