scrapy-splash

官网:https://github.com/scrapy-plugins/scrapy-splash

1、安装:

pip install scrapy-splash

2、运行splash

docker run -p 8050:8050 scrapinghub/splash

3、配置setting文件

DOWNLOADER_MIDDLEWARES = {
    'scrapy_splash.SplashCookiesMiddleware': 723,
    'scrapy_splash.SplashMiddleware': 725,
    'scrapy.downloadermiddlewares.httpcompression.HttpCompressionMiddleware': 810,
}

SPIDER_MIDDLEWARES = {
    'scrapy_splash.SplashDeduplicateArgsMiddleware': 100,
}

SPLASH_URL = 'http://10.63.32.49:8050/'

# 过滤
DUPEFILTER_CLASS = 'scrapy_splash.SplashAwareDupeFilter'

# 缓存
HTTPCACHE_STORAGE = 'scrapy_splash.SplashAwareFSCacheStorage'

 

 

上一篇:复习资料目录


下一篇:Python爬虫(主要是scrapy框架)