/Books/>: HTTP status code is not handled or not allowed
2017-11-04 17:21:38 [scrapy.spidermiddlewares.httperror] INFO: Ignoring response <403 http://www.dmoz.org/Computers/Programming/Languages/Python
/Resources/>: HTTP status code is not handled or not allowed
解决:在settings里面添加
HTTPERROR_ALLOWED_CODES = [403]#上面报的是403,就把403加入。