本文介绍SharePoint 2013 设置外网(Internet)爬网源:
下面是步聚:
1. 新建外部爬网源
a. 打开 “SharePoint 2013 Central Administration” -> "General Application Settings" ->"Farm Search Administration" ->"Search Service Application"
b. 点击"Content Sources"
c. 点击“New Content Source”
d. 输入name,选择web site, 输入address 并点击OK
2. 设置搜索代理
a. 打开 “SharePoint 2013 Central Administration” -> "General Application Settings" ->"Farm Search Administration"
b. 点击ProxySever url 的链接,进入Search proxy setting:
注意如果不设置Search Proxy爬外网时会因为超时失败
3. 启动Full Crawl
a. 打开 “SharePoint 2013 Central Administration” -> "General Application Settings" ->"Farm Search Administration" ->"Search Service Application"
b. 点击"Content Sources"
c. 选择extrenal site并启动Full Crawl
等待Crawl完成再进行第四步
4. 查看Crawl log
a. 打开 “SharePoint 2013 Central Administration” -> "General Application Settings" ->"Farm Search Administration" ->"Search Service Application"
b. 点击Crawl log
c. 查看extrenal site的log
注意,如果出现超时错误可以尝试设置“Crawler Impact Rules”:
a. 打开 “SharePoint 2013 Central Administration” -> "General Application Settings" ->"Farm Search Administration" ->"Search Service Application"
b. 点击 “Crawler Impact Rules”
c.点击 "Add Rule"
另附:
SharePoint 2013 爬新浪网站:《Sharepoint2013搜索学习笔记之设置外网内容源(四)》
SharePoint 2013 爬自己网站:《SharePoint 搜索爬网第三方网站配置 》
SharePoint 2013 微软官方资料:《在SharePoint Server 2013 中管理爬网》