chrome 发表于 2020-12-15 21:37:15

谷歌浏览器爬虫插件Webscraper抓取腾讯体育社区列表的模板

使用谷歌浏览器爬虫插件Webscraper抓取腾讯体育社区列表的模板。

1、 数据字段

标题
详情链接
作者
发布日期
回复数
浏览数
最后回复
最后回复时间

2、结果示例截图



3、sitemap json

{"_id":"sports","startUrl":["https://fans.sports.qq.com/#/f/69"],"selectors":[{"id":"element","type":"SelectorElementClick","parentSelectors":["_root"],"selector":"div.tbody div.tr","multiple":true,"delay":"2000","clickElementSelector":"div.c-and-p:nth-of-type(3) a.btn-next","clickType":"clickMore","discardInitialElements":"do-not-discard","clickElementUniquenessType":"uniqueText"},{"id":"title","type":"SelectorLink","parentSelectors":["element"],"selector":".td-title a","multiple":false,"delay":0},{"id":"author","type":"SelectorText","parentSelectors":["element"],"selector":".td-author a","multiple":false,"regex":"","delay":0},{"id":"date","type":"SelectorText","parentSelectors":["element"],"selector":".td-author span","multiple":false,"regex":"","delay":0},{"id":"reply","type":"SelectorText","parentSelectors":["element"],"selector":"span.main","multiple":false,"regex":"","delay":0},{"id":"browers","type":"SelectorText","parentSelectors":["element"],"selector":".td-count span.sub","multiple":false,"regex":"","delay":0},{"id":"lastreply","type":"SelectorText","parentSelectors":["element"],"selector":".td-last a","multiple":false,"regex":"","delay":0},{"id":"lastreplytime","type":"SelectorText","parentSelectors":["element"],"selector":".td-last span","multiple":false,"regex":"","delay":0}]}

作者:iWebscraper
页: [1]
查看完整版本: 谷歌浏览器爬虫插件Webscraper抓取腾讯体育社区列表的模板