chrome 发表于 2020-12-15 21:41:05

谷歌浏览器爬虫插件Webscraper抓取小鸡词典内容

使用谷歌浏览器爬虫插件Webscraper抓取小鸡词典内容的模板,本站有很多类似的Webscraper插件抓取模板。

1、 数据字段

详情链接

发布日期
查看数
解释
标签
点赞数
踩数
评论数

2、结果示例截图



3、sitemap json

{"_id":"jikipedia","startUrl":["https://jikipedia.com/"],"selectors":[{"id":"element","type":"SelectorElementScroll","parentSelectors":["_root"],"selector":".tile:nth-of-type(-n+50)","multiple":true,"delay":"3000"},{"id":"link","type":"SelectorLink","parentSelectors":["element"],"selector":"a.card-content","multiple":false,"delay":0},{"id":"word","type":"SelectorText","parentSelectors":["link"],"selector":".section h1","multiple":false,"regex":"","delay":0},{"id":"date","type":"SelectorText","parentSelectors":["link"],"selector":"div.created","multiple":false,"regex":"","delay":0},{"id":"look","type":"SelectorText","parentSelectors":["link"],"selector":"div.view","multiple":false,"regex":"","delay":0},{"id":"content","type":"SelectorText","parentSelectors":["link"],"selector":".section .brax-node span","multiple":false,"regex":"","delay":0},{"id":"tag","type":"SelectorText","parentSelectors":["link"],"selector":"span.tag-text","multiple":false,"regex":"","delay":0},{"id":"like","type":"SelectorText","parentSelectors":["link"],"selector":"span.like-count","multiple":false,"regex":"","delay":0},{"id":"down","type":"SelectorText","parentSelectors":["link"],"selector":"span.dislike-count","multiple":false,"regex":"","delay":0},{"id":"comment","type":"SelectorText","parentSelectors":["link"],"selector":"span.comment-count","multiple":false,"regex":"","delay":0}]}

作者:iWebscraper
页: [1]
查看完整版本: 谷歌浏览器爬虫插件Webscraper抓取小鸡词典内容