六客Chrome插件

谷歌浏览器爬虫插件Webscraper抓取宝可梦列表的模板

[复制链接]
发表于 2020-12-15 21:47:35 | 显示全部楼层 |阅读模式
使用chrome谷歌浏览器爬虫插件Webscraper抓取宝可梦列表的模板内容。

1、 数据字段



关都
全国
图象
中文
日文
英文
属性 1
属性 2

2、结果示例截图



w8.jpg

3、sitemap json



  1. {"_id":"poke","startUrl":["http://wiki.52poke.com/wiki/%E5%AE%9D%E5%8F%AF%E6%A2%A6%E5%88%97%E8%A1%A8%EF%BC%88%E6%8C%89%E5%85%A8%E5%9B%BD%E5%9B%BE%E9%89%B4%E7%BC%96%E5%8F%B7%EF%BC%89"],"selectors":[{"id":"element","type":"SelectorElement","parentSelectors":["_root"],"selector":"table:nth-of-type(2) tr:nth-of-type(n+3)","multiple":true,"delay":0},{"id":"guandu","type":"SelectorText","parentSelectors":["element"],"selector":"td:nth-of-type(1)","multiple":false,"regex":"","delay":0},{"id":"quanguo","type":"SelectorText","parentSelectors":["element"],"selector":"td:nth-of-type(2)","multiple":false,"regex":"","delay":0},{"id":"zhongwen","type":"SelectorText","parentSelectors":["element"],"selector":"td:nth-of-type(4)","multiple":false,"regex":"","delay":0},{"id":"riwen","type":"SelectorText","parentSelectors":["element"],"selector":"td:nth-of-type(5)","multiple":false,"regex":"","delay":0},{"id":"yingwen","type":"SelectorText","parentSelectors":["element"],"selector":"td:nth-of-type(6)","multiple":false,"regex":"","delay":0},{"id":"shuxing1","type":"SelectorText","parentSelectors":["element"],"selector":"td:nth-of-type(7)","multiple":false,"regex":"","delay":0},{"id":"shuxing2","type":"SelectorText","parentSelectors":["element"],"selector":"td:nth-of-type(8)","multiple":false,"regex":"","delay":0}]}
复制代码


六客插件 - 好用的插件

本站鼓励并倡导使用正版软件,并不做任何破解软件的工作内容。站内所有软件资源版权均属于原作者所有,资源均只能用于参考学习用,请勿直接商用。若由于商用引起版权纠纷,一切责任均由使用者承担。


回复

使用道具 举报

您需要登录后才可以回帖 登录 | 立即注册

本版积分规则

最近更新的插件

免责声明

六客插件所发布的一切软件应用的帖子仅限用于学习和研究目的;不得将上述内容用于商业或者非法用途,否则,一切后果请用户自负。本站信息来自网络,版权争议与本站无关。您必须在下载后的24个小时之内,从您的电脑中彻底删除上述内容。(如有侵犯了您权益的应用请点此处联系我们处理

Archiver|小黑屋|sitemap|, Processed in 0.064742 second(s), 5 queries , File On.   
快速回复 返回顶部 返回列表