chrome 发表于 2020-12-15 21:47:35

谷歌浏览器爬虫插件Webscraper抓取宝可梦列表的模板

使用chrome谷歌浏览器爬虫插件Webscraper抓取宝可梦列表的模板内容。

1、 数据字段

关都
全国
图象
中文
日文
英文
属性 1
属性 2

2、结果示例截图



3、sitemap json

{"_id":"poke","startUrl":["http://wiki.52poke.com/wiki/%E5%AE%9D%E5%8F%AF%E6%A2%A6%E5%88%97%E8%A1%A8%EF%BC%88%E6%8C%89%E5%85%A8%E5%9B%BD%E5%9B%BE%E9%89%B4%E7%BC%96%E5%8F%B7%EF%BC%89"],"selectors":[{"id":"element","type":"SelectorElement","parentSelectors":["_root"],"selector":"table:nth-of-type(2) tr:nth-of-type(n+3)","multiple":true,"delay":0},{"id":"guandu","type":"SelectorText","parentSelectors":["element"],"selector":"td:nth-of-type(1)","multiple":false,"regex":"","delay":0},{"id":"quanguo","type":"SelectorText","parentSelectors":["element"],"selector":"td:nth-of-type(2)","multiple":false,"regex":"","delay":0},{"id":"zhongwen","type":"SelectorText","parentSelectors":["element"],"selector":"td:nth-of-type(4)","multiple":false,"regex":"","delay":0},{"id":"riwen","type":"SelectorText","parentSelectors":["element"],"selector":"td:nth-of-type(5)","multiple":false,"regex":"","delay":0},{"id":"yingwen","type":"SelectorText","parentSelectors":["element"],"selector":"td:nth-of-type(6)","multiple":false,"regex":"","delay":0},{"id":"shuxing1","type":"SelectorText","parentSelectors":["element"],"selector":"td:nth-of-type(7)","multiple":false,"regex":"","delay":0},{"id":"shuxing2","type":"SelectorText","parentSelectors":["element"],"selector":"td:nth-of-type(8)","multiple":false,"regex":"","delay":0}]}

页: [1]
查看完整版本: 谷歌浏览器爬虫插件Webscraper抓取宝可梦列表的模板