请选择 进入手机版 | 继续访问电脑版
公众号

谷歌浏览器爬虫插件Webscraper抓取 csdn 作者文章列表

所在版块: Chrome教程及常见问题 2020-12-15 21:58 [复制链接] 查看: 1451|回复: 0

马上注册,结交更多好友,享用更多功能,让你轻松玩转社区。

您需要 登录 才可以下载或查看,没有帐号?立即注册

x

1、 数据字段



文章类型
文章标题
文章详情链接
文章简介
发布日期
阅读数
评论数

2、结果示例截图



w12.jpg

3、sitemap json



  1. {"_id":"csdn","startUrl":["https://blog.csdn.net/zll_0405/article/list/[1-5]?"],"selectors":[{"id":"element","type":"SelectorElement","parentSelectors":["_root"],"selector":"div.article-item-box","multiple":true,"delay":0},{"id":"tag","type":"SelectorText","parentSelectors":["element"],"selector":"span.article-type","multiple":false,"regex":"","delay":0},{"id":"title","type":"SelectorLink","parentSelectors":["element"],"selector":"h4 a","multiple":false,"delay":0},{"id":"intro","type":"SelectorText","parentSelectors":["element"],"selector":".content a","multiple":false,"regex":"","delay":0},{"id":"date","type":"SelectorText","parentSelectors":["element"],"selector":"span.date","multiple":false,"regex":"","delay":0},{"id":"read","type":"SelectorText","parentSelectors":["element"],"selector":"p:nth-of-type(3) span.num","multiple":false,"regex":"","delay":0},{"id":"comment","type":"SelectorText","parentSelectors":["element"],"selector":"p:nth-of-type(5) span.num","multiple":false,"regex":"","delay":0}]}
复制代码


作者:iWebscraper

六客插件 - 好用的插件

本站鼓励并倡导使用正版软件,并不做任何破解软件的工作内容。站内所有软件资源版权均属于原作者所有,资源均只能用于参考学习用,请勿直接商用。若由于商用引起版权纠纷,一切责任均由使用者承担。


回复

使用道具 举报

您需要登录后才可以回帖 登录 | 立即注册

本版积分规则

扫描二维码下载APP

Archiver|小黑屋|sitemap|六客Chrome主题插件 ( 渝ICP备18015624号-11

GMT+8, 2022-11-27 10:07 , Processed in 0.022321 second(s), 7 queries , File On.

Powered by Discuz! X3.3

© 2001-2013 Comsenz Inc.

快速回复 返回列表