视频号爬虫

wangkun e3944b1df2 update 2 years ago
.idea 9619456898 update 2 years ago
chlsfiles 11a41fa4e0 update 2 years ago
logs fd49420ba3 first commit 2 years ago
main e3944b1df2 update 2 years ago
shipinhao bb26f5e33d update 2 years ago
videos e43a143963 update 2 years ago
xinshi 06978e6786 update 2 years ago
.gitignore 40d448b9a7 add xinshi 2 years ago
README.md 2afc41db6d update 2 years ago
shipinhao.sh 6d5c1728ea update 2 years ago

README.md

crawler_shipinhao

  1. git:https://git.yishihui.com/Server/crawler_shipinhao.git
  2. 需求:https://w42nne6hzg.feishu.cn/docx/doxcny2B80x9h5B6iu4a36jsiDc
  3. 爬虫表:https://w42nne6hzg.feishu.cn/sheets/shtcn9rOdZRAGFbRkWpn7hqEHGc

介绍

视频号爬虫项目

软件架构

  1. python==3.10
  2. Appium_Python_Client==2.6.1
  3. loguru==0.6.0
  4. oss2==2.15.0
  5. psutil==5.9.2
  6. requests==2.27.1
  7. selenium==4.4.3
  8. urllib3==1.26.9

使用说明

  1. cd ./crawler_shipinhao
  2. sh shipinhao.sh

需求

2022/12/20

  1. 新增定向脚本

2022/10/27

  1. 新增新视榜单爬虫

2022/10/18

  1. 运行时间调整: 10:00:00 - 16:00:00 (包含)

2022/10/12

  1. 同一账号下的作品最多连续抓取两条视频

2022/10/11

  1. 推荐榜,每日入库条数限制:100

2022/9/27

  1. 新增按照话题抓取

2022/9/14

  1. 修改视频时长>=30s

2022/9/13

  1. 视频时长>=10s
  2. 点赞>=1000
  3. 运行时间: 10:00:00 - 20:00:00
  4. 上传账号: [20631278, 20631279]