一、概述这是一个用于获取facebook个人页信息的爬虫，可获取个人资料、粉丝列表、关注列表、好友列表、发布帖子。

二、依赖详见requirements.txt 终端安装:pip install -r requirements.txt

三、模块介绍 1、crawl模块主要存储爬取配置cookie以及headers信息,可调用函数user_crawl（获取用户资料）、friends_crawl（获取好友）、following_crawl（获取关注）、followers_crawl（获取粉丝）、post_crawl（获取帖子）详细查看:help(Crawl) 2、extract模块主要存储数据解析函数，可调用函数user_extract（解析个人资料）、friends_following_followers_extract（解析关注、粉丝、好友）、post_extract（解析帖子）详细查看:help(Extract) 3、store模块主要存储数据保存函数，包含mysql存储以及excel存储详细查看:help(Store) 4、main模块为项目主要流程模块，包含实际业务逻辑，可自行布置

四、说明 1、crawl中setting设置打开F12工具，定位json数据或是文档数据，复制curl 打开网页https://curlconverter.com/，获取cookie及headers 2、store中修改mysql的密码

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

README.md

README.md

Files

README.md

Latest commit

History

README.md

File metadata and controls