Twitter Scraper

✏️ Setup
💁 More infos and Usage
🧪 Testing
🧑‍🏫 Contributing
⚖️ License
🔄 Changelog
🐛 Bugs & TODO

✏️ Setup

Note This project is currently under development. It is not yet ready for production.

Please install first the required packages with the following command:

pip install --upgrade -r requirements.txt

Then you should setup a Twitter developer account and create a new app to get your API keys. You can find more information here.

Then you should create a new file named .env in the root directory of the project and add the following lines (based on .env.example):

API_KEY =
API_KEY_SECRET =
BEARER_TOKEN =

💁 More infos and Usage

🧪 Testing

Oh god! Please don't... Still, make sure you have pytest installed and run the following command:

pytest .\twitter_scraper\

You can also use the vscode UI to run the tests.

🧑‍🏫 Contributing

If you ever want to contribute, please begin by reading our Contributing Guidelines.

The standard procedure is :
fork -> git branch -> push -> pull request
Note that we won't accept any PR :

that does not follow our Contributing Guidelines

that is not sufficiently commented or isn't well formated

without any proper test suite

with a failing or incomplete test suite

Happy coding ! 🙂

⚖️ License

This project is licensed under the CeCILL-C FREE SOFTWARE LICENSE AGREEMENT. For more information, please refer to the official website.

🔄 Changelog

See changelog.md for more information.

gantt
    title Main Versions
    dateFormat YYYY-MM-DD

    section source Code (v0)
    v0.1 : 2023-01-16, 1d
    v0.2 :             2d
    v0.3 :             2d

    section stable Versions
    v1   : 2023-01-19, 9d

Stable Version 1 (click here to expand)

v1.0 first stable release

collection.abc instead of typing (deprecated)
lowered the requirements
min supported python version is now 3.10.6

v1.1 more queries and less storage

encoded tweet.content into bytes for storage
added retweet and reply selectors to SearchQuery

🐛 Bugs & TODO

known bugs (final correction patch version) see Issues

tweet.date is always None when scraping (stored as 0)

todo (first implementation version)

encode tweet.content into bytes for storage
should add tweet.date back in when scraping
add large search queries
a posteriori tweet inspection

Name		Name	Last commit message	Last commit date
Latest commit History 64 Commits
.github		.github
scripts		scripts
twitter_scraper		twitter_scraper
.env.example		.env.example
.gitignore		.gitignore
.pylintrc		.pylintrc
.style.yapf		.style.yapf
CODE_OF_CONDUCT.md		CODE_OF_CONDUCT.md
LICENSE		LICENSE
README.md		README.md
changelog.md		changelog.md
create_db.py		create_db.py
demo.ipynb		demo.ipynb
pyproject.toml		pyproject.toml
requirements.txt		requirements.txt

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Twitter Scraper

✏️ Setup

💁 More infos and Usage

🧪 Testing

🧑‍🏫 Contributing

⚖️ License

🔄 Changelog

🐛 Bugs & TODO

About

Releases

Sponsor this project

Contributors 3

Languages

License

cognitivefactory/twitter-scraper

Folders and files

Latest commit

History

Repository files navigation

Twitter Scraper

✏️ Setup

💁 More infos and Usage

🧪 Testing

🧑‍🏫 Contributing

⚖️ License

🔄 Changelog

🐛 Bugs & TODO

About

Topics

Resources

License

Code of conduct

Stars

Watchers

Forks

Releases

Sponsor this project

Contributors 3

Languages