Skip to content

Commit

Permalink
Docs
Browse files Browse the repository at this point in the history
  • Loading branch information
Yomguithereal committed Feb 2, 2021
1 parent 3b99a02 commit a30693b
Show file tree
Hide file tree
Showing 3 changed files with 37 additions and 4 deletions.
1 change: 1 addition & 0 deletions README.md
Original file line number Diff line number Diff line change
Expand Up @@ -20,6 +20,7 @@ In addition, **minet** also exposes its high-level programmatic interface as a p
* Collecting data from [Facebook](https://www.facebook.com/) (comments, likes etc.)
* Parsing [Facebook](https://www.facebook.com/) urls in a CSV file.
* Collecting data from [Twitter](https://twitter.com) (users, followers, followees etc.)
* Scraping data (tweets etc.) from [Twitter](https://twitter.com)'s website public facing search API.
* Collecting data from [YouTube](https://www.youtube.com/) (captions, comments, video metadata etc.)
* Parsing [YouTube](https://www.youtube.com/) urls in a CSV file.
* Dumping a [Hyphe](https://hyphe.medialab.sciences-po.fr/) corpus.
Expand Down
35 changes: 31 additions & 4 deletions docs/cli.md
Original file line number Diff line number Diff line change
Expand Up @@ -41,6 +41,7 @@
* [twitter](#twitter)
* [followers](#followers)
* [friends](#friends)
* [scrape](#twitter-scrape)
* [users](#users)
* [youtube (yt)](#youtube)
* [captions](#captions)
Expand Down Expand Up @@ -549,7 +550,7 @@ usage: minet crowdtangle search [-h] [--rate-limit RATE_LIMIT] [-o OUTPUT]
[--language LANGUAGE] [-l LIMIT]
[--not-in-title] [--offset OFFSET]
[-p PLATFORMS]
[--search-field {include_query_strings,image_text_only,text_fields_only,account_name_only,text_fields_and_image_text}]
[--search-field {text_fields_only,account_name_only,image_text_only,include_query_strings,text_fields_and_image_text}]
[--sort-by {date,interaction_rate,overperforming,total_interactions,underperforming}]
[--start-date START_DATE] [--types TYPES]
terms
Expand Down Expand Up @@ -579,7 +580,7 @@ optional arguments:
--not-in-title Whether to search terms in account titles also.
--offset OFFSET Count offset.
-p PLATFORMS, --platforms PLATFORMS The platforms from which to retrieve links (facebook, instagram, or reddit). This value can be comma-separated.
--search-field {include_query_strings,image_text_only,text_fields_only,account_name_only,text_fields_and_image_text}
--search-field {text_fields_only,account_name_only,image_text_only,include_query_strings,text_fields_and_image_text}
In what to search the query. Defaults to `text_fields_and_image_text`.
--sort-by {date,interaction_rate,overperforming,total_interactions,underperforming}
The order in which to retrieve posts. Defaults to `date`.
Expand All @@ -599,7 +600,7 @@ examples:
usage: minet crowdtangle summary [-h] [--rate-limit RATE_LIMIT] [-o OUTPUT]
[-t TOKEN] [-p PLATFORMS] [--posts POSTS]
[-s SELECT]
[--sort-by {subscriber_count,total_interactions,date}]
[--sort-by {subscriber_count,date,total_interactions}]
[--start-date START_DATE] [--total TOTAL]
column [file]

Expand All @@ -621,7 +622,7 @@ optional arguments:
-p PLATFORMS, --platforms PLATFORMS The platforms from which to retrieve links (facebook, instagram, or reddit). This value can be comma-separated.
--posts POSTS Path to a file containing the retrieved posts.
-s SELECT, --select SELECT Columns to include in report (separated by `,`).
--sort-by {subscriber_count,total_interactions,date}
--sort-by {subscriber_count,date,total_interactions}
How to sort retrieved posts. Defaults to `date`.
--start-date START_DATE The earliest date at which a post could be posted (UTC!). You can pass just a year or a year-month for convenience.
--total TOTAL Total number of HTML documents. Necessary if you want to display a finite progress indicator.
Expand Down Expand Up @@ -908,6 +909,32 @@ examples:
```

<h3 id="twitter-scrape">scrape</h3>

```
usage: minet twitter scrape [-h] [-l LIMIT] [-o OUTPUT] {tweets} query
Minet Twitter Scrape Command
============================
Scrape Twitter's public facing search API to collect tweets etc.
positional arguments:
{tweets} What to scrape. Currently only `tweets` is possible.
query Search query.
optional arguments:
-h, --help show this help message and exit
-l LIMIT, --limit LIMIT Maximum number of tweets to collect.
-o OUTPUT, --output OUTPUT Path to the output file. By default, the result will be printed to stdout.
examples:
. Collecting the latest 500 tweets of a given Twitter user:
`minet tw scrape tweets "from:@jack" --limit 500 > tweets.csv`
```

### users

```
Expand Down
5 changes: 5 additions & 0 deletions docs/cli.template.md
Original file line number Diff line number Diff line change
Expand Up @@ -41,6 +41,7 @@
* [twitter](#twitter)
* [followers](#followers)
* [friends](#friends)
* [scrape](#twitter-scrape)
* [users](#users)
* [youtube (yt)](#youtube)
* [captions](#captions)
Expand Down Expand Up @@ -199,6 +200,10 @@ For more documentation about minet's scraping DSL check this [page](../cookbook/
<% twitter/friends %>
<h3 id="twitter-scrape">scrape</h3>
<% twitter/scrape %>
### users
<% twitter/users %>
Expand Down

0 comments on commit a30693b

Please sign in to comment.