Skip to main content

Archive tweets from the command line

Project description

twarc

twarc is a command line tool and Python library for collecting and archiving Twitter JSON data via the Twitter API. It has separate commands (twarc and twarc2) for working with the older v1.1 API and the newer v2 API and Academic Access (respectively). It also has an ecosystem of plugins for doing things with the collected data.

See the twarc documentation for running commands: twarc2 and twarc1 for using the v1.1 API. If you aren't sure about which one to use you'll want to start with twarc2 since the v1.1 is scheduled to be retired.

Install

If you have python installed, you can install twarc using:

pip3 install twarc

Once installed, you should be able to use the twarc and twarc2 command line utilities, or use it as a Python library - check the examples here for that.

Other Tools

Twarc is purpose build for working with the twitter API for archiving and studying digital trace data. It is not built as a general purpose API library for Twitter. While the primary use is academic, it works just as well with "Standard" v2 API and "Premium" v1.1 APIs.

For a list of general purpose Twitter Libraries in different languages see the Twitter Documentation. For Python, TwitterAPI and tweepy are both up to date and maintained. They also support v2 APIs, and their data format with expansions may differ from twarc. There is also a reference implementation of the v2 Academic Access Search and v1.1 Premium Search from Twitter here. The v2 version of this script is compatible with twarc.

For R there is academictwitteR. Unlike twarc, it focuses solely on querying the Twitter Academic Research Product Track v2 API endpoint. Data gathered in twarc can be imported into R for analysis as a dataframe if you export the data into CSV using twarc-csv.

Getting Help

Check the tutorials to get started, or follow along with this recorded stream introducing twarc. If you run into trouble, feel free to make a post on the Twarc Repository or on the Twitter Developer Forums.

Project details


Release history Release notifications | RSS feed

Download files

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

Source Distribution

twarc-2.8.1.tar.gz (52.5 kB view details)

Uploaded Source

Built Distribution

If you're not sure about the file name format, learn more about wheel file names.

twarc-2.8.1-py3-none-any.whl (54.4 kB view details)

Uploaded Python 3

File details

Details for the file twarc-2.8.1.tar.gz.

File metadata

  • Download URL: twarc-2.8.1.tar.gz
  • Upload date:
  • Size: 52.5 kB
  • Tags: Source
  • Uploaded using Trusted Publishing? No
  • Uploaded via: twine/3.6.0 importlib_metadata/4.8.2 pkginfo/1.7.1 requests/2.26.0 requests-toolbelt/0.9.1 tqdm/4.62.3 CPython/3.9.8

File hashes

Hashes for twarc-2.8.1.tar.gz
Algorithm Hash digest
SHA256 459bc8fedfbe7ae56b670c75ae52cbbd594974849d9b62432f00b21e1244e13c
MD5 684c839bbd188b893eb996941eb799ce
BLAKE2b-256 fbd70a32fae3c3c90165b51ee501f0976d0a1381075e344b2596cf7d942a3a0a

See more details on using hashes here.

File details

Details for the file twarc-2.8.1-py3-none-any.whl.

File metadata

  • Download URL: twarc-2.8.1-py3-none-any.whl
  • Upload date:
  • Size: 54.4 kB
  • Tags: Python 3
  • Uploaded using Trusted Publishing? No
  • Uploaded via: twine/3.6.0 importlib_metadata/4.8.2 pkginfo/1.7.1 requests/2.26.0 requests-toolbelt/0.9.1 tqdm/4.62.3 CPython/3.9.8

File hashes

Hashes for twarc-2.8.1-py3-none-any.whl
Algorithm Hash digest
SHA256 26efb59eb6a852a58f743b91ed166e65d767539acb4355694f220530446d1d38
MD5 9e91d343856d2c923294d7ba92b64dbd
BLAKE2b-256 788b13d6985b2c4754b344c60fca4a5373419dd6ead8e6688c587a42f60cd8d1

See more details on using hashes here.

Supported by

AWS Cloud computing and Security Sponsor Datadog Monitoring Depot Continuous Integration Fastly CDN Google Download Analytics Pingdom Monitoring Sentry Error logging StatusPage Status page