AruneshNandan's Categories

AruneshNandan's Authors

Latest Saves

Do you need social media data for your machine learning project?

- Twitter data?
- Reddit data?
- Facebook data?

Where to get it?


Reddit: Pushshift

Pushshift is a big-data storage and analytics project.

Most people know it for its copy of reddit comments and

Reddit: Pushshift API

The
https://t.co/HWZNWEvrxY Reddit API was designed and created by the /r/datasets mod team to help provide enhanced functionality and search capabilities for searching Reddit comments and

Reddit: Pushshift file download

Note: The latest data for manual download is from April 2020

https://t.co/jBM4U71dnm


Reddit: PMAW: Pushshift Multithread API Wrapper

PMAW is an ultra minimalist wrapper for the Pushshift API which uses multithreading to retrieve Reddit comments and submissions.

If you pull data via Pushshift use PMAW, highly recommended!

https://t.co/xSlaX3Di6T