Reddit Scraper - Posts, Comments, Subreddits & Users
Pricing
$3.00 / 1,000 results
Reddit Scraper - Posts, Comments, Subreddits & Users
Scrape Reddit posts, comments, subreddits and user profiles by keyword or URL. Full comment threads, 40+ fields, search, NSFW and monitoring mode. No login, no API key, no browser.
Pricing
$3.00 / 1,000 results
Rating
0.0
(0)
Developer
Scrape Sage
Maintained by CommunityActor stats
0
Bookmarked
2
Total users
1
Monthly active users
4 days ago
Last modified
Categories
Share
Reddit Scraper — Posts, Comments, Subreddits & User Profiles (No Login)
Scrape Reddit at scale without an API key, without login, and without a browser. Pull posts, full comment threads, subreddit feeds, keyword search results, communities and user profiles into clean, structured JSON — by URL or by keyword — with monitoring mode so scheduled runs only return what's new.
Built for market research, AI / LLM training data, brand & reputation monitoring, sentiment analysis, social selling and trend discovery.
Why this Reddit scraper?
- No login, no API key, no rate-limit headaches. Reddit's public API is gated and throttled; this actor reads the public site directly and rotates IPs automatically.
- Reliable where others fail. Reddit hard-blocks the old
.jsonendpoints that most scrapers depend on. This actor uses the fully server-rendered site over residential proxies, so it keeps returning data when cheaper tools return errors. - Richest record in class — 40+ fields per post and per comment: score, comment count, exact timestamps, flair, NSFW/spoiler/stickied flags, domain, author, permalinks, full comment depth and parent/child relationships.
- Clean data, no junk —
[deleted]/[removed]/ empty comments are filtered out by default, and you are never charged for them. Every record also carries a unifiedtextcolumn (post title, comment body, community description or user summary) so the default Overview view is fully populated at a glance — no half-empty rows. - Everything in one actor — posts, comments, subreddits, search, communities and users. No need to chain five different scrapers.
- Monitoring mode — remember what you've already pulled and return only new posts/comments on the next run. Perfect with Schedules.
- Pay per result, no start fee. You only pay for the records you actually receive.
Use cases
- AI & LLM training data — harvest high-signal discussions and Q&A threads from any subreddit.
- Brand & product monitoring — track every new mention of your brand, product or competitor across Reddit.
- Market & customer research — mine real opinions, pain points and buying questions in niche communities.
- Sentiment & trend analysis — feed posts and comments into your NLP pipeline.
- Social selling / lead discovery — find people actively asking for recommendations in your space.
- Community & influencer analysis — profile subreddits and prolific users by karma and activity.
How to use
- Tell the actor what to scrape — paste Reddit Start URLs (subreddit, post, user or search pages), and/or list subreddits, search keywords or usernames.
- Set Sort, Time range and Max items.
- (Optional) Turn on Include comments to pull comment threads for every post, and Monitoring mode for new-only runs.
- Run it. Download the dataset as JSON, CSV, Excel, or push it anywhere via API and integrations.
Input
| Field | Description |
|---|---|
startUrls | Any Reddit URLs — subreddit, post/comments, user, search or /subreddits/search. Auto-detected and routed. |
subreddits | Subreddit names (e.g. RealEstate, r/AskReddit) to pull posts from. |
searches | Keyword queries — returns posts (default) or communities. |
usernames | Reddit users — returns their profile (karma, cake day) plus recent posts and comments. |
searchScope | What searches returns: posts or communities. |
sort / time | Order (hot/new/top/rising/controversial/relevance) and time window for top. |
maxItems | Global cap across all record types. |
includeComments / maxCommentsPerPost / commentSort | Pull comment threads for posts; how many and in what order. |
expandMoreComments | Follow "load more comments" links for deeper threads. |
includeDeletedComments | Off by default — skip (and never charge for) [deleted]/[removed]/empty comments. Turn on to keep them with an isDeleted flag. |
onlyPostsNewerThan | Keep only fresh items — ISO date or "7 days", "24 hours", etc. |
includeNSFW / skipAds | Include 18+ content; skip promoted posts. |
monitorMode / monitorStoreName | Return only records not seen in previous runs. |
proxyConfiguration | Defaults to Apify Residential (strongly recommended for Reddit). |
Example input
{"subreddits": ["RealEstate", "FirstTimeHomeBuyer"],"sort": "top","time": "week","maxItems": 500,"includeComments": true,"maxCommentsPerPost": 30,"commentSort": "top"}
Output
Every row carries a type (post, comment, community or user) and a unified text column holding its main content (post title, comment body, community description, or a user's karma summary). The dataset opens on a single Overview table where every row reads as full regardless of type — then switch to the focused Posts / Comments / Communities / Users views for type-specific columns.
{"type": "post","id": "1ucvvkb","fullId": "t3_1ucvvkb","subreddit": "RealEstate","author": "throwaway_homebuyer","score": 41,"createdAt": "2026-06-22T20:35:40.000Z","text": "Attorney won't finish house escrow closing, asking for more money (NY)","title": "Attorney won't finish house escrow closing, asking for more money (NY)","subredditPrefixed": "r/RealEstate","permalink": "https://www.reddit.com/r/RealEstate/comments/1ucvvkb/...","url": "https://www.reddit.com/r/RealEstate/comments/1ucvvkb/...","domain": "self.RealEstate","isSelf": true,"linkFlair": "Financing","numComments": 117,"createdUtc": 1750624540,"isNSFW": false,"isSpoiler": false,"isStickied": false,"selfText": "We're closing on a house in NY and our attorney...","scrapedAt": "2026-06-23T00:00:00.000Z"}
{"type": "comment","id": "ot74y68","fullId": "t1_ot74y68","subreddit": "RealEstate","author": "real_estate_atty","score": 28,"createdAt": "2026-06-22T20:55:40.000Z","text": "It doesn't sound like the addendum was written in a way which...","body": "It doesn't sound like the addendum was written in a way which...","postId": "1ucvvkb","postTitle": "Attorney won't finish house escrow closing, asking for more money (NY)","parentFullId": "t3_1ucvvkb","depth": 0,"isSubmitter": false,"isDeleted": false,"permalink": "https://www.reddit.com/r/RealEstate/comments/1ucvvkb/.../ot74y68/"}
User records carry postKarma, commentKarma, totalKarma and cakeDay; community records carry name, title, publicDescription and url.
Automate & schedule
Run this actor on autopilot and pull results into your own stack:
- Apify API — start runs, fetch datasets, and manage schedules over REST.
- apify-client for JavaScript and apify-client for Python — official SDKs.
- Schedules — run it hourly/daily with monitoring mode to capture only new posts and comments in a subreddit or for a keyword; perfect for brand monitoring and fresh AI data.
- Webhooks — trigger downstream actions (Slack alert, database insert, AI summary) the moment a run finishes.
import { ApifyClient } from 'apify-client';const client = new ApifyClient({ token: 'MY_APIFY_TOKEN' });const run = await client.actor('scrapesage/reddit-scraper').call({searches: ['best CRM for small business'],searchScope: 'posts',sort: 'new',maxItems: 300,includeComments: true,maxCommentsPerPost: 25,});const { items } = await client.dataset(run.defaultDatasetId).listItems();console.log(`Got ${items.length} Reddit records`);
Integrate with any app
Connect the dataset to 5,000+ apps — no code required:
- Make — multi-step automation scenarios.
- Zapier — push new mentions straight into Slack, Sheets or your CRM.
- Slack — get pinged when a monitored subreddit or keyword gets new posts.
- Google Drive / Sheets — auto-export every run to a spreadsheet.
- Airbyte — pipe results into your data warehouse.
- GitHub — trigger runs from commits or releases.
Use with AI assistants (MCP)
The output is clean, LLM-ready JSON. Call this actor from Claude, ChatGPT, or any agent framework through the Apify MCP server — ask your assistant to "find the top posts about mortgage rates in r/RealEstate this week with their comments" and let it run this scraper for you.
More scrapers from scrapesage
Build a complete social, content & sentiment-data stack:
- YouTube Scraper — videos, channels, stats and comments.
- Threads Scraper — posts and profiles from Meta's Threads.
- Bluesky Scraper — posts and profiles from Bluesky.
- Telegram Scraper — public channel messages and members.
- Twitch Scraper — streams, channels and clips.
- Substack Scraper — newsletters, posts and authors.
- Google News Scraper — news articles by topic and query.
- Website Contact Scraper — emails, phones and socials from any site.
Tips
- Use Residential proxy. Reddit blocks datacenter IPs — the default Residential setting is what keeps this actor reliable.
- Fresh-only feeds. Combine
monitorMode+ a Schedule +onlyPostsNewerThanto capture only new posts/comments each run, without re-paying for old data. - Comments cost extra requests. Leave
includeCommentsoff for a fast posts-only pass; turn it on (with a sensiblemaxCommentsPerPost) when you need the discussion. - No junk by default.
[deleted]/[removed]/empty comments are skipped (and not billed), somaxCommentsPerPostfills up with real content. FlipincludeDeletedCommentson only if you need the thread's deleted placeholders for structure/analysis. - Cover a topic fully. Add several
subredditsandsearchesin one run — the globalmaxItemscap spreads across them.
FAQ
Do I need a Reddit account or API key? No. This actor reads publicly available pages — no login, no app registration, no API key.
Why residential proxy? Reddit blocks datacenter IPs and its public .json API. The actor reads the rendered public site over residential IPs, which is the reliable path; it rotates IPs automatically on any block.
Can I get full comment threads? Yes. Use a post URL (or includeComments on a subreddit/search) with maxCommentsPerPost, and turn on expandMoreComments to follow "load more comments" for deep threads. Each comment includes its depth and parentFullId so you can rebuild the tree.
How do I only get new posts over time? Turn on monitorMode and run on a Schedule — each run returns only records not seen before.
Can I export to Google Sheets, CSV, or Excel? Yes — one click in the dataset view, or automatically on every run via the Google Drive integration.
A field is null — why? Some values genuinely aren't published on a given page (e.g. a hidden comment score, or a community's subscriber count, which Reddit renders only client-side). Fields are null when the data isn't in the page, not because the scraper skipped them.
Is scraping Reddit legal? This actor collects publicly available data only. You are responsible for using the data in compliance with applicable laws (GDPR/CCPA for personal data) and Reddit's terms.
Need help?
Open an issue on the actor's Issues tab, or visit the Apify help center. Feature requests are welcome — this actor is actively maintained.