Pricing

Pay per usage

👾 Lemmy Scraper - Federated Reddit Posts & Comments

Scrape Lemmy (the federated Reddit alternative) from any instance via the public API — no login needed. Get front-page or per-community posts, comments, keyword search, and community data. Clean JSON with scores, upvotes & comment counts.

Pricing

Pay per usage

Rating

0.0

(0)

Developer

ben

Actor stats

Bookmarked

Total users

Monthly active users

2 days ago

Last modified

👾 Lemmy Scraper — Posts, Comments & Communities (Federated Reddit)

Extract Lemmy data — the open, federated Reddit alternative — from any instance (lemmy.world, lemmy.ml, sh.itjust.works, beehaw.org and more) through the public API. Pull front-page or per-community posts, comments, keyword search results, or community data as clean, structured JSON with Reddit-style scores, upvotes/downvotes and comment counts — no login required. Perfect for communities that left Reddit and for open-social research. Export to JSON/CSV/Excel, run on a schedule, call via API, or connect to Make, Zapier or n8n.

👾 What is the Lemmy Scraper?

It turns any Lemmy instance into a structured dataset. Point it at a server, pick a mode — front-page posts, posts from specific communities, comments, a keyword search, or a list of communities — set a sort order, and it returns every matching record straight from Lemmy's public REST API. Query the whole federated network or just one instance, and reach cross-instance communities like technology@lemmy.world. It reads a clean JSON API instead of a headless browser, so it's fast and cheap.

What data does it extract?

Post title, body and the link URL it points to
Reddit-style metrics — score, upvotes, downvotes and comment count
Community info — name, title and community URL
Creator (author) name and profile URL
Publish date, NSFW flag and thumbnail image
Comments — content, score, parent post title and author (comments mode)
Community listings — description, subscribers, post/comment totals and monthly active users
Canonical post/comment URLs (ap_id), plus a scraped_at timestamp

⬇️ Input

Choose an instance and a mode, then add communities, a query or a sort as needed:

Field	Description
`mode`	`posts`, `community`, `search`, `comments` or `communities`
`instance`	Lemmy server to query, e.g. `lemmy.world`, `lemmy.ml`, `beehaw.org`
`communities`	Community names, e.g. `technology`, `asklemmy`, or cross-instance `technology@lemmy.world`
`query`	Keyword (search mode = posts; communities mode = community names)
`sort`	`Hot`, `Active`, `New`, `TopDay/Week/Month/Year/All`, `MostComments`
`listingType`	`All` (whole federated network) or `Local` (this instance only)
`maxItems`	Max records to return (1–50000)
`proxyConfiguration`	Optional Apify Proxy for IP rotation on large runs

Example input

{
  "mode": "community",
  "instance": "lemmy.world",
  "communities": ["technology", "asklemmy"],
  "sort": "TopWeek",
  "maxItems": 500
}

⬆️ Output

Every post (or comment/community) is one clean row — view it as a table, or export JSON / CSV / Excel:

{
  "type": "post",
  "id": 48685969,
  "title": "Self-hosting is easier than ever in 2026",
  "body": "Here's my setup...",
  "link_url": "https://example.com/article",
  "post_url": "https://lemmy.world/post/48685969",
  "published": "2026-06-26T09:00:00Z",
  "nsfw": false,
  "score": 842,
  "upvotes": 901,
  "downvotes": 59,
  "comments_count": 137,
  "community_name": "technology",
  "community_title": "Technology",
  "community_url": "https://lemmy.world/c/technology",
  "creator_name": "alice",
  "creator_url": "https://lemmy.world/u/alice",
  "thumbnail_url": "https://lemmy.world/pictrs/image/abc.jpg",
  "scraped_at": "2026-06-26T15:30:00.000Z"
}

💡 Use cases

👂 Community & topic monitoring: track discussions about a product, brand or topic across the fediverse.
🔄 Reddit-migration research: follow the communities and audiences that moved off Reddit.
📈 Trend & sentiment analysis: feed posts and comments straight into an LLM.
🔥 Content discovery: surface the top posts by community and time window with one sort setting.

❓ FAQ

How do I scrape Lemmy posts? Set mode: posts for the front page, or mode: community with one or more communities, choose an instance and a sort, and Run. You get every post with title, body, link, scores and comment counts.

Do I need an API key or login? No — public posts, comments and communities all work with no login, straight from Lemmy's public REST API.

Does it work on any instance, and is it federated? Yes — point instance at any Lemmy server. With listingType: All it sees most of the whole federated network; with Local it stays on that one instance. lemmy.world is the largest starting point.

Can I scrape a community on another instance? Yes — use community@instance (e.g. technology@lemmy.world), since Lemmy is federated and resolves it for you.

Can I get comments, not just posts? Yes — mode: comments returns comments per community (via communities) or instance-wide, with content, score, the parent post title and the author.

How do I find communities to scrape? Use mode: communities with a query to search community names, or leave the query empty to list the instance's top communities with subscriber and activity counts.

How many records can it return? Up to your maxItems cap (up to 50,000); it paginates automatically and, in community/comments modes, splits the cap across the communities you give it.

Can I run it on a schedule or via API? Yes — schedule recurring runs in Apify, call it via the API/SDK, or connect it to Make, Zapier or n8n.

Is scraping Lemmy legal? It reads publicly available data via Lemmy's own public API. Use it responsibly for research and monitoring, and follow applicable laws and each instance's terms.

🔗 You might also like

Bluesky Scraper — posts, profiles, followers & search
Mastodon Scraper — posts, hashtags & trends from any instance
Reddit Scraper — posts, comments & communities
Hacker News Intelligence — stories, comments & trends

Keywords: Lemmy scraper, Lemmy API, fediverse scraper, federated Reddit, Reddit alternative scraper, Lemmy posts, Lemmy comments, Lemmy communities, ActivityPub, lemmy.world scraper, social media scraper, social listening, sentiment analysis, open social data, Reddit migration.

Lemmy Scraper — Posts, Comments & Community Data

devilscrapes/lemmy-community-scraper

Scrape posts and comments from any public Lemmy community on any Fediverse instance. Fingerprint rotation, retries, and proxy fallback handled for you. Typed dataset rows, ready for SQL, CSV, or JSON.

DevilScrapes

Lemmy Scraper: Posts, Comments, Communities & Users

perconey/lemmy-scraper

Scrape any Lemmy instance (lemmy.world, lemmy.ml, beehaw.org and other Lemmyverse nodes) via the official /api/v3/* REST API. Posts with upvote/downvote counts, comment trees, communities with subscriber counts, user profiles, full-text search. No auth, no proxies. Pay per result.

Perconey

Lemmy Posts & Communities Scraper

makework36/lemmy-scraper

Scrape Lemmy instances for posts, comments, communities. Works with any instance. Sort by Hot, New, Top. No login needed.

deusex machine

Lemmy Scraper - Federated Reddit Alternative

legend006/lemmy-scraper

Scrape posts and comments from any Lemmy instance (the open, federated Reddit alternative). Filter by community, search keyword, or pull instance-wide feeds. No login required. Built for AI training datasets, fediverse research, and community monitoring.

NIJ KANANI

Lemmy Scraper

dami_studio/lemmy-scraper

Scrapes public Lemmy posts from any instance (default lemmy.world) by front-page feed, community, or keyword search. Returns title, link, body, author, community, score, comments, votes, NSFW flag and thumbnail as JSON. Best for brand and product mon

Dami's Studio

5.0

(1)

Lemmy Community Posts Scraper

parseforge/lemmy-community-posts-scraper

Track social activity from Lemmy Community Posts with profile name, follower count, posts, replies and timestamps. Designed for community managers, brand watchers and trend researchers. Run on demand or on a recurring schedule and feed every row into your favourite analytics or workflow stack.

ParseForge

Lemmy Keyword Monitor

kempt_sprinkles/lemmy-keyword-monitor

Track Lemmy posts mentioning any keyword, brand, or competitor. The open Reddit alternative — structured data, scheduled daily.

Nikolas Gevorkyan

🧲 Social Media Leads Analyzer

apify/social-media-leads-analyzer

Add a website and extract emails, phone numbers, and social media details. Use this lead scraper to extract basic social media profile data from 8 platforms. Export results in JSON, CSV, HTML, use APIs, schedule runs, build integrations, and more.

Apify

1.3K

4.6

(6)

🚀 Reddit Scraper - Posts, Comments, Communities & Users

breezy_keypress/reddit-scraper

Extract Reddit posts, comments, communities & user profiles without API keys. 100% success rate with advanced anti-bot bypass. Get enriched data: vote ratios, flair, media URLs, awards, account age & more. Perfect for market research, sentiment analysis & competitor tracking.

PRIYANSHU GALANI

Reddit Scraper ✅ Posts, Comments, Users, Communities | NO LOGIN

peakydev/reddit-scraper-post-comments-users

✅. Reddit scraper for posts, comments, users, listings, communities and more. NO LOGIN Required

Peaky Dev

251

5.0

(16)