Pricing

from $0.30 / 1,000 results

Lemmy Scraper - Federated Reddit Alternative

Scrape posts and comments from any Lemmy instance (the open, federated Reddit alternative). Filter by community, search keyword, or pull instance-wide feeds. No login required. Built for AI training datasets, fediverse research, and community monitoring.

Pricing

from $0.30 / 1,000 results

Rating

0.0

(0)

Developer

NIJ KANANI

Actor stats

Bookmarked

Total users

Monthly active users

2 months ago

Last modified

🐭 Lemmy Scraper

Scrape posts and comments from any Lemmy instance — the federated, open-source Reddit alternative. No login. No rate-limit nightmares. Works with lemmy.world, lemmy.ml, beehaw.org, sh.itjust.works, and any other instance.

🎯 Built for AI/LLM training datasets, fediverse research, brand monitoring on emerging platforms, and Reddit alternatives analysis.

✨ What you can do

🏘️ Community posts — pull all posts from one or many communities
🔍 Search — keyword search across an instance
🌐 Instance feed — top/hot/new across the whole instance
💬 Optional comment trees — flattened with paths for tree reconstruction
🔁 Sort options — Hot, Active, New, Top (multiple ranges), MostComments
🌍 Cross-instance federation aware (asklemmy@lemmy.ml)

🚀 Quick start

{
    "instance": "lemmy.world",
    "mode": "community",
    "communities": ["technology@lemmy.world", "asklemmy@lemmy.ml"],
    "sort": "Top",
    "topRange": "TopWeek",
    "maxItems": 200
}

📥 Input

Field	Description
`instance`	Hostname (e.g. `lemmy.world`)
`mode`	`community` / `search` / `instance`
`communities`	Names like `tech` or `tech@lemmy.world`
`searchQueries`	Keywords
`sort`	`Hot`, `Active`, `New`, `Top`, `MostComments`, `NewComments`
`topRange`	When sort = `Top`: `TopHour` … `TopAll`
`maxItems`	Cap per target
`includeComments`	Fetch comment trees

📤 Output (per post)

{
    "instance": "lemmy.world",
    "community": "technology",
    "title": "Some headline",
    "body": "Body text or empty",
    "creator": "username",
    "creatorActor": "https://lemmy.world/u/username",
    "score": 123,
    "upvotes": 130,
    "downvotes": 7,
    "comments": 42,
    "publishedAt": "2026-04-15T...",
    "url": "https://example.com/article",
    "thumbnailUrl": "https://...",
    "nsfw": false,
    "apId": "https://lemmy.world/post/123456",
    "postUrl": "https://lemmy.world/post/123456",
    "commentsList": [
        {
            "id": 9999,
            "creator": "commenter",
            "content": "Reply text",
            "score": 12,
            "publishedAt": "...",
            "path": "0.123.456"
        }
    ]
}

🎯 Use cases

Who	Why
🤖 AI/LLM teams	Reddit-style training data without Reddit's API gate
📚 Researchers	Federation studies, online community migration patterns
📊 Marketers	Track brand mentions on emerging platforms
📰 Journalists	Source mining on Reddit-alternative communities

⚙️ Tech notes

Uses Lemmy's official /api/v3 REST endpoints — fully open, no key required
Federation-aware: community@instance syntax works for any cross-instance pull
Pagination via page parameter; auto-stops when no new posts returned
Comment trees fetched separately and capped per post for performance

👾 Lemmy Scraper - Federated Reddit Posts & Comments

benthepythondev/lemmy-scraper

Scrape Lemmy (the federated Reddit alternative) from any instance via the public API — no login needed. Get front-page or per-community posts, comments, keyword search, and community data. Clean JSON with scores, upvotes & comment counts.

ben

Lemmy Scraper — Posts, Comments & Community Data

devilscrapes/lemmy-community-scraper

Scrape posts and comments from any public Lemmy community on any Fediverse instance. Fingerprint rotation, retries, and proxy fallback handled for you. Typed dataset rows, ready for SQL, CSV, or JSON.

DevilScrapes

Lemmy Scraper

dami_studio/lemmy-scraper

Scrapes public Lemmy posts from any instance (default lemmy.world) by front-page feed, community, or keyword search. Returns title, link, body, author, community, score, comments, votes, NSFW flag and thumbnail as JSON. Best for brand and product mon

Dami's Studio

5.0

Lemmy Keyword Monitor

kempt_sprinkles/lemmy-keyword-monitor

Track Lemmy posts mentioning any keyword, brand, or competitor. The open Reddit alternative — structured data, scheduled daily.

Nikolas Gevorkyan

Lemmy Posts & Communities Scraper

makework36/lemmy-scraper

Scrape Lemmy instances for posts, comments, communities. Works with any instance. Sort by Hot, New, Top. No login needed.

deusex machine

Lemmy Scraper: Posts, Comments, Communities & Users

perconey/lemmy-scraper

Scrape any Lemmy instance (lemmy.world, lemmy.ml, beehaw.org and other Lemmyverse nodes) via the official /api/v3/* REST API. Posts with upvote/downvote counts, comment trees, communities with subscriber counts, user profiles, full-text search. No auth, no proxies. Pay per result.

Perconey

Lemmy Community Posts Scraper

parseforge/lemmy-community-posts-scraper

Track social activity from Lemmy Community Posts with profile name, follower count, posts, replies and timestamps. Designed for community managers, brand watchers and trend researchers. Run on demand or on a recurring schedule and feed every row into your favourite analytics or workflow stack.

ParseForge

Reddit Search Scraper — Posts, Comments & Users

logiover/reddit-search-scraper

Scrape Reddit subreddit search with no API key or login. Export posts and comments to CSV/JSON — a Reddit API alternative for keyword monitoring.

Logiover

Bluesky & Lemmy Brand Monitor

orbiscribe/open-social-brand-monitor

Monitor public Bluesky and Lemmy posts for brand, competitor, keyword, launch, and support mentions using open APIs.

Orbiscribe Labs

Reddit Scraper — Posts & Comments

signalengine/reddit-scraper

Scrape posts and comments from any subreddit — no Reddit API key, no login, no proxy. A fast, free Reddit API alternative for public data, exported to JSON, CSV or Excel.