Discogs Marketplace Scraper
Pricing
Pay per event
Discogs Marketplace Scraper
Scrape Discogs marketplace listings — asking price, seller, condition, ships-from — plus public marketplace stats for any release ID or search query, export to JSON or CSV. A Discogs API + HTML scraper that handles the rate limits and retries for you. No login.
Pricing
Pay per event
Rating
0.0
(0)
Developer
DevilScrapes
Maintained by CommunityActor stats
0
Bookmarked
5
Total users
0
Monthly active users
7 days ago
Last modified
Categories
Share
Discogs Price Scraper
We do the dirty work so your dataset stays clean. 😈
$5.05 / 1,000 rows — Pay only for results that land. No credit card required to try.
Pull live marketplace listings from Discogs — asking prices, condition grades, seller ratings, ships-from country — for any release ID or free-text search query. Joins the public Discogs release API, the marketplace-stats API, and the Cloudflare-protected marketplace HTML into a single flat dataset: one row per listing, one optional aggregate row per release. The per-listing detail that the official Discogs API does not expose.
🎯 What this scrapes
Two row types share one flat schema. The row_type discriminator selects which fields
are populated — listing rows carry per-seller pricing, condition grades, and seller
ratings; stats rows carry the per-release marketplace aggregate.
| Field | Type | Populated for |
|---|---|---|
row_type | "listing" | "stats" | both |
release_id | integer | both |
release_title, artist, year, country, format_name, format_descriptions, genres, master_id, release_url | from /releases/{id} | both |
listing_id, listing_url | integer / string | listing rows |
asking_price, asking_currency | float / ISO 4217 | listing rows |
shipping_text | string (seller free-form) | listing rows |
condition_media, condition_sleeve | Discogs grade vocab | listing rows |
seller_username, seller_rating_pct, seller_rating_count, seller_country | strings + numerics | listing rows |
stats_lowest_price, stats_lowest_currency, stats_num_for_sale, stats_blocked_from_sale | from /marketplace/stats/{id} | stats rows |
scraped_at | ISO 8601 UTC | both |
🔥 Features
- Per-listing asking prices, condition grades, seller ratings, and ships-from country for every release you supply — the detail the official Discogs API withholds.
- Per-release marketplace stats row (lowest asking price, total active listings, blocked-from-sale flag), opt-in and on by default.
- Input by release ID (deterministic, fast) or by free-text search query — the search endpoint resolves the top N matches automatically.
- Single flat schema with a
row_typediscriminator — easy to join or aggregate downstream byrelease_idormaster_id. - Pydantic v2 input validation — XOR between
releaseIdsandsearchQueryis enforced before any network call. - Conservative rate-limit pacing at one request every 1.5 seconds (~40 req/min) — well under Discogs' API ceiling and the HTML surface's unwritten limit.
- Per-release fail isolation — one bad release ID logs a warning and the run continues.
🛡️ What we handle for you
Discogs marketplace pages sit behind a multi-layer blocking stack. We absorb all of it:
- We rotate browser fingerprints (curl-cffi chrome/firefox impersonation) so the target sees real-browser TLS and HTTP/2, not Python. Verified clean pass as of 2026-05-16.
- We warm up the session with a one-shot homepage hit before any listing pages — the same cold-start sequence a real browser runs.
- We rotate residential proxies via Apify Proxy on every block — fresh session ID, fresh exit IP, no stale fingerprint reuse.
- We retry with exponential backoff on 408 / 429 / 503 and honour
Retry-After. Up to 5 attempts per page before we log and move on. - We back off when the target rate-limits. Partial successes surface with a clear status message — we never silently return an empty dataset.
- We keep the dataset clean — Pydantic-validated rows, ISO-8601 timestamps, stable field names across every run.
- You pay only for results that land. If a release fetch fails entirely, no result-row charges fire for it.
💡 Use cases
- Vinyl / CD reseller benchmarking — pull every active listing for releases in your inventory and benchmark your asking price against the live competing supply (median, min, max, condition mix).
- Music-collectibles arbitrage — monitor cross-country shipping spreads
(
seller_country+asking_price+asking_currency) for the same release; spot regional under-pricing. - Catalog / label market intel — for a label's release IDs, track
num_for_saleandlowest_priceover time to see which titles are appreciating. - Journalism / pricing studies — "the cheapest first pressing of Nevermind on Discogs right now" generators, automated with one scheduled Actor run per article.
- Marketplace health monitoring — count
stats_blocked_from_sale=truerows across a watchlist to flag releases Discogs has quietly removed from sale. - Seller-quality screening — filter listings by
seller_rating_pct >= 99.0andseller_rating_count >= 100for a curated high-trust subset.
⚙️ How to use it
- Open the Actor input form on the Apify Console.
- Provide either
releaseIds(recommended — direct, no ambiguity) orsearchQuery— not both. Find a release ID in any Discogs URL:discogs.com/release/249504-Rick-Astley-...→249504. - If using
searchQuery, setmaxSearchResultsto cap how many top hits to fetch (default 5, max 50). - Set
maxPagesPerRelease(default 4 = 100 listings) andmaxListingsPerRelease(default 100, max 500) to control per-release volume. The lower of the two wins. - Leave
includeStatsRowon (default) to also receive one aggregaterow_type="stats"row per release. Turn off if you only want per-listing rows. - Leave
useProxyon (default) — the proxy is mandatory for the Discogs marketplace HTML surface on Apify datacenter IPs. - Click Start. Results stream into the default dataset in JSON, CSV, Excel, or XML.
Single release, first page
{"releaseIds": [249504],"maxPagesPerRelease": 1,"maxListingsPerRelease": 25,"includeStatsRow": true,"useProxy": true}
Search-driven, top 5 results, two pages each
{"searchQuery": "nirvana nevermind","maxSearchResults": 5,"maxPagesPerRelease": 2,"maxListingsPerRelease": 50,"includeStatsRow": true}
📥 Input
| Field | Type | Required | Default | Description |
|---|---|---|---|---|
releaseIds | integer[] | XOR | — | List of Discogs release IDs (1-100). XOR with searchQuery. |
searchQuery | string | XOR | — | Free-text Discogs search; top N results become release IDs. XOR with releaseIds. |
maxSearchResults | integer | no | 5 | Cap on results from searchQuery (1-50). |
maxPagesPerRelease | integer | no | 4 | Cap on listing pages per release (1-20; 25 listings per page). |
maxListingsPerRelease | integer | no | 100 | Hard cap on listing rows per release (1-500). |
includeStatsRow | boolean | no | true | Emit one extra row_type="stats" row per release. |
useProxy | boolean | no | true | Route through Apify Proxy (BUYPROXIES94952). Required on Apify datacenter IPs. |
Exactly one of releaseIds or searchQuery must be provided. Passing both or neither
raises a validation error before any network call fires.
📤 Output
One row per marketplace listing (and optionally one extra stats row per release), pushed to the default dataset and available as JSON, CSV, Excel, or XML.
{"row_type": "listing","release_id": 249504,"release_title": "Never Gonna Give You Up","artist": "Rick Astley","year": 1987,"country": "UK","format_name": "Vinyl","format_descriptions": ["7\"", "45 RPM", "Single", "Stereo"],"genres": ["Electronic", "Pop"],"master_id": 96559,"release_url": "https://www.discogs.com/release/249504","listing_id": 3761251765,"listing_url": "https://www.discogs.com/sell/item/3761251765","asking_price": 0.5,"asking_currency": "GBP","shipping_text": "+£15.00","condition_media": "Very Good Plus (VG+)","condition_sleeve": "Generic","seller_username": "Ronan266","seller_rating_pct": 100.0,"seller_rating_count": 35,"seller_country": "United Kingdom","stats_lowest_price": null,"stats_lowest_currency": null,"stats_num_for_sale": null,"stats_blocked_from_sale": null,"scraped_at": "2026-05-16T12:00:00.000Z"}
Export formats
- JSON — full fidelity, all 27 fields, newline-delimited
- CSV — flat, one row per listing or stats record
- Excel —
.xlsxvia the Apify dataset converter - XML — structured per-item
All formats are available via the Apify API:
GET /datasets/{id}/items?format=csv&clean=true
💰 Pricing
Pay-Per-Event (PPE) — you pay only for what you use:
| Event | Price (USD) | When |
|---|---|---|
actor-start | $0.05 | Once per run, at boot |
result-row | $0.005 | Per listing OR per stats row written |
Example costs
| Run | Rows | Cost |
|---|---|---|
| 1 release × 25 listings + 1 stats row | 26 | $0.18 |
| 5 releases × 100 listings + 5 stats rows | 505 | $2.58 |
| 10 releases × 100 listings + 10 stats rows | 1,010 | $5.10 |
| 50 releases × 100 listings + 50 stats rows | 5,050 | $25.30 |
At scale the per-row charge dominates: ~$5.05 per 1,000 rows. Pricing reflects the commercial value of hand-parsed per-listing data (asking price, condition grade, seller country, seller rating) that the official Discogs API does not surface.
🚧 Limitations
- Asking prices only — no closed-sale history. Discogs hosts sold-price data at
/sell/history/{release_id}but the page is gated behind account login. Without user OAuth credentials it is inaccessible — out of scope for this Actor. What you get instead: per-listing asking prices (the live offer side) plus the publiclowest_priceaggregate (the floor of the ask side). - Public Discogs surfaces only. No authenticated Discogs API calls, no personal token usage, no OAuth flows.
- One snapshot per run. Schedule recurring runs via Apify Schedules for time-series tracking; nothing persists across runs.
- 25 listings per page — a Discogs-imposed pagination size. Combined with
maxPagesPerRelease: 20, the hard ceiling is 500 listings per release per run. - Currency is not normalised. Discogs serves prices in the seller's local currency
for listings and in the request IP's currency for the stats API. Join by the
asking_currencyfield; a single canonical USD is not available. - Throughput ~40 req/min. A 10-release run with 4 pages each takes roughly (10 × (1 + 1 + 4)) × 1.5 s ≈ 90 seconds plus session warm-up.
- 7-day default storage retention on the Apify FREE tier. Export your dataset immediately after the run, or upgrade for longer retention.
❓ FAQ
What is the difference between "discogs price scraper" asking-price data and sold-price history?
Discogs publishes two distinct price signals: (1) active asking prices on the
marketplace listings page — public, and what this Actor delivers — and (2)
completed-sale prices at /sell/history/{release_id} — login-walled, not
accessible without user OAuth. For most reseller and arbitrage workflows the live
asking-price distribution is the more actionable signal: it tells you what the market
is asking right now, not what it cleared months ago.
Does this Actor work as a Discogs marketplace API alternative?
Yes. The official Discogs API only exposes lowest_price and num_for_sale aggregates
for any given release. This Actor adds the full per-listing detail — who is selling,
at what price, in what condition, shipping from where — by joining the REST endpoints
with the marketplace HTML. It is the data layer the official API does not provide.
How does it handle blocks and Cloudflare protection?
We take care of that. The Actor uses browser-fingerprint impersonation (curl-cffi),
a session warm-up sequence, residential proxy rotation, and exponential backoff with
Retry-After support. You run the Actor; we handle the connection layer.
Why does this Actor need a custom User-Agent for Discogs?
Discogs enforces a written API policy (Discogs Developer Terms)
requiring every API request to carry an Application-Name/Version style
User-Agent. The Actor sends DevilScrapes/0.1 (+https://apify.com/DevilScrapes) on
every API call automatically — nothing to configure.
What does stats_blocked_from_sale=true mean?
It means Discogs has flagged the release as un-sellable — a legal, ToS, or licensing
restriction. stats_num_for_sale=0 alongside it confirms zero active listings. To
identify scarce releases, use WHERE stats_num_for_sale < 5.
Can I scrape my own wantlist or private collection?
No. This Actor scrapes only public marketplace data. Private user data (wantlist, collection, messages) requires Discogs OAuth, which is intentionally out of scope.
Can I fetch more than 100 release IDs in one run?
No. The input model caps releaseIds at 100 (and maxListingsPerRelease at 500,
maxPagesPerRelease at 20). A single run emits at most 100 × 500 + 100 = 50,100 rows.
Split larger workloads across multiple runs and concatenate the datasets.
Is this the same as a Discogs scraper for bulk catalog export?
This Actor is optimised for marketplace pricing data — asking prices, condition, seller metadata — not full catalog metadata export. If your goal is release metadata (tracklist, labels, formats) at bulk scale, the official Discogs data dump is a better starting point. If your goal is live pricing intelligence, this is the right tool.
💬 Your feedback
Found a bug, hit a rate limit, or need a new field on the output row (median asking price? seller-country histogram? condition-grade distribution)? Open an issue on the Actor's Apify Store page or reach the Devil Scrapes team at apify.com/DevilScrapes. We ship updates within days of validated reports.