# Medium Scraper (`ivanvs/medium-scraper`) Actor

Scrape data such as authors, titles, applause, responses and publication dates from the Medium blogging platform. Download listings data in JSON, XML, Excel, and other versatile

- **URL**: https://apify.com/ivanvs/medium-scraper.md
- **Developed by:** [Gen First](https://apify.com/ivanvs) (community)
- **Categories:** Social media, News
- **Stats:** 317 total users, 4 monthly users, 100.0% runs succeeded, 19 bookmarks
- **User rating**: 4.00 out of 5 stars

## Pricing

$10.00/month + usage

To use this Actor, you pay a monthly rental fee to the developer. The rent is subtracted from your prepaid usage every month after the free trial period.You also pay for the Apify platform usage, which gets cheaper the higher Apify subscription plan you have.

Learn more: https://docs.apify.com/platform/actors/running/actors-in-store#rental-actors

## What's an Apify Actor?

Actors are a software tools running on the Apify platform, for all kinds of web data extraction and automation use cases.
In Batch mode, an Actor accepts a well-defined JSON input, performs an action which can take anything from a few seconds to a few hours,
and optionally produces a well-defined JSON output, datasets with results, or files in key-value store.
In Standby mode, an Actor provides a web server which can be used as a website, API, or an MCP server.
Actors are written with capital "A".

## How to integrate an Actor?

If asked about integration, you help developers integrate Actors into their projects.
You adapt to their stack and deliver integrations that are safe, well-documented, and production-ready.
The best way to integrate Actors is as follows.

In JavaScript/TypeScript projects, use official [JavaScript/TypeScript client](https://docs.apify.com/api/client/js.md):

```bash
npm install apify-client
```

In Python projects, use official [Python client library](https://docs.apify.com/api/client/python.md):

```bash
pip install apify-client
```

In shell scripts, use [Apify CLI](https://docs.apify.com/cli/docs.md):

````bash
# MacOS / Linux
curl -fsSL https://apify.com/install-cli.sh | bash
# Windows
irm https://apify.com/install-cli.ps1 | iex
```bash

In AI frameworks, you might use the [Apify MCP server](https://docs.apify.com/platform/integrations/mcp.md).

If your project is in a different language, use the [REST API](https://docs.apify.com/api/v2.md).

For usage examples, see the [API](#api) section below.

For more details, see Apify documentation as [Markdown index](https://docs.apify.com/llms.txt) and [Markdown full-text](https://docs.apify.com/llms-full.txt).


# README

## Medium Scraper

Scrape data from [Medium](https://medium.com/) easily with Medium Scraper.

### About

Medium is an American online publishing platform developed by Evan Williams and launched in August 2012. It is owned by A Medium Corporation. The platform is an example of social journalism, having a hybrid collection of amateur and professional people and publications, or exclusive blogs or publishers on Medium and is regularly regarded as a blog host.

### How to use?

In order to use Medium Scraper you need to provide tag for which you want to scrape blog posts on Medium and optionaly month or year. If year and month are not supplied, we will scrape the most popular post for given tag and date info.

So if only year is supplied, the most popular posts from given year will be scraped. If year and month is supplied, the most popular posts for given year and month will be scraped.

### How to find valid tags?

Finding Related Tags on Medium.com

1. Go to Medium.com.

2. Use the search bar at the top of the page.

3. Type in the topic you're interested in (e.g., "artificial intelligence," "writing tips," "healthy eating").

4. After the search results appear, click on the "Topics" tab.

5. This tab will show you various tags and related subjects that are connected to your search term. These are the most relevant tags that Medium users are applying to content on that topic.

![Medium.com tags search](https://apify-scrapers.s3.us-east-1.amazonaws.com/medium/medium-tag-search.png)

#### Example

Scrape the most popular posts for tag java of all time order by oldest first

```json
{
  "tag": "java",
  "orderBy": "most-read"
}
````

Scrape the most popular posts for tag java in 2020

```json
{
  "tag": "java",
  "year": 2020,
  "orderBy": "most-read"
}
```

Scrape the most popular posts for tag java in January 2020

```json
{
  "tag": "java",
  "year": 2020,
  "month": 1,
  "orderBy": "most-read"
}
```

*Only dates in past are valid!*

#### Output

Here is example of a scraped article:

```json
{
  "id": "33189ab03f60",
  "title": "I Asked ChatGPT How To Earn $1000 Online. It Was Hilarious.",
  "image": "https://miro.medium.com/v2/resize:fit:700/1*Y-qcUstemdW57V54aNWnnQ.jpeg",
  "subtitle": "Peering in the hive mind can be really helpful, but it can also be so stupid it’s funny",
  "author": {
    "id": "d5700e8fad77",
    "name": "Linda Caroll",
    "url": "https://medium.com/@lindacaroll",
    "username": "lindacaroll",
    "image": "https://miro.medium.com/v2/resize:fill:20:20/2*k9WGjFrkNI3f-kD5wjRxlQ.jpeg",
    "bio": "Everything is storytelling. https://lindac.substack.com/"
  },
  "url": "https://medium.com/the-partnered-pen/i-asked-chatgpt-how-to-earn-1000-online-it-was-hilarious-33189ab03f60",
  "responses": 540,
  "readingTime": 5.403773584905661,
  "publication": {
    "name": "The Partnered Pen",
    "url": "https://medium.com/the-partnered-pen",
    "image": "https://miro.medium.com/v2/resize:fill:20:20/1*N17wOF_ZH_k7MbDnwogDPQ.png",
    "domain": null
  },
  "aplause": 25220,
  "isMemberOnlyStory": true,
  "content": [
    "I Asked ChatGPT How To Earn $1000 Online. It Was Hilarious.",
    "Peering in the hive mind can be really helpful, but it can also be so stupid it’s funny..."
  ],
  "tags": ["ChatGPT", "Make Money Online", "Entrepreneur", "Work From Home", "Internet Marketing"],
  "publishingDate": "3/24/2023 7:51:42 AM",
  "wordCount": 1061,
  "seo": {
    "title": "",
    "description": ""
  }
}
```

### Medium Scraper data output

The output from Medium Scraper is stored in the dataset. After the run is finished, you can download the dataset in various data formats (JSON, CSV, XML, RSS, HTML Table).

### How much does it cost to scrape Medium Scraper?

Running Medium Scraper once will get you 1,000 results for less than $2 in Apify platform credits. For more details about the plans offer, platform credits, and usage, see the platform pricing [page](https://apify.com/pricing/actors).

### ❓FAQ

#### Do I need proxies to scrape Medium.com?

Proxy for this scraper is already preconfigured for optimal performance.

### Support

For more custom/simplify outputs or Bug report please contact the developer (me) or report an issue.

# Actor input Schema

## `tag` (type: `string`):

Tag from Medium web site for which we want to gather blog post information

## `year` (type: `integer`):

Year for which blogs will be scraped

## `month` (type: `integer`):

Month for which blogs will be scraped

## `orderBy` (type: `string`):

Select in which order you want to get articles, default is 10 most read

## `userInfo` (type: `boolean`):

Scrape information regarding author of the article

## `scrapeArticle` (type: `boolean`):

Scrape information regarding the article

## `maxNumberOfResults` (type: `integer`):

Defines how many items do you want to gather by running this actor

## Actor input object example

```json
{
  "tag": "java",
  "orderBy": "most-read",
  "userInfo": false,
  "scrapeArticle": false,
  "maxNumberOfResults": 10
}
```

# API

You can run this Actor programmatically using our API. Below are code examples in JavaScript, Python, and CLI, as well as the OpenAPI specification and MCP server setup.

## JavaScript example

```javascript
import { ApifyClient } from 'apify-client';

// Initialize the ApifyClient with your Apify API token
// Replace the '<YOUR_API_TOKEN>' with your token
const client = new ApifyClient({
    token: '<YOUR_API_TOKEN>',
});

// Prepare Actor input
const input = {
    "tag": "java",
    "maxNumberOfResults": 10
};

// Run the Actor and wait for it to finish
const run = await client.actor("ivanvs/medium-scraper").call(input);

// Fetch and print Actor results from the run's dataset (if any)
console.log('Results from dataset');
console.log(`💾 Check your data here: https://console.apify.com/storage/datasets/${run.defaultDatasetId}`);
const { items } = await client.dataset(run.defaultDatasetId).listItems();
items.forEach((item) => {
    console.dir(item);
});

// 📚 Want to learn more 📖? Go to → https://docs.apify.com/api/client/js/docs

```

## Python example

```python
from apify_client import ApifyClient

# Initialize the ApifyClient with your Apify API token
# Replace '<YOUR_API_TOKEN>' with your token.
client = ApifyClient("<YOUR_API_TOKEN>")

# Prepare the Actor input
run_input = {
    "tag": "java",
    "maxNumberOfResults": 10,
}

# Run the Actor and wait for it to finish
run = client.actor("ivanvs/medium-scraper").call(run_input=run_input)

# Fetch and print Actor results from the run's dataset (if there are any)
print("💾 Check your data here: https://console.apify.com/storage/datasets/" + run["defaultDatasetId"])
for item in client.dataset(run["defaultDatasetId"]).iterate_items():
    print(item)

# 📚 Want to learn more 📖? Go to → https://docs.apify.com/api/client/python/docs/quick-start

```

## CLI example

```bash
echo '{
  "tag": "java",
  "maxNumberOfResults": 10
}' |
apify call ivanvs/medium-scraper --silent --output-dataset

```

## MCP server setup

```json
{
    "mcpServers": {
        "apify": {
            "command": "npx",
            "args": [
                "mcp-remote",
                "https://mcp.apify.com/?tools=ivanvs/medium-scraper",
                "--header",
                "Authorization: Bearer <YOUR_API_TOKEN>"
            ]
        }
    }
}

```

## OpenAPI specification

```json
{
    "openapi": "3.0.1",
    "info": {
        "title": "Medium Scraper",
        "description": "Scrape data such as authors, titles, applause, responses and publication dates from the Medium blogging platform. Download listings data in JSON, XML, Excel, and other versatile",
        "version": "0.0",
        "x-build-id": "Joq0KjIillfZfdHvj"
    },
    "servers": [
        {
            "url": "https://api.apify.com/v2"
        }
    ],
    "paths": {
        "/acts/ivanvs~medium-scraper/run-sync-get-dataset-items": {
            "post": {
                "operationId": "run-sync-get-dataset-items-ivanvs-medium-scraper",
                "x-openai-isConsequential": false,
                "summary": "Executes an Actor, waits for its completion, and returns Actor's dataset items in response.",
                "tags": [
                    "Run Actor"
                ],
                "requestBody": {
                    "required": true,
                    "content": {
                        "application/json": {
                            "schema": {
                                "$ref": "#/components/schemas/inputSchema"
                            }
                        }
                    }
                },
                "parameters": [
                    {
                        "name": "token",
                        "in": "query",
                        "required": true,
                        "schema": {
                            "type": "string"
                        },
                        "description": "Enter your Apify token here"
                    }
                ],
                "responses": {
                    "200": {
                        "description": "OK"
                    }
                }
            }
        },
        "/acts/ivanvs~medium-scraper/runs": {
            "post": {
                "operationId": "runs-sync-ivanvs-medium-scraper",
                "x-openai-isConsequential": false,
                "summary": "Executes an Actor and returns information about the initiated run in response.",
                "tags": [
                    "Run Actor"
                ],
                "requestBody": {
                    "required": true,
                    "content": {
                        "application/json": {
                            "schema": {
                                "$ref": "#/components/schemas/inputSchema"
                            }
                        }
                    }
                },
                "parameters": [
                    {
                        "name": "token",
                        "in": "query",
                        "required": true,
                        "schema": {
                            "type": "string"
                        },
                        "description": "Enter your Apify token here"
                    }
                ],
                "responses": {
                    "200": {
                        "description": "OK",
                        "content": {
                            "application/json": {
                                "schema": {
                                    "$ref": "#/components/schemas/runsResponseSchema"
                                }
                            }
                        }
                    }
                }
            }
        },
        "/acts/ivanvs~medium-scraper/run-sync": {
            "post": {
                "operationId": "run-sync-ivanvs-medium-scraper",
                "x-openai-isConsequential": false,
                "summary": "Executes an Actor, waits for completion, and returns the OUTPUT from Key-value store in response.",
                "tags": [
                    "Run Actor"
                ],
                "requestBody": {
                    "required": true,
                    "content": {
                        "application/json": {
                            "schema": {
                                "$ref": "#/components/schemas/inputSchema"
                            }
                        }
                    }
                },
                "parameters": [
                    {
                        "name": "token",
                        "in": "query",
                        "required": true,
                        "schema": {
                            "type": "string"
                        },
                        "description": "Enter your Apify token here"
                    }
                ],
                "responses": {
                    "200": {
                        "description": "OK"
                    }
                }
            }
        }
    },
    "components": {
        "schemas": {
            "inputSchema": {
                "type": "object",
                "required": [
                    "tag"
                ],
                "properties": {
                    "tag": {
                        "title": "Tag",
                        "type": "string",
                        "description": "Tag from Medium web site for which we want to gather blog post information"
                    },
                    "year": {
                        "title": "Year",
                        "minimum": 2000,
                        "type": "integer",
                        "description": "Year for which blogs will be scraped"
                    },
                    "month": {
                        "title": "Month",
                        "minimum": 1,
                        "maximum": 12,
                        "type": "integer",
                        "description": "Month for which blogs will be scraped"
                    },
                    "orderBy": {
                        "title": "Order by",
                        "enum": [
                            "most-read",
                            "latest",
                            "oldest"
                        ],
                        "type": "string",
                        "description": "Select in which order you want to get articles, default is 10 most read",
                        "default": "most-read"
                    },
                    "userInfo": {
                        "title": "Scrape author info",
                        "type": "boolean",
                        "description": "Scrape information regarding author of the article",
                        "default": false
                    },
                    "scrapeArticle": {
                        "title": "Scrape article",
                        "type": "boolean",
                        "description": "Scrape information regarding the article",
                        "default": false
                    },
                    "maxNumberOfResults": {
                        "title": "Max number of result",
                        "minimum": 1,
                        "type": "integer",
                        "description": "Defines how many items do you want to gather by running this actor",
                        "default": 100
                    }
                }
            },
            "runsResponseSchema": {
                "type": "object",
                "properties": {
                    "data": {
                        "type": "object",
                        "properties": {
                            "id": {
                                "type": "string"
                            },
                            "actId": {
                                "type": "string"
                            },
                            "userId": {
                                "type": "string"
                            },
                            "startedAt": {
                                "type": "string",
                                "format": "date-time",
                                "example": "2025-01-08T00:00:00.000Z"
                            },
                            "finishedAt": {
                                "type": "string",
                                "format": "date-time",
                                "example": "2025-01-08T00:00:00.000Z"
                            },
                            "status": {
                                "type": "string",
                                "example": "READY"
                            },
                            "meta": {
                                "type": "object",
                                "properties": {
                                    "origin": {
                                        "type": "string",
                                        "example": "API"
                                    },
                                    "userAgent": {
                                        "type": "string"
                                    }
                                }
                            },
                            "stats": {
                                "type": "object",
                                "properties": {
                                    "inputBodyLen": {
                                        "type": "integer",
                                        "example": 2000
                                    },
                                    "rebootCount": {
                                        "type": "integer",
                                        "example": 0
                                    },
                                    "restartCount": {
                                        "type": "integer",
                                        "example": 0
                                    },
                                    "resurrectCount": {
                                        "type": "integer",
                                        "example": 0
                                    },
                                    "computeUnits": {
                                        "type": "integer",
                                        "example": 0
                                    }
                                }
                            },
                            "options": {
                                "type": "object",
                                "properties": {
                                    "build": {
                                        "type": "string",
                                        "example": "latest"
                                    },
                                    "timeoutSecs": {
                                        "type": "integer",
                                        "example": 300
                                    },
                                    "memoryMbytes": {
                                        "type": "integer",
                                        "example": 1024
                                    },
                                    "diskMbytes": {
                                        "type": "integer",
                                        "example": 2048
                                    }
                                }
                            },
                            "buildId": {
                                "type": "string"
                            },
                            "defaultKeyValueStoreId": {
                                "type": "string"
                            },
                            "defaultDatasetId": {
                                "type": "string"
                            },
                            "defaultRequestQueueId": {
                                "type": "string"
                            },
                            "buildNumber": {
                                "type": "string",
                                "example": "1.0.0"
                            },
                            "containerUrl": {
                                "type": "string"
                            },
                            "usage": {
                                "type": "object",
                                "properties": {
                                    "ACTOR_COMPUTE_UNITS": {
                                        "type": "integer",
                                        "example": 0
                                    },
                                    "DATASET_READS": {
                                        "type": "integer",
                                        "example": 0
                                    },
                                    "DATASET_WRITES": {
                                        "type": "integer",
                                        "example": 0
                                    },
                                    "KEY_VALUE_STORE_READS": {
                                        "type": "integer",
                                        "example": 0
                                    },
                                    "KEY_VALUE_STORE_WRITES": {
                                        "type": "integer",
                                        "example": 1
                                    },
                                    "KEY_VALUE_STORE_LISTS": {
                                        "type": "integer",
                                        "example": 0
                                    },
                                    "REQUEST_QUEUE_READS": {
                                        "type": "integer",
                                        "example": 0
                                    },
                                    "REQUEST_QUEUE_WRITES": {
                                        "type": "integer",
                                        "example": 0
                                    },
                                    "DATA_TRANSFER_INTERNAL_GBYTES": {
                                        "type": "integer",
                                        "example": 0
                                    },
                                    "DATA_TRANSFER_EXTERNAL_GBYTES": {
                                        "type": "integer",
                                        "example": 0
                                    },
                                    "PROXY_RESIDENTIAL_TRANSFER_GBYTES": {
                                        "type": "integer",
                                        "example": 0
                                    },
                                    "PROXY_SERPS": {
                                        "type": "integer",
                                        "example": 0
                                    }
                                }
                            },
                            "usageTotalUsd": {
                                "type": "number",
                                "example": 0.00005
                            },
                            "usageUsd": {
                                "type": "object",
                                "properties": {
                                    "ACTOR_COMPUTE_UNITS": {
                                        "type": "integer",
                                        "example": 0
                                    },
                                    "DATASET_READS": {
                                        "type": "integer",
                                        "example": 0
                                    },
                                    "DATASET_WRITES": {
                                        "type": "integer",
                                        "example": 0
                                    },
                                    "KEY_VALUE_STORE_READS": {
                                        "type": "integer",
                                        "example": 0
                                    },
                                    "KEY_VALUE_STORE_WRITES": {
                                        "type": "number",
                                        "example": 0.00005
                                    },
                                    "KEY_VALUE_STORE_LISTS": {
                                        "type": "integer",
                                        "example": 0
                                    },
                                    "REQUEST_QUEUE_READS": {
                                        "type": "integer",
                                        "example": 0
                                    },
                                    "REQUEST_QUEUE_WRITES": {
                                        "type": "integer",
                                        "example": 0
                                    },
                                    "DATA_TRANSFER_INTERNAL_GBYTES": {
                                        "type": "integer",
                                        "example": 0
                                    },
                                    "DATA_TRANSFER_EXTERNAL_GBYTES": {
                                        "type": "integer",
                                        "example": 0
                                    },
                                    "PROXY_RESIDENTIAL_TRANSFER_GBYTES": {
                                        "type": "integer",
                                        "example": 0
                                    },
                                    "PROXY_SERPS": {
                                        "type": "integer",
                                        "example": 0
                                    }
                                }
                            }
                        }
                    }
                }
            }
        }
    }
}
```
