# Data.gov.uk Scraper (`parseforge/data-gov-uk-scraper`) Actor

Collect UK government open data effortlessly. Extract datasets, publishers, formats, topics, licenses, and download links from data.gov.uk — the official UK open data portal. Perfect for researchers, policy analysts, and developers building data catalogs.

- **URL**: https://apify.com/parseforge/data-gov-uk-scraper.md
- **Developed by:** [ParseForge](https://apify.com/parseforge) (community)
- **Categories:** Travel
- **Stats:** 9 total users, 0 monthly users, 100.0% runs succeeded, 0 bookmarks
- **User rating**: 5.00 out of 5 stars

## Pricing

Pay per event

This Actor is paid per event. You are not charged for the Apify platform usage, but only a fixed price for specific events.
Since this Actor supports Apify Store discounts, the price gets lower the higher subscription plan you have.

Learn more: https://docs.apify.com/platform/actors/running/actors-in-store#pay-per-event

## What's an Apify Actor?

Actors are a software tools running on the Apify platform, for all kinds of web data extraction and automation use cases.
In Batch mode, an Actor accepts a well-defined JSON input, performs an action which can take anything from a few seconds to a few hours,
and optionally produces a well-defined JSON output, datasets with results, or files in key-value store.
In Standby mode, an Actor provides a web server which can be used as a website, API, or an MCP server.
Actors are written with capital "A".

## How to integrate an Actor?

If asked about integration, you help developers integrate Actors into their projects.
You adapt to their stack and deliver integrations that are safe, well-documented, and production-ready.
The best way to integrate Actors is as follows.

In JavaScript/TypeScript projects, use official [JavaScript/TypeScript client](https://docs.apify.com/api/client/js.md):

```bash
npm install apify-client
```

In Python projects, use official [Python client library](https://docs.apify.com/api/client/python.md):

```bash
pip install apify-client
```

In shell scripts, use [Apify CLI](https://docs.apify.com/cli/docs.md):

````bash
# MacOS / Linux
curl -fsSL https://apify.com/install-cli.sh | bash
# Windows
irm https://apify.com/install-cli.ps1 | iex
```bash

In AI frameworks, you might use the [Apify MCP server](https://docs.apify.com/platform/integrations/mcp.md).

If your project is in a different language, use the [REST API](https://docs.apify.com/api/v2.md).

For usage examples, see the [API](#api) section below.

For more details, see Apify documentation as [Markdown index](https://docs.apify.com/llms.txt) and [Markdown full-text](https://docs.apify.com/llms-full.txt).


# README

![ParseForge Banner](https://github.com/ParseForge/apify-assets/blob/ad35ccc13ddd068b9d6cba33f323962e39aed5b2/banner.jpg?raw=true)

## 🇬🇧 Data.gov.uk Scraper

> 🚀 **Collect open dataset metadata from Data.gov.uk in minutes.** Search by keyword, publisher, or topic. Export dataset titles, descriptions, download links, and publisher info. No coding, no account required.

> 🕒 **Last updated:** 2026-04-23 · **📊 20+ fields** per dataset · **🔍 3 search filters** · **📂 Publisher + topic** · **🚫 No auth** required


<table><tr>
<td style="border-left:4px solid #0F766E;padding:12px 16px;font-weight:600">Pull structured records from Data.gov.uk — clean fields ready as CSV, JSON, JSONL, Excel, or XML for downstream pipelines.</td>
</tr></table>

<table>
<tr>
<td colspan="3" style="padding:10px 14px;background:#0F766E;border:none;border-radius:4px 4px 0 0">
<span style="color:#FFFFFF;font-size:14px;font-weight:700;letter-spacing:0.5px">Related Scrapers</span>
</td>
</tr>
<tr>
<td style="padding:10px 14px;border:1px solid #E7E5E4;border-top:none;vertical-align:top;width:33%;background:#CCFBF1">
&nbsp;<a href="https://apify.com/parseforge/data-gov-uk-scraper" style="color:#0F766E;text-decoration:none;font-weight:700;font-size:13px">UK data.gov.uk</a><br>
<span style="color:#0F766E;font-size:11px;font-weight:600">➸ You are here</span>
</td>
<td style="padding:10px 14px;border:1px solid #E7E5E4;border-top:none;vertical-align:top;width:33%">
&nbsp;<a href="https://apify.com/parseforge/data-gov-scraper" style="color:#1C1917;text-decoration:none;font-weight:700;font-size:13px">USA data.gov</a><br>
<span style="color:#78716C;font-size:11px">US federal open data</span>
</td>
<td style="padding:10px 14px;border:1px solid #E7E5E4;border-top:none;vertical-align:top;width:33%">
&nbsp;<a href="https://apify.com/parseforge/data-go-kr-scraper" style="color:#1C1917;text-decoration:none;font-weight:700;font-size:13px">Korea data.go.kr</a><br>
<span style="color:#78716C;font-size:11px">Korean open gov data</span>
</td>
</tr>
</table>

##### Copy to your AI assistant

Copy this block into ChatGPT, Claude, Cursor, or any LLM to start using this actor.

````

parseforge/data-gov-uk-scraper on Apify. Call: ApifyClient("TOKEN").actor("parseforge/data-gov-uk-scraper").call(run\_input={...}), then client.dataset(run\["defaultDatasetId"]).list\_items().items for results. Key inputs: startUrl (string), maxItems (integer, default 10), searchQuery (string, default "economics"), publisher (string), topic (string), format (string). Full actor spec: fetch build via GET https://api.apify.com/v2/acts/parseforge~data-gov-uk-scraper (Bearer TOKEN). Get token: https://console.apify.com/account/integrations

````

The **Data.gov.uk Scraper** collects open data catalog metadata from the UK government's data portal, returning **20+ fields per dataset**: title, description, publisher, topic, format, download URL, update frequency, and license. Runs support up to 1,000,000 datasets on a paid plan.

Data.gov.uk hosts over 50,000 open datasets from UK government departments and public bodies. The Actor supports keyword search with publisher and topic filters.

| 🎯 Target Audience | 💡 Primary Use Cases |
|---|---|
| Data scientists, policy researchers, journalists, civic tech teams, academic researchers, open data advocates | Open data discovery, policy research, data journalism, civic technology, government transparency analysis |

---

### 📋 What the Data.gov.uk Scraper does

Three search filters:

- 🔍 **Keyword search.** Free-text search across dataset titles and descriptions.
- 🏛️ **Publisher filter.** Filter by government department or public body.
- 📂 **Topic filter.** Browse by subject area.
- 🔗 **URL mode.** Paste a direct Data.gov.uk search URL.

Each dataset record includes title, description, publisher, topic, data format, download URL, update frequency, license, and portal URL.

> 💡 **Why it matters:** browsing Data.gov.uk for relevant datasets means scrolling through search results and clicking into each listing. This Actor exports structured catalog metadata at scale, ready for your open data inventories or research pipelines.

---

### 🎬 Full Demo

_🚧 Coming soon: a 3-minute walkthrough showing how to go from sign-up to a downloaded dataset._

---

### ⚙️ Input

<table>
<thead>
<tr><th>Input</th><th>Type</th><th>Default</th><th>Behavior</th></tr>
</thead>
<tbody>
<tr><td>startUrl</td><td>string</td><td>""</td><td>Direct Data.gov.uk search URL.</td></tr>
<tr><td>maxItems</td><td>integer</td><td>10</td><td>Max datasets. Free: limited. Paid: up to 1,000,000.</td></tr>
<tr><td>searchQuery</td><td>string</td><td>""</td><td>Keyword search.</td></tr>
<tr><td>publisher</td><td>string</td><td>""</td><td>Government department or public body.</td></tr>
<tr><td>topic</td><td>string</td><td>""</td><td>Subject area filter.</td></tr>
</tbody>
</table>

**Example: health datasets from NHS.**

```json
{
    "searchQuery": "health",
    "publisher": "NHS Digital",
    "maxItems": 50
}
````

**Example: all transport datasets.**

```json
{
    "topic": "transport",
    "maxItems": 100
}
```

> ⚠️ **Good to Know:** Data.gov.uk is the UK government's official open data portal. Datasets link to various formats (CSV, JSON, XML, PDF) hosted by individual departments.

***

### 📊 Output

Each dataset record contains **20+ fields**. Download the catalog as CSV, Excel, JSON, or XML.

#### 🧾 Schema

| Field | Type | Example |
|---|---|---|
| 📝 title | string | `"NHS Hospital Admissions"` |
| 📄 description | string | `"Monthly hospital admission statistics..."` |
| 🏛️ publisher | string | `"NHS Digital"` |
| 📂 topic | string | `"Health"` |
| 📦 format | string | `"CSV"` |
| 🔗 downloadUrl | string | `"https://data.gov.uk/dataset/..."` |
| 📅 lastUpdated | string | `"2026-03-15"` |
| 🔄 updateFrequency | string | `"Monthly"` |
| 📜 license | string | `"Open Government Licence"` |
| 🔗 portalUrl | string | `"https://www.data.gov.uk/dataset/..."` |
| 🕒 scrapedAt | ISO 8601 | `"2026-04-16T00:00:00.000Z"` |

#### 📦 Sample records

<details>
<summary><strong>🏥 NHS health dataset</strong></summary>

```json
{
    "title": "NHS Hospital Admissions",
    "description": "Monthly hospital admission statistics by trust and specialty.",
    "publisher": "NHS Digital",
    "topic": "Health",
    "format": "CSV",
    "downloadUrl": "https://data.gov.uk/dataset/nhs-hospital-admissions",
    "lastUpdated": "2026-03-15",
    "updateFrequency": "Monthly",
    "license": "Open Government Licence",
    "portalUrl": "https://www.data.gov.uk/dataset/nhs-hospital-admissions",
    "scrapedAt": "2026-04-16T00:00:00.000Z"
}
```

</details>

<details>
<summary><strong>🚗 Transport dataset</strong></summary>

```json
{
    "title": "Road Traffic Statistics",
    "description": "Annual average daily traffic flow estimates for UK roads.",
    "publisher": "Department for Transport",
    "topic": "Transport",
    "format": "CSV",
    "downloadUrl": "https://data.gov.uk/dataset/road-traffic-statistics",
    "lastUpdated": "2025-12-01",
    "updateFrequency": "Annual",
    "license": "Open Government Licence",
    "portalUrl": "https://www.data.gov.uk/dataset/road-traffic-statistics",
    "scrapedAt": "2026-04-16T00:00:00.000Z"
}
```

</details>

<details>
<summary><strong>📋 Dataset with sparse metadata</strong></summary>

```json
{
    "title": "Local Authority Planning Applications",
    "description": "Planning application data from local councils.",
    "publisher": "Ministry of Housing",
    "topic": "Towns & Cities",
    "format": "JSON",
    "downloadUrl": null,
    "lastUpdated": null,
    "updateFrequency": null,
    "license": "Open Government Licence",
    "portalUrl": "https://www.data.gov.uk/dataset/planning-applications",
    "scrapedAt": "2026-04-16T00:00:00.000Z"
}
```

</details>

***

### ✨ Why choose this Actor

| | Capability |
|---|---|
| 🇬🇧 | **Full Data.gov.uk catalog.** 50,000+ UK government datasets. |
| 🔍 | **3 search filters.** Keyword, publisher, topic. |
| 📦 | **Format and download links.** Direct URLs to CSV, JSON, XML files. |
| 🏛️ | **Publisher data.** Government department per dataset. |
| 📅 | **Update frequency.** Monthly, quarterly, annual cadence per dataset. |
| ⚡ | **Scalable.** Quick lookups to full catalog sweeps. |
| 🚫 | **No authentication.** Public open data portal. |

> 📊 Data.gov.uk is the UK's official open data portal, hosting datasets from 900+ government organizations. Structured access powers every open data project, policy research, and civic technology workflow.

***

### 📈 How it compares to alternatives

| Approach | Cost | Coverage | Refresh | Filters | Setup |
|---|---|---|---|---|---|
| **⭐ Data.gov.uk Scraper** *(this Actor)* | $5 free credit, then pay-per-use | Full catalog | **Live per run** | keyword, publisher, topic | ⚡ 2 min |
| Manual Data.gov.uk browsing | Free | One at a time | Manual | UI only | 🕒 Hours |
| CKAN API (direct) | Free | Full | Real-time | Many | ⏳ Hours (API setup) |

Pick this Actor when you want UK open data catalog metadata on demand, with publisher and topic filters, without CKAN API integration.

***

### 🚀 How to use

1. 📝 **Sign up.** [Create a free account with $5 credit](https://console.apify.com/sign-up?fpr=vmoqkp) (takes 2 minutes).
2. 🌐 **Open the Actor.** Go to the Data.gov.uk Scraper page on the Apify Store.
3. 🎯 **Set input.** Enter a keyword, pick a publisher or topic.
4. 🚀 **Run it.** Click **Start**.
5. 📥 **Download.** Grab results in the **Dataset** tab.

> ⏱️ Total time: **3-5 minutes.** No coding required.

***

### 💼 Business use cases

<table>
<tr>
<td width="50%" valign="top">

#### 📊 Data Science & Research

- Discover relevant UK government datasets
- Build open data catalogs for research
- Track new dataset publications
- Analyze data coverage by topic

</td>
<td width="50%" valign="top">

#### 🏛️ Policy & Civic Tech

- Monitor government data releases
- Build transparency dashboards
- Track department data publishing rates
- Power civic apps with open data

</td>
</tr>
<tr>
<td width="50%" valign="top">

#### 📰 Data Journalism

- Find datasets for investigative stories
- Track data freshness across departments
- Build story pipelines from government data
- Monitor new dataset publications

</td>
<td width="50%" valign="top">

#### 🏢 Business Intelligence

- Enrich models with UK government data
- Track economic indicators by topic
- Build UK market data pipelines
- Monitor regulatory data releases

</td>
</tr>
</table>

***

***

### 🌟 Beyond business use cases

Data like this powers more than commercial workflows. The same structured records support research, education, civic projects, and personal initiatives.

<table>
<tr>
<td width="50%">

#### 🎓 Research and academia

- Empirical datasets for papers, thesis work, and coursework
- Longitudinal studies tracking changes across snapshots
- Reproducible research with cited, versioned data pulls
- Classroom exercises on data analysis and ethical scraping

</td>
<td width="50%">

#### 🎨 Personal and creative

- Side projects, portfolio demos, and indie app launches
- Data visualizations, dashboards, and infographics
- Content research for bloggers, YouTubers, and podcasters
- Hobbyist collections and personal trackers

</td>
</tr>
<tr>
<td width="50%">

#### 🤝 Non-profit and civic

- Transparency reporting and accountability projects
- Advocacy campaigns backed by public-interest data
- Community-run databases for local issues
- Investigative journalism on public records

</td>
<td width="50%">

#### 🧪 Experimentation

- Prototype AI and machine-learning pipelines with real data
- Validate product-market hypotheses before engineering spend
- Train small domain-specific models on niche corpora
- Test dashboard concepts with live input

</td>
</tr>
</table>

### 🤖 Ask an AI assistant about this scraper

Open a ready-to-send prompt about this ParseForge actor in the AI of your choice:

- 💬 [**ChatGPT**](https://chat.openai.com/?q=How%20do%20I%20use%20the%20Data.gov.uk%20Scraper%20by%20ParseForge%20on%20Apify%3F%20Show%20me%20input%20examples%2C%20output%20fields%2C%20common%20use%20cases%2C%20and%20how%20to%20integrate%20it%20into%20a%20workflow.)
- 🧠 [**Claude**](https://claude.ai/new?q=How%20do%20I%20use%20the%20Data.gov.uk%20Scraper%20by%20ParseForge%20on%20Apify%3F%20Show%20me%20input%20examples%2C%20output%20fields%2C%20common%20use%20cases%2C%20and%20how%20to%20integrate%20it%20into%20a%20workflow.)
- 🔍 [**Perplexity**](https://perplexity.ai/search?q=How%20do%20I%20use%20the%20Data.gov.uk%20Scraper%20by%20ParseForge%20on%20Apify%3F%20Show%20me%20input%20examples%2C%20output%20fields%2C%20common%20use%20cases%2C%20and%20how%20to%20integrate%20it%20into%20a%20workflow.)
- 🅒 [**Copilot**](https://copilot.microsoft.com/?q=How%20do%20I%20use%20the%20Data.gov.uk%20Scraper%20by%20ParseForge%20on%20Apify%3F%20Show%20me%20input%20examples%2C%20output%20fields%2C%20common%20use%20cases%2C%20and%20how%20to%20integrate%20it%20into%20a%20workflow.)

### ❓ Frequently Asked Questions

<details>
<summary><b>💳 Do I need a paid Apify plan to run this actor?</b></summary>

No. You can start right now on the free Apify plan, which includes **$5 in free monthly credit**. That is enough to run this actor several times and explore the output before committing to anything. Paid plans unlock higher limits, more concurrent runs, and larger datasets. [Create a free Apify account here](https://console.apify.com/sign-up?fpr=vmoqkp) to get started.

</details>

<details>
<summary><b>🚨 What happens if my run fails or returns no results?</b></summary>

Failed runs are not charged. If the source site changes, proxies get rate-limited, or a specific input matches nothing, re-run the actor or open our [contact form](https://tally.so/r/BzdKgA) and we will investigate. You can also check the run log in the Apify console to see why the run stopped.

</details>

<details>
<summary><b>📏 How many items can I scrape per run?</b></summary>

Free users are limited to **10 items per run** so you can preview the output and confirm the actor works for your use case. Paid users can raise maxItems up to **1,000,000** per run. [Upgrade here](https://console.apify.com/sign-up?fpr=vmoqkp) if you need full scale.

</details>

<details>
<summary><b>🕒 How fresh is the data?</b></summary>

Every run fetches live data at the moment of execution. There is no cache or delay: the records you get reflect what the source returned at that moment. Schedule the actor to maintain a rolling snapshot of the data you need.

</details>

<details>
<summary><b>🧑‍💻 Can I call this actor from my own code?</b></summary>

Yes. Apify exposes every actor as a REST endpoint and ships first-class SDKs for [Node.js](https://docs.apify.com/sdk/js) and [Python](https://docs.apify.com/sdk/python). You can start a run, read the dataset, and handle webhooks from your own app in a few lines. All you need is your Apify API token.

</details>

<details>
<summary><b>📤 How do I export the data?</b></summary>

Every Apify dataset can be downloaded in one click from the console as CSV, JSON, JSONL, Excel, HTML, XML, or RSS. You can also pull results programmatically via the [Apify API](https://docs.apify.com/api/v2) or stream them into BigQuery, S3, and other destinations through built-in integrations.

</details>

<details>
<summary><b>📅 Can I schedule the actor to run automatically?</b></summary>

Yes. Use the Apify scheduler to run the actor on any cadence, from hourly to monthly. Results are saved to your dataset and can be delivered to webhooks, email, Slack, cloud storage, or automation tools such as Zapier and Make.

***

</details>

### 🔌 Automating Data.gov.uk Scraper

- 🟢 **Node.js.** Install the apify-client NPM package.
- 🐍 **Python.** Use the apify-client PyPI package.
- 📚 See the [Apify API documentation](https://docs.apify.com/api/v2) for full details.

The [Apify Schedules feature](https://docs.apify.com/platform/schedules) lets you trigger this Actor on any cron interval. Weekly pulls catch new dataset publications.

### 🔌 Integrate with any app

- [**Make**](https://docs.apify.com/platform/integrations/make) - Automate workflows
- [**Zapier**](https://docs.apify.com/platform/integrations/zapier) - Connect 5,000+ apps
- [**Slack**](https://docs.apify.com/platform/integrations/slack) - Get notifications
- [**Airbyte**](https://docs.apify.com/platform/integrations/airbyte) - Data pipelines
- [**GitHub**](https://docs.apify.com/platform/integrations/github) - Trigger from commits
- [**Google Drive**](https://docs.apify.com/platform/integrations/drive) - Export to Sheets

***

### 🔗 Recommended Actors

- [**📊 USAspending Scraper**](https://apify.com/parseforge/usaspending-scraper) - U.S. federal spending data
- [**📊 FRED Scraper**](https://apify.com/parseforge/fred-scraper) - Economic data from Federal Reserve
- [**📊 Indexmundi Scraper**](https://apify.com/parseforge/indexmundi-scraper) - Global indicators
- [**📋 GSA eLibrary Scraper**](https://apify.com/parseforge/gsa-elibrary-scraper) - U.S. government contracts
- [**🏥 ClinicalTrials.gov Scraper**](https://apify.com/parseforge/clinicaltrials-scraper) - Clinical trial data

> 💡 **Pro Tip:** browse the complete [ParseForge collection](https://apify.com/parseforge) for more government and open data scrapers.

***

**🆘 Need Help?** [**Open our contact form**](https://tally.so/r/BzdKgA) to request a new scraper, propose a custom data project, or report an issue.

***

> **⚠️ Disclaimer:** this Actor is an independent tool and is not affiliated with, endorsed by, or sponsored by the UK government or Data.gov.uk. All trademarks mentioned are the property of their respective owners. Only publicly available open data catalog metadata is collected.

# Actor input Schema

## `startUrl` (type: `string`):

Direct URL to scrape from data.gov.uk. Use this OR search filters below, not both. Example: https://www.data.gov.uk/search?q=economics

## `maxItems` (type: `integer`):

Free users: Limited to 100. Paid users: Optional, max 1,000,000

## `searchQuery` (type: `string`):

Search term to filter datasets. Use this OR startUrl above, not both. Example: 'economics'

## `publisher` (type: `string`):

Filter by publisher name. Example: 'Office for National Statistics'

## `topic` (type: `string`):

Filter by topic. Available: Business and economy, Crime and justice, Education, Environment, Government, Government spending, Health, Mapping, Society, Towns and cities, Transport

## `format` (type: `string`):

Filter by data format. Available: CSV, ESRI REST, GEOJSON, HTML, JSON, KML, PDF, SHP, WFS, WMS, XLS, XML, ZIP, Other

## `oglOnly` (type: `boolean`):

Only include datasets with Open Government Licence

## `sort` (type: `string`):

Sort results by relevance or date

## Actor input object example

```json
{
  "maxItems": 10,
  "searchQuery": "economics",
  "oglOnly": false,
  "sort": "best"
}
```

# Actor output Schema

## `datasets` (type: `string`):

Complete dataset with all scraped dataset information including titles, publishers, formats, topics, and metadata

## `overview` (type: `string`):

Overview view of datasets with key fields displayed in a table format

# API

You can run this Actor programmatically using our API. Below are code examples in JavaScript, Python, and CLI, as well as the OpenAPI specification and MCP server setup.

## JavaScript example

```javascript
import { ApifyClient } from 'apify-client';

// Initialize the ApifyClient with your Apify API token
// Replace the '<YOUR_API_TOKEN>' with your token
const client = new ApifyClient({
    token: '<YOUR_API_TOKEN>',
});

// Prepare Actor input
const input = {
    "maxItems": 10,
    "searchQuery": "economics",
    "sort": "best"
};

// Run the Actor and wait for it to finish
const run = await client.actor("parseforge/data-gov-uk-scraper").call(input);

// Fetch and print Actor results from the run's dataset (if any)
console.log('Results from dataset');
console.log(`💾 Check your data here: https://console.apify.com/storage/datasets/${run.defaultDatasetId}`);
const { items } = await client.dataset(run.defaultDatasetId).listItems();
items.forEach((item) => {
    console.dir(item);
});

// 📚 Want to learn more 📖? Go to → https://docs.apify.com/api/client/js/docs

```

## Python example

```python
from apify_client import ApifyClient

# Initialize the ApifyClient with your Apify API token
# Replace '<YOUR_API_TOKEN>' with your token.
client = ApifyClient("<YOUR_API_TOKEN>")

# Prepare the Actor input
run_input = {
    "maxItems": 10,
    "searchQuery": "economics",
    "sort": "best",
}

# Run the Actor and wait for it to finish
run = client.actor("parseforge/data-gov-uk-scraper").call(run_input=run_input)

# Fetch and print Actor results from the run's dataset (if there are any)
print("💾 Check your data here: https://console.apify.com/storage/datasets/" + run["defaultDatasetId"])
for item in client.dataset(run["defaultDatasetId"]).iterate_items():
    print(item)

# 📚 Want to learn more 📖? Go to → https://docs.apify.com/api/client/python/docs/quick-start

```

## CLI example

```bash
echo '{
  "maxItems": 10,
  "searchQuery": "economics",
  "sort": "best"
}' |
apify call parseforge/data-gov-uk-scraper --silent --output-dataset

```

## MCP server setup

```json
{
    "mcpServers": {
        "apify": {
            "command": "npx",
            "args": [
                "mcp-remote",
                "https://mcp.apify.com/?tools=parseforge/data-gov-uk-scraper",
                "--header",
                "Authorization: Bearer <YOUR_API_TOKEN>"
            ]
        }
    }
}

```

## OpenAPI specification

```json
{
    "openapi": "3.0.1",
    "info": {
        "title": "Data.gov.uk Scraper",
        "description": "Collect UK government open data effortlessly. Extract datasets, publishers, formats, topics, licenses, and download links from data.gov.uk — the official UK open data portal. Perfect for researchers, policy analysts, and developers building data catalogs.",
        "version": "1.0",
        "x-build-id": "u85f4ZfW2PCCU5YKG"
    },
    "servers": [
        {
            "url": "https://api.apify.com/v2"
        }
    ],
    "paths": {
        "/acts/parseforge~data-gov-uk-scraper/run-sync-get-dataset-items": {
            "post": {
                "operationId": "run-sync-get-dataset-items-parseforge-data-gov-uk-scraper",
                "x-openai-isConsequential": false,
                "summary": "Executes an Actor, waits for its completion, and returns Actor's dataset items in response.",
                "tags": [
                    "Run Actor"
                ],
                "requestBody": {
                    "required": true,
                    "content": {
                        "application/json": {
                            "schema": {
                                "$ref": "#/components/schemas/inputSchema"
                            }
                        }
                    }
                },
                "parameters": [
                    {
                        "name": "token",
                        "in": "query",
                        "required": true,
                        "schema": {
                            "type": "string"
                        },
                        "description": "Enter your Apify token here"
                    }
                ],
                "responses": {
                    "200": {
                        "description": "OK"
                    }
                }
            }
        },
        "/acts/parseforge~data-gov-uk-scraper/runs": {
            "post": {
                "operationId": "runs-sync-parseforge-data-gov-uk-scraper",
                "x-openai-isConsequential": false,
                "summary": "Executes an Actor and returns information about the initiated run in response.",
                "tags": [
                    "Run Actor"
                ],
                "requestBody": {
                    "required": true,
                    "content": {
                        "application/json": {
                            "schema": {
                                "$ref": "#/components/schemas/inputSchema"
                            }
                        }
                    }
                },
                "parameters": [
                    {
                        "name": "token",
                        "in": "query",
                        "required": true,
                        "schema": {
                            "type": "string"
                        },
                        "description": "Enter your Apify token here"
                    }
                ],
                "responses": {
                    "200": {
                        "description": "OK",
                        "content": {
                            "application/json": {
                                "schema": {
                                    "$ref": "#/components/schemas/runsResponseSchema"
                                }
                            }
                        }
                    }
                }
            }
        },
        "/acts/parseforge~data-gov-uk-scraper/run-sync": {
            "post": {
                "operationId": "run-sync-parseforge-data-gov-uk-scraper",
                "x-openai-isConsequential": false,
                "summary": "Executes an Actor, waits for completion, and returns the OUTPUT from Key-value store in response.",
                "tags": [
                    "Run Actor"
                ],
                "requestBody": {
                    "required": true,
                    "content": {
                        "application/json": {
                            "schema": {
                                "$ref": "#/components/schemas/inputSchema"
                            }
                        }
                    }
                },
                "parameters": [
                    {
                        "name": "token",
                        "in": "query",
                        "required": true,
                        "schema": {
                            "type": "string"
                        },
                        "description": "Enter your Apify token here"
                    }
                ],
                "responses": {
                    "200": {
                        "description": "OK"
                    }
                }
            }
        }
    },
    "components": {
        "schemas": {
            "inputSchema": {
                "type": "object",
                "properties": {
                    "startUrl": {
                        "title": "Start URL",
                        "type": "string",
                        "description": "Direct URL to scrape from data.gov.uk. Use this OR search filters below, not both. Example: https://www.data.gov.uk/search?q=economics"
                    },
                    "maxItems": {
                        "title": "Max Items",
                        "minimum": 1,
                        "maximum": 1000000,
                        "type": "integer",
                        "description": "Free users: Limited to 100. Paid users: Optional, max 1,000,000"
                    },
                    "searchQuery": {
                        "title": "Search Query",
                        "type": "string",
                        "description": "Search term to filter datasets. Use this OR startUrl above, not both. Example: 'economics'"
                    },
                    "publisher": {
                        "title": "Publisher",
                        "type": "string",
                        "description": "Filter by publisher name. Example: 'Office for National Statistics'"
                    },
                    "topic": {
                        "title": "Topic",
                        "enum": [
                            "",
                            "Business and economy",
                            "Crime and justice",
                            "Education",
                            "Environment",
                            "Government",
                            "Government spending",
                            "Health",
                            "Mapping",
                            "Society",
                            "Towns and cities",
                            "Transport"
                        ],
                        "type": "string",
                        "description": "Filter by topic. Available: Business and economy, Crime and justice, Education, Environment, Government, Government spending, Health, Mapping, Society, Towns and cities, Transport"
                    },
                    "format": {
                        "title": "Format",
                        "enum": [
                            "",
                            "CSV",
                            "ESRI REST",
                            "GEOJSON",
                            "HTML",
                            "JSON",
                            "KML",
                            "PDF",
                            "SHP",
                            "WFS",
                            "WMS",
                            "XLS",
                            "XML",
                            "ZIP",
                            "Other"
                        ],
                        "type": "string",
                        "description": "Filter by data format. Available: CSV, ESRI REST, GEOJSON, HTML, JSON, KML, PDF, SHP, WFS, WMS, XLS, XML, ZIP, Other"
                    },
                    "oglOnly": {
                        "title": "Open Government Licence (OGL) Only",
                        "type": "boolean",
                        "description": "Only include datasets with Open Government Licence",
                        "default": false
                    },
                    "sort": {
                        "title": "Sort By",
                        "enum": [
                            "best",
                            "recent"
                        ],
                        "type": "string",
                        "description": "Sort results by relevance or date"
                    }
                }
            },
            "runsResponseSchema": {
                "type": "object",
                "properties": {
                    "data": {
                        "type": "object",
                        "properties": {
                            "id": {
                                "type": "string"
                            },
                            "actId": {
                                "type": "string"
                            },
                            "userId": {
                                "type": "string"
                            },
                            "startedAt": {
                                "type": "string",
                                "format": "date-time",
                                "example": "2025-01-08T00:00:00.000Z"
                            },
                            "finishedAt": {
                                "type": "string",
                                "format": "date-time",
                                "example": "2025-01-08T00:00:00.000Z"
                            },
                            "status": {
                                "type": "string",
                                "example": "READY"
                            },
                            "meta": {
                                "type": "object",
                                "properties": {
                                    "origin": {
                                        "type": "string",
                                        "example": "API"
                                    },
                                    "userAgent": {
                                        "type": "string"
                                    }
                                }
                            },
                            "stats": {
                                "type": "object",
                                "properties": {
                                    "inputBodyLen": {
                                        "type": "integer",
                                        "example": 2000
                                    },
                                    "rebootCount": {
                                        "type": "integer",
                                        "example": 0
                                    },
                                    "restartCount": {
                                        "type": "integer",
                                        "example": 0
                                    },
                                    "resurrectCount": {
                                        "type": "integer",
                                        "example": 0
                                    },
                                    "computeUnits": {
                                        "type": "integer",
                                        "example": 0
                                    }
                                }
                            },
                            "options": {
                                "type": "object",
                                "properties": {
                                    "build": {
                                        "type": "string",
                                        "example": "latest"
                                    },
                                    "timeoutSecs": {
                                        "type": "integer",
                                        "example": 300
                                    },
                                    "memoryMbytes": {
                                        "type": "integer",
                                        "example": 1024
                                    },
                                    "diskMbytes": {
                                        "type": "integer",
                                        "example": 2048
                                    }
                                }
                            },
                            "buildId": {
                                "type": "string"
                            },
                            "defaultKeyValueStoreId": {
                                "type": "string"
                            },
                            "defaultDatasetId": {
                                "type": "string"
                            },
                            "defaultRequestQueueId": {
                                "type": "string"
                            },
                            "buildNumber": {
                                "type": "string",
                                "example": "1.0.0"
                            },
                            "containerUrl": {
                                "type": "string"
                            },
                            "usage": {
                                "type": "object",
                                "properties": {
                                    "ACTOR_COMPUTE_UNITS": {
                                        "type": "integer",
                                        "example": 0
                                    },
                                    "DATASET_READS": {
                                        "type": "integer",
                                        "example": 0
                                    },
                                    "DATASET_WRITES": {
                                        "type": "integer",
                                        "example": 0
                                    },
                                    "KEY_VALUE_STORE_READS": {
                                        "type": "integer",
                                        "example": 0
                                    },
                                    "KEY_VALUE_STORE_WRITES": {
                                        "type": "integer",
                                        "example": 1
                                    },
                                    "KEY_VALUE_STORE_LISTS": {
                                        "type": "integer",
                                        "example": 0
                                    },
                                    "REQUEST_QUEUE_READS": {
                                        "type": "integer",
                                        "example": 0
                                    },
                                    "REQUEST_QUEUE_WRITES": {
                                        "type": "integer",
                                        "example": 0
                                    },
                                    "DATA_TRANSFER_INTERNAL_GBYTES": {
                                        "type": "integer",
                                        "example": 0
                                    },
                                    "DATA_TRANSFER_EXTERNAL_GBYTES": {
                                        "type": "integer",
                                        "example": 0
                                    },
                                    "PROXY_RESIDENTIAL_TRANSFER_GBYTES": {
                                        "type": "integer",
                                        "example": 0
                                    },
                                    "PROXY_SERPS": {
                                        "type": "integer",
                                        "example": 0
                                    }
                                }
                            },
                            "usageTotalUsd": {
                                "type": "number",
                                "example": 0.00005
                            },
                            "usageUsd": {
                                "type": "object",
                                "properties": {
                                    "ACTOR_COMPUTE_UNITS": {
                                        "type": "integer",
                                        "example": 0
                                    },
                                    "DATASET_READS": {
                                        "type": "integer",
                                        "example": 0
                                    },
                                    "DATASET_WRITES": {
                                        "type": "integer",
                                        "example": 0
                                    },
                                    "KEY_VALUE_STORE_READS": {
                                        "type": "integer",
                                        "example": 0
                                    },
                                    "KEY_VALUE_STORE_WRITES": {
                                        "type": "number",
                                        "example": 0.00005
                                    },
                                    "KEY_VALUE_STORE_LISTS": {
                                        "type": "integer",
                                        "example": 0
                                    },
                                    "REQUEST_QUEUE_READS": {
                                        "type": "integer",
                                        "example": 0
                                    },
                                    "REQUEST_QUEUE_WRITES": {
                                        "type": "integer",
                                        "example": 0
                                    },
                                    "DATA_TRANSFER_INTERNAL_GBYTES": {
                                        "type": "integer",
                                        "example": 0
                                    },
                                    "DATA_TRANSFER_EXTERNAL_GBYTES": {
                                        "type": "integer",
                                        "example": 0
                                    },
                                    "PROXY_RESIDENTIAL_TRANSFER_GBYTES": {
                                        "type": "integer",
                                        "example": 0
                                    },
                                    "PROXY_SERPS": {
                                        "type": "integer",
                                        "example": 0
                                    }
                                }
                            }
                        }
                    }
                }
            }
        }
    }
}
```
