# ROR (Research Organization Registry) Scraper (`parseforge/ror-scraper`) Actor

Automate the collection of research organization data with our ROR scraper. Obtain names, aliases, locations, external identifiers (GRID, ISNI, Wikidata), relationships, and contacts from the Research Organizations Registry. Ideal for researchers, academic institutions, and data analysts.

- **URL**: https://apify.com/parseforge/ror-scraper.md
- **Developed by:** [ParseForge](https://apify.com/parseforge) (community)
- **Categories:** Automation, Developer tools, Lead generation
- **Stats:** 5 total users, 0 monthly users, 100.0% runs succeeded, 0 bookmarks
- **User rating**: 5.00 out of 5 stars

## Pricing

Pay per event

This Actor is paid per event. You are not charged for the Apify platform usage, but only a fixed price for specific events.
Since this Actor supports Apify Store discounts, the price gets lower the higher subscription plan you have.

Learn more: https://docs.apify.com/platform/actors/running/actors-in-store#pay-per-event

## What's an Apify Actor?

Actors are a software tools running on the Apify platform, for all kinds of web data extraction and automation use cases.
In Batch mode, an Actor accepts a well-defined JSON input, performs an action which can take anything from a few seconds to a few hours,
and optionally produces a well-defined JSON output, datasets with results, or files in key-value store.
In Standby mode, an Actor provides a web server which can be used as a website, API, or an MCP server.
Actors are written with capital "A".

## How to integrate an Actor?

If asked about integration, you help developers integrate Actors into their projects.
You adapt to their stack and deliver integrations that are safe, well-documented, and production-ready.
The best way to integrate Actors is as follows.

In JavaScript/TypeScript projects, use official [JavaScript/TypeScript client](https://docs.apify.com/api/client/js.md):

```bash
npm install apify-client
```

In Python projects, use official [Python client library](https://docs.apify.com/api/client/python.md):

```bash
pip install apify-client
```

In shell scripts, use [Apify CLI](https://docs.apify.com/cli/docs.md):

````bash
# MacOS / Linux
curl -fsSL https://apify.com/install-cli.sh | bash
# Windows
irm https://apify.com/install-cli.ps1 | iex
```bash

In AI frameworks, you might use the [Apify MCP server](https://docs.apify.com/platform/integrations/mcp.md).

If your project is in a different language, use the [REST API](https://docs.apify.com/api/v2.md).

For usage examples, see the [API](#api) section below.

For more details, see Apify documentation as [Markdown index](https://docs.apify.com/llms.txt) and [Markdown full-text](https://docs.apify.com/llms-full.txt).


# README

![ParseForge Banner](https://github.com/ParseForge/apify-assets/blob/ad35ccc13ddd068b9d6cba33f323962e39aed5b2/banner.jpg?raw=true)

## 🏢 ROR Scraper

> 🚀 **Extract research organization data from the Research Organization Registry in seconds.** Search by name, filter by status, and collect identifiers. No coding, no API keys required.

> 🕒 **Last updated:** 2026-04-23 · **📊 30 fields** · **🌍 100,000+ organizations** · **🔍 Name search and URL-based collection**

The Research Organization Registry (ROR) is the global, community-led registry of research organizations, containing over **100,000 entries** covering universities, research institutes, government agencies, healthcare organizations, and funders worldwide. This scraper collects full organization profiles including names, aliases, acronyms, locations (with GPS coordinates), types, external identifiers (GRID, Wikidata, FundRef, ISNI), relationships, establishment dates, and administrative metadata.

Researchers, librarians, grant administrators, and data analysts use this actor to build organization databases, verify institution identifiers, map research networks, and cross-reference organizations across multiple identifier systems. Instead of querying the ROR API manually, you get clean, structured data for up to **1,000,000 organizations per run** for paid users. Every record includes the full name, all aliases and acronyms, precise geolocation, country information, organization type, all external IDs, related organizations, and creation/modification timestamps. The data exports as JSON, CSV, or Excel.

| 🎯 Target Audience | 💡 Use Cases |
|---|---|
| University librarians | Build institution directories with verified identifiers |
| Grant administrators | Verify organization credentials for funding applications |
| Bibliometric analysts | Map institutional research networks |
| Data scientists | Cross-reference organization IDs across databases |
| Research managers | Track institutional relationships and hierarchies |
| System integrators | Populate CRM and ERP systems with organization data |

---

### 📋 What the ROR Scraper does

- 🔍 **Name search** to find organizations by name, keyword, or partial match
- 🔗 **URL-based collection** from any ROR search results page
- 🏷️ **Status filtering** to show active, inactive, or withdrawn organizations
- 🆔 **Multi-identifier extraction** for GRID, Wikidata, FundRef, and ISNI
- 📍 **Geolocation data** with latitude, longitude, city, state, and country
- 👥 **Relationship mapping** to identify parent, child, and related organizations

The scraper queries the ROR API or website with your search parameters, retrieves matching organization records, and extracts full profiles. Results include the ROR ID, official name, all aliases and acronyms, organization types, website URLs, Wikipedia links, establishment year, GPS coordinates, country and city information, GeoNames ID, all external identifiers (GRID, Wikidata, FundRef, ISNI), related organizations with relationship types, and administrative metadata.

> 💡 **Why it matters:** ROR is becoming the standard identifier for research organizations, but collecting data at scale requires API programming. This scraper automates collection and delivers structured organization profiles ready for institutional databases, grant systems, or bibliometric analysis.

---

### 🎬 Full Demo

_🚧 Coming soon..._

---

### ⚙️ Input

<table>
<tr><th>Field</th><th>Type</th><th>Required</th><th>Description</th></tr>
<tr><td><b>startUrl</b></td><td>string</td><td>No</td><td>Direct ROR search URL to scrape results from</td></tr>
<tr><td><b>query</b></td><td>string</td><td>No</td><td>Search term for organizations (e.g., "university", "research institute")</td></tr>
<tr><td><b>recordStatus</b></td><td>string</td><td>No</td><td>Filter by status: active, inactive, or withdrawn</td></tr>
<tr><td><b>maxItems</b></td><td>integer</td><td>No</td><td>Max organizations to collect. Free: up to 10. Paid: up to 1,000,000</td></tr>
</table>

**Example 1: Search with URL**
```json
{
  "startUrl": "https://ror.org/search?query=university&filter=active&page=1",
  "maxItems": 50
}
````

**Example 2: Search with filters**

```json
{
  "query": "research institute",
  "recordStatus": "active",
  "maxItems": 100
}
```

> ⚠️ **Good to Know:** Use either a Start URL or search filters, not both. If you provide a Start URL, search filters are ignored. The "active" status filter excludes organizations that have been merged, closed, or withdrawn from the registry.

***

### 📊 Output

#### 🧾 Schema

| Emoji | Field | Type | Description |
|---|---|---|---|
| 🆔 | rorId | string | Unique ROR identifier |
| 📝 | name | string | Official organization name |
| 🏷️ | organizationTypes | array | Classification (education, funder, healthcare, etc.) |
| 🌐 | websiteUrl | string | Official website URL |
| 📅 | establishedYear | number | Year the organization was established |
| ✅ | recordStatus | string | Active, inactive, or withdrawn |
| 📍 | countryName | string | Country where the organization is located |
| 🌍 | countryCode | string | ISO country code |
| 📍 | stateOrProvince | string | State, province, or region |
| 🏙️ | city | string | City name |
| 📊 | geonamesId | string | GeoNames geographic identifier |
| 📍 | latitude | number | GPS latitude coordinate |
| 📍 | longitude | number | GPS longitude coordinate |
| 📝 | aliases | array | Alternative names for the organization |
| 🏷️ | acronyms | array | Abbreviations and acronyms |
| 🔗 | gridId | string | GRID identifier |
| 🔗 | wikidataId | string | Wikidata identifier |
| 🔗 | fundrefId | string | Crossref Funder Registry identifier |
| 👤 | isniIds | array | ISNI identifiers |
| 🌐 | wikipediaLink | string | Wikipedia page URL |
| 👥 | relatedOrganizations | array | Related institutions |
| 🏷️ | relationshipType | string | Type of relationship (parent, child, related) |
| 📅 | createdDate | string | When the ROR record was created |
| 📅 | lastModifiedDate | string | When the record was last updated |
| 📄 | schemaVersion | string | ROR schema version |
| 🔐 | recordSource | string | Source of the record data |
| 🌐 | externalLinks | array | Additional external links |
| 🏷️ | labels | array | Names in other languages |
| ⏰ | scrapedAt | string | Collection timestamp |
| ⚠️ | error | string | Error message if processing failed |

#### 📦 Sample records

<details>
<summary>📄 University with full identifiers</summary>

```json
{
  "rorId": "https://ror.org/03vek6s52",
  "name": "Harvard University",
  "organizationTypes": ["Education"],
  "websiteUrl": "https://www.harvard.edu",
  "establishedYear": 1636,
  "recordStatus": "active",
  "countryName": "United States",
  "countryCode": "US",
  "stateOrProvince": "Massachusetts",
  "city": "Cambridge",
  "geonamesId": "4931972",
  "latitude": 42.3770,
  "longitude": -71.1167,
  "aliases": [],
  "acronyms": [],
  "gridId": "grid.38142.3c",
  "wikidataId": "Q13371",
  "fundrefId": "100007229",
  "isniIds": ["0000 0001 2167 3675"],
  "wikipediaLink": "https://en.wikipedia.org/wiki/Harvard_University",
  "relatedOrganizations": [
    { "name": "Harvard Medical School", "type": "child" }
  ],
  "createdDate": "2019-01-15",
  "lastModifiedDate": "2025-09-20",
  "scrapedAt": "2026-04-16T12:00:00.000Z"
}
```

</details>

<details>
<summary>📄 Research institute with multilingual labels</summary>

```json
{
  "rorId": "https://ror.org/02jz4aj89",
  "name": "Max Planck Society",
  "organizationTypes": ["Nonprofit"],
  "websiteUrl": "https://www.mpg.de",
  "establishedYear": 1948,
  "recordStatus": "active",
  "countryName": "Germany",
  "countryCode": "DE",
  "stateOrProvince": "Bavaria",
  "city": "Munich",
  "latitude": 48.1351,
  "longitude": 11.5820,
  "aliases": ["Max-Planck-Gesellschaft"],
  "acronyms": ["MPG"],
  "gridId": "grid.4372.2",
  "wikidataId": "Q158085",
  "labels": [{ "label": "Max-Planck-Gesellschaft", "language": "de" }],
  "relatedOrganizations": [
    { "name": "Max Planck Institute for Physics", "type": "child" }
  ],
  "scrapedAt": "2026-04-16T12:00:00.000Z"
}
```

</details>

<details>
<summary>📄 Funding organization</summary>

```json
{
  "rorId": "https://ror.org/021nxhr62",
  "name": "National Science Foundation",
  "organizationTypes": ["Funder", "Government"],
  "websiteUrl": "https://www.nsf.gov",
  "establishedYear": 1950,
  "recordStatus": "active",
  "countryName": "United States",
  "countryCode": "US",
  "stateOrProvince": "Virginia",
  "city": "Alexandria",
  "latitude": 38.8048,
  "longitude": -77.0469,
  "acronyms": ["NSF"],
  "gridId": "grid.431093.c",
  "wikidataId": "Q304878",
  "fundrefId": "100000001",
  "scrapedAt": "2026-04-16T12:00:00.000Z"
}
```

</details>

***

### ✨ Why choose this Actor

| Feature | Details |
|---|---|
| 🌍 100,000+ organizations | Access the full ROR registry of research institutions |
| 🆔 Multi-identifier support | GRID, Wikidata, FundRef, and ISNI in one record |
| 📍 Geolocation data | GPS coordinates, city, state, and country for every organization |
| 🏷️ Organization types | Education, funder, healthcare, government, nonprofit, and more |
| 👥 Relationship mapping | Parent, child, and related organization networks |
| 🌐 Multilingual labels | Organization names in multiple languages |
| 📦 Flexible export | JSON, CSV, or Excel output |

> 📊 **Collect up to 1,000,000 research organization profiles per run with full identifiers, geolocation, and relationship data.**

***

### 📈 How it compares to alternatives

| Feature | This Actor | Manual API Queries | Generic Scrapers |
|---|---|---|---|
| Multi-identifier extraction | ✅ | Manual | ❌ |
| GPS coordinates | ✅ | ✅ | ❌ |
| Relationship mapping | ✅ | Manual | ❌ |
| Bulk collection (1M+ records) | ✅ | Manual | ❌ |
| Structured JSON/CSV output | ✅ | JSON only | Varies |
| Status filtering | ✅ | ✅ | ❌ |
| Scheduled runs | ✅ | ❌ | ❌ |

Get structured organization data at scale without writing API code or managing pagination.

***

### 🚀 How to use

1. **Create an Apify account** - [Sign up free with $5 credit](https://console.apify.com/sign-up?fpr=vmoqkp)
2. **Open the ROR Scraper** - Navigate to the actor page on Apify
3. **Enter a search query or URL** - Type an organization name or paste a ROR search URL
4. **Set optional filters** - Filter by record status (active, inactive, withdrawn)
5. **Click Start** - The actor collects matching organizations and delivers structured data

> ⏱️ **A typical run with 50 organizations completes in under 1 minute.**

***

### 💼 Business use cases

<table>
<tr>
<td width="50%"><b>📚 Library Services</b>
<ul>
<li>Build institution directories with verified identifiers</li>
<li>Cross-reference organization IDs across catalog systems</li>
<li>Maintain up-to-date organization records</li>
<li>Map institutional hierarchies for consortia</li>
</ul>
</td>
<td width="50%"><b>💰 Grant Administration</b>
<ul>
<li>Verify applicant institution credentials</li>
<li>Cross-reference ROR IDs with FundRef records</li>
<li>Track organizational changes and mergers</li>
<li>Build validated institution databases for grant systems</li>
</ul>
</td>
</tr>
<tr>
<td width="50%"><b>📊 Bibliometric Analysis</b>
<ul>
<li>Map institutional research collaboration networks</li>
<li>Geocode research output by institution location</li>
<li>Track organization types across research fields</li>
<li>Analyze regional distribution of research institutions</li>
</ul>
</td>
<td width="50%"><b>🔧 System Integration</b>
<ul>
<li>Populate CRM systems with research organization data</li>
<li>Sync ROR identifiers with internal databases</li>
<li>Build autocomplete systems for institution selection</li>
<li>Maintain organization master data across platforms</li>
</ul>
</td>
</tr>
</table>

***

***

### 🌟 Beyond business use cases

Data like this powers more than commercial workflows. The same structured records support research, education, civic projects, and personal initiatives.

<table>
<tr>
<td width="50%">

#### 🎓 Research and academia

- Empirical datasets for papers, thesis work, and coursework
- Longitudinal studies tracking changes across snapshots
- Reproducible research with cited, versioned data pulls
- Classroom exercises on data analysis and ethical scraping

</td>
<td width="50%">

#### 🎨 Personal and creative

- Side projects, portfolio demos, and indie app launches
- Data visualizations, dashboards, and infographics
- Content research for bloggers, YouTubers, and podcasters
- Hobbyist collections and personal trackers

</td>
</tr>
<tr>
<td width="50%">

#### 🤝 Non-profit and civic

- Transparency reporting and accountability projects
- Advocacy campaigns backed by public-interest data
- Community-run databases for local issues
- Investigative journalism on public records

</td>
<td width="50%">

#### 🧪 Experimentation

- Prototype AI and machine-learning pipelines with real data
- Validate product-market hypotheses before engineering spend
- Train small domain-specific models on niche corpora
- Test dashboard concepts with live input

</td>
</tr>
</table>

### 🤖 Ask an AI assistant about this scraper

Open a ready-to-send prompt about this ParseForge actor in the AI of your choice:

- 💬 [**ChatGPT**](https://chat.openai.com/?q=How%20do%20I%20use%20the%20ROR%20%28Research%20Organization%20Registry%29%20Scraper%20by%20ParseForge%20on%20Apify%3F%20Show%20me%20input%20examples%2C%20output%20fields%2C%20common%20use%20cases%2C%20and%20how%20to%20integrate%20it%20into%20a%20workflow.)
- 🧠 [**Claude**](https://claude.ai/new?q=How%20do%20I%20use%20the%20ROR%20%28Research%20Organization%20Registry%29%20Scraper%20by%20ParseForge%20on%20Apify%3F%20Show%20me%20input%20examples%2C%20output%20fields%2C%20common%20use%20cases%2C%20and%20how%20to%20integrate%20it%20into%20a%20workflow.)
- 🔍 [**Perplexity**](https://perplexity.ai/search?q=How%20do%20I%20use%20the%20ROR%20%28Research%20Organization%20Registry%29%20Scraper%20by%20ParseForge%20on%20Apify%3F%20Show%20me%20input%20examples%2C%20output%20fields%2C%20common%20use%20cases%2C%20and%20how%20to%20integrate%20it%20into%20a%20workflow.)
- 🅒 [**Copilot**](https://copilot.microsoft.com/?q=How%20do%20I%20use%20the%20ROR%20%28Research%20Organization%20Registry%29%20Scraper%20by%20ParseForge%20on%20Apify%3F%20Show%20me%20input%20examples%2C%20output%20fields%2C%20common%20use%20cases%2C%20and%20how%20to%20integrate%20it%20into%20a%20workflow.)

### ❓ Frequently Asked Questions

<details>
<summary><b>💳 Do I need a paid Apify plan to run this actor?</b></summary>

No. You can start right now on the free Apify plan, which includes **$5 in free monthly credit**. That is enough to run this actor several times and explore the output before committing to anything. Paid plans unlock higher limits, more concurrent runs, and larger datasets. [Create a free Apify account here](https://console.apify.com/sign-up?fpr=vmoqkp) to get started.

</details>

<details>
<summary><b>🚨 What happens if my run fails or returns no results?</b></summary>

Failed runs are not charged. If the source site changes, proxies get rate-limited, or a specific input matches nothing, re-run the actor or open our [contact form](https://tally.so/r/BzdKgA) and we will investigate. You can also check the run log in the Apify console to see why the run stopped.

</details>

<details>
<summary><b>📏 How many items can I scrape per run?</b></summary>

Free users are limited to **10 items per run** so you can preview the output and confirm the actor works for your use case. Paid users can raise maxItems up to **1,000,000** per run. [Upgrade here](https://console.apify.com/sign-up?fpr=vmoqkp) if you need full scale.

</details>

<details>
<summary><b>🕒 How fresh is the data?</b></summary>

Every run fetches live data at the moment of execution. There is no cache or delay: the records you get reflect what the source returned at that moment. Schedule the actor to maintain a rolling snapshot of the data you need.

</details>

<details>
<summary><b>🧑‍💻 Can I call this actor from my own code?</b></summary>

Yes. Apify exposes every actor as a REST endpoint and ships first-class SDKs for [Node.js](https://docs.apify.com/sdk/js) and [Python](https://docs.apify.com/sdk/python). You can start a run, read the dataset, and handle webhooks from your own app in a few lines. All you need is your Apify API token.

</details>

<details>
<summary><b>📤 How do I export the data?</b></summary>

Every Apify dataset can be downloaded in one click from the console as CSV, JSON, JSONL, Excel, HTML, XML, or RSS. You can also pull results programmatically via the [Apify API](https://docs.apify.com/api/v2) or stream them into BigQuery, S3, and other destinations through built-in integrations.

</details>

<details>
<summary><b>📅 Can I schedule the actor to run automatically?</b></summary>

Yes. Use the Apify scheduler to run the actor on any cadence, from hourly to monthly. Results are saved to your dataset and can be delivered to webhooks, email, Slack, cloud storage, or automation tools such as Zapier and Make.

***

</details>

### 🔌 Automating ROR Scraper

Integrate the ROR Scraper into your workflow using the Apify API or client libraries.

**Node.js:**

```javascript
import { ApifyClient } from 'apify-client';

const client = new ApifyClient({ token: 'YOUR_API_TOKEN' });
const run = await client.actor("parseforge/ror-scraper").call({
  query: "university",
  recordStatus: "active",
  maxItems: 100
});
const { items } = await client.dataset(run.defaultDatasetId).listItems();
console.log(items);
```

**Python:**

```python
from apify_client import ApifyClient

client = ApifyClient("YOUR_API_TOKEN")
run = client.actor("parseforge/ror-scraper").call(run_input={
    "query": "university",
    "recordStatus": "active",
    "maxItems": 100
})
items = list(client.dataset(run["defaultDatasetId"]).iterate_items())
print(items)
```

- 📖 [Apify API reference](https://docs.apify.com/api/v2)
- 🐍 [Python client docs](https://docs.apify.com/api/client/python)
- 📦 [Node.js client docs](https://docs.apify.com/api/client/js)

**Schedules:** Set up recurring runs to track new organizations added to ROR, monitor status changes, or keep your institution database synchronized. Configure weekly or monthly schedules from the Apify Console.

### 🔌 Integrate with any app

- 🔗 **Make (Integromat)** - Connect organization data to Google Sheets, Salesforce, or any of 1,500+ apps
- 🔗 **Zapier** - Trigger workflows when new organizations are collected
- 🔗 **Slack** - Get notified when an organization data run completes
- 🔗 **Airbyte** - Stream organization data into your data warehouse
- 🔗 **GitHub** - Store organization datasets in repositories for version control
- 🔗 **Google Drive** - Automatically save CSV exports to shared folders

***

### 🔗 Recommended Actors

| Actor | Description |
|---|---|
| [Crossref Scraper](https://apify.com/parseforge/crossref-scraper) | Extract DOI metadata for 155M+ research publications |
| [PubMed Citation Scraper](https://apify.com/parseforge/pubmed-citation-scraper) | Extract publication metadata from PubMed for biomedical research |
| [OpenCitations Scraper](https://apify.com/parseforge/open-citations-scraper) | Collect citation networks and bibliographic metadata |
| [US Census Bureau Scraper](https://apify.com/parseforge/us-census-bureau-scraper) | Extract demographic and economic data from the Census Bureau |
| [Open Library Scraper](https://apify.com/parseforge/open-library-scraper) | Search and download book data from the Internet Archive |

> 💡 **Pro Tip:** Combine the ROR Scraper with the Crossref Scraper to match organization identifiers with publication metadata and analyze institutional research output.

***

**🆘 Need Help?** [**Open our contact form**](https://tally.so/r/BzdKgA) and we will get back to you within 24 hours. We are happy to help with custom setups, integrations, or feature requests.

***

> **Disclaimer:** This actor is not affiliated with, endorsed by, or connected to the Research Organization Registry (ROR) or its governing organizations. It accesses publicly available data from the ROR registry. Use responsibly and in accordance with applicable terms of service.

# Actor input Schema

## `startUrl` (type: `string`):

Direct URL to scrape from ROR. Use this OR search filters below, not both. Example: https://ror.org/search?query=university\&filter=active\&page=1

## `query` (type: `string`):

Search term to filter organizations. Use this OR startUrl above, not both.

## `recordStatus` (type: `string`):

Filter organizations by record status

## `maxItems` (type: `integer`):

Maximum number of organizations to scrape. Free users: Required, maximum 50. Paid users: Optional, maximum 1,000,000. Leave empty for unlimited (paid users only).

## Actor input object example

```json
{
  "startUrl": "https://ror.org/search?filter=&query=university",
  "maxItems": 10
}
```

# Actor output Schema

## `organizations` (type: `string`):

Complete organization dataset with all fields including ROR ID, name, status, types, locations, external identifiers, and relationships

## `overview` (type: `string`):

Overview view of organizations with key fields displayed in a table format

# API

You can run this Actor programmatically using our API. Below are code examples in JavaScript, Python, and CLI, as well as the OpenAPI specification and MCP server setup.

## JavaScript example

```javascript
import { ApifyClient } from 'apify-client';

// Initialize the ApifyClient with your Apify API token
// Replace the '<YOUR_API_TOKEN>' with your token
const client = new ApifyClient({
    token: '<YOUR_API_TOKEN>',
});

// Prepare Actor input
const input = {
    "startUrl": "https://ror.org/search?filter=&query=university",
    "maxItems": 10
};

// Run the Actor and wait for it to finish
const run = await client.actor("parseforge/ror-scraper").call(input);

// Fetch and print Actor results from the run's dataset (if any)
console.log('Results from dataset');
console.log(`💾 Check your data here: https://console.apify.com/storage/datasets/${run.defaultDatasetId}`);
const { items } = await client.dataset(run.defaultDatasetId).listItems();
items.forEach((item) => {
    console.dir(item);
});

// 📚 Want to learn more 📖? Go to → https://docs.apify.com/api/client/js/docs

```

## Python example

```python
from apify_client import ApifyClient

# Initialize the ApifyClient with your Apify API token
# Replace '<YOUR_API_TOKEN>' with your token.
client = ApifyClient("<YOUR_API_TOKEN>")

# Prepare the Actor input
run_input = {
    "startUrl": "https://ror.org/search?filter=&query=university",
    "maxItems": 10,
}

# Run the Actor and wait for it to finish
run = client.actor("parseforge/ror-scraper").call(run_input=run_input)

# Fetch and print Actor results from the run's dataset (if there are any)
print("💾 Check your data here: https://console.apify.com/storage/datasets/" + run["defaultDatasetId"])
for item in client.dataset(run["defaultDatasetId"]).iterate_items():
    print(item)

# 📚 Want to learn more 📖? Go to → https://docs.apify.com/api/client/python/docs/quick-start

```

## CLI example

```bash
echo '{
  "startUrl": "https://ror.org/search?filter=&query=university",
  "maxItems": 10
}' |
apify call parseforge/ror-scraper --silent --output-dataset

```

## MCP server setup

```json
{
    "mcpServers": {
        "apify": {
            "command": "npx",
            "args": [
                "mcp-remote",
                "https://mcp.apify.com/?tools=parseforge/ror-scraper",
                "--header",
                "Authorization: Bearer <YOUR_API_TOKEN>"
            ]
        }
    }
}

```

## OpenAPI specification

```json
{
    "openapi": "3.0.1",
    "info": {
        "title": "ROR (Research Organization Registry) Scraper",
        "description": "Automate the collection of research organization data with our ROR scraper. Obtain names, aliases, locations, external identifiers (GRID, ISNI, Wikidata), relationships, and contacts from the Research Organizations Registry. Ideal for researchers, academic institutions, and data analysts.",
        "version": "1.0",
        "x-build-id": "UlBD2D5xpA1eADD1K"
    },
    "servers": [
        {
            "url": "https://api.apify.com/v2"
        }
    ],
    "paths": {
        "/acts/parseforge~ror-scraper/run-sync-get-dataset-items": {
            "post": {
                "operationId": "run-sync-get-dataset-items-parseforge-ror-scraper",
                "x-openai-isConsequential": false,
                "summary": "Executes an Actor, waits for its completion, and returns Actor's dataset items in response.",
                "tags": [
                    "Run Actor"
                ],
                "requestBody": {
                    "required": true,
                    "content": {
                        "application/json": {
                            "schema": {
                                "$ref": "#/components/schemas/inputSchema"
                            }
                        }
                    }
                },
                "parameters": [
                    {
                        "name": "token",
                        "in": "query",
                        "required": true,
                        "schema": {
                            "type": "string"
                        },
                        "description": "Enter your Apify token here"
                    }
                ],
                "responses": {
                    "200": {
                        "description": "OK"
                    }
                }
            }
        },
        "/acts/parseforge~ror-scraper/runs": {
            "post": {
                "operationId": "runs-sync-parseforge-ror-scraper",
                "x-openai-isConsequential": false,
                "summary": "Executes an Actor and returns information about the initiated run in response.",
                "tags": [
                    "Run Actor"
                ],
                "requestBody": {
                    "required": true,
                    "content": {
                        "application/json": {
                            "schema": {
                                "$ref": "#/components/schemas/inputSchema"
                            }
                        }
                    }
                },
                "parameters": [
                    {
                        "name": "token",
                        "in": "query",
                        "required": true,
                        "schema": {
                            "type": "string"
                        },
                        "description": "Enter your Apify token here"
                    }
                ],
                "responses": {
                    "200": {
                        "description": "OK",
                        "content": {
                            "application/json": {
                                "schema": {
                                    "$ref": "#/components/schemas/runsResponseSchema"
                                }
                            }
                        }
                    }
                }
            }
        },
        "/acts/parseforge~ror-scraper/run-sync": {
            "post": {
                "operationId": "run-sync-parseforge-ror-scraper",
                "x-openai-isConsequential": false,
                "summary": "Executes an Actor, waits for completion, and returns the OUTPUT from Key-value store in response.",
                "tags": [
                    "Run Actor"
                ],
                "requestBody": {
                    "required": true,
                    "content": {
                        "application/json": {
                            "schema": {
                                "$ref": "#/components/schemas/inputSchema"
                            }
                        }
                    }
                },
                "parameters": [
                    {
                        "name": "token",
                        "in": "query",
                        "required": true,
                        "schema": {
                            "type": "string"
                        },
                        "description": "Enter your Apify token here"
                    }
                ],
                "responses": {
                    "200": {
                        "description": "OK"
                    }
                }
            }
        }
    },
    "components": {
        "schemas": {
            "inputSchema": {
                "type": "object",
                "properties": {
                    "startUrl": {
                        "title": "Start URL",
                        "type": "string",
                        "description": "Direct URL to scrape from ROR. Use this OR search filters below, not both. Example: https://ror.org/search?query=university&filter=active&page=1"
                    },
                    "query": {
                        "title": "Search Query",
                        "type": "string",
                        "description": "Search term to filter organizations. Use this OR startUrl above, not both."
                    },
                    "recordStatus": {
                        "title": "Record Status",
                        "enum": [
                            "active",
                            "inactive",
                            "withdrawn"
                        ],
                        "type": "string",
                        "description": "Filter organizations by record status"
                    },
                    "maxItems": {
                        "title": "Max Items",
                        "minimum": 1,
                        "maximum": 1000000,
                        "type": "integer",
                        "description": "Maximum number of organizations to scrape. Free users: Required, maximum 50. Paid users: Optional, maximum 1,000,000. Leave empty for unlimited (paid users only)."
                    }
                }
            },
            "runsResponseSchema": {
                "type": "object",
                "properties": {
                    "data": {
                        "type": "object",
                        "properties": {
                            "id": {
                                "type": "string"
                            },
                            "actId": {
                                "type": "string"
                            },
                            "userId": {
                                "type": "string"
                            },
                            "startedAt": {
                                "type": "string",
                                "format": "date-time",
                                "example": "2025-01-08T00:00:00.000Z"
                            },
                            "finishedAt": {
                                "type": "string",
                                "format": "date-time",
                                "example": "2025-01-08T00:00:00.000Z"
                            },
                            "status": {
                                "type": "string",
                                "example": "READY"
                            },
                            "meta": {
                                "type": "object",
                                "properties": {
                                    "origin": {
                                        "type": "string",
                                        "example": "API"
                                    },
                                    "userAgent": {
                                        "type": "string"
                                    }
                                }
                            },
                            "stats": {
                                "type": "object",
                                "properties": {
                                    "inputBodyLen": {
                                        "type": "integer",
                                        "example": 2000
                                    },
                                    "rebootCount": {
                                        "type": "integer",
                                        "example": 0
                                    },
                                    "restartCount": {
                                        "type": "integer",
                                        "example": 0
                                    },
                                    "resurrectCount": {
                                        "type": "integer",
                                        "example": 0
                                    },
                                    "computeUnits": {
                                        "type": "integer",
                                        "example": 0
                                    }
                                }
                            },
                            "options": {
                                "type": "object",
                                "properties": {
                                    "build": {
                                        "type": "string",
                                        "example": "latest"
                                    },
                                    "timeoutSecs": {
                                        "type": "integer",
                                        "example": 300
                                    },
                                    "memoryMbytes": {
                                        "type": "integer",
                                        "example": 1024
                                    },
                                    "diskMbytes": {
                                        "type": "integer",
                                        "example": 2048
                                    }
                                }
                            },
                            "buildId": {
                                "type": "string"
                            },
                            "defaultKeyValueStoreId": {
                                "type": "string"
                            },
                            "defaultDatasetId": {
                                "type": "string"
                            },
                            "defaultRequestQueueId": {
                                "type": "string"
                            },
                            "buildNumber": {
                                "type": "string",
                                "example": "1.0.0"
                            },
                            "containerUrl": {
                                "type": "string"
                            },
                            "usage": {
                                "type": "object",
                                "properties": {
                                    "ACTOR_COMPUTE_UNITS": {
                                        "type": "integer",
                                        "example": 0
                                    },
                                    "DATASET_READS": {
                                        "type": "integer",
                                        "example": 0
                                    },
                                    "DATASET_WRITES": {
                                        "type": "integer",
                                        "example": 0
                                    },
                                    "KEY_VALUE_STORE_READS": {
                                        "type": "integer",
                                        "example": 0
                                    },
                                    "KEY_VALUE_STORE_WRITES": {
                                        "type": "integer",
                                        "example": 1
                                    },
                                    "KEY_VALUE_STORE_LISTS": {
                                        "type": "integer",
                                        "example": 0
                                    },
                                    "REQUEST_QUEUE_READS": {
                                        "type": "integer",
                                        "example": 0
                                    },
                                    "REQUEST_QUEUE_WRITES": {
                                        "type": "integer",
                                        "example": 0
                                    },
                                    "DATA_TRANSFER_INTERNAL_GBYTES": {
                                        "type": "integer",
                                        "example": 0
                                    },
                                    "DATA_TRANSFER_EXTERNAL_GBYTES": {
                                        "type": "integer",
                                        "example": 0
                                    },
                                    "PROXY_RESIDENTIAL_TRANSFER_GBYTES": {
                                        "type": "integer",
                                        "example": 0
                                    },
                                    "PROXY_SERPS": {
                                        "type": "integer",
                                        "example": 0
                                    }
                                }
                            },
                            "usageTotalUsd": {
                                "type": "number",
                                "example": 0.00005
                            },
                            "usageUsd": {
                                "type": "object",
                                "properties": {
                                    "ACTOR_COMPUTE_UNITS": {
                                        "type": "integer",
                                        "example": 0
                                    },
                                    "DATASET_READS": {
                                        "type": "integer",
                                        "example": 0
                                    },
                                    "DATASET_WRITES": {
                                        "type": "integer",
                                        "example": 0
                                    },
                                    "KEY_VALUE_STORE_READS": {
                                        "type": "integer",
                                        "example": 0
                                    },
                                    "KEY_VALUE_STORE_WRITES": {
                                        "type": "number",
                                        "example": 0.00005
                                    },
                                    "KEY_VALUE_STORE_LISTS": {
                                        "type": "integer",
                                        "example": 0
                                    },
                                    "REQUEST_QUEUE_READS": {
                                        "type": "integer",
                                        "example": 0
                                    },
                                    "REQUEST_QUEUE_WRITES": {
                                        "type": "integer",
                                        "example": 0
                                    },
                                    "DATA_TRANSFER_INTERNAL_GBYTES": {
                                        "type": "integer",
                                        "example": 0
                                    },
                                    "DATA_TRANSFER_EXTERNAL_GBYTES": {
                                        "type": "integer",
                                        "example": 0
                                    },
                                    "PROXY_RESIDENTIAL_TRANSFER_GBYTES": {
                                        "type": "integer",
                                        "example": 0
                                    },
                                    "PROXY_SERPS": {
                                        "type": "integer",
                                        "example": 0
                                    }
                                }
                            }
                        }
                    }
                }
            }
        }
    }
}
```
