# Crunchbase Scraper (`parseforge/crunchbase-scraper`) Actor

Extract company data from Crunchbase profiles. Get funding rounds, investor lists, employee details, social links, operating status, and more from any company URL. No Crunchbase subscription needed. Process hundreds of profiles in a single run and export structured data as JSON, CSV, or Excel.

- **URL**: https://apify.com/parseforge/crunchbase-scraper.md
- **Developed by:** [ParseForge](https://apify.com/parseforge) (community)
- **Categories:** Developer tools, Automation, Other
- **Stats:** 40 total users, 14 monthly users, 100.0% runs succeeded, 0 bookmarks
- **User rating**: No ratings yet

## Pricing

Pay per event

This Actor is paid per event. You are not charged for the Apify platform usage, but only a fixed price for specific events.
Since this Actor supports Apify Store discounts, the price gets lower the higher subscription plan you have.

Learn more: https://docs.apify.com/platform/actors/running/actors-in-store#pay-per-event

## What's an Apify Actor?

Actors are a software tools running on the Apify platform, for all kinds of web data extraction and automation use cases.
In Batch mode, an Actor accepts a well-defined JSON input, performs an action which can take anything from a few seconds to a few hours,
and optionally produces a well-defined JSON output, datasets with results, or files in key-value store.
In Standby mode, an Actor provides a web server which can be used as a website, API, or an MCP server.
Actors are written with capital "A".

## How to integrate an Actor?

If asked about integration, you help developers integrate Actors into their projects.
You adapt to their stack and deliver integrations that are safe, well-documented, and production-ready.
The best way to integrate Actors is as follows.

In JavaScript/TypeScript projects, use official [JavaScript/TypeScript client](https://docs.apify.com/api/client/js.md):

```bash
npm install apify-client
```

In Python projects, use official [Python client library](https://docs.apify.com/api/client/python.md):

```bash
pip install apify-client
```

In shell scripts, use [Apify CLI](https://docs.apify.com/cli/docs.md):

````bash
# MacOS / Linux
curl -fsSL https://apify.com/install-cli.sh | bash
# Windows
irm https://apify.com/install-cli.ps1 | iex
```bash

In AI frameworks, you might use the [Apify MCP server](https://docs.apify.com/platform/integrations/mcp.md).

If your project is in a different language, use the [REST API](https://docs.apify.com/api/v2.md).

For usage examples, see the [API](#api) section below.

For more details, see Apify documentation as [Markdown index](https://docs.apify.com/llms.txt) and [Markdown full-text](https://docs.apify.com/llms-full.txt).


# README

![ParseForge Banner](https://github.com/ParseForge/apify-assets/blob/ad35ccc13ddd068b9d6cba33f323962e39aed5b2/banner.jpg?raw=true)

## 🏢 Crunchbase Scraper

> 🚀 **Pull company and person profiles from Crunchbase in minutes.** 100+ fields per profile. Funding, investors, founders, products, acquisitions. No login.

> 🕒 **Last updated:** 2026-05-09 · **📊 100+ fields** per profile · **🏢 Companies + people** · **🚫 No auth** required


<table><tr>
<td style="border-left:4px solid #0F766E;padding:12px 16px;font-weight:600">Pull structured records from Crunchbase — clean fields ready as CSV, JSON, JSONL, Excel, or XML for downstream pipelines.</td>
</tr></table>

##### Copy to your AI assistant

Copy this block into ChatGPT, Claude, Cursor, or any LLM to start using this actor.

````

parseforge/crunchbase-scraper on Apify. Call: ApifyClient("TOKEN").actor("parseforge/crunchbase-scraper").call(run\_input={...}), then client.dataset(run\["defaultDatasetId"]).list\_items().items for results. Key inputs: startUrls (array, default \[{"url": "https://www.crunchbase.com/organization/openai"}, ), maxItems (integer, default 10), searchQuery (string), searchType (string, default "organizations"). Full actor spec: fetch build via GET https://api.apify.com/v2/acts/parseforge~crunchbase-scraper (Bearer TOKEN). Get token: https://console.apify.com/account/integrations

````

Pull live company and person profiles from Crunchbase, the canonical record of startup, VC, and tech-company data. The actor accepts Crunchbase URLs (companies or people) or a keyword search, walks the result pages, and returns one structured record per profile ready for VC research, sales prospecting, M&A intelligence, or founder-discovery workflows.

Every run fetches data live so you get the current state of Crunchbase at run time. Records include the company name, logo, founders, employees, funding rounds, total funding raised, investors, location, founding year, products, acquisitions, IPO status, web traffic estimates, patents, and a back-reference URL.

| 👥 Built for | 🎯 Primary use cases |
|---|---|
| Venture capital | Track new companies and funding rounds |
| Sales and BD teams | Build startup prospect lists with funding context |
| M&A advisors | Source target candidates by stage and sector |
| Researchers | Study startup ecosystem dynamics |
| BD and partnerships | Map ecosystem players for partner programs |
| Recruiters | Identify well-funded hiring companies |

---

### 📋 What the Crunchbase Scraper does

- 🏢 **Company or person mode.** Scrape company profiles (organizations) or person profiles.
- 🔗 **Direct URL.** Pass a list of Crunchbase URLs.
- 🔍 **Keyword search.** Search by name, sector, or keyword (mutually exclusive with URLs).
- 💰 **Funding history.** Total funding, last round, last round amount, lead investors.
- 👥 **Team data.** Founders, key people, total employees.
- 📊 **Operational signals.** Web traffic, patents, acquisitions, IPO status.

The scraper walks each Crunchbase profile, extracts 100+ fields, and pushes structured records to the dataset.

> 💡 **Why it matters:** Crunchbase is the canonical record of startup data but its API is paywalled and its UI lacks bulk export for free users. A live, structured pull beats manual lookup for VC research, sales prospecting, and M&A intelligence.

---

### 🎬 Full Demo

🚧 Coming soon: a 3-minute walkthrough showing setup, a live run, and how to pipe results into Salesforce or HubSpot via Apify integrations.

---

### ⚙️ Input

| Field | Type | Name | Description |
|---|---|---|---|
| startUrls | array | Start URLs | Crunchbase organization or person URLs. Mutually exclusive with searchQuery. |
| maxItems | integer | Max Items | Free users: limited to 10 items (preview). Paid users: optional, max 1,000,000. |
| searchQuery | string | Search Query | Free-text search query (mutually exclusive with startUrls). |
| searchType | enum | Search Type | organizations (companies) or people. |
| proxyConfiguration | object | Proxy configuration | Apify Proxy with the **RESIDENTIAL** group. **Required** and used by default. |

> 🛡️ **Proxy is required.** Crunchbase is protected by Cloudflare, which blocks datacenter IPs with a security challenge that never clears. This Actor routes the browser through **Apify Proxy with the RESIDENTIAL group by default**, which is the only configuration confirmed to pass Cloudflare. Residential proxy usage consumes residential proxy credits. You may override the proxy in the input, but datacenter or no proxy will be blocked by Cloudflare and the run will fail fast with a clear "residential proxy required" message.

Example 1. Direct URL lookup of three companies.

```json
{
  "startUrls": [
    { "url": "https://www.crunchbase.com/organization/openai" },
    { "url": "https://www.crunchbase.com/organization/anthropic" },
    { "url": "https://www.crunchbase.com/organization/stripe" }
  ],
  "maxItems": 3
}
````

Example 2. Keyword search for AI companies.

```json
{
  "searchQuery": "generative AI startup",
  "searchType": "organizations",
  "maxItems": 50
}
```

> ⚠️ **Good to Know:** when startUrls is set, searchQuery is ignored. URL mode is more precise; search mode is broader.

***

### 📊 Output

The dataset returns one structured record per profile. Each record carries identifiers, name, logo, founders, funding, location, products, acquisitions, IPO status, and a back-reference URL. Consume the dataset as JSON, CSV, Excel, XML, or RSS via the Apify console or API.

#### 🧾 Schema

| Field | Type | Example |
|---|---|---|
| 🆔 cbUrl | string (url) | `https://www.crunchbase.com/organization/openai` |
| 🏢 name | string | OpenAI |
| 🖼️ logoUrl | string (url) | `https://res.cloudinary.com/.../openai.jpg` |
| 📝 description | string | `OpenAI develops AI products and services` |
| 🌐 website | string (url) | `https://openai.com` |
| 🗓️ founded | string | `2015` |
| 📍 headquarters | string | `San Francisco, California, USA` |
| 🏷️ industries | array | `["Artificial Intelligence", "Machine Learning", "Software"]` |
| 👥 employeeCount | string | `1001-5000` |
| 💰 totalFundingUsd | number | `13000000000` |
| 📊 lastRoundType | string | `Series E` |
| 💵 lastRoundAmountUsd | number | `6500000000` |
| 🗓️ lastRoundDate | ISO date | `2026-02-01` |
| 👤 founders | array | `[{"name":"Sam Altman","role":"CEO"}]` |
| 💼 leadInvestors | array | `["Microsoft", "Tiger Global"]` |
| 🏢 acquisitions | array | `[]` |
| 📈 ipoStatus | string | Private |
| 🌐 webTrafficRank | number | `45` |
| 🏷️ patentsFiled | number | `120` |
| 📅 scrapedAt | ISO datetime | `2026-05-09T12:00:00.000Z` |

#### 📦 Sample records

##### 1. High-profile company (full record)

```json
{
  "cbUrl": "https://www.crunchbase.com/organization/openai",
  "name": "OpenAI",
  "logoUrl": "https://res.cloudinary.com/abc/openai.jpg",
  "description": "OpenAI develops AI products and services for businesses and consumers.",
  "website": "https://openai.com",
  "founded": "2015",
  "headquarters": "San Francisco, California, USA",
  "industries": ["Artificial Intelligence", "Machine Learning", "Software"],
  "employeeCount": "1001-5000",
  "totalFundingUsd": 13000000000,
  "lastRoundType": "Series E",
  "lastRoundAmountUsd": 6500000000,
  "lastRoundDate": "2026-02-01",
  "founders": [
    {"name": "Sam Altman", "role": "CEO"},
    {"name": "Greg Brockman", "role": "President"}
  ],
  "leadInvestors": ["Microsoft", "Tiger Global", "Sequoia Capital"],
  "ipoStatus": "Private",
  "scrapedAt": "2026-05-09T12:00:00.000Z"
}
```

##### 2. Mid-stage startup (Seed)

```json
{
  "cbUrl": "https://www.crunchbase.com/organization/acme-ai",
  "name": "Acme AI",
  "description": "AI agents for B2B back-office workflows.",
  "website": "https://acme-ai.com",
  "founded": "2024",
  "headquarters": "San Francisco, California, USA",
  "industries": ["Artificial Intelligence", "B2B"],
  "employeeCount": "11-50",
  "totalFundingUsd": 8000000,
  "lastRoundType": "Seed",
  "lastRoundAmountUsd": 8000000,
  "lastRoundDate": "2026-04-15",
  "founders": [
    {"name": "Jane Smith", "role": "CEO"},
    {"name": "John Doe", "role": "CTO"}
  ],
  "leadInvestors": ["Y Combinator"],
  "ipoStatus": "Private",
  "scrapedAt": "2026-05-09T12:00:00.000Z"
}
```

##### 3. Person profile (sparse)

```json
{
  "cbUrl": "https://www.crunchbase.com/person/sam-altman",
  "name": "Sam Altman",
  "description": "CEO at OpenAI",
  "currentRole": "CEO at OpenAI",
  "previousRoles": ["President at Y Combinator"],
  "education": ["Stanford University (dropped out)"],
  "scrapedAt": "2026-05-09T12:00:00.000Z"
}
```

***

### ✨ Why choose this Actor

| | Capability |
|---|---|
| 🎯 | **Built for the job.** Scoped specifically to Crunchbase so you skip the parser engineering entirely. |
| 🔖 | **Structured output.** 100+ fields ready for analysis, dashboards, or downstream pipelines. |
| ⚡ | **Fast.** Optimized request patterns return results in seconds, not minutes. |
| 🔁 | **Always fresh.** Every run pulls live data, so the dataset reflects Crunchbase as of run time. |
| 🌐 | **No infra to manage.** Apify handles proxies, retries, scaling, scheduling, and storage. |
| 🛡️ | **Reliable.** Battle-tested across many runs and edge cases, with graceful error handling. |
| 🚫 | **No code required.** Configure in the UI, run from CLI, schedule via cron, or call from any language with the Apify SDK. |

> 📊 Production-grade structured startup data without the engineering overhead of building and maintaining your own scraper.

***

### 📈 How it compares to alternatives

| Approach | Cost | Coverage | Refresh | Filters | Setup |
|---|---|---|---|---|---|
| **⭐ Crunchbase Scraper** *(this Actor)* | $5 free credit, then pay-per-use | Public Crunchbase profiles | **Live per run** | URL or keyword search | ⚡ 2 min |
| Crunchbase Pro / Enterprise API | $$$ monthly per seat | Full | Live | Vendor-defined | ⏳ Hours |
| Build your own scraper | Engineering hours | Full once built | Whenever you maintain it | Custom code | 🐢 Days to weeks |
| Manual searches | Hours per check | Limited | Stale | Manual | 🕒 Variable |

Pick this Actor when you want broad coverage, source-native filtering, and no pipeline maintenance.

***

### 🚀 How to use

1. 📝 **Sign up.** [Create a free account with $5 credit](https://console.apify.com/sign-up?fpr=vmoqkp) (takes 2 minutes).
2. 🌐 **Open the Actor.** Go to the Crunchbase Scraper page on the Apify Store.
3. 🎯 **Set inputs.** Either paste Crunchbase URLs or set a search query and search type.
4. 🚀 **Run it.** Click **Start** and let the Actor collect your data.
5. 📥 **Download.** Grab your results in the **Dataset** tab as CSV, Excel, JSON, or XML.

> ⏱️ Total time from signup to downloaded dataset: **3-5 minutes.** No coding required.

***

### 💼 Business use cases

<table>
<tr>
<td width="50%" valign="top">

#### 📊 VC and investor research

- Track new companies and funding rounds
- Build watchlists by industry and stage
- Map founder profiles across portfolios
- Surface hiring signals as growth indicators

</td>
<td width="50%" valign="top">

#### 🏢 Sales and BD

- Build outbound prospect lists of well-funded startups
- Filter by recent round for fresh-money outreach
- Source product partner candidates by industry
- Power CRM enrichment with funding context

</td>
</tr>
<tr>
<td width="50%" valign="top">

#### 🎯 M\&A and corporate development

- Source acquisition targets by stage and sector
- Build comp sets for valuation discussions
- Track competitor M\&A activity
- Power deal-pipeline workflows

</td>
<td width="50%" valign="top">

#### 🛠️ Engineering and product

- Prototype startup-data products without owning a crawler
- Replace fragile in-house Crunchbase scrapers
- Wire datasets into your apps via the Apify API or webhooks
- Skip the proxy, retry, and parsing maintenance entirely

</td>
</tr>
</table>

***

### 🌟 Beyond business use cases

Data like this powers more than commercial workflows. The same structured records support research, education, civic projects, and personal initiatives.

<table>
<tr>
<td width="50%">

#### 🎓 Research and academia

- Empirical datasets for papers, thesis work, and coursework
- Longitudinal studies tracking changes across snapshots
- Reproducible research with cited, versioned data pulls
- Classroom exercises on data analysis and ethical scraping

</td>
<td width="50%">

#### 🎨 Personal and creative

- Side projects, portfolio demos, and indie app launches
- Data visualizations, dashboards, and infographics
- Content research for bloggers, YouTubers, and podcasters
- Hobbyist collections and personal trackers

</td>
</tr>
<tr>
<td width="50%">

#### 🤝 Non-profit and civic

- Transparency reporting and accountability projects
- Advocacy campaigns backed by public-interest data
- Community-run databases for local issues
- Investigative journalism on public records

</td>
<td width="50%">

#### 🧪 Experimentation

- Prototype AI and machine-learning pipelines with real data
- Validate product-market hypotheses before engineering spend
- Train small domain-specific models on niche corpora
- Test dashboard concepts with live input

</td>
</tr>
</table>

***

### 🔌 Automating Crunchbase Scraper

This Actor exposes a REST endpoint, so you can drive it from any language or workflow tool.

- **Node.js** - call it via the [Apify JS SDK](https://docs.apify.com/sdk/js).
- **Python** - call it via the [Apify Python SDK](https://docs.apify.com/sdk/python).
- **REST** - hit it directly through the [Apify v2 API](https://docs.apify.com/api/v2).

**Schedules.** Use Apify Scheduler to capture daily snapshots of target companies. Combine with the Apify dataset diff tools to track funding round changes between runs.

***

### ❓ Frequently Asked Questions

<details>
<summary><b>💳 Do I need a paid Apify plan to run this actor?</b></summary>

No. You can start right now on the free Apify plan, which includes **$5 in monthly credit**. That is enough to run the scraper several times and explore the output. Paid plans unlock higher item caps, more concurrent runs, and larger datasets. [Create a free Apify account here](https://console.apify.com/sign-up?fpr=vmoqkp).

</details>

<details>
<summary><b>🛡️ Why does this Actor need a residential proxy?</b></summary>

Crunchbase is fronted by Cloudflare, which blocks datacenter IP addresses (including Apify's platform IPs) with a security challenge that never clears. To reach the data the Actor must use **Apify Proxy with the RESIDENTIAL group**, which is the only configuration confirmed to pass Cloudflare. This is the **default** proxy setting, so the Actor works out of the box. Residential proxy traffic consumes residential proxy credits on your Apify plan. You can override the proxy in the **Proxy configuration** input, but datacenter or no proxy will be blocked and the run fails fast with a "residential proxy required" message.

</details>

<details>
<summary><b>🚨 What happens if my run fails or returns no results?</b></summary>

Failed runs are not charged. If Crunchbase changes its DOM, proxies get rate-limited, or your URLs match nothing, re-run the actor or open our [contact form](https://tally.so/r/BzdKgA) and we will look into it.

</details>

<details>
<summary><b>📏 How many items can I scrape per run?</b></summary>

Free users are limited to **10 items per run** so you can preview the output. Paid users can raise maxItems up to **1,000,000** per run.

</details>

<details>
<summary><b>🕒 How fresh is the data?</b></summary>

Every run fetches live data at the moment of execution. There is no cache or delay: records reflect what Crunchbase returned at run time.

</details>

<details>
<summary><b>🧑‍💻 Can I call this actor from my own code?</b></summary>

Yes. Apify exposes every actor as a REST endpoint and ships first-class SDKs for [Node.js](https://docs.apify.com/sdk/js) and [Python](https://docs.apify.com/sdk/python). You can start a run, read the dataset, and handle webhooks from your own app in a few lines.

</details>

<details>
<summary><b>📤 How do I export the data?</b></summary>

Every Apify dataset can be downloaded in one click as CSV, JSON, JSONL, Excel, HTML, XML, or RSS. You can also pull results programmatically via the [Apify API](https://docs.apify.com/api/v2) or stream into BigQuery, S3, and other destinations through built-in integrations.

</details>

<details>
<summary><b>📅 Can I schedule the actor to run automatically?</b></summary>

Yes. Use the Apify scheduler to run the actor on any cadence, from hourly to monthly. Results are saved to your dataset and can be delivered to webhooks, email, Slack, cloud storage, or automation tools such as Zapier and Make.

</details>

<details>
<summary><b>🏪 Can I use the data commercially?</b></summary>

Yes. The scraped data is yours to use in your own internal pipelines, products, and reports, subject to the terms of service of the source site.

</details>

<details>
<summary><b>💼 Which plan should I pick for production use?</b></summary>

Apify's Starter and Scale plans are designed for production workloads. Pick the plan that matches your dataset size and refresh cadence.

</details>

<details>
<summary><b>🛠️ Can you add Crunchbase Pro fields?</b></summary>

Open the [contact form](https://tally.so/r/BzdKgA) and tell us about your use case. We can extract any field that is exposed on the public profile.

</details>

<details>
<summary><b>⚖️ Is scraping Crunchbase legal?</b></summary>

This Actor only collects data from publicly accessible Crunchbase pages, the same content any visitor can read. Public web scraping is generally legal in most jurisdictions for non-personal data, but laws vary by country and use case. You are responsible for compliance with the source site's Terms of Service and applicable law.

***

</details>

### 🔌 Integrate with any app

Crunchbase Scraper connects to any cloud service via [Apify integrations](https://apify.com/integrations):

- [**Make**](https://docs.apify.com/platform/integrations/make) - Automate multi-step workflows
- [**Zapier**](https://docs.apify.com/platform/integrations/zapier) - Connect with 5,000+ apps
- [**Slack**](https://docs.apify.com/platform/integrations/slack) - Get run notifications in your channels
- [**Airbyte**](https://docs.apify.com/platform/integrations/airbyte) - Pipe results into your warehouse
- [**GitHub**](https://docs.apify.com/platform/integrations/github) - Trigger runs from commits and releases
- [**Google Drive**](https://docs.apify.com/platform/integrations/drive) - Export datasets straight to Sheets

You can also use webhooks to trigger downstream actions when a run finishes.

***

### 🔗 Recommended Actors

- [**🚀 Y Combinator Companies Scraper**](https://apify.com/parseforge/y-combinator-scraper) - YC-funded startup directory
- [**📈 PitchBook Companies Scraper**](https://apify.com/parseforge/pitchbook-companies-scraper) - Private company data with investors
- [**📋 PitchBook Investors Scraper**](https://apify.com/parseforge/pitchbook-investors-scraper) - Investor profiles and portfolios
- [**🏢 Dun & Bradstreet Company Scraper**](https://apify.com/parseforge/dnb-scraper) - 500M+ business directory with DUNS
- [**📋 SEC EDGAR Full-Text Search Scraper**](https://apify.com/parseforge/sec-edgar-full-text-search-scraper) - U.S. SEC filings full-text search

> 💡 **Pro Tip:** browse the complete [ParseForge collection](https://apify.com/parseforge) for more reference-data scrapers.

***

**🆘 Need Help?** [**Open our contact form**](https://tally.so/r/BzdKgA) to request a new scraper, propose a custom project, or report an issue.

***

> ⚠️ **Disclaimer.** This Actor is an independent tool and is not affiliated with, endorsed by, or sponsored by Crunchbase. All trademarks mentioned are the property of their respective owners. The scraper accesses only publicly available pages and is intended for legitimate research, analytics, and lead-generation use. Users are responsible for compliance with the source site's Terms of Service and applicable law.

# Actor input Schema

## `startUrls` (type: `array`):

Crunchbase company or person profile URLs. Example: https://www.crunchbase.com/organization/openai or https://www.crunchbase.com/person/sam-altman

## `maxItems` (type: `integer`):

Free users: Limited to 100. Paid users: Optional, max 1,000,000

## `searchQuery` (type: `string`):

Search for companies or people by keyword. Use this OR Start URLs, not both.

## `searchType` (type: `string`):

What to search for: companies or people.

## `proxyConfiguration` (type: `object`):

Crunchbase is protected by Cloudflare, which blocks datacenter IPs with a security challenge that never clears. Apify Proxy with the RESIDENTIAL group is REQUIRED for this actor to work and is the default. Residential proxy consumes residential proxy credits. You can override this, but datacenter or no proxy will be blocked by Cloudflare and the run will fail fast.

## Actor input object example

```json
{
  "startUrls": [
    {
      "url": "https://www.crunchbase.com/organization/openai"
    },
    {
      "url": "https://www.crunchbase.com/organization/anthropic"
    },
    {
      "url": "https://www.crunchbase.com/organization/stripe"
    }
  ],
  "maxItems": 10,
  "searchType": "organizations",
  "proxyConfiguration": {
    "useApifyProxy": true,
    "apifyProxyGroups": [
      "RESIDENTIAL"
    ]
  }
}
```

# Actor output Schema

## `overview` (type: `string`):

Table view with the most important profile fields.

# API

You can run this Actor programmatically using our API. Below are code examples in JavaScript, Python, and CLI, as well as the OpenAPI specification and MCP server setup.

## JavaScript example

```javascript
import { ApifyClient } from 'apify-client';

// Initialize the ApifyClient with your Apify API token
// Replace the '<YOUR_API_TOKEN>' with your token
const client = new ApifyClient({
    token: '<YOUR_API_TOKEN>',
});

// Prepare Actor input
const input = {
    "startUrls": [
        {
            "url": "https://www.crunchbase.com/organization/openai"
        },
        {
            "url": "https://www.crunchbase.com/organization/anthropic"
        },
        {
            "url": "https://www.crunchbase.com/organization/stripe"
        }
    ],
    "maxItems": 10,
    "proxyConfiguration": {
        "useApifyProxy": true,
        "apifyProxyGroups": [
            "RESIDENTIAL"
        ]
    }
};

// Run the Actor and wait for it to finish
const run = await client.actor("parseforge/crunchbase-scraper").call(input);

// Fetch and print Actor results from the run's dataset (if any)
console.log('Results from dataset');
console.log(`💾 Check your data here: https://console.apify.com/storage/datasets/${run.defaultDatasetId}`);
const { items } = await client.dataset(run.defaultDatasetId).listItems();
items.forEach((item) => {
    console.dir(item);
});

// 📚 Want to learn more 📖? Go to → https://docs.apify.com/api/client/js/docs

```

## Python example

```python
from apify_client import ApifyClient

# Initialize the ApifyClient with your Apify API token
# Replace '<YOUR_API_TOKEN>' with your token.
client = ApifyClient("<YOUR_API_TOKEN>")

# Prepare the Actor input
run_input = {
    "startUrls": [
        { "url": "https://www.crunchbase.com/organization/openai" },
        { "url": "https://www.crunchbase.com/organization/anthropic" },
        { "url": "https://www.crunchbase.com/organization/stripe" },
    ],
    "maxItems": 10,
    "proxyConfiguration": {
        "useApifyProxy": True,
        "apifyProxyGroups": ["RESIDENTIAL"],
    },
}

# Run the Actor and wait for it to finish
run = client.actor("parseforge/crunchbase-scraper").call(run_input=run_input)

# Fetch and print Actor results from the run's dataset (if there are any)
print("💾 Check your data here: https://console.apify.com/storage/datasets/" + run["defaultDatasetId"])
for item in client.dataset(run["defaultDatasetId"]).iterate_items():
    print(item)

# 📚 Want to learn more 📖? Go to → https://docs.apify.com/api/client/python/docs/quick-start

```

## CLI example

```bash
echo '{
  "startUrls": [
    {
      "url": "https://www.crunchbase.com/organization/openai"
    },
    {
      "url": "https://www.crunchbase.com/organization/anthropic"
    },
    {
      "url": "https://www.crunchbase.com/organization/stripe"
    }
  ],
  "maxItems": 10,
  "proxyConfiguration": {
    "useApifyProxy": true,
    "apifyProxyGroups": [
      "RESIDENTIAL"
    ]
  }
}' |
apify call parseforge/crunchbase-scraper --silent --output-dataset

```

## MCP server setup

```json
{
    "mcpServers": {
        "apify": {
            "command": "npx",
            "args": [
                "mcp-remote",
                "https://mcp.apify.com/?tools=parseforge/crunchbase-scraper",
                "--header",
                "Authorization: Bearer <YOUR_API_TOKEN>"
            ]
        }
    }
}

```

## OpenAPI specification

```json
{
    "openapi": "3.0.1",
    "info": {
        "title": "Crunchbase Scraper",
        "description": "Extract company data from Crunchbase profiles. Get funding rounds, investor lists, employee details, social links, operating status, and more from any company URL. No Crunchbase subscription needed. Process hundreds of profiles in a single run and export structured data as JSON, CSV, or Excel.",
        "version": "2.0",
        "x-build-id": "PtTkL893n4VCGhlC6"
    },
    "servers": [
        {
            "url": "https://api.apify.com/v2"
        }
    ],
    "paths": {
        "/acts/parseforge~crunchbase-scraper/run-sync-get-dataset-items": {
            "post": {
                "operationId": "run-sync-get-dataset-items-parseforge-crunchbase-scraper",
                "x-openai-isConsequential": false,
                "summary": "Executes an Actor, waits for its completion, and returns Actor's dataset items in response.",
                "tags": [
                    "Run Actor"
                ],
                "requestBody": {
                    "required": true,
                    "content": {
                        "application/json": {
                            "schema": {
                                "$ref": "#/components/schemas/inputSchema"
                            }
                        }
                    }
                },
                "parameters": [
                    {
                        "name": "token",
                        "in": "query",
                        "required": true,
                        "schema": {
                            "type": "string"
                        },
                        "description": "Enter your Apify token here"
                    }
                ],
                "responses": {
                    "200": {
                        "description": "OK"
                    }
                }
            }
        },
        "/acts/parseforge~crunchbase-scraper/runs": {
            "post": {
                "operationId": "runs-sync-parseforge-crunchbase-scraper",
                "x-openai-isConsequential": false,
                "summary": "Executes an Actor and returns information about the initiated run in response.",
                "tags": [
                    "Run Actor"
                ],
                "requestBody": {
                    "required": true,
                    "content": {
                        "application/json": {
                            "schema": {
                                "$ref": "#/components/schemas/inputSchema"
                            }
                        }
                    }
                },
                "parameters": [
                    {
                        "name": "token",
                        "in": "query",
                        "required": true,
                        "schema": {
                            "type": "string"
                        },
                        "description": "Enter your Apify token here"
                    }
                ],
                "responses": {
                    "200": {
                        "description": "OK",
                        "content": {
                            "application/json": {
                                "schema": {
                                    "$ref": "#/components/schemas/runsResponseSchema"
                                }
                            }
                        }
                    }
                }
            }
        },
        "/acts/parseforge~crunchbase-scraper/run-sync": {
            "post": {
                "operationId": "run-sync-parseforge-crunchbase-scraper",
                "x-openai-isConsequential": false,
                "summary": "Executes an Actor, waits for completion, and returns the OUTPUT from Key-value store in response.",
                "tags": [
                    "Run Actor"
                ],
                "requestBody": {
                    "required": true,
                    "content": {
                        "application/json": {
                            "schema": {
                                "$ref": "#/components/schemas/inputSchema"
                            }
                        }
                    }
                },
                "parameters": [
                    {
                        "name": "token",
                        "in": "query",
                        "required": true,
                        "schema": {
                            "type": "string"
                        },
                        "description": "Enter your Apify token here"
                    }
                ],
                "responses": {
                    "200": {
                        "description": "OK"
                    }
                }
            }
        }
    },
    "components": {
        "schemas": {
            "inputSchema": {
                "type": "object",
                "properties": {
                    "startUrls": {
                        "title": "Start URLs",
                        "type": "array",
                        "description": "Crunchbase company or person profile URLs. Example: https://www.crunchbase.com/organization/openai or https://www.crunchbase.com/person/sam-altman",
                        "items": {
                            "type": "object",
                            "required": [
                                "url"
                            ],
                            "properties": {
                                "url": {
                                    "type": "string",
                                    "title": "URL of a web page",
                                    "format": "uri"
                                }
                            }
                        }
                    },
                    "maxItems": {
                        "title": "Max Items",
                        "minimum": 1,
                        "maximum": 1000000,
                        "type": "integer",
                        "description": "Free users: Limited to 100. Paid users: Optional, max 1,000,000"
                    },
                    "searchQuery": {
                        "title": "Search Query",
                        "type": "string",
                        "description": "Search for companies or people by keyword. Use this OR Start URLs, not both."
                    },
                    "searchType": {
                        "title": "Search Type",
                        "enum": [
                            "organizations",
                            "people"
                        ],
                        "type": "string",
                        "description": "What to search for: companies or people.",
                        "default": "organizations"
                    },
                    "proxyConfiguration": {
                        "title": "Proxy configuration",
                        "type": "object",
                        "description": "Crunchbase is protected by Cloudflare, which blocks datacenter IPs with a security challenge that never clears. Apify Proxy with the RESIDENTIAL group is REQUIRED for this actor to work and is the default. Residential proxy consumes residential proxy credits. You can override this, but datacenter or no proxy will be blocked by Cloudflare and the run will fail fast.",
                        "default": {
                            "useApifyProxy": true,
                            "apifyProxyGroups": [
                                "RESIDENTIAL"
                            ]
                        }
                    }
                }
            },
            "runsResponseSchema": {
                "type": "object",
                "properties": {
                    "data": {
                        "type": "object",
                        "properties": {
                            "id": {
                                "type": "string"
                            },
                            "actId": {
                                "type": "string"
                            },
                            "userId": {
                                "type": "string"
                            },
                            "startedAt": {
                                "type": "string",
                                "format": "date-time",
                                "example": "2025-01-08T00:00:00.000Z"
                            },
                            "finishedAt": {
                                "type": "string",
                                "format": "date-time",
                                "example": "2025-01-08T00:00:00.000Z"
                            },
                            "status": {
                                "type": "string",
                                "example": "READY"
                            },
                            "meta": {
                                "type": "object",
                                "properties": {
                                    "origin": {
                                        "type": "string",
                                        "example": "API"
                                    },
                                    "userAgent": {
                                        "type": "string"
                                    }
                                }
                            },
                            "stats": {
                                "type": "object",
                                "properties": {
                                    "inputBodyLen": {
                                        "type": "integer",
                                        "example": 2000
                                    },
                                    "rebootCount": {
                                        "type": "integer",
                                        "example": 0
                                    },
                                    "restartCount": {
                                        "type": "integer",
                                        "example": 0
                                    },
                                    "resurrectCount": {
                                        "type": "integer",
                                        "example": 0
                                    },
                                    "computeUnits": {
                                        "type": "integer",
                                        "example": 0
                                    }
                                }
                            },
                            "options": {
                                "type": "object",
                                "properties": {
                                    "build": {
                                        "type": "string",
                                        "example": "latest"
                                    },
                                    "timeoutSecs": {
                                        "type": "integer",
                                        "example": 300
                                    },
                                    "memoryMbytes": {
                                        "type": "integer",
                                        "example": 1024
                                    },
                                    "diskMbytes": {
                                        "type": "integer",
                                        "example": 2048
                                    }
                                }
                            },
                            "buildId": {
                                "type": "string"
                            },
                            "defaultKeyValueStoreId": {
                                "type": "string"
                            },
                            "defaultDatasetId": {
                                "type": "string"
                            },
                            "defaultRequestQueueId": {
                                "type": "string"
                            },
                            "buildNumber": {
                                "type": "string",
                                "example": "1.0.0"
                            },
                            "containerUrl": {
                                "type": "string"
                            },
                            "usage": {
                                "type": "object",
                                "properties": {
                                    "ACTOR_COMPUTE_UNITS": {
                                        "type": "integer",
                                        "example": 0
                                    },
                                    "DATASET_READS": {
                                        "type": "integer",
                                        "example": 0
                                    },
                                    "DATASET_WRITES": {
                                        "type": "integer",
                                        "example": 0
                                    },
                                    "KEY_VALUE_STORE_READS": {
                                        "type": "integer",
                                        "example": 0
                                    },
                                    "KEY_VALUE_STORE_WRITES": {
                                        "type": "integer",
                                        "example": 1
                                    },
                                    "KEY_VALUE_STORE_LISTS": {
                                        "type": "integer",
                                        "example": 0
                                    },
                                    "REQUEST_QUEUE_READS": {
                                        "type": "integer",
                                        "example": 0
                                    },
                                    "REQUEST_QUEUE_WRITES": {
                                        "type": "integer",
                                        "example": 0
                                    },
                                    "DATA_TRANSFER_INTERNAL_GBYTES": {
                                        "type": "integer",
                                        "example": 0
                                    },
                                    "DATA_TRANSFER_EXTERNAL_GBYTES": {
                                        "type": "integer",
                                        "example": 0
                                    },
                                    "PROXY_RESIDENTIAL_TRANSFER_GBYTES": {
                                        "type": "integer",
                                        "example": 0
                                    },
                                    "PROXY_SERPS": {
                                        "type": "integer",
                                        "example": 0
                                    }
                                }
                            },
                            "usageTotalUsd": {
                                "type": "number",
                                "example": 0.00005
                            },
                            "usageUsd": {
                                "type": "object",
                                "properties": {
                                    "ACTOR_COMPUTE_UNITS": {
                                        "type": "integer",
                                        "example": 0
                                    },
                                    "DATASET_READS": {
                                        "type": "integer",
                                        "example": 0
                                    },
                                    "DATASET_WRITES": {
                                        "type": "integer",
                                        "example": 0
                                    },
                                    "KEY_VALUE_STORE_READS": {
                                        "type": "integer",
                                        "example": 0
                                    },
                                    "KEY_VALUE_STORE_WRITES": {
                                        "type": "number",
                                        "example": 0.00005
                                    },
                                    "KEY_VALUE_STORE_LISTS": {
                                        "type": "integer",
                                        "example": 0
                                    },
                                    "REQUEST_QUEUE_READS": {
                                        "type": "integer",
                                        "example": 0
                                    },
                                    "REQUEST_QUEUE_WRITES": {
                                        "type": "integer",
                                        "example": 0
                                    },
                                    "DATA_TRANSFER_INTERNAL_GBYTES": {
                                        "type": "integer",
                                        "example": 0
                                    },
                                    "DATA_TRANSFER_EXTERNAL_GBYTES": {
                                        "type": "integer",
                                        "example": 0
                                    },
                                    "PROXY_RESIDENTIAL_TRANSFER_GBYTES": {
                                        "type": "integer",
                                        "example": 0
                                    },
                                    "PROXY_SERPS": {
                                        "type": "integer",
                                        "example": 0
                                    }
                                }
                            }
                        }
                    }
                }
            }
        }
    }
}
```
#### 📊 VC and investor research - Track new companies and funding rounds - Build watchlists by industry and stage - Map founder profiles across portfolios - Surface hiring signals as growth indicators	#### 🏢 Sales and BD - Build outbound prospect lists of well-funded startups - Filter by recent round for fresh-money outreach - Source product partner candidates by industry - Power CRM enrichment with funding context
#### 🎯 M\&A and corporate development - Source acquisition targets by stage and sector - Build comp sets for valuation discussions - Track competitor M\&A activity - Power deal-pipeline workflows	#### 🛠️ Engineering and product - Prototype startup-data products without owning a crawler - Replace fragile in-house Crunchbase scrapers - Wire datasets into your apps via the Apify API or webhooks - Skip the proxy, retry, and parsing maintenance entirely
#### 🎓 Research and academia - Empirical datasets for papers, thesis work, and coursework - Longitudinal studies tracking changes across snapshots - Reproducible research with cited, versioned data pulls - Classroom exercises on data analysis and ethical scraping	#### 🎨 Personal and creative - Side projects, portfolio demos, and indie app launches - Data visualizations, dashboards, and infographics - Content research for bloggers, YouTubers, and podcasters - Hobbyist collections and personal trackers
#### 🤝 Non-profit and civic - Transparency reporting and accountability projects - Advocacy campaigns backed by public-interest data - Community-run databases for local issues - Investigative journalism on public records	#### 🧪 Experimentation - Prototype AI and machine-learning pipelines with real data - Validate product-market hypotheses before engineering spend - Train small domain-specific models on niche corpora - Test dashboard concepts with live input