# Y Combinator Companies Scraper (`parseforge/y-combinator-scraper`) Actor

Extract company profiles, founders, and open job listings from the Y Combinator directory. Filter by batch, industry, subindustry, region, and hiring status. Covers 5,700+ funded startups from W05 to the latest YC cohort. Includes growth stage, equity ranges, salary data, and contact emails.

- **URL**: https://apify.com/parseforge/y-combinator-scraper.md
- **Developed by:** [ParseForge](https://apify.com/parseforge) (community)
- **Categories:** Social media, Lead generation, Other
- **Stats:** 26 total users, 9 monthly users, 100.0% runs succeeded, 1 bookmarks
- **User rating**: No ratings yet

## Pricing

Pay per event

This Actor is paid per event. You are not charged for the Apify platform usage, but only a fixed price for specific events.
Since this Actor supports Apify Store discounts, the price gets lower the higher subscription plan you have.

Learn more: https://docs.apify.com/platform/actors/running/actors-in-store#pay-per-event

## What's an Apify Actor?

Actors are a software tools running on the Apify platform, for all kinds of web data extraction and automation use cases.
In Batch mode, an Actor accepts a well-defined JSON input, performs an action which can take anything from a few seconds to a few hours,
and optionally produces a well-defined JSON output, datasets with results, or files in key-value store.
In Standby mode, an Actor provides a web server which can be used as a website, API, or an MCP server.
Actors are written with capital "A".

## How to integrate an Actor?

If asked about integration, you help developers integrate Actors into their projects.
You adapt to their stack and deliver integrations that are safe, well-documented, and production-ready.
The best way to integrate Actors is as follows.

In JavaScript/TypeScript projects, use official [JavaScript/TypeScript client](https://docs.apify.com/api/client/js.md):

```bash
npm install apify-client
```

In Python projects, use official [Python client library](https://docs.apify.com/api/client/python.md):

```bash
pip install apify-client
```

In shell scripts, use [Apify CLI](https://docs.apify.com/cli/docs.md):

````bash
# MacOS / Linux
curl -fsSL https://apify.com/install-cli.sh | bash
# Windows
irm https://apify.com/install-cli.ps1 | iex
```bash

In AI frameworks, you might use the [Apify MCP server](https://docs.apify.com/platform/integrations/mcp.md).

If your project is in a different language, use the [REST API](https://docs.apify.com/api/v2.md).

For usage examples, see the [API](#api) section below.

For more details, see Apify documentation as [Markdown index](https://docs.apify.com/llms.txt) and [Markdown full-text](https://docs.apify.com/llms-full.txt).


# README

![ParseForge Banner](https://github.com/ParseForge/apify-assets/blob/ad35ccc13ddd068b9d6cba33f323962e39aed5b2/banner.jpg?raw=true)

## 🚀 Y Combinator Companies Scraper

> 🚀 **Pull 5,700+ Y Combinator-funded startups in minutes.** Companies, founders, batches, industries, open jobs. No API key, no manual CSV wrangling.

> 🕒 **Last updated:** 2026-05-08 · **📊 30+ fields** per company · **🎓 W05 to current batch** · **💼 Open jobs included** · **🚫 No auth** required

Pull live company data from the Y Combinator directory, the canonical record of every YC-funded startup since the very first batch. The actor walks the YC catalog with your filter combination, paginates through results, fetches each company detail page, and returns one structured record per company ready for investor research, sales prospecting, lead-gen, or talent sourcing.

Every run fetches data live so you get the current state of the YC directory, not a stale dump. Records include logo URL, batch, batchName, growth stage, year founded, team size, location, founder names with bios, social handles, current job listings with salary and equity ranges, and a back-reference URL to the canonical YC profile.

| 👥 Built for | 🎯 Primary use cases |
|---|---|
| Venture capital and angels | Track new YC batches as they launch |
| Sales and BD teams | Build prospect lists of YC-funded startups |
| Recruiters | Source candidates from YC company hiring pages |
| Founders and operators | Map competitor landscape and funding signals |
| Researchers and journalists | Study startup ecosystem trends across batches |
| BizDev and partnerships | Identify integration partners by industry |

---

### 📋 What the Y Combinator Companies Scraper does

- 🎓 **Filter by batch.** Pass batch codes like W25, S25, X25, F25 or full names.
- 🏭 **Industry filters.** B2B, Consumer, Healthcare, Fintech, Industrials, Real Estate, Education, Government.
- 🔬 **Sub-industries.** Drill down into Payments, Drug Discovery, Engineering, Product and Design, etc.
- 🌍 **Region filters.** USA, Europe, Latin America, South Asia, Southeast Asia, Africa, India, UK.
- 📊 **Status and stage.** Active, Public, Acquired, Inactive plus the YC growth stage.
- 💼 **Hiring filter.** Return only companies with open job listings.
- ⭐ **Top Companies.** Filter to YC's curated Top Companies list.

The scraper accepts any combination of these filters, builds the matching YC search URL, and walks the result pages. For each company it fetches the detail page to extract founders, social handles, job listings (with salary and equity ranges), and the full company description.

> 💡 **Why it matters:** the YC directory is the canonical record of YC-funded startups but its UI is paginated, JS-rendered, and lacks bulk export. A live, structured pull beats manual sourcing for VC research, BD outreach, and recruiting at scale.

---

### 🎬 Full Demo

🚧 Coming soon: a 3-minute walkthrough showing setup, a live run, and how to pipe results into Salesforce or Airtable via Apify integrations.

---

### ⚙️ Input

| Field | Type | Name | Description |
|---|---|---|---|
| startUrls | array | Company URLs | Specific YC company URLs (e.g. `https://www.ycombinator.com/companies/airbnb`). When provided, all other filters are ignored. |
| maxItems | integer | Max Companies | Free users: limited to 10 items (preview). Paid users: optional, max 1,000,000. |
| query | string | Search Query | Full-text search across name, description, keywords. |
| batches | array | Batches | Batch codes (W25, S25, X25, F25) or full names. |
| industries | array | Industries | Top-level industry tags. |
| subindustries | array | Subindustries | Drill-down sub-industry tags. |
| regions | array | Regions | Geographic region filters. |
| companyStatus | enum | Company Status | Active, Public, Acquired, Inactive. |
| isHiring | boolean | Hiring Only | Only companies with open job listings. |
| nonprofit | boolean | Nonprofits Only | Only nonprofit YC companies. |
| topCompaniesOnly | boolean | Top Companies Only | Only YC's curated Top Companies list. |

Example 1. Hiring fintech startups from W25, USA only.

```json
{
  "batches": ["W25"],
  "industries": ["Fintech"],
  "regions": ["United States of America"],
  "isHiring": true,
  "maxItems": 50
}
````

Example 2. Direct lookup of two specific YC companies.

```json
{
  "startUrls": [
    "https://www.ycombinator.com/companies/airbnb",
    "https://www.ycombinator.com/companies/stripe"
  ],
  "maxItems": 2
}
```

> ⚠️ **Good to Know:** when startUrls is set, every other filter is ignored. Use it for ad-hoc enrichment of known YC companies.

***

### 📊 Output

The dataset returns one structured record per YC company. Each record carries identifiers, batch metadata, growth stage, location, team size, founders, social handles, open job listings, and a back-reference URL. Consume the dataset as JSON, CSV, Excel, XML, or RSS via the Apify console or API.

#### 🧾 Schema

| Field | Type | Example |
|---|---|---|
| 🖼️ logoUrl | string (url) | `https://bookface-images.s3.amazonaws.com/logos/abc.png` |
| 🆔 id | string | `5234` |
| 🏢 name | string | Airbnb |
| 🏷️ slug | string | airbnb |
| 🔗 url | string (url) | `https://www.ycombinator.com/companies/airbnb` |
| 🌐 website | string (url) | `https://airbnb.com` |
| 📝 oneLiner | string | `Book accommodations around the world` |
| 🎓 batch | string | W09 |
| 🏷️ batchName | string | `Winter 2009` |
| 📊 status | string | Public |
| 📈 stage | string | Public |
| 🗓️ yearFounded | number | `2008` |
| 👥 teamSize | number | `6132` |
| 📍 location | string | `San Francisco, CA, USA` |
| 🏷️ industries | array | `["Travel"]` |
| 🌍 regions | array | `["United States of America"]` |
| 👥 founders | array | `[{"name":"Brian Chesky","title":"CEO","linkedin":"..."}]` |
| 💼 jobs | array | `[{"title":"Senior Engineer","equity":"0.01-0.05%","salary":"$200K-$300K"}]` |
| 🐦 twitter | string | `https://twitter.com/airbnb` |
| 💼 linkedin | string | `https://linkedin.com/company/airbnb` |
| 📞 contactEmail | string | `press@airbnb.com` |
| ⭐ isTopCompany | boolean | true |
| 🤝 isNonprofit | boolean | false |
| 📝 description | string | `Airbnb is an online marketplace for...` |
| 📅 scrapedAt | ISO datetime | `2026-05-08T12:00:00.000Z` |

#### 📦 Sample records

##### 1. Public top company (Airbnb)

```json
{
  "logoUrl": "https://bookface-images.s3.amazonaws.com/logos/airbnb.png",
  "id": "5234",
  "name": "Airbnb",
  "slug": "airbnb",
  "url": "https://www.ycombinator.com/companies/airbnb",
  "website": "https://airbnb.com",
  "oneLiner": "Book accommodations around the world",
  "batch": "W09",
  "batchName": "Winter 2009",
  "status": "Public",
  "stage": "Public",
  "yearFounded": 2008,
  "teamSize": 6132,
  "location": "San Francisco, CA, USA",
  "industries": ["Consumer", "Travel"],
  "regions": ["United States of America"],
  "founders": [
    {"name": "Brian Chesky", "title": "CEO", "linkedin": "https://linkedin.com/in/brianchesky"},
    {"name": "Joe Gebbia", "title": "Co-founder"},
    {"name": "Nathan Blecharczyk", "title": "Co-founder"}
  ],
  "twitter": "https://twitter.com/airbnb",
  "linkedin": "https://linkedin.com/company/airbnb",
  "isTopCompany": true,
  "isNonprofit": false,
  "scrapedAt": "2026-05-08T12:00:00.000Z"
}
```

##### 2. Hiring early-stage company (W25 batch)

```json
{
  "logoUrl": "https://bookface-images.s3.amazonaws.com/logos/acme.png",
  "id": "32145",
  "name": "Acme AI",
  "slug": "acme-ai",
  "website": "https://acme-ai.com",
  "oneLiner": "AI agents for B2B back-office workflows",
  "batch": "W25",
  "batchName": "Winter 2025",
  "status": "Active",
  "stage": "Seed",
  "yearFounded": 2024,
  "teamSize": 5,
  "location": "San Francisco, CA, USA",
  "industries": ["B2B"],
  "regions": ["United States of America"],
  "founders": [
    {"name": "Jane Smith", "title": "CEO"},
    {"name": "John Doe", "title": "CTO"}
  ],
  "jobs": [
    {"title": "Founding Engineer", "equity": "0.5-2.0%", "salary": "$150K-$200K", "location": "SF (in-person)"},
    {"title": "Founding Designer", "equity": "0.3-1.0%", "salary": "$130K-$180K", "location": "SF (in-person)"}
  ],
  "scrapedAt": "2026-05-08T12:00:00.000Z"
}
```

##### 3. Acquired company (sparse fields)

```json
{
  "id": "1234",
  "name": "Old Startup",
  "slug": "old-startup",
  "batch": "S15",
  "batchName": "Summer 2015",
  "status": "Acquired",
  "stage": "Acquired",
  "yearFounded": 2014,
  "isTopCompany": false,
  "scrapedAt": "2026-05-08T12:00:00.000Z"
}
```

***

### ✨ Why choose this Actor

| | Capability |
|---|---|
| 🎯 | **Built for the job.** Scoped specifically to the Y Combinator directory so you skip the parser engineering entirely. |
| 🔖 | **Structured output.** Clean, typed fields ready for analysis, dashboards, or downstream pipelines. |
| ⚡ | **Fast.** Optimized request patterns return results in seconds, not minutes. |
| 🔁 | **Always fresh.** Every run pulls live data, so the dataset reflects YC as of run time. |
| 🌐 | **No infra to manage.** Apify handles proxies, retries, scaling, scheduling, and storage. |
| 🛡️ | **Reliable.** Battle-tested across many runs and edge cases, with graceful error handling. |
| 🚫 | **No code required.** Configure in the UI, run from CLI, schedule via cron, or call from any language with the Apify SDK. |

> 📊 Production-grade structured startup data without the engineering overhead of building and maintaining your own scraper.

***

### 📈 How it compares to alternatives

| Approach | Cost | Coverage | Refresh | Filters | Setup |
|---|---|---|---|---|---|
| **⭐ Y Combinator Companies Scraper** *(this Actor)* | $5 free credit, then pay-per-use | Full YC directory (5,700+) | **Live per run** | Batch, industry, region, stage, hiring | ⚡ 2 min |
| Build your own scraper | Engineering hours | Full once built | Whenever you maintain it | Custom code | 🐢 Days to weeks |
| Paid VC databases | $$$ monthly per seat | Vendor-defined | Periodic | Vendor-defined | ⏳ Hours |
| Manual sourcing | Hours per company | Limited | Stale | Manual filter clicking | 🕒 Variable |

Pick this Actor when you want broad coverage, source-native filtering, and no pipeline maintenance.

***

### 🚀 How to use

1. 📝 **Sign up.** [Create a free account with $5 credit](https://console.apify.com/sign-up?fpr=vmoqkp) (takes 2 minutes).
2. 🌐 **Open the Actor.** Go to the Y Combinator Companies Scraper page on the Apify Store.
3. 🎯 **Set filters.** Pick batch, industry, region, and other filters, then set maxItems.
4. 🚀 **Run it.** Click **Start** and let the Actor collect your data.
5. 📥 **Download.** Grab your results in the **Dataset** tab as CSV, Excel, JSON, or XML.

> ⏱️ Total time from signup to downloaded dataset: **3-5 minutes.** No coding required.

***

### 💼 Business use cases

<table>
<tr>
<td width="50%" valign="top">

#### 📊 VC and investor research

- Track new YC batches as they launch
- Build watchlists by industry and stage
- Map founder profiles across YC cohorts
- Surface hiring signals as growth indicators

</td>
<td width="50%" valign="top">

#### 🏢 Sales and BD

- Build outbound prospect lists of YC startups
- Filter by hiring status to find well-funded buyers
- Source product partner candidates by industry
- Power CRM enrichment with batch and stage data

</td>
</tr>
<tr>
<td width="50%" valign="top">

#### 🎯 Recruiting

- Source candidates from YC company hiring pages
- Build talent pipelines by industry vertical
- Map technical leadership across YC alumni
- Track which YC companies are hiring in your stack

</td>
<td width="50%" valign="top">

#### 🛠️ Engineering and product

- Prototype startup-data products without owning a crawler
- Replace fragile in-house YC scrapers
- Wire datasets into your apps via the Apify API or webhooks
- Skip the proxy, retry, and parsing maintenance entirely

</td>
</tr>
</table>

***

### 🌟 Beyond business use cases

Data like this powers more than commercial workflows. The same structured records support research, education, civic projects, and personal initiatives.

<table>
<tr>
<td width="50%">

#### 🎓 Research and academia

- Empirical datasets for papers, thesis work, and coursework
- Longitudinal studies tracking changes across snapshots
- Reproducible research with cited, versioned data pulls
- Classroom exercises on data analysis and ethical scraping

</td>
<td width="50%">

#### 🎨 Personal and creative

- Side projects, portfolio demos, and indie app launches
- Data visualizations, dashboards, and infographics
- Content research for bloggers, YouTubers, and podcasters
- Hobbyist collections and personal trackers

</td>
</tr>
<tr>
<td width="50%">

#### 🤝 Non-profit and civic

- Transparency reporting and accountability projects
- Advocacy campaigns backed by public-interest data
- Community-run databases for local issues
- Investigative journalism on public records

</td>
<td width="50%">

#### 🧪 Experimentation

- Prototype AI and machine-learning pipelines with real data
- Validate product-market hypotheses before engineering spend
- Train small domain-specific models on niche corpora
- Test dashboard concepts with live input

</td>
</tr>
</table>

***

### 🔌 Automating Y Combinator Companies Scraper

This Actor exposes a REST endpoint, so you can drive it from any language or workflow tool.

- **Node.js** - call it via the [Apify JS SDK](https://docs.apify.com/sdk/js).
- **Python** - call it via the [Apify Python SDK](https://docs.apify.com/sdk/python).
- **REST** - hit it directly through the [Apify v2 API](https://docs.apify.com/api/v2).

**Schedules.** Use Apify Scheduler to run hourly, daily, or weekly snapshots. Combine with the Apify dataset diff tools to track new YC companies between runs.

***

### ❓ Frequently Asked Questions

<details>
<summary><b>💳 Do I need a paid Apify plan to run this actor?</b></summary>

No. You can start right now on the free Apify plan, which includes **$5 in monthly credit**. That is enough to run the scraper several times and explore the output. Paid plans unlock higher item caps, more concurrent runs, and larger datasets. [Create a free Apify account here](https://console.apify.com/sign-up?fpr=vmoqkp).

</details>

<details>
<summary><b>🚨 What happens if my run fails or returns no results?</b></summary>

Failed runs are not charged. If the YC site changes, proxies get rate-limited, or your filters match nothing, re-run the actor or open our [contact form](https://tally.so/r/BzdKgA) and we will look into it. The run log in the Apify console explains why a run stopped.

</details>

<details>
<summary><b>📏 How many items can I scrape per run?</b></summary>

Free users are limited to **10 items per run** so you can preview the output and confirm the actor works for your use case. Paid users can raise maxItems up to **1,000,000** per run. [Upgrade here](https://console.apify.com/sign-up?fpr=vmoqkp) if you need full scale.

</details>

<details>
<summary><b>🕒 How fresh is the data?</b></summary>

Every run fetches live data at the moment of execution. There is no cache or delay: records reflect what the YC directory returned at run time. Schedule the actor to maintain a rolling snapshot.

</details>

<details>
<summary><b>🧑‍💻 Can I call this actor from my own code?</b></summary>

Yes. Apify exposes every actor as a REST endpoint and ships first-class SDKs for [Node.js](https://docs.apify.com/sdk/js) and [Python](https://docs.apify.com/sdk/python). You can start a run, read the dataset, and handle webhooks from your own app in a few lines.

</details>

<details>
<summary><b>📤 How do I export the data?</b></summary>

Every Apify dataset can be downloaded in one click as CSV, JSON, JSONL, Excel, HTML, XML, or RSS. You can also pull results programmatically via the [Apify API](https://docs.apify.com/api/v2) or stream into BigQuery, S3, and other destinations through built-in integrations.

</details>

<details>
<summary><b>📅 Can I schedule the actor to run automatically?</b></summary>

Yes. Use the Apify scheduler to run the actor on any cadence, from hourly to monthly. Results are saved to your dataset and can be delivered to webhooks, email, Slack, cloud storage, or automation tools such as Zapier and Make.

</details>

<details>
<summary><b>🏪 Can I use the data commercially?</b></summary>

Yes. The scraped data is yours to use in your own internal pipelines, products, and reports, subject to the terms of service of the source site. The Apify dataset itself has no extra licensing on top.

</details>

<details>
<summary><b>💼 Which plan should I pick for production use?</b></summary>

Apify's Starter and Scale plans are designed for production workloads. They give you faster instances, more concurrent runs, and higher proxy quotas. Pick the plan that matches your dataset size and refresh cadence; the in-app pricing calculator will help you size it.

</details>

<details>
<summary><b>🛠️ The data I need is not in the output. Can you add it?</b></summary>

Most likely yes. Open the [contact form](https://tally.so/r/BzdKgA) and tell us which field you need. We add fields all the time when there is a clear use case and the source page exposes the data.

</details>

<details>
<summary><b>⚖️ Is scraping Y Combinator legal?</b></summary>

This Actor only collects data from publicly accessible YC directory pages, the same content any visitor can read. Public web scraping is generally legal in most jurisdictions for non-personal data, but laws vary by country and use case. You are responsible for compliance with the source site's Terms of Service and applicable law.

***

</details>

### 🔌 Integrate with any app

Y Combinator Companies Scraper connects to any cloud service via [Apify integrations](https://apify.com/integrations):

- [**Make**](https://docs.apify.com/platform/integrations/make) - Automate multi-step workflows
- [**Zapier**](https://docs.apify.com/platform/integrations/zapier) - Connect with 5,000+ apps
- [**Slack**](https://docs.apify.com/platform/integrations/slack) - Get run notifications in your channels
- [**Airbyte**](https://docs.apify.com/platform/integrations/airbyte) - Pipe results into your warehouse
- [**GitHub**](https://docs.apify.com/platform/integrations/github) - Trigger runs from commits and releases
- [**Google Drive**](https://docs.apify.com/platform/integrations/drive) - Export datasets straight to Sheets

You can also use webhooks to trigger downstream actions when a run finishes. Push fresh data into your product backend or alert your team in Slack.

***

### 🔗 Recommended Actors

- [**🏢 Crunchbase Scraper**](https://apify.com/parseforge/crunchbase-scraper) - Startup company data with funding rounds and investors
- [**📈 PitchBook Companies Scraper**](https://apify.com/parseforge/pitchbook-companies-scraper) - Private company data with investors and funding
- [**💼 Wellfound Jobs Scraper**](https://apify.com/parseforge/wellfound-jobs-scraper) - Startup jobs from Wellfound (formerly AngelList)
- [**📋 PitchBook Investors Scraper**](https://apify.com/parseforge/pitchbook-investors-scraper) - Investor profiles with portfolios
- [**🏢 Dun & Bradstreet Company Scraper**](https://apify.com/parseforge/dnb-scraper) - 500M+ business directory with DUNS

> 💡 **Pro Tip:** browse the complete [ParseForge collection](https://apify.com/parseforge) for more reference-data scrapers.

***

**🆘 Need Help?** [**Open our contact form**](https://tally.so/r/BzdKgA) to request a new scraper, propose a custom project, or report an issue.

***

> ⚠️ **Disclaimer.** This Actor is an independent tool and is not affiliated with, endorsed by, or sponsored by Y Combinator or any of its subsidiaries. All trademarks mentioned are the property of their respective owners. The scraper accesses only publicly available pages and is intended for legitimate research, analytics, and lead-generation use. Users are responsible for compliance with the source site's Terms of Service and applicable law.

# Actor input Schema

## `startUrls` (type: `array`):

Scrape specific YC companies by URL. Example: https://www.ycombinator.com/companies/airbnb. If provided, all filter fields below are ignored.

## `maxItems` (type: `integer`):

Free users: Limited to 100. Paid users: Optional, max 1,000,000.

## `query` (type: `string`):

Search by company name, description, or keyword. Example: AI assistant

## `batches` (type: `array`):

Filter by YC batch. Use short codes (W25, S25, X25, F25) or full names (Winter 2025, Spring 2025, Fall 2025). Leave empty for all batches.

## `industries` (type: `array`):

Filter by top-level industry. Options: B2B, Consumer, Healthcare, Fintech, Industrials, Real Estate and Construction, Education, Government.

## `subindustries` (type: `array`):

Filter by specific subindustry. Examples: B2B -> Engineering, Product and Design | Fintech -> Payments | Healthcare -> Drug Discovery and Delivery.

## `regions` (type: `array`):

Filter by region. Examples: United States of America, Europe, Latin America, South Asia, Southeast Asia, Africa, India, United Kingdom.

## `companyStatus` (type: `string`):

Filter by company status. Leave empty for all statuses.

## `isHiring` (type: `boolean`):

Only return companies with open job listings.

## `nonprofit` (type: `boolean`):

Only return nonprofit organizations.

## `topCompaniesOnly` (type: `boolean`):

Only return companies flagged as top YC companies (most notable alumni like Airbnb, Stripe, Coinbase).

## `scrapeFounders` (type: `boolean`):

Include founder profiles with name, title, bio, LinkedIn, and Twitter.

## `scrapeJobs` (type: `boolean`):

Include open job listings with salary range, equity range, visa sponsorship, and required skills.

## `scrapeJobDescriptions` (type: `boolean`):

Include full job description text for each open position. Requires scrapeJobs to be enabled. Adds one extra request per job listing.

## Actor input object example

```json
{
  "maxItems": 10,
  "scrapeFounders": true,
  "scrapeJobs": true,
  "scrapeJobDescriptions": false
}
```

# Actor output Schema

## `results` (type: `string`):

Full dataset of YC company profiles with founders and job listings

# API

You can run this Actor programmatically using our API. Below are code examples in JavaScript, Python, and CLI, as well as the OpenAPI specification and MCP server setup.

## JavaScript example

```javascript
import { ApifyClient } from 'apify-client';

// Initialize the ApifyClient with your Apify API token
// Replace the '<YOUR_API_TOKEN>' with your token
const client = new ApifyClient({
    token: '<YOUR_API_TOKEN>',
});

// Prepare Actor input
const input = {
    "maxItems": 10
};

// Run the Actor and wait for it to finish
const run = await client.actor("parseforge/y-combinator-scraper").call(input);

// Fetch and print Actor results from the run's dataset (if any)
console.log('Results from dataset');
console.log(`💾 Check your data here: https://console.apify.com/storage/datasets/${run.defaultDatasetId}`);
const { items } = await client.dataset(run.defaultDatasetId).listItems();
items.forEach((item) => {
    console.dir(item);
});

// 📚 Want to learn more 📖? Go to → https://docs.apify.com/api/client/js/docs

```

## Python example

```python
from apify_client import ApifyClient

# Initialize the ApifyClient with your Apify API token
# Replace '<YOUR_API_TOKEN>' with your token.
client = ApifyClient("<YOUR_API_TOKEN>")

# Prepare the Actor input
run_input = { "maxItems": 10 }

# Run the Actor and wait for it to finish
run = client.actor("parseforge/y-combinator-scraper").call(run_input=run_input)

# Fetch and print Actor results from the run's dataset (if there are any)
print("💾 Check your data here: https://console.apify.com/storage/datasets/" + run["defaultDatasetId"])
for item in client.dataset(run["defaultDatasetId"]).iterate_items():
    print(item)

# 📚 Want to learn more 📖? Go to → https://docs.apify.com/api/client/python/docs/quick-start

```

## CLI example

```bash
echo '{
  "maxItems": 10
}' |
apify call parseforge/y-combinator-scraper --silent --output-dataset

```

## MCP server setup

```json
{
    "mcpServers": {
        "apify": {
            "command": "npx",
            "args": [
                "mcp-remote",
                "https://mcp.apify.com/?tools=parseforge/y-combinator-scraper",
                "--header",
                "Authorization: Bearer <YOUR_API_TOKEN>"
            ]
        }
    }
}

```

## OpenAPI specification

```json
{
    "openapi": "3.0.1",
    "info": {
        "title": "Y Combinator Companies Scraper",
        "description": "Extract company profiles, founders, and open job listings from the Y Combinator directory. Filter by batch, industry, subindustry, region, and hiring status. Covers 5,700+ funded startups from W05 to the latest YC cohort. Includes growth stage, equity ranges, salary data, and contact emails.",
        "version": "1.0",
        "x-build-id": "cWnbTS7Ry6kHq4NEC"
    },
    "servers": [
        {
            "url": "https://api.apify.com/v2"
        }
    ],
    "paths": {
        "/acts/parseforge~y-combinator-scraper/run-sync-get-dataset-items": {
            "post": {
                "operationId": "run-sync-get-dataset-items-parseforge-y-combinator-scraper",
                "x-openai-isConsequential": false,
                "summary": "Executes an Actor, waits for its completion, and returns Actor's dataset items in response.",
                "tags": [
                    "Run Actor"
                ],
                "requestBody": {
                    "required": true,
                    "content": {
                        "application/json": {
                            "schema": {
                                "$ref": "#/components/schemas/inputSchema"
                            }
                        }
                    }
                },
                "parameters": [
                    {
                        "name": "token",
                        "in": "query",
                        "required": true,
                        "schema": {
                            "type": "string"
                        },
                        "description": "Enter your Apify token here"
                    }
                ],
                "responses": {
                    "200": {
                        "description": "OK"
                    }
                }
            }
        },
        "/acts/parseforge~y-combinator-scraper/runs": {
            "post": {
                "operationId": "runs-sync-parseforge-y-combinator-scraper",
                "x-openai-isConsequential": false,
                "summary": "Executes an Actor and returns information about the initiated run in response.",
                "tags": [
                    "Run Actor"
                ],
                "requestBody": {
                    "required": true,
                    "content": {
                        "application/json": {
                            "schema": {
                                "$ref": "#/components/schemas/inputSchema"
                            }
                        }
                    }
                },
                "parameters": [
                    {
                        "name": "token",
                        "in": "query",
                        "required": true,
                        "schema": {
                            "type": "string"
                        },
                        "description": "Enter your Apify token here"
                    }
                ],
                "responses": {
                    "200": {
                        "description": "OK",
                        "content": {
                            "application/json": {
                                "schema": {
                                    "$ref": "#/components/schemas/runsResponseSchema"
                                }
                            }
                        }
                    }
                }
            }
        },
        "/acts/parseforge~y-combinator-scraper/run-sync": {
            "post": {
                "operationId": "run-sync-parseforge-y-combinator-scraper",
                "x-openai-isConsequential": false,
                "summary": "Executes an Actor, waits for completion, and returns the OUTPUT from Key-value store in response.",
                "tags": [
                    "Run Actor"
                ],
                "requestBody": {
                    "required": true,
                    "content": {
                        "application/json": {
                            "schema": {
                                "$ref": "#/components/schemas/inputSchema"
                            }
                        }
                    }
                },
                "parameters": [
                    {
                        "name": "token",
                        "in": "query",
                        "required": true,
                        "schema": {
                            "type": "string"
                        },
                        "description": "Enter your Apify token here"
                    }
                ],
                "responses": {
                    "200": {
                        "description": "OK"
                    }
                }
            }
        }
    },
    "components": {
        "schemas": {
            "inputSchema": {
                "type": "object",
                "properties": {
                    "startUrls": {
                        "title": "🔗 Company URLs",
                        "type": "array",
                        "description": "Scrape specific YC companies by URL. Example: https://www.ycombinator.com/companies/airbnb. If provided, all filter fields below are ignored.",
                        "items": {
                            "type": "string"
                        }
                    },
                    "maxItems": {
                        "title": "💯 Maximum Companies",
                        "minimum": 1,
                        "maximum": 1000000,
                        "type": "integer",
                        "description": "Free users: Limited to 100. Paid users: Optional, max 1,000,000."
                    },
                    "query": {
                        "title": "🔍 Search Query",
                        "type": "string",
                        "description": "Search by company name, description, or keyword. Example: AI assistant"
                    },
                    "batches": {
                        "title": "🎓 Batches",
                        "type": "array",
                        "description": "Filter by YC batch. Use short codes (W25, S25, X25, F25) or full names (Winter 2025, Spring 2025, Fall 2025). Leave empty for all batches.",
                        "items": {
                            "type": "string"
                        }
                    },
                    "industries": {
                        "title": "🏭 Industries",
                        "type": "array",
                        "description": "Filter by top-level industry. Options: B2B, Consumer, Healthcare, Fintech, Industrials, Real Estate and Construction, Education, Government.",
                        "items": {
                            "type": "string"
                        }
                    },
                    "subindustries": {
                        "title": "🔬 Subindustries",
                        "type": "array",
                        "description": "Filter by specific subindustry. Examples: B2B -> Engineering, Product and Design | Fintech -> Payments | Healthcare -> Drug Discovery and Delivery.",
                        "items": {
                            "type": "string"
                        }
                    },
                    "regions": {
                        "title": "🌍 Regions",
                        "type": "array",
                        "description": "Filter by region. Examples: United States of America, Europe, Latin America, South Asia, Southeast Asia, Africa, India, United Kingdom.",
                        "items": {
                            "type": "string"
                        }
                    },
                    "companyStatus": {
                        "title": "📊 Company Status",
                        "enum": [
                            "",
                            "Active",
                            "Public",
                            "Acquired",
                            "Inactive"
                        ],
                        "type": "string",
                        "description": "Filter by company status. Leave empty for all statuses."
                    },
                    "isHiring": {
                        "title": "💼 Hiring Only",
                        "type": "boolean",
                        "description": "Only return companies with open job listings."
                    },
                    "nonprofit": {
                        "title": "🤝 Nonprofits Only",
                        "type": "boolean",
                        "description": "Only return nonprofit organizations."
                    },
                    "topCompaniesOnly": {
                        "title": "⭐ Top Companies Only",
                        "type": "boolean",
                        "description": "Only return companies flagged as top YC companies (most notable alumni like Airbnb, Stripe, Coinbase)."
                    },
                    "scrapeFounders": {
                        "title": "👤 Include Founders",
                        "type": "boolean",
                        "description": "Include founder profiles with name, title, bio, LinkedIn, and Twitter.",
                        "default": true
                    },
                    "scrapeJobs": {
                        "title": "💼 Include Open Jobs",
                        "type": "boolean",
                        "description": "Include open job listings with salary range, equity range, visa sponsorship, and required skills.",
                        "default": true
                    },
                    "scrapeJobDescriptions": {
                        "title": "📄 Include Job Descriptions",
                        "type": "boolean",
                        "description": "Include full job description text for each open position. Requires scrapeJobs to be enabled. Adds one extra request per job listing.",
                        "default": false
                    }
                }
            },
            "runsResponseSchema": {
                "type": "object",
                "properties": {
                    "data": {
                        "type": "object",
                        "properties": {
                            "id": {
                                "type": "string"
                            },
                            "actId": {
                                "type": "string"
                            },
                            "userId": {
                                "type": "string"
                            },
                            "startedAt": {
                                "type": "string",
                                "format": "date-time",
                                "example": "2025-01-08T00:00:00.000Z"
                            },
                            "finishedAt": {
                                "type": "string",
                                "format": "date-time",
                                "example": "2025-01-08T00:00:00.000Z"
                            },
                            "status": {
                                "type": "string",
                                "example": "READY"
                            },
                            "meta": {
                                "type": "object",
                                "properties": {
                                    "origin": {
                                        "type": "string",
                                        "example": "API"
                                    },
                                    "userAgent": {
                                        "type": "string"
                                    }
                                }
                            },
                            "stats": {
                                "type": "object",
                                "properties": {
                                    "inputBodyLen": {
                                        "type": "integer",
                                        "example": 2000
                                    },
                                    "rebootCount": {
                                        "type": "integer",
                                        "example": 0
                                    },
                                    "restartCount": {
                                        "type": "integer",
                                        "example": 0
                                    },
                                    "resurrectCount": {
                                        "type": "integer",
                                        "example": 0
                                    },
                                    "computeUnits": {
                                        "type": "integer",
                                        "example": 0
                                    }
                                }
                            },
                            "options": {
                                "type": "object",
                                "properties": {
                                    "build": {
                                        "type": "string",
                                        "example": "latest"
                                    },
                                    "timeoutSecs": {
                                        "type": "integer",
                                        "example": 300
                                    },
                                    "memoryMbytes": {
                                        "type": "integer",
                                        "example": 1024
                                    },
                                    "diskMbytes": {
                                        "type": "integer",
                                        "example": 2048
                                    }
                                }
                            },
                            "buildId": {
                                "type": "string"
                            },
                            "defaultKeyValueStoreId": {
                                "type": "string"
                            },
                            "defaultDatasetId": {
                                "type": "string"
                            },
                            "defaultRequestQueueId": {
                                "type": "string"
                            },
                            "buildNumber": {
                                "type": "string",
                                "example": "1.0.0"
                            },
                            "containerUrl": {
                                "type": "string"
                            },
                            "usage": {
                                "type": "object",
                                "properties": {
                                    "ACTOR_COMPUTE_UNITS": {
                                        "type": "integer",
                                        "example": 0
                                    },
                                    "DATASET_READS": {
                                        "type": "integer",
                                        "example": 0
                                    },
                                    "DATASET_WRITES": {
                                        "type": "integer",
                                        "example": 0
                                    },
                                    "KEY_VALUE_STORE_READS": {
                                        "type": "integer",
                                        "example": 0
                                    },
                                    "KEY_VALUE_STORE_WRITES": {
                                        "type": "integer",
                                        "example": 1
                                    },
                                    "KEY_VALUE_STORE_LISTS": {
                                        "type": "integer",
                                        "example": 0
                                    },
                                    "REQUEST_QUEUE_READS": {
                                        "type": "integer",
                                        "example": 0
                                    },
                                    "REQUEST_QUEUE_WRITES": {
                                        "type": "integer",
                                        "example": 0
                                    },
                                    "DATA_TRANSFER_INTERNAL_GBYTES": {
                                        "type": "integer",
                                        "example": 0
                                    },
                                    "DATA_TRANSFER_EXTERNAL_GBYTES": {
                                        "type": "integer",
                                        "example": 0
                                    },
                                    "PROXY_RESIDENTIAL_TRANSFER_GBYTES": {
                                        "type": "integer",
                                        "example": 0
                                    },
                                    "PROXY_SERPS": {
                                        "type": "integer",
                                        "example": 0
                                    }
                                }
                            },
                            "usageTotalUsd": {
                                "type": "number",
                                "example": 0.00005
                            },
                            "usageUsd": {
                                "type": "object",
                                "properties": {
                                    "ACTOR_COMPUTE_UNITS": {
                                        "type": "integer",
                                        "example": 0
                                    },
                                    "DATASET_READS": {
                                        "type": "integer",
                                        "example": 0
                                    },
                                    "DATASET_WRITES": {
                                        "type": "integer",
                                        "example": 0
                                    },
                                    "KEY_VALUE_STORE_READS": {
                                        "type": "integer",
                                        "example": 0
                                    },
                                    "KEY_VALUE_STORE_WRITES": {
                                        "type": "number",
                                        "example": 0.00005
                                    },
                                    "KEY_VALUE_STORE_LISTS": {
                                        "type": "integer",
                                        "example": 0
                                    },
                                    "REQUEST_QUEUE_READS": {
                                        "type": "integer",
                                        "example": 0
                                    },
                                    "REQUEST_QUEUE_WRITES": {
                                        "type": "integer",
                                        "example": 0
                                    },
                                    "DATA_TRANSFER_INTERNAL_GBYTES": {
                                        "type": "integer",
                                        "example": 0
                                    },
                                    "DATA_TRANSFER_EXTERNAL_GBYTES": {
                                        "type": "integer",
                                        "example": 0
                                    },
                                    "PROXY_RESIDENTIAL_TRANSFER_GBYTES": {
                                        "type": "integer",
                                        "example": 0
                                    },
                                    "PROXY_SERPS": {
                                        "type": "integer",
                                        "example": 0
                                    }
                                }
                            }
                        }
                    }
                }
            }
        }
    }
}
```
