# Greenhouse Scraper 💰 $0.95/1K — Career Site Jobs (`blackfalcondata/greenhouse-scraper`) Actor

Scrape any Greenhouse-powered career site for structured job data — $0.95/1K, no monthly fee. Application questions, department & office metadata, and multi-board batch scraping across companies. Incremental mode flags new/changed listings between runs.

- **URL**: https://apify.com/blackfalcondata/greenhouse-scraper.md
- **Developed by:** [Black Falcon Data](https://apify.com/blackfalcondata) (community)
- **Categories:** Jobs, Lead generation, Automation
- **Stats:** 20 total users, 3 monthly users, 100.0% runs succeeded, 4 bookmarks
- **User rating**: 5.00 out of 5 stars

## Pricing

from $0.95 / 1,000 results

This Actor is paid per event. You are not charged for the Apify platform usage, but only a fixed price for specific events.

Learn more: https://docs.apify.com/platform/actors/running/actors-in-store#pay-per-event

## What's an Apify Actor?

Actors are a software tools running on the Apify platform, for all kinds of web data extraction and automation use cases.
In Batch mode, an Actor accepts a well-defined JSON input, performs an action which can take anything from a few seconds to a few hours,
and optionally produces a well-defined JSON output, datasets with results, or files in key-value store.
In Standby mode, an Actor provides a web server which can be used as a website, API, or an MCP server.
Actors are written with capital "A".

## How to integrate an Actor?

If asked about integration, you help developers integrate Actors into their projects.
You adapt to their stack and deliver integrations that are safe, well-documented, and production-ready.
The best way to integrate Actors is as follows.

In JavaScript/TypeScript projects, use official [JavaScript/TypeScript client](https://docs.apify.com/api/client/js.md):

```bash
npm install apify-client
```

In Python projects, use official [Python client library](https://docs.apify.com/api/client/python.md):

```bash
pip install apify-client
```

In shell scripts, use [Apify CLI](https://docs.apify.com/cli/docs.md):

````bash
# MacOS / Linux
curl -fsSL https://apify.com/install-cli.sh | bash
# Windows
irm https://apify.com/install-cli.ps1 | iex
```bash

In AI frameworks, you might use the [Apify MCP server](https://docs.apify.com/platform/integrations/mcp.md).

If your project is in a different language, use the [REST API](https://docs.apify.com/api/v2.md).

For usage examples, see the [API](#api) section below.

For more details, see Apify documentation as [Markdown index](https://docs.apify.com/llms.txt) and [Markdown full-text](https://docs.apify.com/llms-full.txt).


# README

### What does Greenhouse Scraper do?

Greenhouse Scraper extracts structured job data from [greenhouse.io](https://greenhouse.io) — including salary data, apply URLs, company metadata, and full descriptions. It supports keyword search, location filters, and controllable result limits, so you can run the same query consistently over time. The actor also offers detail enrichment (full descriptions and company metadata) where the source provides them.

**New to Apify?** [Sign up free](https://console.apify.com/sign-up?fpr=1h3gvi&fp_sid=ctaplain) and use the included $5 monthly platform credit to test this actor.

### Key features

<!-- KEY_FEATURES:START -->
- **♻️ Incremental mode** — recurring runs emit only listings that changed since the last run on this ATS — schema-stable diffs ready for downstream pipeline ingestion. Saves 80–95% on monitoring.
- **🔔 Notifications** — Telegram, Slack, Discord, WhatsApp Cloud API, and generic webhook out of the box. Pair with incremental for daily new-listing alerts without pipeline glue.
- **📋 Detail enrichment** — toggle two-stage scraping: first collect listings, then enrich each with full description + detail-page-only fields. Off by default to keep runs fast; flip on when you need the deep payload.
- **🏢 ATS platform signal** — each job carries a `postedVia` field identifying the source ATS (Workday / Greenhouse / Lever / Ashby / SmartRecruiters / etc.) for downstream routing in unified hiring stacks.
- **📦 Compact mode** — compact mode — schema-stable feed for downstream HRIS / aggregator ingestion. Sized for LLM context windows when piping into resume- matching pipelines.
- **✂️ Description truncation** — cap description length with `descriptionMaxLength` to control LLM prompt cost and dataset size — set 0 for full descriptions, or any char-limit to trim.
- **📌 Change classification** — each record carries a `changeType` of NEW / UPDATED / UNCHANGED / REAPPEARED / EXPIRED. Default emits NEW + UPDATED + REAPPEARED; opt into the others with `emitUnchanged` / `emitExpired`. Repost detection flags previously-expired listings that come back.
- **📤 Export anywhere** — Download the dataset as JSON, CSV, or Excel from the Apify Console, or stream live via the Apify API and integrations (Make, Zapier, Google Sheets, n8n, …).
- **🔗 URL + social-profile extraction** — every record carries `extractedUrls[]` plus a structured `socialProfiles { linkedin, twitter, instagram, facebook, youtube, tiktok, github, xing }` parsed from the description.
- **📧 Email + phone extraction** — best-effort regex extraction of contact emails and phone numbers from descriptions — emitted as `extractedEmails[]` and `extractedPhones[]` on every record.
- **🔌 MCP connectors** — export your results into Notion via Apify's MCP connectors — a clean run-summary page, no glue code. Opt-in via the App connector field; deterministic field-mapping, no AI. Built on Apify's connector framework, so more destinations open up as their catalog grows.
<!-- KEY_FEATURES:END -->

### What data can you extract from greenhouse.io?

Each result includes Core listing fields (`jobId`, `greenhouseId`, `internalJobId`, `title`, `location`, `department`, `offices`, and `employmentType`, and more), detail fields when enrichment is enabled (`description`, `descriptionHtml`, and `descriptionMarkdown`), apply information (`applyUrl`), and company metadata (`company`). In standard mode, all fields are always present — unavailable data points are returned as `null`, never omitted. In compact mode, only core fields are returned.

Enable detail enrichment in the input to get richer fields such as full descriptions and company metadata where the source provides them.

### Input

The main inputs are a search keyword, an optional location filter, and a result limit. Additional filters and options are available in the input schema.

Key parameters:

- **`boardTokens`** — Greenhouse board tokens or URLs. Example: 'airbnb' or 'https://boards.greenhouse.io/airbnb'.
- **`query`** — Filter jobs by keyword (matched against title and description).
- **`location`** — Filter by location (substring match, e.g. 'London' or 'Remote').
- **`department`** — Filter by department name (substring match, e.g. 'Engineering').
- **`maxResults`** — Maximum total results across all boards (0 = unlimited). (default: `0`)
- **`includeDetails`** — Fetch pay transparency and application questions per job (slower — one extra request per job). (default: `false`)
- **`descriptionMaxLength`** — Truncate HTML description to N characters. 0 = no truncation. (default: `0`)
- **`compact`** — Return only core fields (for AI-agent/MCP workflows). (default: `false`)
- **`incrementalMode`** — Only return new or changed jobs since last run. (default: `false`)
- **`stateKey`** — Custom key for incremental state (default: auto-generated from board tokens).
- **`skipReposts`** — Skip jobs detected as reposts of previously seen jobs (cross-run detection via content hash). (default: `false`)
- **`telegramToken`** — Telegram bot token from @BotFather. Required for Telegram notifications.
- ...and 14 more parameters

### Input examples

**Basic board scrape** — Pull every open role from one or more company boards.

→ Full payload per result — all standard fields populated where the source provides them.

```json
{
  "boardTokens": [
    "airbnb",
    "stripe",
    "discord"
  ],
  "maxResults": 50
}
````

**Keyword + location filter** — Narrow a board scrape to roles matching a keyword query and location.

→ Only listings whose title/content match the query and whose location matches.

```json
{
  "boardTokens": [
    "stripe"
  ],
  "query": "engineer",
  "location": "Remote",
  "maxResults": 50
}
```

**Incremental tracking** — Only emit jobs that changed since the previous run with this `stateKey`.

→ First run builds the baseline state. Subsequent runs emit only records that are new or whose tracked content changed. Set `emitUnchanged: true` to include unchanged records as well.

```json
{
  "boardTokens": [
    "airbnb",
    "stripe",
    "discord"
  ],
  "maxResults": 200,
  "incrementalMode": true,
  "stateKey": "greenhouse-tracker"
}
```

**Compact output for AI agents** — Return only core fields for AI-agent and MCP workflows.

→ Small payload with the most important fields — ideal for piping into LLMs without token overhead.

```json
{
  "boardTokens": [
    "airbnb",
    "stripe",
    "discord"
  ],
  "maxResults": 50,
  "compact": true
}
```

### Output

Each run produces a dataset of structured job records. Results can be downloaded as JSON, CSV, or Excel from the Dataset tab in Apify Console.

### Example job record

```json
{
  "jobId": "gh-7649441",
  "greenhouseId": 7649441,
  "internalJobId": 3369660,
  "title": "Account Executive (12 Month FTC)",
  "company": "Airbnb",
  "location": "Paris, France",
  "department": "Sales",
  "offices": [
    "Paris, France"
  ],
  "description": "Airbnb was born in 2007 when two hosts welcomed three guests to their San Francisco home, and has since grown to over 5 million hosts who have welcomed over 2 billion guest arrivals in almost every co...",
  "descriptionHtml": "&lt;div class=&quot;content-intro&quot;&gt;&lt;p&gt;&lt;span style=&quot;font-family: helvetica, arial, sans-serif; font-size: 12pt;&quot;&gt;Airbnb was born in 2007 when two hosts welcomed three gues...",
  "descriptionMarkdown": "Airbnb was born in 2007 when two hosts welcomed three guests to their San Francisco home, and has since grown to over 5 million hosts who have welcomed over 2 billion guest arrivals in almost every co...",
  "contentHash": "e438ac968b472977c34b05ae70f5a0cbd9c6a033329841bb6456f9fd67e44d14",
  "workplaceType": "Hybrid",
  "url": "https://careers.airbnb.com/positions/7649441?gh_jid=7649441",
  "applyUrl": "https://careers.airbnb.com/positions/7649441?gh_jid=7649441",
  "requisitionId": "ONE",
  "language": "en",
  "salaryMin": 61000,
  "salaryMax": 72000,
  "salaryCurrency": "EUR",
  "salaryPeriod": "yearly",
  "questions": [
    {
      "label": "First Name",
      "type": "input_text",
      "required": true
    },
    {
      "label": "Last Name",
      "type": "input_text",
      "required": true
    },
    {
      "label": "Email",
      "type": "input_text",
      "required": true
    },
    {
      "label": "Phone",
      "type": "input_text",
      "required": true
    },
    {
      "label": "Resume/CV",
      "type": "input_file",
      "required": true
    },
    {
      "label": "Cover Letter",
      "type": "input_file",
      "required": false
    },
    {
      "label": "LinkedIn Profile",
      "type": "input_text",
      "required": false
    },
    {
      "label": "Why have you chosen to apply to Airbnb?",
      "type": "textarea",
      "required": true
    },
    {
      "label": "Which city are you based?  Are you within easy community distance to Paris?",
      "type": "input_text",
      "required": true
    },
    {
      "label": "Please indicate your language proficiencies and the level of fluency for each.",
      "type": "input_text",
      "required": true
    },
    {
      "label": "Before submitting your application please review the points below:",
      "type": "multi_value_multi_select",
      "required": true
    },
    {
      "label": "Gender",
      "type": "multi_value_single_select",
      "required": true
    },
    {
      "label": "Are you legally authorized to work in the country where the job is located?\n",
      "type": "multi_value_single_select",
      "required": true
    },
    {
      "label": "Will you now or in the future require company sponsorship to retain or extend your work authorization in the country where the job is located?",
      "type": "multi_value_single_select",
      "required": true
    },
    {
      "label": "Are you currently subject to any non-compete or non-solicitation agreement that would impact your ability to work at Airbnb or prevent you from accepting a job offer from Airbnb? ",
      "type": "multi_value_single_select",
      "required": true
    },
    {
      "label": "Are you currently or have you ever worked for Airbnb in any capacity? This could include, but is not limited to, a full-time employee, intern, apprentice, or contingent worker.",
      "type": "multi_value_single_select",
      "required": true
    }
  ],
  "metadata": [
    {
      "name": "Is this job part of ACC?",
      "value": false
    },
    {
      "name": "Workplace Type",
      "value": "Hybrid"
    }
  ],
  "postedDate": "2026-02-24T09:04:33-05:00",
  "updatedAt": "2026-02-24T09:25:19-05:00",
  "scrapedAt": "2026-05-16T15:07:42.615Z",
  "portalUrl": "https://boards.greenhouse.io/airbnb",
  "source": "greenhouse.io",
  "isRepost": false
}
```

### Incremental fields

When incremental mode is on, each record also carries:

- `changeType` — one of `NEW`, `UPDATED`, `UNCHANGED`, `REAPPEARED`, `EXPIRED`.
- `isRepost`, `repostOfId`, `repostDetectedAt` — populated when a new listing matches the tracked content of a previously expired one. Set `skipReposts: true` to drop detected reposts from the output.

### How to scrape greenhouse.io

1. Go to [Greenhouse Scraper](https://apify.com/blackfalcondata/greenhouse-scraper?fpr=1h3gvi) in Apify Console.
2. Enter a search keyword and optional location filter.
3. Set `maxResults` to control how many results you need.
4. Enable `includeDetails` if you need full descriptions, company data.
5. Click **Start** and wait for the run to finish.
6. Export the dataset as JSON, CSV, or Excel.

### Use cases

- Extract job data from greenhouse.io for market research and competitive analysis.
- Track salary trends across regions and categories over time.
- Monitor new and changed listings on scheduled runs without processing the full dataset every time.
- Auto-apply or feed apply URLs into your ATS / hiring pipeline.
- Research company hiring patterns, employer profiles, and industry distribution.
- Feed structured data into AI agents, MCP tools, and automated pipelines using compact mode.
- Export clean, structured data to dashboards, spreadsheets, or data warehouses.

### How much does it cost to scrape greenhouse.io?

Greenhouse Scraper uses [pay-per-event](https://docs.apify.com/platform/actors/paid-actors/pay-per-event) pricing. You pay a small fee when the run starts and then for each result that is actually produced.

- **Run start:** $0.005 per run
- **Per result:** $0.00095 per job record

Example costs:

- 10 results: **$0.014**
- 25 results: **$0.029**
- 100 results: **$0.1**
- 200 results: **$0.2**
- 500 results: **$0.48**

#### Example: recurring monitoring savings

These examples compare full re-scrapes with incremental runs at different churn rates. Churn is the share of listings that are new or whose tracked content changed since the previous run. Actual churn depends on your query breadth, source activity, and polling frequency — the scenarios below are examples, not predictions.

Example setup: 250 results per run, daily polling (30 runs/month). Event-pricing examples scale linearly with result count.

| Churn rate | Full re-scrape run cost | Incremental run cost | Savings vs full re-scrape | Monthly cost after baseline |
|---|---:|---:|---:|---:|
| 5% — stable niche query | $0.24 | $0.02 | $0.23 (93%) | $0.51 |
| 15% — moderate broad query | $0.24 | $0.04 | $0.20 (83%) | $1.22 |
| 30% — high-volume aggregator | $0.24 | $0.08 | $0.17 (69%) | $2.29 |

Full re-scrape monthly cost at daily polling: $7.27. First month with incremental costs $0.73 / $1.42 / $2.45 for the 5% / 15% / 30% scenarios because the first run builds baseline state at full cost before incremental savings apply.

### FAQ

#### How many results can I get from greenhouse.io?

The number of results depends on the search query and available listings on greenhouse.io. Use the `maxResults` parameter to control how many results are returned per run.

#### Does Greenhouse Scraper support recurring monitoring?

Yes. Enable incremental mode to only receive new or changed listings on subsequent runs. This is ideal for scheduled monitoring where you want to track changes over time without re-processing the full dataset.

#### Can I integrate Greenhouse Scraper with other apps?

Yes. Greenhouse Scraper works with Apify's [integrations](https://apify.com/integrations?fpr=1h3gvi) to connect with tools like Zapier, Make, Google Sheets, Slack, and more. You can also use webhooks to trigger actions when a run completes.

#### Can I use Greenhouse Scraper with the Apify API?

Yes. You can start runs, manage inputs, and retrieve results programmatically through the [Apify API](https://docs.apify.com/api/v2). Client libraries are available for JavaScript, Python, and other languages.

#### Can I use Greenhouse Scraper through an MCP Server?

Yes. Apify provides an [MCP Server](https://apify.com/apify/actors-mcp-server?fpr=1h3gvi) that lets AI assistants and agents call this actor directly. Use compact mode, `descriptionMaxLength`, a single `descriptionFormat`, and `excludeEmptyFields` to keep payloads manageable for LLM context windows.

#### Is it legal to scrape greenhouse.io?

This actor extracts publicly available data from greenhouse.io. Web scraping of public information is generally considered legal, but you should always review the target site's terms of service and ensure your use case complies with applicable laws and regulations, including GDPR where relevant.

#### Your feedback

If you have questions, need a feature, or found a bug, please [open an issue](https://apify.com/blackfalcondata/greenhouse-scraper/issues?fpr=1h3gvi) on the actor's page in Apify Console. Your feedback helps us improve.

### You might also like

- [Actiris Brussels Job Scraper](https://apify.com/blackfalcondata/actiris-scraper?fpr=1h3gvi) — Scrape all active job listings from actiris.brussels — official Brussels public employment service..
- [AMS Austria Job Scraper — Austrian Public Employment Service](https://apify.com/blackfalcondata/ams-austria-job-scraper?fpr=1h3gvi) — Scrape jobs.ams.at — Austria's official AMS public employment portal, branded "alle jobs" ("all.
- [APEC.fr Scraper - French Executive Jobs](https://apify.com/blackfalcondata/apec-scraper?fpr=1h3gvi) — Scrape apec.fr - French executive job listings with salary ranges, company, location, skills,.
- [Arbeitsagentur Jobs Feed — German Federal Employment Agency](https://apify.com/blackfalcondata/arbeitsagentur-jobs-feed?fpr=1h3gvi) — Scrape arbeitsagentur.de — Germany's official public employment portal with over 1 million live job.
- [Arbetsformedlingen Job Scraper](https://apify.com/blackfalcondata/arbetsformedlingen-scraper?fpr=1h3gvi) — Scrape arbetsformedlingen.se (Platsbanken) — Sweden's official employment portal. Returns 84.
- [Bayt.com Scraper — MENA Jobs with Salary & Skills Filter](https://apify.com/blackfalcondata/bayt-scraper?fpr=1h3gvi) — Scrape bayt.com — the leading Middle East job board spanning UAE, Saudi Arabia, Qatar, Egypt.
- [Bumeran Scraper — LATAM Jobs across 7 Countries & 8 Brands](https://apify.com/blackfalcondata/bumeran-scraper?fpr=1h3gvi) — Scrape Bumeran Group's job boards across LATAM — Argentina (bumeran.com.ar + zonajobs), Chile.
- [Cadremploi Scraper — French Executive & Management Jobs](https://apify.com/blackfalcondata/cadremploi-scraper?fpr=1h3gvi) — Scrape cadremploi.fr — France's leading job board for executives and managers (cadres). Salary.

### Getting started with Apify

New to Apify? [Create a free account with $5 credit](https://console.apify.com/sign-up?fpr=1h3gvi\&fp_sid=ctaplain) — no credit card required.

1. Sign up — $5 platform credit included
2. Open this actor and configure your input
3. Click **Start** — export results as JSON, CSV, or Excel

Need more later? [See Apify pricing](https://apify.com/pricing?fpr=1h3gvi).

# Actor input Schema

## `boardTokens` (type: `array`):

Greenhouse board tokens or URLs. Example: 'airbnb' or 'https://boards.greenhouse.io/airbnb'.

## `query` (type: `string`):

Filter jobs by keyword (matched against title and description).

## `location` (type: `string`):

Filter by location (substring match, e.g. 'London' or 'Remote').

## `department` (type: `string`):

Filter by department name (substring match, e.g. 'Engineering').

## `maxResults` (type: `integer`):

Maximum total results across all boards (0 = unlimited).

## `includeDetails` (type: `boolean`):

Fetch pay transparency and application questions per job (slower — one extra request per job).

## `descriptionMaxLength` (type: `integer`):

Truncate HTML description to N characters. 0 = no truncation.

## `compact` (type: `boolean`):

Return only core fields (for AI-agent/MCP workflows).

## `incrementalMode` (type: `boolean`):

Only return new or changed jobs since last run.

## `stateKey` (type: `string`):

Custom key for incremental state (default: auto-generated from board tokens).

## `skipReposts` (type: `boolean`):

Skip jobs detected as reposts of previously seen jobs (cross-run detection via content hash).

## `telegramToken` (type: `string`):

Telegram bot token from @BotFather. Required for Telegram notifications.

## `telegramChatId` (type: `string`):

Telegram chat or channel ID where alerts are sent (e.g. "-100123456789" for a private group, or "@yourchannel").

## `discordWebhookUrl` (type: `string`):

Discord incoming webhook URL. Get one from Server Settings → Integrations → Webhooks.

## `slackWebhookUrl` (type: `string`):

Slack incoming webhook URL. Create at api.slack.com/messaging/webhooks.

## `whatsappPhoneNumberId` (type: `string`):

WhatsApp Business phone number ID from Meta Business Manager (NOT the phone number itself — the numeric ID shown next to your business number). Free service-conversation messages within 24 hours of last user-initiated contact.

## `whatsappAccessToken` (type: `string`):

Meta Cloud API access token with `whatsapp_business_messaging` scope. Get a permanent token from a system user in Meta Business Manager.

## `whatsappTo` (type: `string`):

Recipient phone in E.164 format (e.g. +919876543210). Recipient must have messaged your business number within the last 24 hours — outside that window, free-form text is rejected by Meta.

## `webhookUrl` (type: `string`):

Generic webhook URL that receives a JSON POST with the full job payload + run metadata. Universal escape hatch for n8n / Make / Zapier / your own backend.

## `webhookHeaders` (type: `object`):

Optional headers (e.g. {"Authorization": "Bearer xyz"}) sent with the webhook POST.

## `notificationLimit` (type: `integer`):

Maximum number of jobs included in each notification message (1–20). Excess jobs are still in the dataset; notifications get a summary line.

## `notifyOnlyChanges` (type: `boolean`):

When Incremental Mode is on, only send notifications for NEW / UPDATED / REAPPEARED jobs. Has no effect outside incremental mode.

## `descriptionFormat` (type: `string`):

Pick a single description representation. `all` keeps every variant; `text` / `html` / `markdown` drop the others.

## `excludeEmptyFields` (type: `boolean`):

Drop null, empty-string, and empty-array fields from each record before push. Smaller payloads for AI agents and dashboards.

## `appConnector` (type: `string`):

Optional. Pick a connected app under Settings → API & Integrations to receive your results. Notion is supported today (a run-summary page); other MCP connectors are best-effort as Apify expands its catalog.

## `mcpIssueTeam` (type: `string`):

Only when the connected app is an issue tracker: the team (name or ID) the summary issue is created under, if that app requires one.

## Actor input object example

```json
{
  "boardTokens": [
    "airbnb",
    "stripe",
    "discord"
  ],
  "maxResults": 50,
  "includeDetails": false,
  "descriptionMaxLength": 0,
  "compact": false,
  "incrementalMode": false,
  "skipReposts": false,
  "notificationLimit": 5,
  "notifyOnlyChanges": false,
  "descriptionFormat": "all",
  "excludeEmptyFields": false
}
```

# Actor output Schema

## `results` (type: `string`):

No description

# API

You can run this Actor programmatically using our API. Below are code examples in JavaScript, Python, and CLI, as well as the OpenAPI specification and MCP server setup.

## JavaScript example

```javascript
import { ApifyClient } from 'apify-client';

// Initialize the ApifyClient with your Apify API token
// Replace the '<YOUR_API_TOKEN>' with your token
const client = new ApifyClient({
    token: '<YOUR_API_TOKEN>',
});

// Prepare Actor input
const input = {
    "boardTokens": [
        "airbnb",
        "stripe",
        "discord"
    ],
    "maxResults": 50,
    "descriptionFormat": "all",
    "excludeEmptyFields": false
};

// Run the Actor and wait for it to finish
const run = await client.actor("blackfalcondata/greenhouse-scraper").call(input);

// Fetch and print Actor results from the run's dataset (if any)
console.log('Results from dataset');
console.log(`💾 Check your data here: https://console.apify.com/storage/datasets/${run.defaultDatasetId}`);
const { items } = await client.dataset(run.defaultDatasetId).listItems();
items.forEach((item) => {
    console.dir(item);
});

// 📚 Want to learn more 📖? Go to → https://docs.apify.com/api/client/js/docs

```

## Python example

```python
from apify_client import ApifyClient

# Initialize the ApifyClient with your Apify API token
# Replace '<YOUR_API_TOKEN>' with your token.
client = ApifyClient("<YOUR_API_TOKEN>")

# Prepare the Actor input
run_input = {
    "boardTokens": [
        "airbnb",
        "stripe",
        "discord",
    ],
    "maxResults": 50,
    "descriptionFormat": "all",
    "excludeEmptyFields": False,
}

# Run the Actor and wait for it to finish
run = client.actor("blackfalcondata/greenhouse-scraper").call(run_input=run_input)

# Fetch and print Actor results from the run's dataset (if there are any)
print("💾 Check your data here: https://console.apify.com/storage/datasets/" + run["defaultDatasetId"])
for item in client.dataset(run["defaultDatasetId"]).iterate_items():
    print(item)

# 📚 Want to learn more 📖? Go to → https://docs.apify.com/api/client/python/docs/quick-start

```

## CLI example

```bash
echo '{
  "boardTokens": [
    "airbnb",
    "stripe",
    "discord"
  ],
  "maxResults": 50,
  "descriptionFormat": "all",
  "excludeEmptyFields": false
}' |
apify call blackfalcondata/greenhouse-scraper --silent --output-dataset

```

## MCP server setup

```json
{
    "mcpServers": {
        "apify": {
            "command": "npx",
            "args": [
                "mcp-remote",
                "https://mcp.apify.com/?tools=blackfalcondata/greenhouse-scraper",
                "--header",
                "Authorization: Bearer <YOUR_API_TOKEN>"
            ]
        }
    }
}

```

## OpenAPI specification

```json
{
    "openapi": "3.0.1",
    "info": {
        "title": "Greenhouse Scraper 💰 $0.95/1K — Career Site Jobs",
        "description": "Scrape any Greenhouse-powered career site for structured job data — $0.95/1K, no monthly fee. Application questions, department & office metadata, and multi-board batch scraping across companies. Incremental mode flags new/changed listings between runs.",
        "version": "0.1",
        "x-build-id": "Z1DfeilLZwbf0MAU0"
    },
    "servers": [
        {
            "url": "https://api.apify.com/v2"
        }
    ],
    "paths": {
        "/acts/blackfalcondata~greenhouse-scraper/run-sync-get-dataset-items": {
            "post": {
                "operationId": "run-sync-get-dataset-items-blackfalcondata-greenhouse-scraper",
                "x-openai-isConsequential": false,
                "summary": "Executes an Actor, waits for its completion, and returns Actor's dataset items in response.",
                "tags": [
                    "Run Actor"
                ],
                "requestBody": {
                    "required": true,
                    "content": {
                        "application/json": {
                            "schema": {
                                "$ref": "#/components/schemas/inputSchema"
                            }
                        }
                    }
                },
                "parameters": [
                    {
                        "name": "token",
                        "in": "query",
                        "required": true,
                        "schema": {
                            "type": "string"
                        },
                        "description": "Enter your Apify token here"
                    }
                ],
                "responses": {
                    "200": {
                        "description": "OK"
                    }
                }
            }
        },
        "/acts/blackfalcondata~greenhouse-scraper/runs": {
            "post": {
                "operationId": "runs-sync-blackfalcondata-greenhouse-scraper",
                "x-openai-isConsequential": false,
                "summary": "Executes an Actor and returns information about the initiated run in response.",
                "tags": [
                    "Run Actor"
                ],
                "requestBody": {
                    "required": true,
                    "content": {
                        "application/json": {
                            "schema": {
                                "$ref": "#/components/schemas/inputSchema"
                            }
                        }
                    }
                },
                "parameters": [
                    {
                        "name": "token",
                        "in": "query",
                        "required": true,
                        "schema": {
                            "type": "string"
                        },
                        "description": "Enter your Apify token here"
                    }
                ],
                "responses": {
                    "200": {
                        "description": "OK",
                        "content": {
                            "application/json": {
                                "schema": {
                                    "$ref": "#/components/schemas/runsResponseSchema"
                                }
                            }
                        }
                    }
                }
            }
        },
        "/acts/blackfalcondata~greenhouse-scraper/run-sync": {
            "post": {
                "operationId": "run-sync-blackfalcondata-greenhouse-scraper",
                "x-openai-isConsequential": false,
                "summary": "Executes an Actor, waits for completion, and returns the OUTPUT from Key-value store in response.",
                "tags": [
                    "Run Actor"
                ],
                "requestBody": {
                    "required": true,
                    "content": {
                        "application/json": {
                            "schema": {
                                "$ref": "#/components/schemas/inputSchema"
                            }
                        }
                    }
                },
                "parameters": [
                    {
                        "name": "token",
                        "in": "query",
                        "required": true,
                        "schema": {
                            "type": "string"
                        },
                        "description": "Enter your Apify token here"
                    }
                ],
                "responses": {
                    "200": {
                        "description": "OK"
                    }
                }
            }
        }
    },
    "components": {
        "schemas": {
            "inputSchema": {
                "type": "object",
                "required": [
                    "boardTokens"
                ],
                "properties": {
                    "boardTokens": {
                        "title": "Company Boards",
                        "type": "array",
                        "description": "Greenhouse board tokens or URLs. Example: 'airbnb' or 'https://boards.greenhouse.io/airbnb'.",
                        "items": {
                            "type": "string"
                        }
                    },
                    "query": {
                        "title": "🔍 Search Term(s)",
                        "type": "string",
                        "description": "Filter jobs by keyword (matched against title and description)."
                    },
                    "location": {
                        "title": "📍 Location",
                        "type": "string",
                        "description": "Filter by location (substring match, e.g. 'London' or 'Remote')."
                    },
                    "department": {
                        "title": "Department Filter",
                        "type": "string",
                        "description": "Filter by department name (substring match, e.g. 'Engineering')."
                    },
                    "maxResults": {
                        "title": "💯 Max Results",
                        "minimum": 0,
                        "maximum": 10000,
                        "type": "integer",
                        "description": "Maximum total results across all boards (0 = unlimited).",
                        "default": 0
                    },
                    "includeDetails": {
                        "title": "📋 Include Full Details",
                        "type": "boolean",
                        "description": "Fetch pay transparency and application questions per job (slower — one extra request per job).",
                        "default": false
                    },
                    "descriptionMaxLength": {
                        "title": "✂️ Description Max Length",
                        "minimum": 0,
                        "type": "integer",
                        "description": "Truncate HTML description to N characters. 0 = no truncation.",
                        "default": 0
                    },
                    "compact": {
                        "title": "📦 Compact Output",
                        "type": "boolean",
                        "description": "Return only core fields (for AI-agent/MCP workflows).",
                        "default": false
                    },
                    "incrementalMode": {
                        "title": "♻️ Incremental Mode",
                        "type": "boolean",
                        "description": "Only return new or changed jobs since last run.",
                        "default": false
                    },
                    "stateKey": {
                        "title": "🔑 State Key",
                        "type": "string",
                        "description": "Custom key for incremental state (default: auto-generated from board tokens)."
                    },
                    "skipReposts": {
                        "title": "🚫 Skip Reposts",
                        "type": "boolean",
                        "description": "Skip jobs detected as reposts of previously seen jobs (cross-run detection via content hash).",
                        "default": false
                    },
                    "telegramToken": {
                        "title": "🤖 Telegram Bot Token",
                        "type": "string",
                        "description": "Telegram bot token from @BotFather. Required for Telegram notifications."
                    },
                    "telegramChatId": {
                        "title": "💬 Telegram Chat ID",
                        "type": "string",
                        "description": "Telegram chat or channel ID where alerts are sent (e.g. \"-100123456789\" for a private group, or \"@yourchannel\")."
                    },
                    "discordWebhookUrl": {
                        "title": "🎮 Discord Webhook URL",
                        "type": "string",
                        "description": "Discord incoming webhook URL. Get one from Server Settings → Integrations → Webhooks."
                    },
                    "slackWebhookUrl": {
                        "title": "💼 Slack Webhook URL",
                        "type": "string",
                        "description": "Slack incoming webhook URL. Create at api.slack.com/messaging/webhooks."
                    },
                    "whatsappPhoneNumberId": {
                        "title": "📱 WhatsApp Phone Number ID",
                        "type": "string",
                        "description": "WhatsApp Business phone number ID from Meta Business Manager (NOT the phone number itself — the numeric ID shown next to your business number). Free service-conversation messages within 24 hours of last user-initiated contact."
                    },
                    "whatsappAccessToken": {
                        "title": "🔐 WhatsApp Access Token",
                        "type": "string",
                        "description": "Meta Cloud API access token with `whatsapp_business_messaging` scope. Get a permanent token from a system user in Meta Business Manager."
                    },
                    "whatsappTo": {
                        "title": "📨 WhatsApp Recipient",
                        "type": "string",
                        "description": "Recipient phone in E.164 format (e.g. +919876543210). Recipient must have messaged your business number within the last 24 hours — outside that window, free-form text is rejected by Meta."
                    },
                    "webhookUrl": {
                        "title": "🪝 Generic Webhook URL",
                        "type": "string",
                        "description": "Generic webhook URL that receives a JSON POST with the full job payload + run metadata. Universal escape hatch for n8n / Make / Zapier / your own backend."
                    },
                    "webhookHeaders": {
                        "title": "🔑 Webhook Headers",
                        "type": "object",
                        "description": "Optional headers (e.g. {\"Authorization\": \"Bearer xyz\"}) sent with the webhook POST."
                    },
                    "notificationLimit": {
                        "title": "📊 Max Jobs Per Notification",
                        "minimum": 1,
                        "maximum": 20,
                        "type": "integer",
                        "description": "Maximum number of jobs included in each notification message (1–20). Excess jobs are still in the dataset; notifications get a summary line.",
                        "default": 5
                    },
                    "notifyOnlyChanges": {
                        "title": "🔄 Notify Only New/Updated",
                        "type": "boolean",
                        "description": "When Incremental Mode is on, only send notifications for NEW / UPDATED / REAPPEARED jobs. Has no effect outside incremental mode.",
                        "default": false
                    },
                    "descriptionFormat": {
                        "title": "Description format",
                        "enum": [
                            "all",
                            "text",
                            "html",
                            "markdown"
                        ],
                        "type": "string",
                        "description": "Pick a single description representation. `all` keeps every variant; `text` / `html` / `markdown` drop the others.",
                        "default": "all"
                    },
                    "excludeEmptyFields": {
                        "title": "Exclude empty fields from output",
                        "type": "boolean",
                        "description": "Drop null, empty-string, and empty-array fields from each record before push. Smaller payloads for AI agents and dashboards.",
                        "default": false
                    },
                    "appConnector": {
                        "title": "Send results to Notion (or another connected app)",
                        "type": "string",
                        "description": "Optional. Pick a connected app under Settings → API & Integrations to receive your results. Notion is supported today (a run-summary page); other MCP connectors are best-effort as Apify expands its catalog."
                    },
                    "mcpIssueTeam": {
                        "title": "Issue tracker team",
                        "type": "string",
                        "description": "Only when the connected app is an issue tracker: the team (name or ID) the summary issue is created under, if that app requires one."
                    }
                }
            },
            "runsResponseSchema": {
                "type": "object",
                "properties": {
                    "data": {
                        "type": "object",
                        "properties": {
                            "id": {
                                "type": "string"
                            },
                            "actId": {
                                "type": "string"
                            },
                            "userId": {
                                "type": "string"
                            },
                            "startedAt": {
                                "type": "string",
                                "format": "date-time",
                                "example": "2025-01-08T00:00:00.000Z"
                            },
                            "finishedAt": {
                                "type": "string",
                                "format": "date-time",
                                "example": "2025-01-08T00:00:00.000Z"
                            },
                            "status": {
                                "type": "string",
                                "example": "READY"
                            },
                            "meta": {
                                "type": "object",
                                "properties": {
                                    "origin": {
                                        "type": "string",
                                        "example": "API"
                                    },
                                    "userAgent": {
                                        "type": "string"
                                    }
                                }
                            },
                            "stats": {
                                "type": "object",
                                "properties": {
                                    "inputBodyLen": {
                                        "type": "integer",
                                        "example": 2000
                                    },
                                    "rebootCount": {
                                        "type": "integer",
                                        "example": 0
                                    },
                                    "restartCount": {
                                        "type": "integer",
                                        "example": 0
                                    },
                                    "resurrectCount": {
                                        "type": "integer",
                                        "example": 0
                                    },
                                    "computeUnits": {
                                        "type": "integer",
                                        "example": 0
                                    }
                                }
                            },
                            "options": {
                                "type": "object",
                                "properties": {
                                    "build": {
                                        "type": "string",
                                        "example": "latest"
                                    },
                                    "timeoutSecs": {
                                        "type": "integer",
                                        "example": 300
                                    },
                                    "memoryMbytes": {
                                        "type": "integer",
                                        "example": 1024
                                    },
                                    "diskMbytes": {
                                        "type": "integer",
                                        "example": 2048
                                    }
                                }
                            },
                            "buildId": {
                                "type": "string"
                            },
                            "defaultKeyValueStoreId": {
                                "type": "string"
                            },
                            "defaultDatasetId": {
                                "type": "string"
                            },
                            "defaultRequestQueueId": {
                                "type": "string"
                            },
                            "buildNumber": {
                                "type": "string",
                                "example": "1.0.0"
                            },
                            "containerUrl": {
                                "type": "string"
                            },
                            "usage": {
                                "type": "object",
                                "properties": {
                                    "ACTOR_COMPUTE_UNITS": {
                                        "type": "integer",
                                        "example": 0
                                    },
                                    "DATASET_READS": {
                                        "type": "integer",
                                        "example": 0
                                    },
                                    "DATASET_WRITES": {
                                        "type": "integer",
                                        "example": 0
                                    },
                                    "KEY_VALUE_STORE_READS": {
                                        "type": "integer",
                                        "example": 0
                                    },
                                    "KEY_VALUE_STORE_WRITES": {
                                        "type": "integer",
                                        "example": 1
                                    },
                                    "KEY_VALUE_STORE_LISTS": {
                                        "type": "integer",
                                        "example": 0
                                    },
                                    "REQUEST_QUEUE_READS": {
                                        "type": "integer",
                                        "example": 0
                                    },
                                    "REQUEST_QUEUE_WRITES": {
                                        "type": "integer",
                                        "example": 0
                                    },
                                    "DATA_TRANSFER_INTERNAL_GBYTES": {
                                        "type": "integer",
                                        "example": 0
                                    },
                                    "DATA_TRANSFER_EXTERNAL_GBYTES": {
                                        "type": "integer",
                                        "example": 0
                                    },
                                    "PROXY_RESIDENTIAL_TRANSFER_GBYTES": {
                                        "type": "integer",
                                        "example": 0
                                    },
                                    "PROXY_SERPS": {
                                        "type": "integer",
                                        "example": 0
                                    }
                                }
                            },
                            "usageTotalUsd": {
                                "type": "number",
                                "example": 0.00005
                            },
                            "usageUsd": {
                                "type": "object",
                                "properties": {
                                    "ACTOR_COMPUTE_UNITS": {
                                        "type": "integer",
                                        "example": 0
                                    },
                                    "DATASET_READS": {
                                        "type": "integer",
                                        "example": 0
                                    },
                                    "DATASET_WRITES": {
                                        "type": "integer",
                                        "example": 0
                                    },
                                    "KEY_VALUE_STORE_READS": {
                                        "type": "integer",
                                        "example": 0
                                    },
                                    "KEY_VALUE_STORE_WRITES": {
                                        "type": "number",
                                        "example": 0.00005
                                    },
                                    "KEY_VALUE_STORE_LISTS": {
                                        "type": "integer",
                                        "example": 0
                                    },
                                    "REQUEST_QUEUE_READS": {
                                        "type": "integer",
                                        "example": 0
                                    },
                                    "REQUEST_QUEUE_WRITES": {
                                        "type": "integer",
                                        "example": 0
                                    },
                                    "DATA_TRANSFER_INTERNAL_GBYTES": {
                                        "type": "integer",
                                        "example": 0
                                    },
                                    "DATA_TRANSFER_EXTERNAL_GBYTES": {
                                        "type": "integer",
                                        "example": 0
                                    },
                                    "PROXY_RESIDENTIAL_TRANSFER_GBYTES": {
                                        "type": "integer",
                                        "example": 0
                                    },
                                    "PROXY_SERPS": {
                                        "type": "integer",
                                        "example": 0
                                    }
                                }
                            }
                        }
                    }
                }
            }
        }
    }
}
```
