# Willhaben Scraper 💰 $0.89/1K — Austria’s Largest Job Portal (`blackfalcondata/willhaben-scraper`) Actor

Scrape willhaben.at - Austria's largest job portal. Structured salary fields and Austrian VAT/UID numbers for B2B outreach. Incremental mode with NEW/UPDATED/EXPIRED/REAPPEARED + repost detection.

- **URL**: https://apify.com/blackfalcondata/willhaben-scraper.md
- **Developed by:** [Black Falcon Data](https://apify.com/blackfalcondata) (community)
- **Categories:** Jobs, Lead generation, Automation
- **Stats:** 11 total users, 4 monthly users, 100.0% runs succeeded, 1 bookmarks
- **User rating**: No ratings yet

## Pricing

from $0.89 / 1,000 results

This Actor is paid per event. You are not charged for the Apify platform usage, but only a fixed price for specific events.

Learn more: https://docs.apify.com/platform/actors/running/actors-in-store#pay-per-event

## What's an Apify Actor?

Actors are a software tools running on the Apify platform, for all kinds of web data extraction and automation use cases.
In Batch mode, an Actor accepts a well-defined JSON input, performs an action which can take anything from a few seconds to a few hours,
and optionally produces a well-defined JSON output, datasets with results, or files in key-value store.
In Standby mode, an Actor provides a web server which can be used as a website, API, or an MCP server.
Actors are written with capital "A".

## How to integrate an Actor?

If asked about integration, you help developers integrate Actors into their projects.
You adapt to their stack and deliver integrations that are safe, well-documented, and production-ready.
The best way to integrate Actors is as follows.

In JavaScript/TypeScript projects, use official [JavaScript/TypeScript client](https://docs.apify.com/api/client/js.md):

```bash
npm install apify-client
```

In Python projects, use official [Python client library](https://docs.apify.com/api/client/python.md):

```bash
pip install apify-client
```

In shell scripts, use [Apify CLI](https://docs.apify.com/cli/docs.md):

````bash
# MacOS / Linux
curl -fsSL https://apify.com/install-cli.sh | bash
# Windows
irm https://apify.com/install-cli.ps1 | iex
```bash

In AI frameworks, you might use the [Apify MCP server](https://docs.apify.com/platform/integrations/mcp.md).

If your project is in a different language, use the [REST API](https://docs.apify.com/api/v2.md).

For usage examples, see the [API](#api) section below.

For more details, see Apify documentation as [Markdown index](https://docs.apify.com/llms.txt) and [Markdown full-text](https://docs.apify.com/llms-full.txt).


# README

### What does Willhaben Job Scraper do?

Willhaben Job Scraper extracts structured job data from [willhaben.at](https://willhaben.at) — including salary data, contact details (email, apply URL), company metadata, full descriptions, remote-work indicators, location data, and skill tags. It supports keyword search, location filters, and controllable result limits, so you can run the same query consistently over time. The actor also offers detail enrichment (full descriptions, company metadata, and contact information) where the source provides them.

### How to use this actor

- 👉 **Register for a free Apify account** — no credit card required.
- 🎉 Just click **[Sign up free on Apify →](https://console.apify.com/sign-up?fpr=1h3gvi&fp_sid=ctarich)** and complete a quick signup.
- 💰 A free Apify account includes $5 in monthly credits — enough to test this actor.
- ⏳ Scrape during the free trial, with no commitment or upfront payment required.

### Key features

<!-- KEY_FEATURES:START -->
- **♻️ Incremental mode** — recurring runs emit only NEW / UPDATED / REAPPEARED records — UNCHANGED and EXPIRED are opt-in. First run builds the baseline; subsequent runs emit and charge only for the diff. Pair with notifications for daily "new jobs" alerts to your hiring team. Saves 80–95% on daily monitoring.
- **🔔 Notifications** — Telegram, Slack, Discord, WhatsApp Cloud API, generic webhook — out of the box. Pair with incremental + `notifyOnlyChanges` for daily "new Willhaben jobs" pings to your hiring channel.
- **🔗 Paste-mode** — paste any willhaben.at URL straight from your browser — single-job pages, search-results URLs, or category SEO URLs. Build the search you want in the UI, copy the URL, paste it here.
- **📋 Detail enrichment** — two-stage mode: list, then enrich each job with the full description + detail-page fields (apply counts, education, etc.). One toggle, no extra orchestration.
- **📧 Email + phone extraction** — every record carries `extractedEmails[]` and `extractedPhones[]` regex-pulled from the description — direct-outreach lists with no extra processing step.
- **🔗 URL + social-profile extraction** — every record carries `extractedUrls[]` and structured `socialProfiles { linkedin, twitter, github, … }` parsed from the description — useful when employers drop their careers page or recruiter LinkedIn in-line.
- **📌 Change classification** — each record carries a `changeType` of NEW / UPDATED / UNCHANGED / REAPPEARED / EXPIRED. Default emits NEW + UPDATED + REAPPEARED; opt into the others with `emitUnchanged` / `emitExpired`. Repost detection flags previously-expired listings that come back.
- **📦 Compact mode** — AI-agent and MCP-friendly compact payloads with core fields only — pipe straight into your ATS, salary-benchmarking tool, or LLM context without parsing extras.
- **✂️ Description truncation** — cap description length with `descriptionMaxLength` to control LLM prompt cost and dataset size — set 0 for full descriptions, or any char-limit to trim.
- **📤 Export anywhere** — Download the dataset as JSON, CSV, or Excel from the Apify Console, or stream live via the Apify API and integrations (Make, Zapier, Google Sheets, n8n, …).
- **🔌 MCP connectors** — export your results into Notion via Apify's MCP connectors — a clean run-summary page, no glue code. Opt-in via the App connector field; deterministic field-mapping, no AI. Built on Apify's connector framework, so more destinations open up as their catalog grows.
<!-- KEY_FEATURES:END -->

### What data can you extract from willhaben.at?

Each result includes Core listing fields (`jobId`, `title`, `location`, `federalState`, `country`, `countryCode`, `locations`, and `salary`, and more), detail fields when enrichment is enabled (`description` and `descriptionMarkdown`), contact and apply information (`extractedEmails`, `extractedPhones`, `contactName`, and `applyUrl`), and company metadata (`company`, `companyId`, `companyLogoUrl`, and `companyType`). In standard mode, all fields are always present — unavailable data points are returned as `null`, never omitted. In compact mode, only core fields are returned.

Enable detail enrichment in the input to get richer fields such as full descriptions, company metadata, and contact information where the source provides them.

### Input

The main inputs are a search keyword, an optional location filter, and a result limit. Additional filters and options are available in the input schema.

Key parameters:

- **`query`** — Job search keywords (e.g. "developer", "Tischler"). Leave empty to browse all jobs.
- **`location`** — City or district name.
- **`region`** — Austrian federal state. Use name (e.g. "Wien") or code.
- **`operationArea`** — Filter by professional field (Berufsfeld).
- **`employmentMode`** — Filter by employment type.
- **`position`** — Filter by position level.
- **`companyType`** — Filter by company type.
- **`timeLimit`** — Filter by posting recency.
- **`startUrls`** — Paste Willhaben job-search URLs directly (e.g. https://www.willhaben.at/jobs/suche?keyword=developer&region=14486). Filters in the URL override the explicit input fields where present. Each unique listing is returned only once — duplicates across pages or overlapping search URLs are removed by listing ID.
- **`sortBy`** — Sort search results.
- **`salaryMinFilter`** — Drop jobs whose maximum salary is below this value (in EUR). Jobs without salary data are dropped when set.
- **`salaryMaxFilter`** — Drop jobs whose minimum salary exceeds this value (in EUR).
- ...and 27 more parameters

### Input examples

**Basic search** — Keyword-driven search with a result cap.

→ Full payload per result — all standard fields populated where the source provides them.

```json
{
  "query": "developer",
  "maxResults": 50
}
````

**Incremental tracking** — Only emit jobs that changed since the previous run with this `stateKey`.

→ First run builds the baseline state. Subsequent runs emit only records that are new or whose tracked content changed. Set `emitUnchanged: true` to include unchanged records as well.

```json
{
  "query": "developer",
  "maxResults": 200,
  "incrementalMode": true,
  "stateKey": "developer-tracker"
}
```

**Compact output for AI agents** — Return only core fields for AI-agent and MCP workflows.

→ Small payload with the most important fields — ideal for piping into LLMs without token overhead.

```json
{
  "query": "developer",
  "maxResults": 50,
  "compact": true
}
```

### Output

Each run produces a dataset of structured job records. Results can be downloaded as JSON, CSV, or Excel from the Dataset tab in Apify Console.

### Example job record

```json
{
  "jobId": "37f5c80b9f52544bdab675f75c5fabb94987f88d472ef4f757b2d646810eeb7d",
  "title": "Bautechniker/ Bauleiter (m/w/d) für Holzbau",
  "company": "PRONATURHAUS Obritzberger GesmbH",
  "companyId": 19122749,
  "companyLogoUrl": "https://www.willhaben.at/jobs/api/v1/images/public/481516580?resolution=480",
  "companyType": "Firma",
  "companyIndustry": "Handwerk / Gewerbe",
  "companyUrl": "http://www.pronaturhaus.at",
  "companyAddress": "Wagramer Straße 25, 3484, Grafenwörth, Österreich",
  "companyEmployeeCount": "11-50",
  "companyFoundingYear": 1975,
  "companyVatId": "ATU20275002",
  "companyActiveAdverts": 2,
  "location": "Grafenwörth",
  "federalState": "Niederösterreich",
  "country": "Österreich",
  "countryCode": "AT",
  "locations": [
    {
      "name": "Grafenwörth",
      "federalState": "Niederösterreich",
      "country": "Österreich"
    }
  ],
  "salary": 3400,
  "salaryTimeFrame": "monatlich",
  "salaryText": "€3 400 per month",
  "salaryMin": 3400,
  "salaryMax": null,
  "salaryCurrency": "EUR",
  "salaryPeriod": "month",
  "overpay": true,
  "employmentModes": [
    "Vollzeit"
  ],
  "positionLevel": "Mitarbeiter:in",
  "employmentTime": "ab sofort",
  "isFeatured": true,
  "isFreshlyPosted": false,
  "internalApplicationOnly": false,
  "requiresExternalApplication": true,
  "requiresProfessionalExperience": true,
  "remoteWorkPossible": null,
  "summary": "Bautechniker/ Bauleiter (m/w/d) für Holzbau Gerne auch Zimmerer oder Poliere die ins Büro wechseln wollen PRONATURHAUS Obritzberger GesmbH Grafenwörth Du bist Bautechniker/ Bauleiter (m/w/d) und suchs...",
  "description": "Bautechniker/ Bauleiter (m/w/d) für Holzbau \nGerne auch Zimmerer oder Poliere die ins Büro wechseln wollen \nPRONATURHAUS Obritzberger GesmbH \nGrafenwörth \nDu bist Bautechniker/ Bauleiter (m/w/d) und s...",
  "descriptionMarkdown": "Bautechniker/ Bauleiter (m/w/d) für Holzbau\nGerne auch Zimmerer oder Poliere die ins Büro wechseln wollen\nPRONATURHAUS Obritzberger GesmbH\nGrafenwörth\nDu bist Bautechniker/ Bauleiter (m/w/d) und suchs...",
  "contentHash": "8d7467c864937c20437ee73eafd5f5df359deeee615669c996f4ae999eefd0ad",
  "extractedEmails": [],
  "extractedPhones": [],
  "extractedUrls": [
    "http://www.pronaturhaus.at"
  ],
  "socialProfiles": {
    "linkedin": null,
    "twitter": null,
    "instagram": null,
    "facebook": null,
    "youtube": null,
    "tiktok": null,
    "github": null,
    "xing": null,
    "bluesky": null,
    "threads": null,
    "mastodon": null
  },
  "languageSkills": [],
  "contactName": "Josef Schedelmayer",
  "contactTitle": null,
  "contactEmail": null,
  "contactPhone": null,
  "contactSex": "MALE",
  "applyUrl": "https://www.willhaben.at/jobs/job/bautechniker-bauleiter-m-w-d-fuer-holzbau/13217338",
  "externalApplyUrl": "https://pronaturhaus.at/unternehmen/jobs/Bautechniker__Bauleiter_m_w_d_fuer_Holzbau",
  "portalUrl": "https://www.willhaben.at/jobs/job/bautechniker-bauleiter-m-w-d-fuer-holzbau/13217338",
  "searchQuery": null,
  "searchUrl": "https://www.willhaben.at/jobs/suche?rows=90",
  "createdAt": "2026-05-28T19:44:21.656354+02:00",
  "postedDate": "2026-05-28T19:44:20.416+02:00",
  "postedAt": "2026-05-28T19:44:20.416+02:00",
  "lastModifiedDate": "2026-05-29T10:06:50.665852+02:00",
  "lastReorderedAt": "2026-05-28T19:44:20.416+02:00",
  "expiryDate": "2026-07-27",
  "scrapedAt": "2026-06-03T20:08:35.262Z",
  "source": "willhaben.at",
  "changeType": null,
  "firstSeenAt": null,
  "lastSeenAt": null,
  "previousSeenAt": null,
  "expiredAt": null,
  "isRepost": false,
  "repostOfId": null,
  "repostDetectedAt": null
}
```

### Incremental fields

When incremental mode is on, each record also carries:

- `changeType` — one of `NEW`, `UPDATED`, `UNCHANGED`, `REAPPEARED`, `EXPIRED`. Default output covers `NEW` / `UPDATED` / `REAPPEARED`; set `emitUnchanged: true` or `emitExpired: true` to opt into the others.
- `firstSeenAt`, `lastSeenAt` — ISO-8601 timestamps tracking the listing across runs.
- `isRepost`, `repostOfId`, `repostDetectedAt` — populated when a new listing matches the tracked content of a previously expired one. Set `skipReposts: true` to drop detected reposts from the output.

### How to scrape willhaben.at

1. Go to [Willhaben Job Scraper](https://apify.com/blackfalcondata/willhaben-scraper?fpr=1h3gvi) in Apify Console.
2. Enter a search keyword and optional location filter.
3. Set `maxResults` to control how many results you need.
4. Enable `includeDetails` if you need full descriptions, contact info, company data.
5. Click **Start** and wait for the run to finish.
6. Export the dataset as JSON, CSV, or Excel.

### Use cases

- Extract job data from willhaben.at for market research and competitive analysis.
- Track salary trends across regions and categories over time.
- Monitor new and changed listings on scheduled runs without processing the full dataset every time.
- Build outreach lists using contact details and apply URLs from listings.
- Research company hiring patterns, employer profiles, and industry distribution.
- Use structured location data for regional analysis, mapping, and geo-targeting.
- Feed structured data into AI agents, MCP tools, and automated pipelines using compact mode.
- Export clean, structured data to dashboards, spreadsheets, or data warehouses.
- Analyze skill demand across listings using structured skill tags.

### How much does it cost to scrape willhaben.at?

Willhaben Job Scraper uses [pay-per-event](https://docs.apify.com/platform/actors/paid-actors/pay-per-event) pricing. You pay a small fee when the run starts and then for each result that is actually produced.

- **Run start:** $0.01 per run
- **Per result:** $0.00089 per job record

Example costs:

- 10 results: **$0.019**
- 25 results: **$0.032**
- 100 results: **$0.099**
- 200 results: **$0.19**
- 500 results: **$0.45**

#### Example: recurring monitoring savings

These examples compare full re-scrapes with incremental runs at different churn rates. Churn is the share of listings that are new or whose tracked content changed since the previous run. Actual churn depends on your query breadth, source activity, and polling frequency — the scenarios below are examples, not predictions.

Example setup: 100 results per run, daily polling (30 runs/month). Event-pricing examples scale linearly with result count.

| Churn rate | Full re-scrape run cost | Incremental run cost | Savings vs full re-scrape | Monthly cost after baseline |
|---|---:|---:|---:|---:|
| 5% — stable niche query | $0.10 | $0.01 | $0.08 (85%) | $0.43 |
| 15% — moderate broad query | $0.10 | $0.02 | $0.08 (76%) | $0.70 |
| 30% — high-volume aggregator | $0.10 | $0.04 | $0.06 (63%) | $1.10 |

Full re-scrape monthly cost at daily polling: $2.97. First month with incremental costs $0.52 / $0.78 / $1.16 for the 5% / 15% / 30% scenarios because the first run builds baseline state at full cost before incremental savings apply.

### FAQ

#### How many results can I get from willhaben.at?

The number of results depends on the search query and available listings on willhaben.at. Use the `maxResults` parameter to control how many results are returned per run.

#### Does Willhaben Job Scraper support recurring monitoring?

Yes. Enable incremental mode to only receive new or changed listings on subsequent runs. This is ideal for scheduled monitoring where you want to track changes over time without re-processing the full dataset.

#### Can I integrate Willhaben Job Scraper with other apps?

Yes. Willhaben Job Scraper works with Apify's [integrations](https://apify.com/integrations?fpr=1h3gvi) to connect with tools like Zapier, Make, Google Sheets, Slack, and more. You can also use webhooks to trigger actions when a run completes.

#### Can I use Willhaben Job Scraper with the Apify API?

Yes. You can start runs, manage inputs, and retrieve results programmatically through the [Apify API](https://docs.apify.com/api/v2). Client libraries are available for JavaScript, Python, and other languages.

#### Can I use Willhaben Job Scraper through an MCP Server?

Yes. Apify provides an [MCP Server](https://apify.com/apify/actors-mcp-server?fpr=1h3gvi) that lets AI assistants and agents call this actor directly. Use compact mode, `descriptionMaxLength`, a single `descriptionFormat`, and `excludeEmptyFields` to keep payloads manageable for LLM context windows.

#### Is it legal to scrape willhaben.at?

This actor extracts publicly available data from willhaben.at. Web scraping of public information is generally considered legal, but you should always review the target site's terms of service and ensure your use case complies with applicable laws and regulations, including GDPR where relevant.

#### Your feedback

If you have questions, need a feature, or found a bug, please [open an issue](https://apify.com/blackfalcondata/willhaben-scraper/issues?fpr=1h3gvi) on the actor's page in Apify Console. Your feedback helps us improve.

### You might also like

- [Actiris Brussels Job Scraper](https://apify.com/blackfalcondata/actiris-scraper?fpr=1h3gvi) — Scrape all active job listings from actiris.brussels — official Brussels public employment service..
- [AMS Austria Job Scraper — Austrian Public Employment Service](https://apify.com/blackfalcondata/ams-austria-job-scraper?fpr=1h3gvi) — Scrape jobs.ams.at — Austria's official AMS public employment portal, branded "alle jobs" ("all.
- [APEC.fr Scraper - French Executive Jobs](https://apify.com/blackfalcondata/apec-scraper?fpr=1h3gvi) — Scrape apec.fr - French executive job listings with salary ranges, company, location, skills,.
- [Arbeitsagentur Jobs Feed — German Federal Employment Agency](https://apify.com/blackfalcondata/arbeitsagentur-jobs-feed?fpr=1h3gvi) — Scrape arbeitsagentur.de — Germany's official public employment portal with over 1 million live job.
- [Arbetsformedlingen Job Scraper](https://apify.com/blackfalcondata/arbetsformedlingen-scraper?fpr=1h3gvi) — Scrape arbetsformedlingen.se (Platsbanken) — Sweden's official employment portal. Returns 84.
- [Bayt.com Scraper — MENA Jobs with Salary & Skills Filter](https://apify.com/blackfalcondata/bayt-scraper?fpr=1h3gvi) — Scrape bayt.com — the leading Middle East job board spanning UAE, Saudi Arabia, Qatar, Egypt.
- [Bumeran Scraper — LATAM Jobs across 7 Countries & 8 Brands](https://apify.com/blackfalcondata/bumeran-scraper?fpr=1h3gvi) — Scrape Bumeran Group's job boards across LATAM — Argentina (bumeran.com.ar + zonajobs), Chile.
- [Cadremploi Scraper — French Executive & Management Jobs](https://apify.com/blackfalcondata/cadremploi-scraper?fpr=1h3gvi) — Scrape cadremploi.fr — France's leading job board for executives and managers (cadres). Salary.

### Getting started with Apify

New to Apify? [Create a free account with $5 credit](https://console.apify.com/sign-up?fpr=1h3gvi\&fp_sid=ctarich) — no credit card required.

1. Sign up — $5 platform credit included
2. Open this actor and configure your input
3. Click **Start** — export results as JSON, CSV, or Excel

Need more later? [See Apify pricing](https://apify.com/pricing?fpr=1h3gvi).

# Actor input Schema

## `query` (type: `string`):

Job search keywords (e.g. "developer", "Tischler"). Leave empty to browse all jobs.

## `location` (type: `string`):

City or district name.

## `region` (type: `string`):

Austrian federal state. Use name (e.g. "Wien") or code.

## `operationArea` (type: `string`):

Filter by professional field (Berufsfeld).

## `employmentMode` (type: `string`):

Filter by employment type.

## `position` (type: `string`):

Filter by position level.

## `companyType` (type: `string`):

Filter by company type.

## `timeLimit` (type: `string`):

Filter by posting recency.

## `startUrls` (type: `array`):

Paste Willhaben job-search URLs directly (e.g. https://www.willhaben.at/jobs/suche?keyword=developer\&region=14486). Filters in the URL override the explicit input fields where present. Each unique listing is returned only once — duplicates across pages or overlapping search URLs are removed by listing ID.

## `sortBy` (type: `string`):

Sort search results.

## `salaryMinFilter` (type: `integer`):

Drop jobs whose maximum salary is below this value (in EUR). Jobs without salary data are dropped when set.

## `salaryMaxFilter` (type: `integer`):

Drop jobs whose minimum salary exceeds this value (in EUR).

## `whatAnd` (type: `string`):

Every word listed here must appear in the title or description (AND logic). E.g. "python remote" requires both words.

## `whatExclude` (type: `string`):

Jobs containing any of these words in title or description are excluded (NOT logic). E.g. "senior lead" filters out senior roles.

## `maxResults` (type: `integer`):

Maximum total results across all start URLs combined (not per URL). With multiple start URLs this cap is divided across them — e.g. with 13 URLs and maxResults=25 you receive ~2 jobs per URL. Set to 0 (unlimited) or a high number (e.g. 500) to receive every available job from every URL.

## `includeDetails` (type: `boolean`):

Fetch full job details (contact info, company VAT/UID number, language skills, remote work, expiry date).

## `descriptionMaxLength` (type: `integer`):

Truncate description to N chars. 0 = no truncation.

## `compact` (type: `boolean`):

Core fields only — for AI-agent/MCP workflows. Notifications always receive full records, so this only affects the dataset rows.

## `incrementalMode` (type: `boolean`):

Compare against previous run state — only emit NEW, UPDATED, EXPIRED, or REAPPEARED jobs. stateKey is optional — leave empty and a stable identifier is derived from your search inputs so different searches never share state.

## `stateKey` (type: `string`):

Optional. Stable identifier for the tracked search universe (e.g. "at-software-vienna"). Leave empty to auto-generate from search inputs — this prevents narrower runs from marking broader-run jobs as EXPIRED. Set explicitly only if you want backward compatibility with state under a specific legacy key.

## `emitUnchanged` (type: `boolean`):

Also emit jobs that have not changed since the last run (incrementalMode only).

## `emitExpired` (type: `boolean`):

Also emit jobs that have disappeared since the last run (incrementalMode only).

## `skipReposts` (type: `boolean`):

When incremental, skip jobs whose content matches an expired job from a prior run (cross-run repost detection).

## `telegramToken` (type: `string`):

Telegram bot token (from @BotFather). Required for Telegram notifications.

## `telegramChatId` (type: `string`):

Telegram chat or channel ID (e.g. "-100123456789"). Required when telegramToken is set.

## `discordWebhookUrl` (type: `string`):

Discord incoming webhook URL. Server Settings → Integrations → Webhooks → New Webhook.

## `slackWebhookUrl` (type: `string`):

Slack incoming webhook URL. api.slack.com/messaging/webhooks.

## `notificationLimit` (type: `integer`):

Maximum number of jobs included in each notification message (1–20).

## `notifyOnlyChanges` (type: `boolean`):

When Incremental Mode is on, only send notifications for NEW and UPDATED jobs. Has no effect outside incremental mode.

## `whatsappAccessToken` (type: `string`):

WhatsApp Cloud API permanent access token (System User token from Meta Business). Recipient must have messaged the business number within the last 24h (service-conversation window — free since Nov 2024). For first-contact alerts use approved templates outside this actor.

## `whatsappPhoneNumberId` (type: `string`):

Your WhatsApp Business phone-number ID (numeric, from Meta dashboard). Required when whatsappAccessToken is set.

## `whatsappTo` (type: `string`):

Recipient phone in E.164 format without + (e.g. "436641234567" for an Austrian mobile). Recipient must have messaged your business number within the last 24 hours.

## `phoneExtractionMode` (type: `string`):

How aggressively to detect phone numbers in job descriptions. Strict: only matches with Tel:/Telefon:/Mobil: prefix or +CC international. Lenient: also matches bare 0-prefixed local numbers (more false positives possible).

## `webhookUrl` (type: `string`):

Receives a JSON POST with full {metadata, items} payload after each run. Universal escape hatch for n8n / Make / Zapier / custom backends.

## `webhookHeaders` (type: `object`):

Optional JSON object of custom headers (e.g. {"Authorization":"Bearer ..."}). Sent with the webhook POST in addition to Content-Type: application/json.

## `descriptionFormat` (type: `string`):

Pick a single description representation. `all` keeps every variant; `text` / `html` / `markdown` drop the others.

## `excludeEmptyFields` (type: `boolean`):

Drop null, empty-string, and empty-array fields from each record before push. Smaller payloads for AI agents and dashboards.

## `appConnector` (type: `string`):

Optional. Pick a connected app under Settings → API & Integrations to receive your results. Notion is supported today (a run-summary page); other MCP connectors are best-effort as Apify expands its catalog.

## `mcpIssueTeam` (type: `string`):

Only when the connected app is an issue tracker: the team (name or ID) the summary issue is created under, if that app requires one.

## Actor input object example

```json
{
  "query": "developer",
  "location": "Wien",
  "salaryMinFilter": 50000,
  "salaryMaxFilter": 150000,
  "whatAnd": "kotlin",
  "whatExclude": "intern, praktikum",
  "maxResults": 25,
  "includeDetails": true,
  "descriptionMaxLength": 0,
  "compact": false,
  "incrementalMode": false,
  "stateKey": "at-software-vienna",
  "emitUnchanged": false,
  "emitExpired": false,
  "skipReposts": false,
  "notificationLimit": 5,
  "notifyOnlyChanges": false,
  "phoneExtractionMode": "strict",
  "descriptionFormat": "all",
  "excludeEmptyFields": false
}
```

# Actor output Schema

## `results` (type: `string`):

No description

# API

You can run this Actor programmatically using our API. Below are code examples in JavaScript, Python, and CLI, as well as the OpenAPI specification and MCP server setup.

## JavaScript example

```javascript
import { ApifyClient } from 'apify-client';

// Initialize the ApifyClient with your Apify API token
// Replace the '<YOUR_API_TOKEN>' with your token
const client = new ApifyClient({
    token: '<YOUR_API_TOKEN>',
});

// Prepare Actor input
const input = {
    "query": "developer",
    "maxResults": 25,
    "includeDetails": false,
    "descriptionFormat": "all",
    "excludeEmptyFields": false
};

// Run the Actor and wait for it to finish
const run = await client.actor("blackfalcondata/willhaben-scraper").call(input);

// Fetch and print Actor results from the run's dataset (if any)
console.log('Results from dataset');
console.log(`💾 Check your data here: https://console.apify.com/storage/datasets/${run.defaultDatasetId}`);
const { items } = await client.dataset(run.defaultDatasetId).listItems();
items.forEach((item) => {
    console.dir(item);
});

// 📚 Want to learn more 📖? Go to → https://docs.apify.com/api/client/js/docs

```

## Python example

```python
from apify_client import ApifyClient

# Initialize the ApifyClient with your Apify API token
# Replace '<YOUR_API_TOKEN>' with your token.
client = ApifyClient("<YOUR_API_TOKEN>")

# Prepare the Actor input
run_input = {
    "query": "developer",
    "maxResults": 25,
    "includeDetails": False,
    "descriptionFormat": "all",
    "excludeEmptyFields": False,
}

# Run the Actor and wait for it to finish
run = client.actor("blackfalcondata/willhaben-scraper").call(run_input=run_input)

# Fetch and print Actor results from the run's dataset (if there are any)
print("💾 Check your data here: https://console.apify.com/storage/datasets/" + run["defaultDatasetId"])
for item in client.dataset(run["defaultDatasetId"]).iterate_items():
    print(item)

# 📚 Want to learn more 📖? Go to → https://docs.apify.com/api/client/python/docs/quick-start

```

## CLI example

```bash
echo '{
  "query": "developer",
  "maxResults": 25,
  "includeDetails": false,
  "descriptionFormat": "all",
  "excludeEmptyFields": false
}' |
apify call blackfalcondata/willhaben-scraper --silent --output-dataset

```

## MCP server setup

```json
{
    "mcpServers": {
        "apify": {
            "command": "npx",
            "args": [
                "mcp-remote",
                "https://mcp.apify.com/?tools=blackfalcondata/willhaben-scraper",
                "--header",
                "Authorization: Bearer <YOUR_API_TOKEN>"
            ]
        }
    }
}

```

## OpenAPI specification

```json
{
    "openapi": "3.0.1",
    "info": {
        "title": "Willhaben Scraper 💰 $0.89/1K — Austria’s Largest Job Portal",
        "description": "Scrape willhaben.at - Austria's largest job portal. Structured salary fields and Austrian VAT/UID numbers for B2B outreach. Incremental mode with NEW/UPDATED/EXPIRED/REAPPEARED + repost detection.",
        "version": "0.3",
        "x-build-id": "ZToCNPl4YbO2YD4rZ"
    },
    "servers": [
        {
            "url": "https://api.apify.com/v2"
        }
    ],
    "paths": {
        "/acts/blackfalcondata~willhaben-scraper/run-sync-get-dataset-items": {
            "post": {
                "operationId": "run-sync-get-dataset-items-blackfalcondata-willhaben-scraper",
                "x-openai-isConsequential": false,
                "summary": "Executes an Actor, waits for its completion, and returns Actor's dataset items in response.",
                "tags": [
                    "Run Actor"
                ],
                "requestBody": {
                    "required": true,
                    "content": {
                        "application/json": {
                            "schema": {
                                "$ref": "#/components/schemas/inputSchema"
                            }
                        }
                    }
                },
                "parameters": [
                    {
                        "name": "token",
                        "in": "query",
                        "required": true,
                        "schema": {
                            "type": "string"
                        },
                        "description": "Enter your Apify token here"
                    }
                ],
                "responses": {
                    "200": {
                        "description": "OK"
                    }
                }
            }
        },
        "/acts/blackfalcondata~willhaben-scraper/runs": {
            "post": {
                "operationId": "runs-sync-blackfalcondata-willhaben-scraper",
                "x-openai-isConsequential": false,
                "summary": "Executes an Actor and returns information about the initiated run in response.",
                "tags": [
                    "Run Actor"
                ],
                "requestBody": {
                    "required": true,
                    "content": {
                        "application/json": {
                            "schema": {
                                "$ref": "#/components/schemas/inputSchema"
                            }
                        }
                    }
                },
                "parameters": [
                    {
                        "name": "token",
                        "in": "query",
                        "required": true,
                        "schema": {
                            "type": "string"
                        },
                        "description": "Enter your Apify token here"
                    }
                ],
                "responses": {
                    "200": {
                        "description": "OK",
                        "content": {
                            "application/json": {
                                "schema": {
                                    "$ref": "#/components/schemas/runsResponseSchema"
                                }
                            }
                        }
                    }
                }
            }
        },
        "/acts/blackfalcondata~willhaben-scraper/run-sync": {
            "post": {
                "operationId": "run-sync-blackfalcondata-willhaben-scraper",
                "x-openai-isConsequential": false,
                "summary": "Executes an Actor, waits for completion, and returns the OUTPUT from Key-value store in response.",
                "tags": [
                    "Run Actor"
                ],
                "requestBody": {
                    "required": true,
                    "content": {
                        "application/json": {
                            "schema": {
                                "$ref": "#/components/schemas/inputSchema"
                            }
                        }
                    }
                },
                "parameters": [
                    {
                        "name": "token",
                        "in": "query",
                        "required": true,
                        "schema": {
                            "type": "string"
                        },
                        "description": "Enter your Apify token here"
                    }
                ],
                "responses": {
                    "200": {
                        "description": "OK"
                    }
                }
            }
        }
    },
    "components": {
        "schemas": {
            "inputSchema": {
                "type": "object",
                "properties": {
                    "query": {
                        "title": "🔍 Search Term(s)",
                        "type": "string",
                        "description": "Job search keywords (e.g. \"developer\", \"Tischler\"). Leave empty to browse all jobs."
                    },
                    "location": {
                        "title": "📍 Location",
                        "type": "string",
                        "description": "City or district name."
                    },
                    "region": {
                        "title": "🌍 Region (Bundesland)",
                        "enum": [
                            "",
                            "Wien",
                            "Niederösterreich",
                            "Oberösterreich",
                            "Steiermark",
                            "Tirol",
                            "Salzburg",
                            "Kärnten",
                            "Vorarlberg",
                            "Burgenland"
                        ],
                        "type": "string",
                        "description": "Austrian federal state. Use name (e.g. \"Wien\") or code."
                    },
                    "operationArea": {
                        "title": "📍 Field / Sector",
                        "enum": [
                            "",
                            "12666",
                            "12681",
                            "12746",
                            "12769",
                            "17007",
                            "12887",
                            "12984",
                            "13094",
                            "13186",
                            "13260",
                            "13278",
                            "13319",
                            "13325",
                            "13351",
                            "13368",
                            "13400",
                            "13496"
                        ],
                        "type": "string",
                        "description": "Filter by professional field (Berufsfeld)."
                    },
                    "employmentMode": {
                        "title": "💼 Employment Type",
                        "enum": [
                            "",
                            "110",
                            "113",
                            "109",
                            "11796"
                        ],
                        "type": "string",
                        "description": "Filter by employment type."
                    },
                    "position": {
                        "title": "📌 Position Level",
                        "enum": [
                            "",
                            "13540",
                            "13541",
                            "13539",
                            "13542",
                            "13543",
                            "13544",
                            "28428"
                        ],
                        "type": "string",
                        "description": "Filter by position level."
                    },
                    "companyType": {
                        "title": "🏢 Company Type",
                        "enum": [
                            "",
                            "agency",
                            "direct_employer"
                        ],
                        "type": "string",
                        "description": "Filter by company type."
                    },
                    "timeLimit": {
                        "title": "⏰ Posted Within",
                        "enum": [
                            "",
                            "last_24_hours",
                            "last_72_hours",
                            "last_week"
                        ],
                        "type": "string",
                        "description": "Filter by posting recency."
                    },
                    "startUrls": {
                        "title": "🔗 Start URLs",
                        "type": "array",
                        "description": "Paste Willhaben job-search URLs directly (e.g. https://www.willhaben.at/jobs/suche?keyword=developer&region=14486). Filters in the URL override the explicit input fields where present. Each unique listing is returned only once — duplicates across pages or overlapping search URLs are removed by listing ID.",
                        "items": {
                            "type": "string"
                        }
                    },
                    "sortBy": {
                        "title": "🔀 Sort By",
                        "enum": [
                            "",
                            "publish_date_desc",
                            "relevance"
                        ],
                        "type": "string",
                        "description": "Sort search results."
                    },
                    "salaryMinFilter": {
                        "title": "💰 Minimum Salary",
                        "minimum": 0,
                        "type": "integer",
                        "description": "Drop jobs whose maximum salary is below this value (in EUR). Jobs without salary data are dropped when set."
                    },
                    "salaryMaxFilter": {
                        "title": "💰 Maximum Salary",
                        "minimum": 0,
                        "type": "integer",
                        "description": "Drop jobs whose minimum salary exceeds this value (in EUR)."
                    },
                    "whatAnd": {
                        "title": "🔍 All These Words",
                        "type": "string",
                        "description": "Every word listed here must appear in the title or description (AND logic). E.g. \"python remote\" requires both words."
                    },
                    "whatExclude": {
                        "title": "🚫 Exclude These Words",
                        "type": "string",
                        "description": "Jobs containing any of these words in title or description are excluded (NOT logic). E.g. \"senior lead\" filters out senior roles."
                    },
                    "maxResults": {
                        "title": "💯 Max Results",
                        "minimum": 0,
                        "maximum": 5000,
                        "type": "integer",
                        "description": "Maximum total results across all start URLs combined (not per URL). With multiple start URLs this cap is divided across them — e.g. with 13 URLs and maxResults=25 you receive ~2 jobs per URL. Set to 0 (unlimited) or a high number (e.g. 500) to receive every available job from every URL.",
                        "default": 25
                    },
                    "includeDetails": {
                        "title": "📋 Include Full Details",
                        "type": "boolean",
                        "description": "Fetch full job details (contact info, company VAT/UID number, language skills, remote work, expiry date).",
                        "default": true
                    },
                    "descriptionMaxLength": {
                        "title": "✂️ Description Max Length",
                        "minimum": 0,
                        "type": "integer",
                        "description": "Truncate description to N chars. 0 = no truncation.",
                        "default": 0
                    },
                    "compact": {
                        "title": "📦 Compact Output",
                        "type": "boolean",
                        "description": "Core fields only — for AI-agent/MCP workflows. Notifications always receive full records, so this only affects the dataset rows.",
                        "default": false
                    },
                    "incrementalMode": {
                        "title": "♻️ Incremental Mode",
                        "type": "boolean",
                        "description": "Compare against previous run state — only emit NEW, UPDATED, EXPIRED, or REAPPEARED jobs. stateKey is optional — leave empty and a stable identifier is derived from your search inputs so different searches never share state.",
                        "default": false
                    },
                    "stateKey": {
                        "title": "🔑 State Key",
                        "type": "string",
                        "description": "Optional. Stable identifier for the tracked search universe (e.g. \"at-software-vienna\"). Leave empty to auto-generate from search inputs — this prevents narrower runs from marking broader-run jobs as EXPIRED. Set explicitly only if you want backward compatibility with state under a specific legacy key."
                    },
                    "emitUnchanged": {
                        "title": "♻️ Emit Unchanged",
                        "type": "boolean",
                        "description": "Also emit jobs that have not changed since the last run (incrementalMode only).",
                        "default": false
                    },
                    "emitExpired": {
                        "title": "⚰️ Emit Expired",
                        "type": "boolean",
                        "description": "Also emit jobs that have disappeared since the last run (incrementalMode only).",
                        "default": false
                    },
                    "skipReposts": {
                        "title": "🚫 Skip Reposts",
                        "type": "boolean",
                        "description": "When incremental, skip jobs whose content matches an expired job from a prior run (cross-run repost detection).",
                        "default": false
                    },
                    "telegramToken": {
                        "title": "🔑 Telegram Bot Token",
                        "type": "string",
                        "description": "Telegram bot token (from @BotFather). Required for Telegram notifications."
                    },
                    "telegramChatId": {
                        "title": "💬 Telegram Chat ID",
                        "type": "string",
                        "description": "Telegram chat or channel ID (e.g. \"-100123456789\"). Required when telegramToken is set."
                    },
                    "discordWebhookUrl": {
                        "title": "🎮 Discord Webhook URL",
                        "type": "string",
                        "description": "Discord incoming webhook URL. Server Settings → Integrations → Webhooks → New Webhook."
                    },
                    "slackWebhookUrl": {
                        "title": "💼 Slack Webhook URL",
                        "type": "string",
                        "description": "Slack incoming webhook URL. api.slack.com/messaging/webhooks."
                    },
                    "notificationLimit": {
                        "title": "📊 Max Jobs Per Notification",
                        "minimum": 1,
                        "maximum": 20,
                        "type": "integer",
                        "description": "Maximum number of jobs included in each notification message (1–20).",
                        "default": 5
                    },
                    "notifyOnlyChanges": {
                        "title": "🔄 Notify Only New/Updated",
                        "type": "boolean",
                        "description": "When Incremental Mode is on, only send notifications for NEW and UPDATED jobs. Has no effect outside incremental mode.",
                        "default": false
                    },
                    "whatsappAccessToken": {
                        "title": "📱 WhatsApp Access Token",
                        "type": "string",
                        "description": "WhatsApp Cloud API permanent access token (System User token from Meta Business). Recipient must have messaged the business number within the last 24h (service-conversation window — free since Nov 2024). For first-contact alerts use approved templates outside this actor."
                    },
                    "whatsappPhoneNumberId": {
                        "title": "📞 WhatsApp Phone Number ID",
                        "type": "string",
                        "description": "Your WhatsApp Business phone-number ID (numeric, from Meta dashboard). Required when whatsappAccessToken is set."
                    },
                    "whatsappTo": {
                        "title": "📲 WhatsApp Recipient",
                        "type": "string",
                        "description": "Recipient phone in E.164 format without + (e.g. \"436641234567\" for an Austrian mobile). Recipient must have messaged your business number within the last 24 hours."
                    },
                    "phoneExtractionMode": {
                        "title": "📞 Phone Extraction Mode",
                        "enum": [
                            "strict",
                            "lenient"
                        ],
                        "type": "string",
                        "description": "How aggressively to detect phone numbers in job descriptions. Strict: only matches with Tel:/Telefon:/Mobil: prefix or +CC international. Lenient: also matches bare 0-prefixed local numbers (more false positives possible).",
                        "default": "strict"
                    },
                    "webhookUrl": {
                        "title": "🪝 Generic Webhook URL",
                        "type": "string",
                        "description": "Receives a JSON POST with full {metadata, items} payload after each run. Universal escape hatch for n8n / Make / Zapier / custom backends."
                    },
                    "webhookHeaders": {
                        "title": "📋 Webhook Headers",
                        "type": "object",
                        "description": "Optional JSON object of custom headers (e.g. {\"Authorization\":\"Bearer ...\"}). Sent with the webhook POST in addition to Content-Type: application/json."
                    },
                    "descriptionFormat": {
                        "title": "Description format",
                        "enum": [
                            "all",
                            "text",
                            "html",
                            "markdown"
                        ],
                        "type": "string",
                        "description": "Pick a single description representation. `all` keeps every variant; `text` / `html` / `markdown` drop the others.",
                        "default": "all"
                    },
                    "excludeEmptyFields": {
                        "title": "Exclude empty fields from output",
                        "type": "boolean",
                        "description": "Drop null, empty-string, and empty-array fields from each record before push. Smaller payloads for AI agents and dashboards.",
                        "default": false
                    },
                    "appConnector": {
                        "title": "Send results to Notion (or another connected app)",
                        "type": "string",
                        "description": "Optional. Pick a connected app under Settings → API & Integrations to receive your results. Notion is supported today (a run-summary page); other MCP connectors are best-effort as Apify expands its catalog."
                    },
                    "mcpIssueTeam": {
                        "title": "Issue tracker team",
                        "type": "string",
                        "description": "Only when the connected app is an issue tracker: the team (name or ID) the summary issue is created under, if that app requires one."
                    }
                }
            },
            "runsResponseSchema": {
                "type": "object",
                "properties": {
                    "data": {
                        "type": "object",
                        "properties": {
                            "id": {
                                "type": "string"
                            },
                            "actId": {
                                "type": "string"
                            },
                            "userId": {
                                "type": "string"
                            },
                            "startedAt": {
                                "type": "string",
                                "format": "date-time",
                                "example": "2025-01-08T00:00:00.000Z"
                            },
                            "finishedAt": {
                                "type": "string",
                                "format": "date-time",
                                "example": "2025-01-08T00:00:00.000Z"
                            },
                            "status": {
                                "type": "string",
                                "example": "READY"
                            },
                            "meta": {
                                "type": "object",
                                "properties": {
                                    "origin": {
                                        "type": "string",
                                        "example": "API"
                                    },
                                    "userAgent": {
                                        "type": "string"
                                    }
                                }
                            },
                            "stats": {
                                "type": "object",
                                "properties": {
                                    "inputBodyLen": {
                                        "type": "integer",
                                        "example": 2000
                                    },
                                    "rebootCount": {
                                        "type": "integer",
                                        "example": 0
                                    },
                                    "restartCount": {
                                        "type": "integer",
                                        "example": 0
                                    },
                                    "resurrectCount": {
                                        "type": "integer",
                                        "example": 0
                                    },
                                    "computeUnits": {
                                        "type": "integer",
                                        "example": 0
                                    }
                                }
                            },
                            "options": {
                                "type": "object",
                                "properties": {
                                    "build": {
                                        "type": "string",
                                        "example": "latest"
                                    },
                                    "timeoutSecs": {
                                        "type": "integer",
                                        "example": 300
                                    },
                                    "memoryMbytes": {
                                        "type": "integer",
                                        "example": 1024
                                    },
                                    "diskMbytes": {
                                        "type": "integer",
                                        "example": 2048
                                    }
                                }
                            },
                            "buildId": {
                                "type": "string"
                            },
                            "defaultKeyValueStoreId": {
                                "type": "string"
                            },
                            "defaultDatasetId": {
                                "type": "string"
                            },
                            "defaultRequestQueueId": {
                                "type": "string"
                            },
                            "buildNumber": {
                                "type": "string",
                                "example": "1.0.0"
                            },
                            "containerUrl": {
                                "type": "string"
                            },
                            "usage": {
                                "type": "object",
                                "properties": {
                                    "ACTOR_COMPUTE_UNITS": {
                                        "type": "integer",
                                        "example": 0
                                    },
                                    "DATASET_READS": {
                                        "type": "integer",
                                        "example": 0
                                    },
                                    "DATASET_WRITES": {
                                        "type": "integer",
                                        "example": 0
                                    },
                                    "KEY_VALUE_STORE_READS": {
                                        "type": "integer",
                                        "example": 0
                                    },
                                    "KEY_VALUE_STORE_WRITES": {
                                        "type": "integer",
                                        "example": 1
                                    },
                                    "KEY_VALUE_STORE_LISTS": {
                                        "type": "integer",
                                        "example": 0
                                    },
                                    "REQUEST_QUEUE_READS": {
                                        "type": "integer",
                                        "example": 0
                                    },
                                    "REQUEST_QUEUE_WRITES": {
                                        "type": "integer",
                                        "example": 0
                                    },
                                    "DATA_TRANSFER_INTERNAL_GBYTES": {
                                        "type": "integer",
                                        "example": 0
                                    },
                                    "DATA_TRANSFER_EXTERNAL_GBYTES": {
                                        "type": "integer",
                                        "example": 0
                                    },
                                    "PROXY_RESIDENTIAL_TRANSFER_GBYTES": {
                                        "type": "integer",
                                        "example": 0
                                    },
                                    "PROXY_SERPS": {
                                        "type": "integer",
                                        "example": 0
                                    }
                                }
                            },
                            "usageTotalUsd": {
                                "type": "number",
                                "example": 0.00005
                            },
                            "usageUsd": {
                                "type": "object",
                                "properties": {
                                    "ACTOR_COMPUTE_UNITS": {
                                        "type": "integer",
                                        "example": 0
                                    },
                                    "DATASET_READS": {
                                        "type": "integer",
                                        "example": 0
                                    },
                                    "DATASET_WRITES": {
                                        "type": "integer",
                                        "example": 0
                                    },
                                    "KEY_VALUE_STORE_READS": {
                                        "type": "integer",
                                        "example": 0
                                    },
                                    "KEY_VALUE_STORE_WRITES": {
                                        "type": "number",
                                        "example": 0.00005
                                    },
                                    "KEY_VALUE_STORE_LISTS": {
                                        "type": "integer",
                                        "example": 0
                                    },
                                    "REQUEST_QUEUE_READS": {
                                        "type": "integer",
                                        "example": 0
                                    },
                                    "REQUEST_QUEUE_WRITES": {
                                        "type": "integer",
                                        "example": 0
                                    },
                                    "DATA_TRANSFER_INTERNAL_GBYTES": {
                                        "type": "integer",
                                        "example": 0
                                    },
                                    "DATA_TRANSFER_EXTERNAL_GBYTES": {
                                        "type": "integer",
                                        "example": 0
                                    },
                                    "PROXY_RESIDENTIAL_TRANSFER_GBYTES": {
                                        "type": "integer",
                                        "example": 0
                                    },
                                    "PROXY_SERPS": {
                                        "type": "integer",
                                        "example": 0
                                    }
                                }
                            }
                        }
                    }
                }
            }
        }
    }
}
```
