# Herold.at Scraper - Austrian Business Leads (`plowdata/herold-at`) Actor

Extract Austrian business leads and company intelligence from Herold.at. Collect emails, phone numbers, addresses, opening hours, services, reviews, and payment methods. Export to CSV, Excel, JSON, or integrate into automation / CRM workflows.

- **URL**: https://apify.com/plowdata/herold-at.md
- **Developed by:** [Frederic](https://apify.com/plowdata) (community)
- **Categories:** Lead generation, Automation, Developer tools
- **Stats:** 14 total users, 2 monthly users, 100.0% runs succeeded, 2 bookmarks
- **User rating**: No ratings yet

## Pricing

$10.00/month + usage

To use this Actor, you pay a monthly rental fee to the developer. The rent is subtracted from your prepaid usage every month after the free trial period.You also pay for the Apify platform usage, which gets cheaper the higher Apify subscription plan you have.

Learn more: https://docs.apify.com/platform/actors/running/actors-in-store#rental-actors

## What's an Apify Actor?

Actors are a software tools running on the Apify platform, for all kinds of web data extraction and automation use cases.
In Batch mode, an Actor accepts a well-defined JSON input, performs an action which can take anything from a few seconds to a few hours,
and optionally produces a well-defined JSON output, datasets with results, or files in key-value store.
In Standby mode, an Actor provides a web server which can be used as a website, API, or an MCP server.
Actors are written with capital "A".

## How to integrate an Actor?

If asked about integration, you help developers integrate Actors into their projects.
You adapt to their stack and deliver integrations that are safe, well-documented, and production-ready.
The best way to integrate Actors is as follows.

In JavaScript/TypeScript projects, use official [JavaScript/TypeScript client](https://docs.apify.com/api/client/js.md):

```bash
npm install apify-client
```

In Python projects, use official [Python client library](https://docs.apify.com/api/client/python.md):

```bash
pip install apify-client
```

In shell scripts, use [Apify CLI](https://docs.apify.com/cli/docs.md):

````bash
# MacOS / Linux
curl -fsSL https://apify.com/install-cli.sh | bash
# Windows
irm https://apify.com/install-cli.ps1 | iex
```bash

In AI frameworks, you might use the [Apify MCP server](https://docs.apify.com/platform/integrations/mcp.md).

If your project is in a different language, use the [REST API](https://docs.apify.com/api/v2.md).

For usage examples, see the [API](#api) section below.

For more details, see Apify documentation as [Markdown index](https://docs.apify.com/llms.txt) and [Markdown full-text](https://docs.apify.com/llms-full.txt).


# README

## Herold.at Scraper

Turn **Herold.at** into a clean, ready-to-use **Austrian business leads and company intelligence dataset**.

This actor is built for agencies, analysts, and automation workflows that need **high-quality Austrian business data** — including **emails, phone numbers, addresses, opening hours, services, reviews, and trust signals** - extracted reliably in one streamlined process.

> **Prefer usage-based pricing for smaller or variable workloads, or per-client billing?**\
> For pay-per-result pricing without a monthly commitment, see the **[Herold.at Scraper (Pay per Result)](https://apify.com/plowdata/herold-at-ppr)** version.

---

### 🚀 Built for agencies & recurring workflows

The Herold.at Scraper focuses on **depth, reliability, and predictable operation**, making it a strong fit for agency use cases, CRM enrichment, and automated pipelines.

* **One company = one row**
  Each Herold.at listing is emitted once and deduplicated automatically, ensuring clean datasets even across large or repeated runs.

* **Rich contact information**
  Emails, phone numbers, websites, addresses, and opening hours are extracted wherever available — ideal for lead generation and enrichment.

* **Deep company profiles**
  Extracts services, industries, reviews, payment methods, awards, images, and detailed company metadata when present.

* **No hard page limits**
  The scraper automatically paginates through all available results and stops when the directory is exhausted.

* **Built for unattended runs**
  Error-resilient by design: malformed listings are logged and skipped without interrupting the run.

* **Clean exports by default**
  Works out of the box with CSV, Excel, JSON, XML, and Apify’s table view.

---

### 🎯 Typical use cases

* **Austrian Lead Generation** – Build business lead lists by category and location
* **Sales & Marketing** – Extract verified contact data for outreach campaigns
* **Market & Competitor Research** – Analyze services, reviews, and positioning
* **CRM & Data Enrichment** – Augment existing records with authoritative Herold.at data
* **Automation Agencies** – Integrate Herold.at data into recurring pipelines

---

### ⚙️ Input

| Parameter   | Type    | Description                                                             |
| ----------- | ------- | ----------------------------------------------------------------------- |
| `query`     | string  | Search term you’d normally enter on Herold.at (required).               |
| `location`  | string  | Optional location filter (e.g., `Wien`, `Innsbruck`).                   |
| `openedNow` | boolean | Include only businesses currently open.                                 |
| `maxPages`  | number  | Maximum number of pages to scrape. If not set, all pages are processed. |

Example:

```json
{
  "query": "Restaurant",
  "location": "Wien",
  "openedNow": true
}
````

***

### 📦 Output: rich, structured business data

Each dataset row represents **one Herold.at company listing** with strictly typed, schema-validated fields. The output works both as a **flat table** for spreadsheets and CRMs, and as a **rich nested structure** for integrations and automations.

#### ⭐ Commonly used fields

- `name`
- `email`
- `phone`
- `website`
- `address`
- `rating`, `reviews.total`
- `industries`, `services`
- `openingHours`

***

### 📘 Detailed output schema (high-level)

Below is a structured overview of the available data groups. Many fields are optional because availability varies by listing type.

#### 🔹 Identification & trust

- Stable listing ID and profile URL
- Verification status and awards
- Aggregated ratings and review counts

#### 🔹 Location & contact

- Structured address data
- Primary and additional contact channels
- Opening hours and operational notes

#### 🔹 Reviews & reputation

- Full review texts and metadata
- Per-category ratings
- Owner responses (when available)

#### 🔹 Business details

- Industries and services
- Payment methods
- Company metadata (founding year, employee count, register ID)

#### 🔹 Media & enrichment

- Images and logos
- Social accounts and external links

#### 🔹 Multi-location companies

- Additional branch locations with individual addresses

***

### 🛡️ Reliability & schema guarantees

- Strongly typed, schema-validated output
- Automatic recovery from malformed or partial listings
- Safe to stop and restart without duplicating results
- Continues scraping even when individual fields are missing

If Herold.at changes its structure or introduces new fields, the scraper continues to run and logs the change for review.

***

### 👥 Target audience

This scraper is designed for:

- **Agencies & Sales Teams** – Generate Austrian business leads with rich contact data
- **Business Analysts & Researchers** – Collect structured market data
- **Competitor Intelligence** – Track services, reviews, and positioning at scale
- **Automation Builders** – Integrate Herold.at data into recurring workflows

***

### 💬 Support

Questions, edge cases, or feature requests?

Open a support ticket or attach a failing run log — we’ll take care of it.

# Actor input Schema

## `query` (type: `string`):

Search query for the Herold.at search.

## `location` (type: `string`):

Location for the Herold.at search, optional.

## `openedNow` (type: `boolean`):

Filter results to only include businesses that are currently open.

## `maxPages` (type: `integer`):

Maximum number of pages to scrape, optional - when omitted/left empty, all pages are scraped.

## Actor input object example

```json
{
  "query": "Cafe",
  "maxPages": 10
}
```

# Actor output Schema

## `companies` (type: `string`):

Extracted companies

# API

You can run this Actor programmatically using our API. Below are code examples in JavaScript, Python, and CLI, as well as the OpenAPI specification and MCP server setup.

## JavaScript example

```javascript
import { ApifyClient } from 'apify-client';

// Initialize the ApifyClient with your Apify API token
// Replace the '<YOUR_API_TOKEN>' with your token
const client = new ApifyClient({
    token: '<YOUR_API_TOKEN>',
});

// Prepare Actor input
const input = {
    "query": "Cafe",
    "maxPages": 10
};

// Run the Actor and wait for it to finish
const run = await client.actor("plowdata/herold-at").call(input);

// Fetch and print Actor results from the run's dataset (if any)
console.log('Results from dataset');
console.log(`💾 Check your data here: https://console.apify.com/storage/datasets/${run.defaultDatasetId}`);
const { items } = await client.dataset(run.defaultDatasetId).listItems();
items.forEach((item) => {
    console.dir(item);
});

// 📚 Want to learn more 📖? Go to → https://docs.apify.com/api/client/js/docs

```

## Python example

```python
from apify_client import ApifyClient

# Initialize the ApifyClient with your Apify API token
# Replace '<YOUR_API_TOKEN>' with your token.
client = ApifyClient("<YOUR_API_TOKEN>")

# Prepare the Actor input
run_input = {
    "query": "Cafe",
    "maxPages": 10,
}

# Run the Actor and wait for it to finish
run = client.actor("plowdata/herold-at").call(run_input=run_input)

# Fetch and print Actor results from the run's dataset (if there are any)
print("💾 Check your data here: https://console.apify.com/storage/datasets/" + run["defaultDatasetId"])
for item in client.dataset(run["defaultDatasetId"]).iterate_items():
    print(item)

# 📚 Want to learn more 📖? Go to → https://docs.apify.com/api/client/python/docs/quick-start

```

## CLI example

```bash
echo '{
  "query": "Cafe",
  "maxPages": 10
}' |
apify call plowdata/herold-at --silent --output-dataset

```

## MCP server setup

```json
{
    "mcpServers": {
        "apify": {
            "command": "npx",
            "args": [
                "mcp-remote",
                "https://mcp.apify.com/?tools=plowdata/herold-at",
                "--header",
                "Authorization: Bearer <YOUR_API_TOKEN>"
            ]
        }
    }
}

```

## OpenAPI specification

```json
{
    "openapi": "3.0.1",
    "info": {
        "title": "Herold.at Scraper - Austrian Business Leads",
        "description": "Extract Austrian business leads and company intelligence from Herold.at. Collect emails, phone numbers, addresses, opening hours, services, reviews, and payment methods. Export to CSV, Excel, JSON, or integrate into automation / CRM workflows.",
        "version": "1.0",
        "x-build-id": "giHBC2ZqWZrf2N44S"
    },
    "servers": [
        {
            "url": "https://api.apify.com/v2"
        }
    ],
    "paths": {
        "/acts/plowdata~herold-at/run-sync-get-dataset-items": {
            "post": {
                "operationId": "run-sync-get-dataset-items-plowdata-herold-at",
                "x-openai-isConsequential": false,
                "summary": "Executes an Actor, waits for its completion, and returns Actor's dataset items in response.",
                "tags": [
                    "Run Actor"
                ],
                "requestBody": {
                    "required": true,
                    "content": {
                        "application/json": {
                            "schema": {
                                "$ref": "#/components/schemas/inputSchema"
                            }
                        }
                    }
                },
                "parameters": [
                    {
                        "name": "token",
                        "in": "query",
                        "required": true,
                        "schema": {
                            "type": "string"
                        },
                        "description": "Enter your Apify token here"
                    }
                ],
                "responses": {
                    "200": {
                        "description": "OK"
                    }
                }
            }
        },
        "/acts/plowdata~herold-at/runs": {
            "post": {
                "operationId": "runs-sync-plowdata-herold-at",
                "x-openai-isConsequential": false,
                "summary": "Executes an Actor and returns information about the initiated run in response.",
                "tags": [
                    "Run Actor"
                ],
                "requestBody": {
                    "required": true,
                    "content": {
                        "application/json": {
                            "schema": {
                                "$ref": "#/components/schemas/inputSchema"
                            }
                        }
                    }
                },
                "parameters": [
                    {
                        "name": "token",
                        "in": "query",
                        "required": true,
                        "schema": {
                            "type": "string"
                        },
                        "description": "Enter your Apify token here"
                    }
                ],
                "responses": {
                    "200": {
                        "description": "OK",
                        "content": {
                            "application/json": {
                                "schema": {
                                    "$ref": "#/components/schemas/runsResponseSchema"
                                }
                            }
                        }
                    }
                }
            }
        },
        "/acts/plowdata~herold-at/run-sync": {
            "post": {
                "operationId": "run-sync-plowdata-herold-at",
                "x-openai-isConsequential": false,
                "summary": "Executes an Actor, waits for completion, and returns the OUTPUT from Key-value store in response.",
                "tags": [
                    "Run Actor"
                ],
                "requestBody": {
                    "required": true,
                    "content": {
                        "application/json": {
                            "schema": {
                                "$ref": "#/components/schemas/inputSchema"
                            }
                        }
                    }
                },
                "parameters": [
                    {
                        "name": "token",
                        "in": "query",
                        "required": true,
                        "schema": {
                            "type": "string"
                        },
                        "description": "Enter your Apify token here"
                    }
                ],
                "responses": {
                    "200": {
                        "description": "OK"
                    }
                }
            }
        }
    },
    "components": {
        "schemas": {
            "inputSchema": {
                "type": "object",
                "required": [
                    "query"
                ],
                "properties": {
                    "query": {
                        "title": "Query",
                        "type": "string",
                        "description": "Search query for the Herold.at search."
                    },
                    "location": {
                        "title": "Location",
                        "type": "string",
                        "description": "Location for the Herold.at search, optional."
                    },
                    "openedNow": {
                        "title": "Opened Now",
                        "type": "boolean",
                        "description": "Filter results to only include businesses that are currently open."
                    },
                    "maxPages": {
                        "title": "Max Pages",
                        "minimum": 1,
                        "type": "integer",
                        "description": "Maximum number of pages to scrape, optional - when omitted/left empty, all pages are scraped."
                    }
                }
            },
            "runsResponseSchema": {
                "type": "object",
                "properties": {
                    "data": {
                        "type": "object",
                        "properties": {
                            "id": {
                                "type": "string"
                            },
                            "actId": {
                                "type": "string"
                            },
                            "userId": {
                                "type": "string"
                            },
                            "startedAt": {
                                "type": "string",
                                "format": "date-time",
                                "example": "2025-01-08T00:00:00.000Z"
                            },
                            "finishedAt": {
                                "type": "string",
                                "format": "date-time",
                                "example": "2025-01-08T00:00:00.000Z"
                            },
                            "status": {
                                "type": "string",
                                "example": "READY"
                            },
                            "meta": {
                                "type": "object",
                                "properties": {
                                    "origin": {
                                        "type": "string",
                                        "example": "API"
                                    },
                                    "userAgent": {
                                        "type": "string"
                                    }
                                }
                            },
                            "stats": {
                                "type": "object",
                                "properties": {
                                    "inputBodyLen": {
                                        "type": "integer",
                                        "example": 2000
                                    },
                                    "rebootCount": {
                                        "type": "integer",
                                        "example": 0
                                    },
                                    "restartCount": {
                                        "type": "integer",
                                        "example": 0
                                    },
                                    "resurrectCount": {
                                        "type": "integer",
                                        "example": 0
                                    },
                                    "computeUnits": {
                                        "type": "integer",
                                        "example": 0
                                    }
                                }
                            },
                            "options": {
                                "type": "object",
                                "properties": {
                                    "build": {
                                        "type": "string",
                                        "example": "latest"
                                    },
                                    "timeoutSecs": {
                                        "type": "integer",
                                        "example": 300
                                    },
                                    "memoryMbytes": {
                                        "type": "integer",
                                        "example": 1024
                                    },
                                    "diskMbytes": {
                                        "type": "integer",
                                        "example": 2048
                                    }
                                }
                            },
                            "buildId": {
                                "type": "string"
                            },
                            "defaultKeyValueStoreId": {
                                "type": "string"
                            },
                            "defaultDatasetId": {
                                "type": "string"
                            },
                            "defaultRequestQueueId": {
                                "type": "string"
                            },
                            "buildNumber": {
                                "type": "string",
                                "example": "1.0.0"
                            },
                            "containerUrl": {
                                "type": "string"
                            },
                            "usage": {
                                "type": "object",
                                "properties": {
                                    "ACTOR_COMPUTE_UNITS": {
                                        "type": "integer",
                                        "example": 0
                                    },
                                    "DATASET_READS": {
                                        "type": "integer",
                                        "example": 0
                                    },
                                    "DATASET_WRITES": {
                                        "type": "integer",
                                        "example": 0
                                    },
                                    "KEY_VALUE_STORE_READS": {
                                        "type": "integer",
                                        "example": 0
                                    },
                                    "KEY_VALUE_STORE_WRITES": {
                                        "type": "integer",
                                        "example": 1
                                    },
                                    "KEY_VALUE_STORE_LISTS": {
                                        "type": "integer",
                                        "example": 0
                                    },
                                    "REQUEST_QUEUE_READS": {
                                        "type": "integer",
                                        "example": 0
                                    },
                                    "REQUEST_QUEUE_WRITES": {
                                        "type": "integer",
                                        "example": 0
                                    },
                                    "DATA_TRANSFER_INTERNAL_GBYTES": {
                                        "type": "integer",
                                        "example": 0
                                    },
                                    "DATA_TRANSFER_EXTERNAL_GBYTES": {
                                        "type": "integer",
                                        "example": 0
                                    },
                                    "PROXY_RESIDENTIAL_TRANSFER_GBYTES": {
                                        "type": "integer",
                                        "example": 0
                                    },
                                    "PROXY_SERPS": {
                                        "type": "integer",
                                        "example": 0
                                    }
                                }
                            },
                            "usageTotalUsd": {
                                "type": "number",
                                "example": 0.00005
                            },
                            "usageUsd": {
                                "type": "object",
                                "properties": {
                                    "ACTOR_COMPUTE_UNITS": {
                                        "type": "integer",
                                        "example": 0
                                    },
                                    "DATASET_READS": {
                                        "type": "integer",
                                        "example": 0
                                    },
                                    "DATASET_WRITES": {
                                        "type": "integer",
                                        "example": 0
                                    },
                                    "KEY_VALUE_STORE_READS": {
                                        "type": "integer",
                                        "example": 0
                                    },
                                    "KEY_VALUE_STORE_WRITES": {
                                        "type": "number",
                                        "example": 0.00005
                                    },
                                    "KEY_VALUE_STORE_LISTS": {
                                        "type": "integer",
                                        "example": 0
                                    },
                                    "REQUEST_QUEUE_READS": {
                                        "type": "integer",
                                        "example": 0
                                    },
                                    "REQUEST_QUEUE_WRITES": {
                                        "type": "integer",
                                        "example": 0
                                    },
                                    "DATA_TRANSFER_INTERNAL_GBYTES": {
                                        "type": "integer",
                                        "example": 0
                                    },
                                    "DATA_TRANSFER_EXTERNAL_GBYTES": {
                                        "type": "integer",
                                        "example": 0
                                    },
                                    "PROXY_RESIDENTIAL_TRANSFER_GBYTES": {
                                        "type": "integer",
                                        "example": 0
                                    },
                                    "PROXY_SERPS": {
                                        "type": "integer",
                                        "example": 0
                                    }
                                }
                            }
                        }
                    }
                }
            }
        }
    }
}
```
