# Zalando Scraper (Pay Per Result) (`123webdata/zalando-scraper`) Actor

A scraper for Zalando to collect product information, prices, and availability from both category and product detail pages. Perfect for AI training, pricing analytics, and e-commerce integrations. Pricing: $0.005/result. 10 results on free trial.

- **URL**: https://apify.com/123webdata/zalando-scraper.md
- **Developed by:** [Milán Vásárhelyi](https://apify.com/123webdata) (community)
- **Categories:** E-commerce, AI
- **Stats:** 18 total users, 1 monthly users, 100.0% runs succeeded, 0 bookmarks
- **User rating**: No ratings yet

## Pricing

$5.00 / 1,000 results

This Actor is paid per event. You are not charged for the Apify platform usage, but only a fixed price for specific events.

Learn more: https://docs.apify.com/platform/actors/running/actors-in-store#pay-per-event

## What's an Apify Actor?

Actors are a software tools running on the Apify platform, for all kinds of web data extraction and automation use cases.
In Batch mode, an Actor accepts a well-defined JSON input, performs an action which can take anything from a few seconds to a few hours,
and optionally produces a well-defined JSON output, datasets with results, or files in key-value store.
In Standby mode, an Actor provides a web server which can be used as a website, API, or an MCP server.
Actors are written with capital "A".

## How to integrate an Actor?

If asked about integration, you help developers integrate Actors into their projects.
You adapt to their stack and deliver integrations that are safe, well-documented, and production-ready.
The best way to integrate Actors is as follows.

In JavaScript/TypeScript projects, use official [JavaScript/TypeScript client](https://docs.apify.com/api/client/js.md):

```bash
npm install apify-client
```

In Python projects, use official [Python client library](https://docs.apify.com/api/client/python.md):

```bash
pip install apify-client
```

In shell scripts, use [Apify CLI](https://docs.apify.com/cli/docs.md):

````bash
# MacOS / Linux
curl -fsSL https://apify.com/install-cli.sh | bash
# Windows
irm https://apify.com/install-cli.ps1 | iex
```bash

In AI frameworks, you might use the [Apify MCP server](https://docs.apify.com/platform/integrations/mcp.md).

If your project is in a different language, use the [REST API](https://docs.apify.com/api/v2.md).

For usage examples, see the [API](#api) section below.

For more details, see Apify documentation as [Markdown index](https://docs.apify.com/llms.txt) and [Markdown full-text](https://docs.apify.com/llms-full.txt).


# README

## Zalando.com Scraper

This actor scrapes product data from [Zalando.com](https://www.zalando.com/), a popular European online fashion retailer.

### Issues and feature requests
If you encounter any issue, or you'd like us to change or add anything, please open an issue in the actor page or write to us directly at hello@123webdata.com

### Output Example (simplified)

The actor provides a convenient overview mode that displays the most important product information in a tabular format. Here's an example of how the data looks in the overview mode:

| Image | Product URL | Product Name | Current Price | Regular Price | Currency | Description | Scraped At | 
|-------|-------|-------|-------|-------|-------|-------|-------|
| <img src="https://img01.ztat.net/article/spp-media-p1/ea5f7901252f4ddda6db30fe83dc373f/521ce5eb256448b7b085b3366e8bdd1f.jpg?imwidth=762&filter=packshot" style="max-width: 100px; max-height: 100px; object-fit: contain;"> | https://www.zalando.co.uk/anna-field-handbag-black-an651h12h-q11.html | Handbag - black | 23.99 | 23.99 | GBP | Material & care  Outer mate... | 2025-07-21 | 



### Features

- Scrapes detailed product information from Zalando.com product pages
- Extracts data from both individual product URLs and category pages
- Follows pagination links on category pages (optional)
- Supports various product categories with specific fields like brand, price, and category

### Input

The actor accepts the following input parameters:

| Field | Type | Description |
|-------|------|-------------|
| `productUrls` | array | List of Zalando.com product URLs to scrape |
| `categoryUrls` | array | List of Zalando.com category URLs to scrape |
| `maxResultsPerScrape` | integer | Maximum number of results to extract (default: 1000) |
| `usePagination` | boolean | Whether to follow pagination links on category pages (default: true) |


### Output Example (full)

The actor outputs data in the following structure:

```json
{
	"url": "https://www.zalando.co.uk/anna-field-handbag-black-an651h12h-q11.html",
	"name": "Handbag - black",
	"price": 23.99,
	"regular_price": 23.99,
	"currency": "GBP",
	"in_stock": null,
	"sku": "AN651H12H-Q11",
	"mpn": null,
	"gtin": null,
	"brand": "Anna Field by Zalando",
	"breadcrumbs": [],
	"main_image": "https://img01.ztat.net/article/spp-media-p1/ea5f7901252f4ddda6db30fe83dc373f/521ce5eb256448b7b085b3366e8bdd1f.jpg?imwidth=762&filter=packshot",
	"images": [
		"https://img01.ztat.net/article/spp-media-p1/ea5f7901252f4ddda6db30fe83dc373f/521ce5eb256448b7b085b3366e8bdd1f.jpg?imwidth=762&filter=packshot",
		"https://img01.ztat.net/article/spp-media-p1/34a45af4a0d840109ca2c767f3736818/1e085e1d9633408c8d34c9ab720dd679.jpg?imwidth=156",
		"https://img01.ztat.net/article/spp-media-p1/c0354e9167ba4518bcb57dad31cd4cf1/0230c6d658fd4859baf344da79fd6c6b.jpg?imwidth=156",
		"https://img01.ztat.net/article/spp-media-p1/678496c36de549738d73581b1cf05a04/4e8e2a90b42349bdb4f29cdf9dc08168.jpg?imwidth=156",
		"https://img01.ztat.net/article/spp-media-p1/ea5f7901252f4ddda6db30fe83dc373f/521ce5eb256448b7b085b3366e8bdd1f.jpg?imwidth=156&filter=packshot"
	],
	"description": "Material & care\n\nOuter material:\nPolyurethane\nLining:\nPolyester\nFabric:\nFaux leather\n\nDetails\n\nFastening:\n\nZip\n\nPattern:\n\nPlain\n\nArticle number:\n\nAN651H12H-Q11\n\nView manufacturing details\n\nSize & fit\n\nHeight:\n11.0 \" (Size One Size)\nLength:\n13.0 \" (Size One Size)\nWidth:\n4.5 \" (Size One Size)\nCarrying handle:\n10.0 \" (Size One Size)\n\nProduct standard\n\nAnna Field by Zalando\n\nFollow",
	"color": "black",
	"size": "One Size",
	"material": "Polyurethane, Polyester, Faux leather",
	"style": "Plain",
	"features": [],
	"attributes": {
		"current selected color colour": "black",
		"fastening": "Zip",
		"pattern": "Plain",
		"article number": "AN651H12H-Q11",
		"outer material": "Polyurethane",
		"lining": "Polyester",
		"fabric": "Faux leather",
		"height": "11.0 \" (Size One Size)",
		"length": "13.0 \" (Size One Size)",
		"width": "4.5 \" (Size One Size)",
		"carrying handle": "10.0 \" (Size One Size)"
	},
	"rating_value": 4.3,
	"review_count": 51,
	"canonical_url": null,
	"scraped_at": "2025-07-21T12:11:23.424Z"
}
````

### Use Cases

- Monitoring fashion product prices across European markets
- Analyzing product availability and trends
- Tracking price changes over time
- Market research and competitive analysis in the fashion industry

# Actor input Schema

## `productUrls` (type: `array`):

List of Zalando.com product URLs to scrape.

## `categoryUrls` (type: `array`):

List of Zalando.com category URLs to scrape. The scraper will extract all product URLs from these pages.

## `maxResultsPerScrape` (type: `integer`):

Maximum number of results to extract. Default is 20.

## `usePagination` (type: `boolean`):

Whether to follow pagination links on category pages.

## Actor input object example

```json
{
  "productUrls": [
    "https://www.zalando.co.uk/anna-field-handbag-black-an651h12h-q11.html"
  ],
  "categoryUrls": [
    "https://www.zalando.co.uk/handbags/"
  ],
  "maxResultsPerScrape": 20,
  "usePagination": true
}
```

# API

You can run this Actor programmatically using our API. Below are code examples in JavaScript, Python, and CLI, as well as the OpenAPI specification and MCP server setup.

## JavaScript example

```javascript
import { ApifyClient } from 'apify-client';

// Initialize the ApifyClient with your Apify API token
// Replace the '<YOUR_API_TOKEN>' with your token
const client = new ApifyClient({
    token: '<YOUR_API_TOKEN>',
});

// Prepare Actor input
const input = {
    "productUrls": [
        "https://www.zalando.co.uk/anna-field-handbag-black-an651h12h-q11.html"
    ],
    "categoryUrls": [
        "https://www.zalando.co.uk/handbags/"
    ]
};

// Run the Actor and wait for it to finish
const run = await client.actor("123webdata/zalando-scraper").call(input);

// Fetch and print Actor results from the run's dataset (if any)
console.log('Results from dataset');
console.log(`💾 Check your data here: https://console.apify.com/storage/datasets/${run.defaultDatasetId}`);
const { items } = await client.dataset(run.defaultDatasetId).listItems();
items.forEach((item) => {
    console.dir(item);
});

// 📚 Want to learn more 📖? Go to → https://docs.apify.com/api/client/js/docs

```

## Python example

```python
from apify_client import ApifyClient

# Initialize the ApifyClient with your Apify API token
# Replace '<YOUR_API_TOKEN>' with your token.
client = ApifyClient("<YOUR_API_TOKEN>")

# Prepare the Actor input
run_input = {
    "productUrls": ["https://www.zalando.co.uk/anna-field-handbag-black-an651h12h-q11.html"],
    "categoryUrls": ["https://www.zalando.co.uk/handbags/"],
}

# Run the Actor and wait for it to finish
run = client.actor("123webdata/zalando-scraper").call(run_input=run_input)

# Fetch and print Actor results from the run's dataset (if there are any)
print("💾 Check your data here: https://console.apify.com/storage/datasets/" + run["defaultDatasetId"])
for item in client.dataset(run["defaultDatasetId"]).iterate_items():
    print(item)

# 📚 Want to learn more 📖? Go to → https://docs.apify.com/api/client/python/docs/quick-start

```

## CLI example

```bash
echo '{
  "productUrls": [
    "https://www.zalando.co.uk/anna-field-handbag-black-an651h12h-q11.html"
  ],
  "categoryUrls": [
    "https://www.zalando.co.uk/handbags/"
  ]
}' |
apify call 123webdata/zalando-scraper --silent --output-dataset

```

## MCP server setup

```json
{
    "mcpServers": {
        "apify": {
            "command": "npx",
            "args": [
                "mcp-remote",
                "https://mcp.apify.com/?tools=123webdata/zalando-scraper",
                "--header",
                "Authorization: Bearer <YOUR_API_TOKEN>"
            ]
        }
    }
}

```

## OpenAPI specification

```json
{
    "openapi": "3.0.1",
    "info": {
        "title": "Zalando Scraper (Pay Per Result)",
        "description": "A scraper for Zalando to collect product information, prices, and availability from both category and product detail pages. Perfect for AI training, pricing analytics, and e-commerce integrations. Pricing: $0.005/result. 10 results on free trial.",
        "version": "0.1",
        "x-build-id": "UsAsP4CVh0gcby7Th"
    },
    "servers": [
        {
            "url": "https://api.apify.com/v2"
        }
    ],
    "paths": {
        "/acts/123webdata~zalando-scraper/run-sync-get-dataset-items": {
            "post": {
                "operationId": "run-sync-get-dataset-items-123webdata-zalando-scraper",
                "x-openai-isConsequential": false,
                "summary": "Executes an Actor, waits for its completion, and returns Actor's dataset items in response.",
                "tags": [
                    "Run Actor"
                ],
                "requestBody": {
                    "required": true,
                    "content": {
                        "application/json": {
                            "schema": {
                                "$ref": "#/components/schemas/inputSchema"
                            }
                        }
                    }
                },
                "parameters": [
                    {
                        "name": "token",
                        "in": "query",
                        "required": true,
                        "schema": {
                            "type": "string"
                        },
                        "description": "Enter your Apify token here"
                    }
                ],
                "responses": {
                    "200": {
                        "description": "OK"
                    }
                }
            }
        },
        "/acts/123webdata~zalando-scraper/runs": {
            "post": {
                "operationId": "runs-sync-123webdata-zalando-scraper",
                "x-openai-isConsequential": false,
                "summary": "Executes an Actor and returns information about the initiated run in response.",
                "tags": [
                    "Run Actor"
                ],
                "requestBody": {
                    "required": true,
                    "content": {
                        "application/json": {
                            "schema": {
                                "$ref": "#/components/schemas/inputSchema"
                            }
                        }
                    }
                },
                "parameters": [
                    {
                        "name": "token",
                        "in": "query",
                        "required": true,
                        "schema": {
                            "type": "string"
                        },
                        "description": "Enter your Apify token here"
                    }
                ],
                "responses": {
                    "200": {
                        "description": "OK",
                        "content": {
                            "application/json": {
                                "schema": {
                                    "$ref": "#/components/schemas/runsResponseSchema"
                                }
                            }
                        }
                    }
                }
            }
        },
        "/acts/123webdata~zalando-scraper/run-sync": {
            "post": {
                "operationId": "run-sync-123webdata-zalando-scraper",
                "x-openai-isConsequential": false,
                "summary": "Executes an Actor, waits for completion, and returns the OUTPUT from Key-value store in response.",
                "tags": [
                    "Run Actor"
                ],
                "requestBody": {
                    "required": true,
                    "content": {
                        "application/json": {
                            "schema": {
                                "$ref": "#/components/schemas/inputSchema"
                            }
                        }
                    }
                },
                "parameters": [
                    {
                        "name": "token",
                        "in": "query",
                        "required": true,
                        "schema": {
                            "type": "string"
                        },
                        "description": "Enter your Apify token here"
                    }
                ],
                "responses": {
                    "200": {
                        "description": "OK"
                    }
                }
            }
        }
    },
    "components": {
        "schemas": {
            "inputSchema": {
                "type": "object",
                "properties": {
                    "productUrls": {
                        "title": "Product URLs",
                        "type": "array",
                        "description": "List of Zalando.com product URLs to scrape.",
                        "default": [],
                        "items": {
                            "type": "string"
                        }
                    },
                    "categoryUrls": {
                        "title": "Category URLs",
                        "type": "array",
                        "description": "List of Zalando.com category URLs to scrape. The scraper will extract all product URLs from these pages.",
                        "default": [],
                        "items": {
                            "type": "string"
                        }
                    },
                    "maxResultsPerScrape": {
                        "title": "Maximum Results",
                        "minimum": 1,
                        "maximum": 100000,
                        "type": "integer",
                        "description": "Maximum number of results to extract. Default is 20.",
                        "default": 20
                    },
                    "usePagination": {
                        "title": "Use Pagination",
                        "type": "boolean",
                        "description": "Whether to follow pagination links on category pages.",
                        "default": true
                    }
                }
            },
            "runsResponseSchema": {
                "type": "object",
                "properties": {
                    "data": {
                        "type": "object",
                        "properties": {
                            "id": {
                                "type": "string"
                            },
                            "actId": {
                                "type": "string"
                            },
                            "userId": {
                                "type": "string"
                            },
                            "startedAt": {
                                "type": "string",
                                "format": "date-time",
                                "example": "2025-01-08T00:00:00.000Z"
                            },
                            "finishedAt": {
                                "type": "string",
                                "format": "date-time",
                                "example": "2025-01-08T00:00:00.000Z"
                            },
                            "status": {
                                "type": "string",
                                "example": "READY"
                            },
                            "meta": {
                                "type": "object",
                                "properties": {
                                    "origin": {
                                        "type": "string",
                                        "example": "API"
                                    },
                                    "userAgent": {
                                        "type": "string"
                                    }
                                }
                            },
                            "stats": {
                                "type": "object",
                                "properties": {
                                    "inputBodyLen": {
                                        "type": "integer",
                                        "example": 2000
                                    },
                                    "rebootCount": {
                                        "type": "integer",
                                        "example": 0
                                    },
                                    "restartCount": {
                                        "type": "integer",
                                        "example": 0
                                    },
                                    "resurrectCount": {
                                        "type": "integer",
                                        "example": 0
                                    },
                                    "computeUnits": {
                                        "type": "integer",
                                        "example": 0
                                    }
                                }
                            },
                            "options": {
                                "type": "object",
                                "properties": {
                                    "build": {
                                        "type": "string",
                                        "example": "latest"
                                    },
                                    "timeoutSecs": {
                                        "type": "integer",
                                        "example": 300
                                    },
                                    "memoryMbytes": {
                                        "type": "integer",
                                        "example": 1024
                                    },
                                    "diskMbytes": {
                                        "type": "integer",
                                        "example": 2048
                                    }
                                }
                            },
                            "buildId": {
                                "type": "string"
                            },
                            "defaultKeyValueStoreId": {
                                "type": "string"
                            },
                            "defaultDatasetId": {
                                "type": "string"
                            },
                            "defaultRequestQueueId": {
                                "type": "string"
                            },
                            "buildNumber": {
                                "type": "string",
                                "example": "1.0.0"
                            },
                            "containerUrl": {
                                "type": "string"
                            },
                            "usage": {
                                "type": "object",
                                "properties": {
                                    "ACTOR_COMPUTE_UNITS": {
                                        "type": "integer",
                                        "example": 0
                                    },
                                    "DATASET_READS": {
                                        "type": "integer",
                                        "example": 0
                                    },
                                    "DATASET_WRITES": {
                                        "type": "integer",
                                        "example": 0
                                    },
                                    "KEY_VALUE_STORE_READS": {
                                        "type": "integer",
                                        "example": 0
                                    },
                                    "KEY_VALUE_STORE_WRITES": {
                                        "type": "integer",
                                        "example": 1
                                    },
                                    "KEY_VALUE_STORE_LISTS": {
                                        "type": "integer",
                                        "example": 0
                                    },
                                    "REQUEST_QUEUE_READS": {
                                        "type": "integer",
                                        "example": 0
                                    },
                                    "REQUEST_QUEUE_WRITES": {
                                        "type": "integer",
                                        "example": 0
                                    },
                                    "DATA_TRANSFER_INTERNAL_GBYTES": {
                                        "type": "integer",
                                        "example": 0
                                    },
                                    "DATA_TRANSFER_EXTERNAL_GBYTES": {
                                        "type": "integer",
                                        "example": 0
                                    },
                                    "PROXY_RESIDENTIAL_TRANSFER_GBYTES": {
                                        "type": "integer",
                                        "example": 0
                                    },
                                    "PROXY_SERPS": {
                                        "type": "integer",
                                        "example": 0
                                    }
                                }
                            },
                            "usageTotalUsd": {
                                "type": "number",
                                "example": 0.00005
                            },
                            "usageUsd": {
                                "type": "object",
                                "properties": {
                                    "ACTOR_COMPUTE_UNITS": {
                                        "type": "integer",
                                        "example": 0
                                    },
                                    "DATASET_READS": {
                                        "type": "integer",
                                        "example": 0
                                    },
                                    "DATASET_WRITES": {
                                        "type": "integer",
                                        "example": 0
                                    },
                                    "KEY_VALUE_STORE_READS": {
                                        "type": "integer",
                                        "example": 0
                                    },
                                    "KEY_VALUE_STORE_WRITES": {
                                        "type": "number",
                                        "example": 0.00005
                                    },
                                    "KEY_VALUE_STORE_LISTS": {
                                        "type": "integer",
                                        "example": 0
                                    },
                                    "REQUEST_QUEUE_READS": {
                                        "type": "integer",
                                        "example": 0
                                    },
                                    "REQUEST_QUEUE_WRITES": {
                                        "type": "integer",
                                        "example": 0
                                    },
                                    "DATA_TRANSFER_INTERNAL_GBYTES": {
                                        "type": "integer",
                                        "example": 0
                                    },
                                    "DATA_TRANSFER_EXTERNAL_GBYTES": {
                                        "type": "integer",
                                        "example": 0
                                    },
                                    "PROXY_RESIDENTIAL_TRANSFER_GBYTES": {
                                        "type": "integer",
                                        "example": 0
                                    },
                                    "PROXY_SERPS": {
                                        "type": "integer",
                                        "example": 0
                                    }
                                }
                            }
                        }
                    }
                }
            }
        }
    }
}
```
