# Pull\&Bear (`datasaurus/pullandbear`) Actor

Scrape products from pullandbear.com websites. All countries and languages. Scrape the full website or scrape by product category or individual product. Fast and efficient. n8n node: n8n-nodes-pullandbear

- **URL**: https://apify.com/datasaurus/pullandbear.md
- **Developed by:** [datasaurus](https://apify.com/datasaurus) (community)
- **Categories:** E-commerce
- **Stats:** 21 total users, 1 monthly users, 100.0% runs succeeded, 7 bookmarks
- **User rating**: 5.00 out of 5 stars

## Pricing

$9.99/month + usage

To use this Actor, you pay a monthly rental fee to the developer. The rent is subtracted from your prepaid usage every month after the free trial period.You also pay for the Apify platform usage, which gets cheaper the higher Apify subscription plan you have.

Learn more: https://docs.apify.com/platform/actors/running/actors-in-store#rental-actors

## What's an Apify Actor?

Actors are a software tools running on the Apify platform, for all kinds of web data extraction and automation use cases.
In Batch mode, an Actor accepts a well-defined JSON input, performs an action which can take anything from a few seconds to a few hours,
and optionally produces a well-defined JSON output, datasets with results, or files in key-value store.
In Standby mode, an Actor provides a web server which can be used as a website, API, or an MCP server.
Actors are written with capital "A".

## How to integrate an Actor?

If asked about integration, you help developers integrate Actors into their projects.
You adapt to their stack and deliver integrations that are safe, well-documented, and production-ready.
The best way to integrate Actors is as follows.

In JavaScript/TypeScript projects, use official [JavaScript/TypeScript client](https://docs.apify.com/api/client/js.md):

```bash
npm install apify-client
```

In Python projects, use official [Python client library](https://docs.apify.com/api/client/python.md):

```bash
pip install apify-client
```

In shell scripts, use [Apify CLI](https://docs.apify.com/cli/docs.md):

````bash
# MacOS / Linux
curl -fsSL https://apify.com/install-cli.sh | bash
# Windows
irm https://apify.com/install-cli.ps1 | iex
```bash

In AI frameworks, you might use the [Apify MCP server](https://docs.apify.com/platform/integrations/mcp.md).

If your project is in a different language, use the [REST API](https://docs.apify.com/api/v2.md).

For usage examples, see the [API](#api) section below.

For more details, see Apify documentation as [Markdown index](https://docs.apify.com/llms.txt) and [Markdown full-text](https://docs.apify.com/llms-full.txt).


# README

## Pull&Bear Products Scraper

This tool scrapes product data from Pull&Bear. It works with all website countries and languages.

Checkout our other scrapers: [Zara](https://apify.com/datasaurus/zara) | [Stradivarius](https://apify.com/datasaurus/stradivarius) | [Bershka](https://apify.com/datasaurus/bershka) | [MassimoDutti](https://apify.com/datasaurus/massimodutti) | [Oysho](https://apify.com/datasaurus/oysho) | [ZaraHome](https://apify.com/datasaurus/zarahome)

n8n node: [n8n-nodes-pullandbear](https://www.npmjs.com/package/n8n-nodes-pullandbear)

### Features

- **Detailed Scrape**:
    - Scrapes individual product pages for detailed information
    - <span style="color:">**Fields include**: name, description, price, colors, sizes, SKUs, category, images, first available date, promotions, product page url, product composition, sustainability details, certified materials, traceability. </span>
    - <span style="color:MediumSeaGreen">Approx run cost: 1000 products in 5 minutes, actor usage cost 24 cents (including proxy).</span>

- **Granularity**: Scrape:
    - The entire website, e.g. [https://www.pullandbear.com/gb/](https://www.pullandbear.com/gb/)
    - Product category pages, e.g. [https://www.pullandbear.com/gb/woman/sale/clothing/t-shirts-and-tops-n7097](https://www.pullandbear.com/gb/woman/sale/clothing/t-shirts-and-tops-n7097)
    - Product pages, e.g. [https://www.pullandbear.com/gb/straight-wideleg-jeans-l03660308?cS=972&pelement=721441890](https://www.pullandbear.com/gb/straight-wideleg-jeans-l03660308?cS=972&pelement=721441890)

- **Multiple URLs**: Scrape data from multiple start urls within a single scrape, including from different country websites.

- **Set Limits**: Define the maximum products and categories to scrape per url.

- **Deduplication**: Unique products are returned, even across multiple start URLs. This is useful in instances where the same product appears in multiple subcategories. The scrapper will do it's best to deduplicate results - as pages are scrapped concurrently, the scrapper may occassionally not identify every duplicate product.


### Notes

- This tool is configured to use residential proxies which are necessay to avoid IP address blocking. It is effecient with data download sizes which optimises run time and cost.
- The output is in JSON format due to the nested data structure.
- The fields `colors`, `sizes`, `category` and `mainImage` summarize data from the `colorsSizesImagesJSON` json field. This allows for the key information to be exported in a flat structure to csv and be easily read by Excel or GoogleSheets.
- Detailed `color` and `size` information is contained within the `colorsSizesImagesJSON` json field.
- Each line/item in the result is a unique product within which the can be multiple colors and sizes.
- The field names match the field names returned by the website.
- Occassionally the scraper may return less results than requested - this can happen when the website returns dummy products with blank information. The scrapper filters these out.

### Known Scraping Challenges
- Occasionally the scrape might not start or the scrape might stop due to requests being blocked by the website. A '403 error' will appear in the logs. This is common in webscrapping and difficult to avoid, but it can be solved by re-running the scrape.
- Actual results may be lower than the max_results requested  due to the way products are displayed on the website. On the website individual colors of the same item may appear separately, however the scraper will retrieve the 'product bundle' containing all colours of that item. So requesting 5 products might return 1 result if 5 colour variations of the same product where displayed separately on the webpage.

### Resources

- **Getting Started with Apify**:
    - [Getting Started with Apify 1/4 - Introduction](https://www.youtube.com/watch?v=b_nep8vMnkc&list=PLObrtcm1Kw6MUrlLNDbK9QRg8VDJg0gOW&index=1)
    - [Getting Started with Apify 4/4 - Integrations](https://www.youtube.com/watch?v=bNACk1_S_6w&list=PLObrtcm1Kw6MUrlLNDbK9QRg8VDJg0gOW&index=4)
    - [Try Apify for Free - 2,000+ Pre-built Web Scraping and Automation Tools](https://www.youtube.com/watch?v=WQNgQVRG9_U)
    - [Integration with Make, GitHub, Zapier, Google Drive, and other apps](https://apify.com/integrations)
    - [How to Train ChatGPT on Your Own Data - Build a Custom AI Chatbot](https://www.youtube.com/watch?v=LcG919C4UeU)
    - [Apify Platform Documentation](https://docs.apify.com/platform)


### Example Input
```json
{
  "start_urls": [
    {"url": "https://www.pullandbear.com/gb/"},
    {"url": "https://www.pullandbear.com/gb/woman/sale/clothing/t-shirts-and-tops-n7097"},
    {"url": "https://www.pullandbear.com/gb/straight-wideleg-jeans-l03660308?cS=972&pelement=721441890"}
  ],
  "deduplicate_across_all_start_urls": true,
  "max_subcategories": 10,
  "max_results": 1000
}
````

### Example Output

```json
{
  "id": 677318356,
  "name": "High neck jumper",
  "description": "",
  "longDescription": "Twisted yarn knit jumper with a high neck and long sleeves.",
  "reference": "03558305-I2024",
  "displayReference": "3558/305",
  "productType": "Clothing",
  "mainImage": "https://static.pullandbear.net/assets/public/7606/4625/2adf454bbde5/51bc4484a8a5/03558305070-A8M/03558305070-A8M.jpg?ts=1731510422687",
  "colors": "Ecru/Black, Blue Marl",
  "sizes": "XS, S, M, L",
  "price": 799,
  "oldPrice": 2999.0,
  "keyword": "high-neck-jumper",
  "category": "woman/sale/favourites-n7254",
  "availabilityDate": "2024-11-22 00:00:00.0",
  "colorsSizesImagesJSON": [
    {
      "id": "070",
      "reference": "C03558305070-I2024",
      "displayReference": "3558/305/070",
      "name": "Ecru/Black",
      "productPageSelectedColor": "https://www.pullandbear.com/gb/high-neck-jumper-l03558305?pelement=677318356&colorId=070",
      "xmedia": [
        "https://static.pullandbear.net/assets/public/7606/4625/2adf454bbde5/51bc4484a8a5/03558305070-A8M/03558305070-A8M.jpg?ts=1731510422687",
        "https://static.pullandbear.net/assets/public/de1e/0561/979242e0bbff/8fdb3dfac91d/03558305070-C/03558305070-C.jpg?ts=1731510475872",
        "https://static.pullandbear.net/assets/public/0b75/e233/9b554e5eb61d/3e0127d73f24/03558305070-A6M/03558305070-A6M.jpg?ts=1731510506478",
        "https://static.pullandbear.net/assets/public/2670/9e96/8a8f4cd2bb87/833151563484/03558305070-A4M/03558305070-A4M.jpg?ts=1732006817018",
        "https://static.pullandbear.net/assets/public/7c3f/f9cf/af2046de8815/f64435740b15/03558305070-A2M/03558305070-A2M.jpg?ts=1732006869111",
        "https://static.pullandbear.net/assets/public/5a50/2b36/99c64fd99fc0/a35130813396/03558305070-A1M/03558305070-A1M.jpg?ts=1732006969720",
        "https://static.pullandbear.net/assets/public/1353/3bfd/e660490a9e38/33c293ff09ff/03558305070-A3M/03558305070-A3M.jpg?ts=1732006977840",
        "https://static.pullandbear.net/assets/public/60cb/85f6/842240d583af/c5c689da9b9f/03558305070-M/03558305070-M.jpg?ts=1732006981335",
        "https://static.pullandbear.net/assets/public/ef45/cb32/d55347108805/0ef2cade45a8/03558305070-E/03558305070-E.jpg?ts=1732177236036",
        "https://static.pullandbear.net/assets/public/d225/7fa7/8853459ab59f/4c7de8969aca/03558305070-A7M/03558305070-A7M.jpg?ts=1732177243252",
        "https://static.pullandbear.net/assets/public/2bf2/9f56/70464ca4b966/ceb7cc5c034d/0355830507001-A7M/0355830507001-A7M.jpg?ts=1732179315805",
        "https://static.pullandbear.net/assets/public/5a50/2b36/99c64fd99fc0/a35130813396/03558305070-A1M/03558305070-A1M.jpg?ts=1732006969720",
        "https://static.pullandbear.net/assets/public/7c3f/f9cf/af2046de8815/f64435740b15/03558305070-A2M/03558305070-A2M.jpg?ts=1732006869111",
        "https://static.pullandbear.net/assets/public/1353/3bfd/e660490a9e38/33c293ff09ff/03558305070-A3M/03558305070-A3M.jpg?ts=1732006977840",
        "https://static.pullandbear.net/assets/public/2670/9e96/8a8f4cd2bb87/833151563484/03558305070-A4M/03558305070-A4M.jpg?ts=1732006817018",
        "https://static.pullandbear.net/assets/public/0b75/e233/9b554e5eb61d/3e0127d73f24/03558305070-A6M/03558305070-A6M.jpg?ts=1731510506478",
        "https://static.pullandbear.net/assets/public/7606/4625/2adf454bbde5/51bc4484a8a5/03558305070-A8M/03558305070-A8M.jpg?ts=1731510422687",
        "https://static.pullandbear.net/assets/public/60cb/85f6/842240d583af/c5c689da9b9f/03558305070-M/03558305070-M.jpg?ts=1732006981335",
        "https://static.pullandbear.net/assets/public/9e92/7386/d1d04e51985d/d2a3b276bd2d/0355830507001-E/0355830507001-E.jpg?ts=1732179339795",
        "https://static.pullandbear.net/assets/public/de1e/0561/979242e0bbff/8fdb3dfac91d/03558305070-C/03558305070-C.jpg?ts=1731510475872"
      ],
      "sizes": [
        {
          "sku": 677318362,
          "name": "XS",
          "description": null,
          "partnumber": "0355830507001-I2024",
          "isBuyable": true,
          "backSoon": "0",
          "mastersSizeId": "101",
          "price": "799",
          "oldPrice": "2999",
          "discountsPercentages": {
            "oldPriceDiscount": "73"
          },
          "promotionId": 18085,
          "sizeType": "regular",
          "skuDimensions": [
            {
              "dimensionId": "113",
              "value": 49.0,
              "dimensionName": "1/2 Chest"
            },
            {
              "dimensionId": "120",
              "value": 43.0,
              "dimensionName": "Front Length"
            },
            {
              "dimensionId": "24",
              "value": 50.0,
              "dimensionName": "Sleeve Length"
            }
          ]
        },
        {
          "sku": 677318360,
          "name": "S",
          "description": null,
          "partnumber": "0355830507002-I2024",
          "isBuyable": true,
          "backSoon": "0",
          "mastersSizeId": "102",
          "price": "799",
          "oldPrice": "2999",
          "discountsPercentages": {
            "oldPriceDiscount": "73"
          },
          "promotionId": 18085,
          "sizeType": "regular",
          "skuDimensions": [
            {
              "dimensionId": "113",
              "value": 52.0,
              "dimensionName": "1/2 Chest"
            },
            {
              "dimensionId": "120",
              "value": 45.0,
              "dimensionName": "Front Length"
            },
            {
              "dimensionId": "24",
              "value": 51.0,
              "dimensionName": "Sleeve Length"
            }
          ]
        },
        {
          "sku": 677318361,
          "name": "M",
          "description": null,
          "partnumber": "0355830507003-I2024",
          "isBuyable": true,
          "backSoon": "0",
          "mastersSizeId": "103",
          "price": "799",
          "oldPrice": "2999",
          "discountsPercentages": {
            "oldPriceDiscount": "73"
          },
          "promotionId": 18085,
          "sizeType": "regular",
          "skuDimensions": [
            {
              "dimensionId": "113",
              "value": 55.0,
              "dimensionName": "1/2 Chest"
            },
            {
              "dimensionId": "120",
              "value": 47.0,
              "dimensionName": "Front Length"
            },
            {
              "dimensionId": "24",
              "value": 52.0,
              "dimensionName": "Sleeve Length"
            }
          ]
        },
        {
          "sku": 677318359,
          "name": "L",
          "description": null,
          "partnumber": "0355830507004-I2024",
          "isBuyable": true,
          "backSoon": "0",
          "mastersSizeId": "104",
          "price": "799",
          "oldPrice": "2999",
          "discountsPercentages": {
            "oldPriceDiscount": "73"
          },
          "promotionId": 18085,
          "sizeType": "regular",
          "skuDimensions": [
            {
              "dimensionId": "113",
              "value": 58.0,
              "dimensionName": "1/2 Chest"
            },
            {
              "dimensionId": "120",
              "value": 49.0,
              "dimensionName": "Front Length"
            },
            {
              "dimensionId": "24",
              "value": 53.0,
              "dimensionName": "Sleeve Length"
            }
          ]
        }
      ],
      "extraInfo": {
        "fitSizeMessage": {
          "additionalInfo": {
            "percentages": {}
          }
        }
      }
    },
    {
      "id": "418",
      "reference": "C03558305418-I2024",
      "displayReference": "3558/305/418",
      "name": "Blue marl",
      "productPageSelectedColor": "https://www.pullandbear.com/gb/high-neck-jumper-l03558305?pelement=677318356&colorId=418",
      "xmedia": [
        "https://static.pullandbear.net/assets/public/e789/ccd0/5a044b1aaca6/8079553ade6b/03558305418-A6M/03558305418-A6M.jpg?ts=1732121482023",
        "https://static.pullandbear.net/assets/public/cee9/8f01/27e54629bab3/bc703de1420f/03558305418-M/03558305418-M.jpg?ts=1732184715538",
        "https://static.pullandbear.net/assets/public/376d/a327/882e4f108515/b31ec8acdd31/03558305418-A1M/03558305418-A1M.jpg?ts=1732184723623",
        "https://static.pullandbear.net/assets/public/8cac/34d8/473b4ae49233/072544a32e76/03558305418-A7M/03558305418-A7M.jpg?ts=1732189419636",
        "https://static.pullandbear.net/assets/public/c45f/3588/91db47208261/cf710ba9c494/03558305418-E/03558305418-E.jpg?ts=1732190452773",
        "https://static.pullandbear.net/assets/public/598c/6d1d/be9e4d35a6d8/3522defc6f1e/03558305418-C/03558305418-C.jpg?ts=1732197138231",
        "https://static.pullandbear.net/assets/public/31b4/a915/c34444e8be07/6b7e81b7b531/03558305418-A8M/03558305418-A8M.jpg?ts=1732197267758",
        "https://static.pullandbear.net/assets/public/6204/2b83/4bf044b9ad28/200f47674052/03558305418-A3M/03558305418-A3M.jpg?ts=1732198668299",
        "https://static.pullandbear.net/assets/public/d228/5cfd/1e044fc5813e/d1f7ac68c318/03558305418-A4M/03558305418-A4M.jpg?ts=1732198716804",
        "https://static.pullandbear.net/assets/public/0ed2/7abe/bc8d475d8722/4e0242fd2c34/03558305418-R/03558305418-R.jpg?ts=1732199181631",
        "https://static.pullandbear.net/assets/public/6989/fc44/a6184ea28865/1e91edfd7e8c/03558305418-A2M/03558305418-A2M.jpg?ts=1732209786392",
        "https://static.pullandbear.net/assets/public/9b97/7134/665d4b9ab0ba/6b86f6ab4f3b/0355830541801-A7M/0355830541801-A7M.jpg?ts=1732199425731",
        "https://static.pullandbear.net/assets/public/ad53/6c67/ebe342f982c4/b0279a47a234/0355830541801-A1M/0355830541801-A1M.jpg?ts=1732199011432",
        "https://static.pullandbear.net/assets/public/e789/ccd0/5a044b1aaca6/8079553ade6b/03558305418-A6M/03558305418-A6M.jpg?ts=1732121482023",
        "https://static.pullandbear.net/assets/public/1ec9/42e2/d1764c979688/4ba41221230d/03558305418-A8M/03558305418-A8M.jpg?ts=1732121553237",
        "https://static.pullandbear.net/assets/public/312d/fb4e/66534fcdbfb5/d1807fc9017d/0355830541801-A4M/0355830541801-A4M.jpg?ts=1732198996404",
        "https://static.pullandbear.net/assets/public/7749/df8e/8deb4e5dacb9/a6eb0bfd914a/0355830541801-A3M/0355830541801-A3M.jpg?ts=1732199012037",
        "https://static.pullandbear.net/assets/public/f2ea/b4aa/61114859a327/c33f0babc026/0355830541801-M/0355830541801-M.jpg?ts=1732199025470",
        "https://static.pullandbear.net/assets/public/7fd9/d2b6/ba3c4d67b2d2/68548bfe2467/0355830541801-E/0355830541801-E.jpg?ts=1732199417180",
        "https://static.pullandbear.net/assets/public/95a5/1591/089845ec90cb/104332b84509/03558305418-C/03558305418-C.jpg?ts=1732121478111",
        "https://static.pullandbear.net/assets/public/638f/56ba/6f9c43558046/6e5824b67029/0355830541801-A2M/0355830541801-A2M.jpg?ts=1732209783134"
      ],
      "sizes": [
        {
          "sku": 677318357,
          "name": "XS",
          "description": null,
          "partnumber": "0355830541801-I2024",
          "isBuyable": true,
          "backSoon": "0",
          "mastersSizeId": "101",
          "price": "799",
          "oldPrice": "2999",
          "discountsPercentages": {
            "oldPriceDiscount": "73"
          },
          "promotionId": 18085,
          "sizeType": "regular",
          "skuDimensions": [
            {
              "dimensionId": "113",
              "value": 49.0,
              "dimensionName": "1/2 Chest"
            },
            {
              "dimensionId": "120",
              "value": 43.0,
              "dimensionName": "Front Length"
            },
            {
              "dimensionId": "24",
              "value": 50.0,
              "dimensionName": "Sleeve Length"
            }
          ]
        },
        {
          "sku": 677318358,
          "name": "S",
          "description": null,
          "partnumber": "0355830541802-I2024",
          "isBuyable": true,
          "backSoon": "0",
          "mastersSizeId": "102",
          "price": "799",
          "oldPrice": "2999",
          "discountsPercentages": {
            "oldPriceDiscount": "73"
          },
          "promotionId": 18085,
          "sizeType": "regular",
          "skuDimensions": [
            {
              "dimensionId": "113",
              "value": 52.0,
              "dimensionName": "1/2 Chest"
            },
            {
              "dimensionId": "120",
              "value": 45.0,
              "dimensionName": "Front Length"
            },
            {
              "dimensionId": "24",
              "value": 51.0,
              "dimensionName": "Sleeve Length"
            }
          ]
        },
        {
          "sku": 677318363,
          "name": "M",
          "description": null,
          "partnumber": "0355830541803-I2024",
          "isBuyable": true,
          "backSoon": "0",
          "mastersSizeId": "103",
          "price": "799",
          "oldPrice": "2999",
          "discountsPercentages": {
            "oldPriceDiscount": "73"
          },
          "promotionId": 18085,
          "sizeType": "regular",
          "skuDimensions": [
            {
              "dimensionId": "113",
              "value": 55.0,
              "dimensionName": "1/2 Chest"
            },
            {
              "dimensionId": "120",
              "value": 47.0,
              "dimensionName": "Front Length"
            },
            {
              "dimensionId": "24",
              "value": 52.0,
              "dimensionName": "Sleeve Length"
            }
          ]
        },
        {
          "sku": 677318364,
          "name": "L",
          "description": null,
          "partnumber": "0355830541804-I2024",
          "isBuyable": true,
          "backSoon": "0",
          "mastersSizeId": "104",
          "price": "799",
          "oldPrice": "2999",
          "discountsPercentages": {
            "oldPriceDiscount": "73"
          },
          "promotionId": 18085,
          "sizeType": "regular",
          "skuDimensions": [
            {
              "dimensionId": "113",
              "value": 58.0,
              "dimensionName": "1/2 Chest"
            },
            {
              "dimensionId": "120",
              "value": 49.0,
              "dimensionName": "Front Length"
            },
            {
              "dimensionId": "24",
              "value": 53.0,
              "dimensionName": "Sleeve Length"
            }
          ]
        }
      ],
      "extraInfo": {
        "fitSizeMessage": {
          "additionalInfo": {
            "percentages": {}
          }
        }
      }
    }
  ],
  "isBuyable": true,
  "onSpecial": false,
  "website": "https://www.pullandbear.com/gb/",
  "categoryPage": "https://www.pullandbear.com/gb/woman/sale/favourites-n7254",
  "productPage": "https://www.pullandbear.com/gb/high-neck-jumper-l03558305?pelement=677318356",
  "mainColorid": "070",
  "composition": [
    {
      "part": "1",
      "composition": [
        {
          "id": "173862441980495135",
          "name": "polyester",
          "description": "88",
          "percentage": "88"
        }
      ]
    },
    {
      "part": "1",
      "composition": [
        {
          "id": "173862441980441373",
          "name": "acrylic",
          "description": "11",
          "percentage": "11"
        }
      ]
    },
    {
      "part": "1",
      "composition": [
        {
          "id": "17386244198049342",
          "name": "elastane",
          "description": "1",
          "percentage": "1"
        }
      ]
    }
  ],
  "compositionDetail": {
    "parts": [
      {
        "id": "1",
        "description": "OUTER SHELL",
        "areas": [],
        "components": [
          {
            "material": "polyester",
            "percentage": "88%"
          },
          {
            "material": "acrylic",
            "percentage": "11%"
          },
          {
            "material": "elastane",
            "percentage": "1%"
          }
        ],
        "microcontents": [],
        "reinforcements": []
      }
    ],
    "exceptions": []
  },
  "care": [
    "Hand wash up to 30ºC/86ºF",
    "Do not bleach",
    "Iron up to 110ºC/230ºF",
    "Do not tumble dry",
    "Dry clean tetrachlorethylene"
  ],
  "sustainability": {
    "show": false,
    "syntheticFiberPercentage": {
      "name": ""
    }
  },
  "certifiedMaterials": {
    "show": true,
    "containsAtLeastTitle": "Which Contains At Least",
    "title": "Certified Materials",
    "materials": [
      {
        "moreInformationTitle": "More Information",
        "certificationLink": "https://textileexchange.org/standards/",
        "fiberKey": "materials.recycled_acrylic",
        "fiberType": "Rcs Certified Recycled Acrylic",
        "fiberDescription": "This material is made from recycled acrylic waste from other production processes. Transforming waste into new materials helps limit the extraction of raw materials. It is certified to the Recycled Content Standard (RCS), which verifies the recycled content and tracks it from source to final product.",
        "percentage": "11%",
        "certificationId": "193341",
        "certifiedByText": "Certified By Intertek 193341.",
        "fiberParts": [
          {
            "fiberPartTitle": "Outer Shell",
            "fiberAreas": [
              {
                "fiberAreaTitle": ""
              }
            ]
          }
        ]
      },
      {
        "moreInformationTitle": "More Information",
        "certificationLink": "https://textileexchange.org/standards/",
        "fiberKey": "materials.recycled_polyester",
        "fiberType": "Rcs Certified Recycled Polyester",
        "fiberDescription": "Nowadays, recycled polyester is mainly made from PET plastic waste. This is a type of plastic that is widely used in a variety of items, such as plastic bottles. Using recycled materials helps limit the production of virgin polyester fiber. It is certified to the Recycled Content Standard (RCS), which verifies the recycled content and tracks it from source to final product.",
        "percentage": "88%",
        "certificationId": "193341",
        "certifiedByText": "Certified By Intertek 193341.",
        "fiberParts": [
          {
            "fiberPartTitle": "Outer Shell",
            "fiberAreas": [
              {
                "fiberAreaTitle": ""
              }
            ]
          }
        ]
      }
    ]
  },
  "nameEn": "High neck jumper",
  "traceability": {
    "show": true,
    "weaving": {
      "name": "Weaving",
      "country": [
        "Mainland China"
      ]
    },
    "dyeingPrinting": {
      "name": "",
      "country": []
    },
    "confection": {
      "name": "Production",
      "country": [
        "Mainland China"
      ]
    },
    "assembly": {
      "name": "",
      "country": []
    },
    "pricking": {
      "name": "",
      "country": []
    },
    "finish": {
      "name": "",
      "country": []
    }
  },
  "additionalInfo": ""
}
```

# Actor input Schema

## `start_urls` (type: `array`):

Website: e.g.
https://www.pullandbear.com/gb/
-see Information tab for a list of country websites-
Category page: e.g.
https://www.pullandbear.com/gb/woman/clothing/jeans-n6581
Product page: e.g.
https://www.pullandbear.com/gb/straight-wideleg-jeans-l03660308?cS=972\&pelement=721441890
-the full url is required to scrape product pages-

## `deduplicate_across_all_start_urls` (type: `boolean`):

Activate to deduplicate results across all start urls. Deactivate to deduplicate results for each start url independently.

## `max_categories` (type: `integer`):

Select the number of category product pages to scrape per url. This works with main page urls only.

## `max_results` (type: `integer`):

Select the maximum number of products to scrape per start url. Actual results may be lower than this number due to the way products are displayed on the website. On the website individual colors of the same item may appear separately, however the scraper will retrieve the 'product bundle' containing all colours of that item. So requesting 5 products might return 1 result if 5 colour variations of the same product where displayed separately on the webpage.

## Actor input object example

```json
{
  "start_urls": [
    {
      "url": "https://www.pullandbear.com/gb/"
    },
    {
      "url": "https://www.pullandbear.com/gb/woman/clothing/jeans-n6581"
    },
    {
      "url": "https://www.pullandbear.com/gb/straight-wideleg-jeans-l03660308?cS=972&pelement=721441890"
    }
  ],
  "deduplicate_across_all_start_urls": true,
  "max_results": 10
}
```

# Actor output Schema

## `results` (type: `string`):

No description

# API

You can run this Actor programmatically using our API. Below are code examples in JavaScript, Python, and CLI, as well as the OpenAPI specification and MCP server setup.

## JavaScript example

```javascript
import { ApifyClient } from 'apify-client';

// Initialize the ApifyClient with your Apify API token
// Replace the '<YOUR_API_TOKEN>' with your token
const client = new ApifyClient({
    token: '<YOUR_API_TOKEN>',
});

// Prepare Actor input
const input = {
    "start_urls": [
        {
            "url": "https://www.pullandbear.com/gb/"
        },
        {
            "url": "https://www.pullandbear.com/gb/woman/clothing/jeans-n6581"
        },
        {
            "url": "https://www.pullandbear.com/gb/straight-wideleg-jeans-l03660308?cS=972&pelement=721441890"
        }
    ],
    "max_results": 10
};

// Run the Actor and wait for it to finish
const run = await client.actor("datasaurus/pullandbear").call(input);

// Fetch and print Actor results from the run's dataset (if any)
console.log('Results from dataset');
console.log(`💾 Check your data here: https://console.apify.com/storage/datasets/${run.defaultDatasetId}`);
const { items } = await client.dataset(run.defaultDatasetId).listItems();
items.forEach((item) => {
    console.dir(item);
});

// 📚 Want to learn more 📖? Go to → https://docs.apify.com/api/client/js/docs

```

## Python example

```python
from apify_client import ApifyClient

# Initialize the ApifyClient with your Apify API token
# Replace '<YOUR_API_TOKEN>' with your token.
client = ApifyClient("<YOUR_API_TOKEN>")

# Prepare the Actor input
run_input = {
    "start_urls": [
        { "url": "https://www.pullandbear.com/gb/" },
        { "url": "https://www.pullandbear.com/gb/woman/clothing/jeans-n6581" },
        { "url": "https://www.pullandbear.com/gb/straight-wideleg-jeans-l03660308?cS=972&pelement=721441890" },
    ],
    "max_results": 10,
}

# Run the Actor and wait for it to finish
run = client.actor("datasaurus/pullandbear").call(run_input=run_input)

# Fetch and print Actor results from the run's dataset (if there are any)
print("💾 Check your data here: https://console.apify.com/storage/datasets/" + run["defaultDatasetId"])
for item in client.dataset(run["defaultDatasetId"]).iterate_items():
    print(item)

# 📚 Want to learn more 📖? Go to → https://docs.apify.com/api/client/python/docs/quick-start

```

## CLI example

```bash
echo '{
  "start_urls": [
    {
      "url": "https://www.pullandbear.com/gb/"
    },
    {
      "url": "https://www.pullandbear.com/gb/woman/clothing/jeans-n6581"
    },
    {
      "url": "https://www.pullandbear.com/gb/straight-wideleg-jeans-l03660308?cS=972&pelement=721441890"
    }
  ],
  "max_results": 10
}' |
apify call datasaurus/pullandbear --silent --output-dataset

```

## MCP server setup

```json
{
    "mcpServers": {
        "apify": {
            "command": "npx",
            "args": [
                "mcp-remote",
                "https://mcp.apify.com/?tools=datasaurus/pullandbear",
                "--header",
                "Authorization: Bearer <YOUR_API_TOKEN>"
            ]
        }
    }
}

```

## OpenAPI specification

```json
{
    "openapi": "3.0.1",
    "info": {
        "title": "Pull&Bear",
        "description": "Scrape products from pullandbear.com websites. All countries and languages. Scrape the full website or scrape by product category or individual product. Fast and efficient. n8n node: n8n-nodes-pullandbear",
        "version": "0.1",
        "x-build-id": "AqInmA8YfZzyoRFO0"
    },
    "servers": [
        {
            "url": "https://api.apify.com/v2"
        }
    ],
    "paths": {
        "/acts/datasaurus~pullandbear/run-sync-get-dataset-items": {
            "post": {
                "operationId": "run-sync-get-dataset-items-datasaurus-pullandbear",
                "x-openai-isConsequential": false,
                "summary": "Executes an Actor, waits for its completion, and returns Actor's dataset items in response.",
                "tags": [
                    "Run Actor"
                ],
                "requestBody": {
                    "required": true,
                    "content": {
                        "application/json": {
                            "schema": {
                                "$ref": "#/components/schemas/inputSchema"
                            }
                        }
                    }
                },
                "parameters": [
                    {
                        "name": "token",
                        "in": "query",
                        "required": true,
                        "schema": {
                            "type": "string"
                        },
                        "description": "Enter your Apify token here"
                    }
                ],
                "responses": {
                    "200": {
                        "description": "OK"
                    }
                }
            }
        },
        "/acts/datasaurus~pullandbear/runs": {
            "post": {
                "operationId": "runs-sync-datasaurus-pullandbear",
                "x-openai-isConsequential": false,
                "summary": "Executes an Actor and returns information about the initiated run in response.",
                "tags": [
                    "Run Actor"
                ],
                "requestBody": {
                    "required": true,
                    "content": {
                        "application/json": {
                            "schema": {
                                "$ref": "#/components/schemas/inputSchema"
                            }
                        }
                    }
                },
                "parameters": [
                    {
                        "name": "token",
                        "in": "query",
                        "required": true,
                        "schema": {
                            "type": "string"
                        },
                        "description": "Enter your Apify token here"
                    }
                ],
                "responses": {
                    "200": {
                        "description": "OK",
                        "content": {
                            "application/json": {
                                "schema": {
                                    "$ref": "#/components/schemas/runsResponseSchema"
                                }
                            }
                        }
                    }
                }
            }
        },
        "/acts/datasaurus~pullandbear/run-sync": {
            "post": {
                "operationId": "run-sync-datasaurus-pullandbear",
                "x-openai-isConsequential": false,
                "summary": "Executes an Actor, waits for completion, and returns the OUTPUT from Key-value store in response.",
                "tags": [
                    "Run Actor"
                ],
                "requestBody": {
                    "required": true,
                    "content": {
                        "application/json": {
                            "schema": {
                                "$ref": "#/components/schemas/inputSchema"
                            }
                        }
                    }
                },
                "parameters": [
                    {
                        "name": "token",
                        "in": "query",
                        "required": true,
                        "schema": {
                            "type": "string"
                        },
                        "description": "Enter your Apify token here"
                    }
                ],
                "responses": {
                    "200": {
                        "description": "OK"
                    }
                }
            }
        }
    },
    "components": {
        "schemas": {
            "inputSchema": {
                "type": "object",
                "required": [
                    "start_urls"
                ],
                "properties": {
                    "start_urls": {
                        "title": "Start URLs - full website, category page or product page",
                        "uniqueItems": true,
                        "type": "array",
                        "description": "Website: e.g.\nhttps://www.pullandbear.com/gb/\n-see Information tab for a list of country websites-\nCategory page: e.g.\nhttps://www.pullandbear.com/gb/woman/clothing/jeans-n6581\nProduct page: e.g.\nhttps://www.pullandbear.com/gb/straight-wideleg-jeans-l03660308?cS=972&pelement=721441890\n-the full url is required to scrape product pages-",
                        "default": [
                            {
                                "url": "https://www.pullandbear.com/gb/"
                            }
                        ],
                        "items": {
                            "type": "object",
                            "required": [
                                "url"
                            ],
                            "properties": {
                                "url": {
                                    "type": "string",
                                    "title": "URL of a web page",
                                    "format": "uri"
                                }
                            }
                        }
                    },
                    "deduplicate_across_all_start_urls": {
                        "title": "Deduplicate across all start urls",
                        "type": "boolean",
                        "description": "Activate to deduplicate results across all start urls. Deactivate to deduplicate results for each start url independently.",
                        "default": true
                    },
                    "max_categories": {
                        "title": "Max Categories per start url",
                        "minimum": 1,
                        "maximum": 100000,
                        "type": "integer",
                        "description": "Select the number of category product pages to scrape per url. This works with main page urls only."
                    },
                    "max_results": {
                        "title": "Max Results per start url",
                        "minimum": 1,
                        "maximum": 100000,
                        "type": "integer",
                        "description": "Select the maximum number of products to scrape per start url. Actual results may be lower than this number due to the way products are displayed on the website. On the website individual colors of the same item may appear separately, however the scraper will retrieve the 'product bundle' containing all colours of that item. So requesting 5 products might return 1 result if 5 colour variations of the same product where displayed separately on the webpage."
                    }
                }
            },
            "runsResponseSchema": {
                "type": "object",
                "properties": {
                    "data": {
                        "type": "object",
                        "properties": {
                            "id": {
                                "type": "string"
                            },
                            "actId": {
                                "type": "string"
                            },
                            "userId": {
                                "type": "string"
                            },
                            "startedAt": {
                                "type": "string",
                                "format": "date-time",
                                "example": "2025-01-08T00:00:00.000Z"
                            },
                            "finishedAt": {
                                "type": "string",
                                "format": "date-time",
                                "example": "2025-01-08T00:00:00.000Z"
                            },
                            "status": {
                                "type": "string",
                                "example": "READY"
                            },
                            "meta": {
                                "type": "object",
                                "properties": {
                                    "origin": {
                                        "type": "string",
                                        "example": "API"
                                    },
                                    "userAgent": {
                                        "type": "string"
                                    }
                                }
                            },
                            "stats": {
                                "type": "object",
                                "properties": {
                                    "inputBodyLen": {
                                        "type": "integer",
                                        "example": 2000
                                    },
                                    "rebootCount": {
                                        "type": "integer",
                                        "example": 0
                                    },
                                    "restartCount": {
                                        "type": "integer",
                                        "example": 0
                                    },
                                    "resurrectCount": {
                                        "type": "integer",
                                        "example": 0
                                    },
                                    "computeUnits": {
                                        "type": "integer",
                                        "example": 0
                                    }
                                }
                            },
                            "options": {
                                "type": "object",
                                "properties": {
                                    "build": {
                                        "type": "string",
                                        "example": "latest"
                                    },
                                    "timeoutSecs": {
                                        "type": "integer",
                                        "example": 300
                                    },
                                    "memoryMbytes": {
                                        "type": "integer",
                                        "example": 1024
                                    },
                                    "diskMbytes": {
                                        "type": "integer",
                                        "example": 2048
                                    }
                                }
                            },
                            "buildId": {
                                "type": "string"
                            },
                            "defaultKeyValueStoreId": {
                                "type": "string"
                            },
                            "defaultDatasetId": {
                                "type": "string"
                            },
                            "defaultRequestQueueId": {
                                "type": "string"
                            },
                            "buildNumber": {
                                "type": "string",
                                "example": "1.0.0"
                            },
                            "containerUrl": {
                                "type": "string"
                            },
                            "usage": {
                                "type": "object",
                                "properties": {
                                    "ACTOR_COMPUTE_UNITS": {
                                        "type": "integer",
                                        "example": 0
                                    },
                                    "DATASET_READS": {
                                        "type": "integer",
                                        "example": 0
                                    },
                                    "DATASET_WRITES": {
                                        "type": "integer",
                                        "example": 0
                                    },
                                    "KEY_VALUE_STORE_READS": {
                                        "type": "integer",
                                        "example": 0
                                    },
                                    "KEY_VALUE_STORE_WRITES": {
                                        "type": "integer",
                                        "example": 1
                                    },
                                    "KEY_VALUE_STORE_LISTS": {
                                        "type": "integer",
                                        "example": 0
                                    },
                                    "REQUEST_QUEUE_READS": {
                                        "type": "integer",
                                        "example": 0
                                    },
                                    "REQUEST_QUEUE_WRITES": {
                                        "type": "integer",
                                        "example": 0
                                    },
                                    "DATA_TRANSFER_INTERNAL_GBYTES": {
                                        "type": "integer",
                                        "example": 0
                                    },
                                    "DATA_TRANSFER_EXTERNAL_GBYTES": {
                                        "type": "integer",
                                        "example": 0
                                    },
                                    "PROXY_RESIDENTIAL_TRANSFER_GBYTES": {
                                        "type": "integer",
                                        "example": 0
                                    },
                                    "PROXY_SERPS": {
                                        "type": "integer",
                                        "example": 0
                                    }
                                }
                            },
                            "usageTotalUsd": {
                                "type": "number",
                                "example": 0.00005
                            },
                            "usageUsd": {
                                "type": "object",
                                "properties": {
                                    "ACTOR_COMPUTE_UNITS": {
                                        "type": "integer",
                                        "example": 0
                                    },
                                    "DATASET_READS": {
                                        "type": "integer",
                                        "example": 0
                                    },
                                    "DATASET_WRITES": {
                                        "type": "integer",
                                        "example": 0
                                    },
                                    "KEY_VALUE_STORE_READS": {
                                        "type": "integer",
                                        "example": 0
                                    },
                                    "KEY_VALUE_STORE_WRITES": {
                                        "type": "number",
                                        "example": 0.00005
                                    },
                                    "KEY_VALUE_STORE_LISTS": {
                                        "type": "integer",
                                        "example": 0
                                    },
                                    "REQUEST_QUEUE_READS": {
                                        "type": "integer",
                                        "example": 0
                                    },
                                    "REQUEST_QUEUE_WRITES": {
                                        "type": "integer",
                                        "example": 0
                                    },
                                    "DATA_TRANSFER_INTERNAL_GBYTES": {
                                        "type": "integer",
                                        "example": 0
                                    },
                                    "DATA_TRANSFER_EXTERNAL_GBYTES": {
                                        "type": "integer",
                                        "example": 0
                                    },
                                    "PROXY_RESIDENTIAL_TRANSFER_GBYTES": {
                                        "type": "integer",
                                        "example": 0
                                    },
                                    "PROXY_SERPS": {
                                        "type": "integer",
                                        "example": 0
                                    }
                                }
                            }
                        }
                    }
                }
            }
        }
    }
}
```
