# Alibaba Products Search Scraper (`xtracto/alibaba-search-scraper`) Actor

Run keyword searches on alibaba.com and collect every offer card across paginated pages.

- **URL**: https://apify.com/xtracto/alibaba-search-scraper.md
- **Developed by:** [Farhan Febrian Nauval](https://apify.com/xtracto) (community)
- **Categories:** Developer tools, E-commerce, Automation
- **Stats:** 24 total users, 11 monthly users, 98.0% runs succeeded, 0 bookmarks
- **User rating**: No ratings yet

## Pricing

from $3.00 / 1,000 results

This Actor is paid per event and usage. You are charged both the fixed price for specific events and for Apify platform usage.
Since this Actor supports Apify Store discounts, the price gets lower the higher subscription plan you have.

Learn more: https://docs.apify.com/platform/actors/running/actors-in-store#pay-per-event

## What's an Apify Actor?

Actors are a software tools running on the Apify platform, for all kinds of web data extraction and automation use cases.
In Batch mode, an Actor accepts a well-defined JSON input, performs an action which can take anything from a few seconds to a few hours,
and optionally produces a well-defined JSON output, datasets with results, or files in key-value store.
In Standby mode, an Actor provides a web server which can be used as a website, API, or an MCP server.
Actors are written with capital "A".

## How to integrate an Actor?

If asked about integration, you help developers integrate Actors into their projects.
You adapt to their stack and deliver integrations that are safe, well-documented, and production-ready.
The best way to integrate Actors is as follows.

In JavaScript/TypeScript projects, use official [JavaScript/TypeScript client](https://docs.apify.com/api/client/js.md):

```bash
npm install apify-client
```

In Python projects, use official [Python client library](https://docs.apify.com/api/client/python.md):

```bash
pip install apify-client
```

In shell scripts, use [Apify CLI](https://docs.apify.com/cli/docs.md):

````bash
# MacOS / Linux
curl -fsSL https://apify.com/install-cli.sh | bash
# Windows
irm https://apify.com/install-cli.ps1 | iex
```bash

In AI frameworks, you might use the [Apify MCP server](https://docs.apify.com/platform/integrations/mcp.md).

If your project is in a different language, use the [REST API](https://docs.apify.com/api/v2.md).

For usage examples, see the [API](#api) section below.

For more details, see Apify documentation as [Markdown index](https://docs.apify.com/llms.txt) and [Markdown full-text](https://docs.apify.com/llms-full.txt).


# README

## Alibaba Search Scraper

Search any keyword on Alibaba.com and export every result card — supplier identity, pricing, minimum order quantity, trust signals, and product images — across as many pages as you need.

### Why use this actor

- No Alibaba account or login required — public search results only.
- Covers every result card per page: title, price range, MOQ, country of origin, and supplier trust tier in one structured record.
- Multi-keyword support lets you scrape dozens of categories in a single run.
- Stable JSON output ready to load into a spreadsheet, database, or sourcing pipeline.
- Automatic retries keep your run on track even when individual pages take longer to respond.
- Export to JSON, CSV, or Excel straight from the Apify Console.

### How it works

1. You provide one or more search keywords (e.g. `"bluetooth earphones"`, `"stainless steel bottle"`).
2. The actor opens each search results page on Alibaba.com, page by page, up to the limit you set.
3. Every product card is parsed into a structured record: product ID, title, price range, MOQ, supplier name, country, gold-supplier years, Trade Assurance status, and main image.
4. Records stream into your dataset as each page completes — no waiting for the full run to finish.
5. Download the complete dataset as JSON, CSV, or Excel when the run is done.

You don't need to manage any browsers or scrapers.

### Input

```json
{
    "queries": ["bluetooth earphones", "stainless steel water bottle"],
    "maxPagesPerQuery": 3,
    "maxConcurrency": 3,
    "maxRequestRetries": 6,
    "proxyConfiguration": { "useApifyProxy": true, "apifyProxyGroups": ["RESIDENTIAL"] }
}
````

| Field | Type | Description |
| --- | --- | --- |
| `queries` | string\[] | Search keywords. One keyword per entry. |
| `maxPagesPerQuery` | integer | Number of result pages to scrape per keyword. Default: 3, max: 100. |
| `maxConcurrency` | integer | Number of pages processed in parallel. Default: 3. |
| `maxRequestRetries` | integer | How many times a failed page request is retried before skipping. Default: 6. |
| `proxyConfiguration` | object | Apify Proxy settings or a custom proxy list. RESIDENTIAL group recommended. |

### Output

```json
{
    "query": "bluetooth earphones",
    "page": 1,
    "productId": "1601416191034",
    "title": "Wholesale Light Weight Bluetooth Headphones 30H Playtime Over-Ear Earphones with Mic LED Battery Indicator",
    "url": "https://www.alibaba.com/product-detail/..._1601416191034.html",
    "priceFormatted": "$3.50 - $4.20",
    "minOrderQuantity": "5",
    "minOrderUnit": "pieces",
    "companyId": "233189915",
    "companyName": "Foshan Shunde Junye Electronic Co., Ltd.",
    "companyHomeUrl": "https://jun-ye.en.alibaba.com/",
    "countryCode": "CN",
    "goldSupplierYears": "11 yrs",
    "tradeAssurance": false,
    "mainImage": "https://s.alicdn.com/@sc04/kf/Hc94740ec.jpg",
    "scrapedAt": "2026-05-30T04:27:41Z"
}
```

| Field | Type | Description |
| --- | --- | --- |
| `query` | string | The search keyword that produced this result. |
| `page` | integer | Page number the result appeared on. |
| `productId` | string | Alibaba's unique product identifier. |
| `title` | string | Full product listing title. |
| `url` | string | Direct link to the product detail page. |
| `priceFormatted` | string | Price range as shown on the listing (e.g. `"$3.50 - $4.20"`). |
| `minOrderQuantity` | string | Minimum order quantity. |
| `minOrderUnit` | string | Unit for the MOQ (e.g. `"pieces"`, `"sets"`). |
| `companyId` | string | Alibaba's internal supplier company ID. |
| `companyName` | string | Registered name of the supplier company. |
| `companyHomeUrl` | string | Supplier's Alibaba storefront URL. |
| `countryCode` | string | ISO country code of the supplier (e.g. `"CN"`). |
| `goldSupplierYears` | string | How long the supplier has held Gold Supplier status. |
| `tradeAssurance` | boolean | Whether the supplier offers Trade Assurance on this listing. |
| `mainImage` | string | URL of the listing's primary product image. |
| `scrapedAt` | string | ISO 8601 timestamp of when the record was collected. |

### Other Alibaba Scrapers

| Actor | Description |
| --- | --- |
| [Alibaba Search Scraper](https://apify.com/misceres/alibaba-search-scraper) | Search results with supplier, pricing, and MOQ for any keyword. |
| [Alibaba Product Scraper](https://apify.com/misceres/alibaba-product-scraper) | Full product detail: ladder pricing, SKU matrix, inventory, and supplier profile. |
| [Alibaba Company Scraper](https://apify.com/misceres/alibaba-company-scraper) | Complete supplier profile from any Alibaba storefront subdomain. |
| [Alibaba RFQ Scraper](https://apify.com/misceres/alibaba-rfq-scraper) | Live buying requests from Alibaba's sourcing marketplace. |
| [Alibaba Review Scraper](https://apify.com/misceres/alibaba-review-scraper) | Aggregate ratings, per-criterion scores, and supplier SLA metrics. |

# Actor input Schema

## `queries` (type: `array`):

Search keywords (one per array entry).

## `maxPagesPerQuery` (type: `integer`):

How many result pages to traverse per query.

## `maxConcurrency` (type: `integer`):

Maximum number of parallel requests.

## `maxRequestRetries` (type: `integer`):

How many times a request will be retried before giving up.

## `proxyConfiguration` (type: `object`):

Apify Proxy or your own proxy list. RESIDENTIAL group is recommended.

## Actor input object example

```json
{
  "queries": [
    "bluetooth headphones"
  ],
  "maxPagesPerQuery": 3,
  "maxConcurrency": 3,
  "maxRequestRetries": 6,
  "proxyConfiguration": {
    "useApifyProxy": true,
    "apifyProxyGroups": [
      "RESIDENTIAL"
    ]
  }
}
```

# API

You can run this Actor programmatically using our API. Below are code examples in JavaScript, Python, and CLI, as well as the OpenAPI specification and MCP server setup.

## JavaScript example

```javascript
import { ApifyClient } from 'apify-client';

// Initialize the ApifyClient with your Apify API token
// Replace the '<YOUR_API_TOKEN>' with your token
const client = new ApifyClient({
    token: '<YOUR_API_TOKEN>',
});

// Prepare Actor input
const input = {
    "queries": [
        "bluetooth headphones"
    ],
    "proxyConfiguration": {
        "useApifyProxy": true,
        "apifyProxyGroups": [
            "RESIDENTIAL"
        ]
    }
};

// Run the Actor and wait for it to finish
const run = await client.actor("xtracto/alibaba-search-scraper").call(input);

// Fetch and print Actor results from the run's dataset (if any)
console.log('Results from dataset');
console.log(`💾 Check your data here: https://console.apify.com/storage/datasets/${run.defaultDatasetId}`);
const { items } = await client.dataset(run.defaultDatasetId).listItems();
items.forEach((item) => {
    console.dir(item);
});

// 📚 Want to learn more 📖? Go to → https://docs.apify.com/api/client/js/docs

```

## Python example

```python
from apify_client import ApifyClient

# Initialize the ApifyClient with your Apify API token
# Replace '<YOUR_API_TOKEN>' with your token.
client = ApifyClient("<YOUR_API_TOKEN>")

# Prepare the Actor input
run_input = {
    "queries": ["bluetooth headphones"],
    "proxyConfiguration": {
        "useApifyProxy": True,
        "apifyProxyGroups": ["RESIDENTIAL"],
    },
}

# Run the Actor and wait for it to finish
run = client.actor("xtracto/alibaba-search-scraper").call(run_input=run_input)

# Fetch and print Actor results from the run's dataset (if there are any)
print("💾 Check your data here: https://console.apify.com/storage/datasets/" + run["defaultDatasetId"])
for item in client.dataset(run["defaultDatasetId"]).iterate_items():
    print(item)

# 📚 Want to learn more 📖? Go to → https://docs.apify.com/api/client/python/docs/quick-start

```

## CLI example

```bash
echo '{
  "queries": [
    "bluetooth headphones"
  ],
  "proxyConfiguration": {
    "useApifyProxy": true,
    "apifyProxyGroups": [
      "RESIDENTIAL"
    ]
  }
}' |
apify call xtracto/alibaba-search-scraper --silent --output-dataset

```

## MCP server setup

```json
{
    "mcpServers": {
        "apify": {
            "command": "npx",
            "args": [
                "mcp-remote",
                "https://mcp.apify.com/?tools=xtracto/alibaba-search-scraper",
                "--header",
                "Authorization: Bearer <YOUR_API_TOKEN>"
            ]
        }
    }
}

```

## OpenAPI specification

```json
{
    "openapi": "3.0.1",
    "info": {
        "title": "Alibaba Products Search Scraper",
        "description": "Run keyword searches on alibaba.com and collect every offer card across paginated pages.",
        "version": "1.0",
        "x-build-id": "s5EkbYfnEGrxwJfqq"
    },
    "servers": [
        {
            "url": "https://api.apify.com/v2"
        }
    ],
    "paths": {
        "/acts/xtracto~alibaba-search-scraper/run-sync-get-dataset-items": {
            "post": {
                "operationId": "run-sync-get-dataset-items-xtracto-alibaba-search-scraper",
                "x-openai-isConsequential": false,
                "summary": "Executes an Actor, waits for its completion, and returns Actor's dataset items in response.",
                "tags": [
                    "Run Actor"
                ],
                "requestBody": {
                    "required": true,
                    "content": {
                        "application/json": {
                            "schema": {
                                "$ref": "#/components/schemas/inputSchema"
                            }
                        }
                    }
                },
                "parameters": [
                    {
                        "name": "token",
                        "in": "query",
                        "required": true,
                        "schema": {
                            "type": "string"
                        },
                        "description": "Enter your Apify token here"
                    }
                ],
                "responses": {
                    "200": {
                        "description": "OK"
                    }
                }
            }
        },
        "/acts/xtracto~alibaba-search-scraper/runs": {
            "post": {
                "operationId": "runs-sync-xtracto-alibaba-search-scraper",
                "x-openai-isConsequential": false,
                "summary": "Executes an Actor and returns information about the initiated run in response.",
                "tags": [
                    "Run Actor"
                ],
                "requestBody": {
                    "required": true,
                    "content": {
                        "application/json": {
                            "schema": {
                                "$ref": "#/components/schemas/inputSchema"
                            }
                        }
                    }
                },
                "parameters": [
                    {
                        "name": "token",
                        "in": "query",
                        "required": true,
                        "schema": {
                            "type": "string"
                        },
                        "description": "Enter your Apify token here"
                    }
                ],
                "responses": {
                    "200": {
                        "description": "OK",
                        "content": {
                            "application/json": {
                                "schema": {
                                    "$ref": "#/components/schemas/runsResponseSchema"
                                }
                            }
                        }
                    }
                }
            }
        },
        "/acts/xtracto~alibaba-search-scraper/run-sync": {
            "post": {
                "operationId": "run-sync-xtracto-alibaba-search-scraper",
                "x-openai-isConsequential": false,
                "summary": "Executes an Actor, waits for completion, and returns the OUTPUT from Key-value store in response.",
                "tags": [
                    "Run Actor"
                ],
                "requestBody": {
                    "required": true,
                    "content": {
                        "application/json": {
                            "schema": {
                                "$ref": "#/components/schemas/inputSchema"
                            }
                        }
                    }
                },
                "parameters": [
                    {
                        "name": "token",
                        "in": "query",
                        "required": true,
                        "schema": {
                            "type": "string"
                        },
                        "description": "Enter your Apify token here"
                    }
                ],
                "responses": {
                    "200": {
                        "description": "OK"
                    }
                }
            }
        }
    },
    "components": {
        "schemas": {
            "inputSchema": {
                "type": "object",
                "required": [
                    "queries"
                ],
                "properties": {
                    "queries": {
                        "title": "Search keywords",
                        "type": "array",
                        "description": "Search keywords (one per array entry).",
                        "items": {
                            "type": "string"
                        }
                    },
                    "maxPagesPerQuery": {
                        "title": "Max pages per query",
                        "minimum": 1,
                        "maximum": 100,
                        "type": "integer",
                        "description": "How many result pages to traverse per query.",
                        "default": 3
                    },
                    "maxConcurrency": {
                        "title": "Max concurrency",
                        "type": "integer",
                        "description": "Maximum number of parallel requests.",
                        "default": 3
                    },
                    "maxRequestRetries": {
                        "title": "Max retries",
                        "type": "integer",
                        "description": "How many times a request will be retried before giving up.",
                        "default": 6
                    },
                    "proxyConfiguration": {
                        "title": "Proxy configuration",
                        "type": "object",
                        "description": "Apify Proxy or your own proxy list. RESIDENTIAL group is recommended."
                    }
                }
            },
            "runsResponseSchema": {
                "type": "object",
                "properties": {
                    "data": {
                        "type": "object",
                        "properties": {
                            "id": {
                                "type": "string"
                            },
                            "actId": {
                                "type": "string"
                            },
                            "userId": {
                                "type": "string"
                            },
                            "startedAt": {
                                "type": "string",
                                "format": "date-time",
                                "example": "2025-01-08T00:00:00.000Z"
                            },
                            "finishedAt": {
                                "type": "string",
                                "format": "date-time",
                                "example": "2025-01-08T00:00:00.000Z"
                            },
                            "status": {
                                "type": "string",
                                "example": "READY"
                            },
                            "meta": {
                                "type": "object",
                                "properties": {
                                    "origin": {
                                        "type": "string",
                                        "example": "API"
                                    },
                                    "userAgent": {
                                        "type": "string"
                                    }
                                }
                            },
                            "stats": {
                                "type": "object",
                                "properties": {
                                    "inputBodyLen": {
                                        "type": "integer",
                                        "example": 2000
                                    },
                                    "rebootCount": {
                                        "type": "integer",
                                        "example": 0
                                    },
                                    "restartCount": {
                                        "type": "integer",
                                        "example": 0
                                    },
                                    "resurrectCount": {
                                        "type": "integer",
                                        "example": 0
                                    },
                                    "computeUnits": {
                                        "type": "integer",
                                        "example": 0
                                    }
                                }
                            },
                            "options": {
                                "type": "object",
                                "properties": {
                                    "build": {
                                        "type": "string",
                                        "example": "latest"
                                    },
                                    "timeoutSecs": {
                                        "type": "integer",
                                        "example": 300
                                    },
                                    "memoryMbytes": {
                                        "type": "integer",
                                        "example": 1024
                                    },
                                    "diskMbytes": {
                                        "type": "integer",
                                        "example": 2048
                                    }
                                }
                            },
                            "buildId": {
                                "type": "string"
                            },
                            "defaultKeyValueStoreId": {
                                "type": "string"
                            },
                            "defaultDatasetId": {
                                "type": "string"
                            },
                            "defaultRequestQueueId": {
                                "type": "string"
                            },
                            "buildNumber": {
                                "type": "string",
                                "example": "1.0.0"
                            },
                            "containerUrl": {
                                "type": "string"
                            },
                            "usage": {
                                "type": "object",
                                "properties": {
                                    "ACTOR_COMPUTE_UNITS": {
                                        "type": "integer",
                                        "example": 0
                                    },
                                    "DATASET_READS": {
                                        "type": "integer",
                                        "example": 0
                                    },
                                    "DATASET_WRITES": {
                                        "type": "integer",
                                        "example": 0
                                    },
                                    "KEY_VALUE_STORE_READS": {
                                        "type": "integer",
                                        "example": 0
                                    },
                                    "KEY_VALUE_STORE_WRITES": {
                                        "type": "integer",
                                        "example": 1
                                    },
                                    "KEY_VALUE_STORE_LISTS": {
                                        "type": "integer",
                                        "example": 0
                                    },
                                    "REQUEST_QUEUE_READS": {
                                        "type": "integer",
                                        "example": 0
                                    },
                                    "REQUEST_QUEUE_WRITES": {
                                        "type": "integer",
                                        "example": 0
                                    },
                                    "DATA_TRANSFER_INTERNAL_GBYTES": {
                                        "type": "integer",
                                        "example": 0
                                    },
                                    "DATA_TRANSFER_EXTERNAL_GBYTES": {
                                        "type": "integer",
                                        "example": 0
                                    },
                                    "PROXY_RESIDENTIAL_TRANSFER_GBYTES": {
                                        "type": "integer",
                                        "example": 0
                                    },
                                    "PROXY_SERPS": {
                                        "type": "integer",
                                        "example": 0
                                    }
                                }
                            },
                            "usageTotalUsd": {
                                "type": "number",
                                "example": 0.00005
                            },
                            "usageUsd": {
                                "type": "object",
                                "properties": {
                                    "ACTOR_COMPUTE_UNITS": {
                                        "type": "integer",
                                        "example": 0
                                    },
                                    "DATASET_READS": {
                                        "type": "integer",
                                        "example": 0
                                    },
                                    "DATASET_WRITES": {
                                        "type": "integer",
                                        "example": 0
                                    },
                                    "KEY_VALUE_STORE_READS": {
                                        "type": "integer",
                                        "example": 0
                                    },
                                    "KEY_VALUE_STORE_WRITES": {
                                        "type": "number",
                                        "example": 0.00005
                                    },
                                    "KEY_VALUE_STORE_LISTS": {
                                        "type": "integer",
                                        "example": 0
                                    },
                                    "REQUEST_QUEUE_READS": {
                                        "type": "integer",
                                        "example": 0
                                    },
                                    "REQUEST_QUEUE_WRITES": {
                                        "type": "integer",
                                        "example": 0
                                    },
                                    "DATA_TRANSFER_INTERNAL_GBYTES": {
                                        "type": "integer",
                                        "example": 0
                                    },
                                    "DATA_TRANSFER_EXTERNAL_GBYTES": {
                                        "type": "integer",
                                        "example": 0
                                    },
                                    "PROXY_RESIDENTIAL_TRANSFER_GBYTES": {
                                        "type": "integer",
                                        "example": 0
                                    },
                                    "PROXY_SERPS": {
                                        "type": "integer",
                                        "example": 0
                                    }
                                }
                            }
                        }
                    }
                }
            }
        }
    }
}
```
