# Noon Advanced Scraper (`buseta/noon-advanced-scraper`) Actor

Scrape everything with Noon Advanced Scraper ! Whole category, search results, list of products, ratings, reviews, pricing info, and more... Last Update: Mar 22, 2026

- **URL**: https://apify.com/buseta/noon-advanced-scraper.md
- **Developed by:** [buseta](https://apify.com/buseta) (community)
- **Categories:** E-commerce, Automation
- **Stats:** 205 total users, 4 monthly users, 100.0% runs succeeded, 2 bookmarks
- **User rating**: 3.39 out of 5 stars

## Pricing

$25.00/month + usage

To use this Actor, you pay a monthly rental fee to the developer. The rent is subtracted from your prepaid usage every month after the free trial period.You also pay for the Apify platform usage, which gets cheaper the higher Apify subscription plan you have.

Learn more: https://docs.apify.com/platform/actors/running/actors-in-store#rental-actors

## What's an Apify Actor?

Actors are a software tools running on the Apify platform, for all kinds of web data extraction and automation use cases.
In Batch mode, an Actor accepts a well-defined JSON input, performs an action which can take anything from a few seconds to a few hours,
and optionally produces a well-defined JSON output, datasets with results, or files in key-value store.
In Standby mode, an Actor provides a web server which can be used as a website, API, or an MCP server.
Actors are written with capital "A".

## How to integrate an Actor?

If asked about integration, you help developers integrate Actors into their projects.
You adapt to their stack and deliver integrations that are safe, well-documented, and production-ready.
The best way to integrate Actors is as follows.

In JavaScript/TypeScript projects, use official [JavaScript/TypeScript client](https://docs.apify.com/api/client/js.md):

```bash
npm install apify-client
```

In Python projects, use official [Python client library](https://docs.apify.com/api/client/python.md):

```bash
pip install apify-client
```

In shell scripts, use [Apify CLI](https://docs.apify.com/cli/docs.md):

````bash
# MacOS / Linux
curl -fsSL https://apify.com/install-cli.sh | bash
# Windows
irm https://apify.com/install-cli.ps1 | iex
```bash

In AI frameworks, you might use the [Apify MCP server](https://docs.apify.com/platform/integrations/mcp.md).

If your project is in a different language, use the [REST API](https://docs.apify.com/api/v2.md).

For usage examples, see the [API](#api) section below.

For more details, see Apify documentation as [Markdown index](https://docs.apify.com/llms.txt) and [Markdown full-text](https://docs.apify.com/llms-full.txt).


# README

## Noon Advanced Scraper: Extract Detailed Product Data

The **Noon Advanced Scraper** is a powerful tool designed to extract in-depth product information from **Noon**, one of the Middle East's leading e-commerce platforms. Whether you're a market researcher, entrepreneur, or online retailer, this scraper helps you gather essential product data with ease.

---

### 🚀 What Does the Noon Scraper Do?

#### 1. Extract Detailed Product Information

- Scrape key product details, including:
  - **Product Title, Description, Prices (Local/USD), Old Prices, Discounts, Images, Ratings, and Reviews.**
- Retrieve historical pricing data to monitor discounts and trends.

#### 2. Scrape Category Pages

- Provide a category URL, and the scraper will automatically navigate through all product pages, enqueue product URLs, and extract data for each item.

#### 3. Review and Rating Insights

- Gather detailed customer reviews, including:
  - **Ratings, Review Comments, Review Titles, Date of Submission, Verified Purchase Status, and Reviewer Name.**
- Access rating distributions and statistics for customer sentiment analysis.

#### 4. Track Discounts

- Identify discounted products and their original prices, helping you spot the best deals.

#### 5. Data Output in Multiple Formats

- Export your scraped data in the following formats:
  - **JSON, CSV, Excel, or HTML** for further analysis.

---

### 💼 Who Needs the Noon Scraper?

#### 1. Market Researchers

- Conduct **competitor analysis** by gathering pricing, discounts, and customer feedback.
- Analyze **customer sentiment** and preferences based on reviews and ratings.
- Monitor **trending products** and forecast demand.

#### 2. Entrepreneurs & Retailers

- Plan **product launches** by studying popular categories and customer behavior.
- Monitor **competitor pricing** and optimize their own pricing strategy.
- Find the best deals to **source inventory at discounted rates.**

#### 3. E-commerce Businesses

- Scrape product data to create **localized product listings.**
- Track market trends to adjust **stock levels and marketing strategies.**

#### 4. Data Analysts & AI Developers

- Leverage structured data for in-depth **business intelligence and reporting.**
- Identify patterns in product performance and customer feedback.
- Use reviews and other data in training **LLM models**.

---

### ⚙️ How to Use the Noon Scraper?

You need to provide the scrape type with either "category_url", "product_urls", or "search" fields to start with. You also need to provide the base country you want to use Noon in.

- If you chose "category", you must provide "category_url".
- If you chose "product", you must provide "product_urls" which is a list of product urls.
- If you chose "search", you must provide "search" field with the search keyword.

You can sort products by their popularity, price (highest or lowest), newest first, or rating. You can set a price range if you want. You can optionally choose to scrape reviews, translate them to english, and provide maximum of reviews to scrape. Note that if a product has a lot of reviews, then scraping them will take longer. So, if you just want a quick scraping, you can set a maximum reviews count such as 100.

#### Some Input Examples

Example for scraping a category:

```json
{
  "scrape_type": "category",
  "category_url": "https://www.noon.com/uae-en/electronics/",
  "country": "uae",
  "sort_by": "popularity",
  "min_price": 100,
  "max_price": 500,
  "get_reviews": true,
  "max_reviews_per_product": 100,
  "translate_to_english": true
}
````

Example for scraping product URLs:

```json
{
  "scrape_type": "product",
  "product_urls": [
    "https://www.noon.com/uae-en/product-1",
    "https://www.noon.com/uae-en/product-2"
  ],
  "get_reviews": true
}
```

Example for scraping a search result:

```json
{
  "scrape_type": "search",
  "search": "laptop",
  "sort_by": "lowest-price",
  "max_reviews_per_product": 10,
  "max_price": 5000
}
```

***

### ⚙️ Output

You can download the dataset extracted by Noon Advanced Scraper in various formats such as JSON, HTML, CSV, or Excel.

A sample output result for each product will be as follows:

```json
{
  "name": "Gaming Laptop",
  "description": "High-performance gaming laptop with powerful specs.",
  "product_link": "https://www.noon.com/uae-en/gaming-laptop/",
  "price": {
    "local_currency": "AED",
    "price_local": 5000,
    "price_usd": 1361.23,
    "price_old_local": 5500,
    "price_old_usd": 1496.35,
    "discount": "9%"
  },
  "rating_info": {
    "rating": 4.5,
    "number_of_ratings": 100,
    "rating_distribution_percentage": [
      { "5": "70%" },
      { "4": "20%" },
      { "3": "5%" },
      { "2": "0%" },
      { "1": "5%" }
    ]
  },
  "images": [
    "https://noon-cdn.com/laptop1.jpg",
    "https://noon-cdn.com/laptop2.jpg"
  ],
  "number_of_reviews": 50,
  "reviews": [
    {
      "title": "Great Laptop",
      "review": "Excellent performance and battery life.",
      "helpful_count": 20,
      "reviewed_by": "John Doe",
      "rating": 5,
      "verified_purchase": true,
      "review_date": "2024-06-01",
      "review_images": ["https://noon-cdn.com/review1.jpg"]
    }
  ]
}
```

***

### ⚙️ Is it legal to scrape Noon?

Our scrapers are ethical and do not extract any private user data. They only extract publicly available data. However, always ensure that scraping is compliant with Noon’s terms of service and applicable regulations.

For more details, check out Apify's blog on [the legality of web scraping](https://blog.apify.com/is-web-scraping-legal/).

# Actor input Schema

## `scrape_type` (type: `string`):

Type of scraping to perform: Search, Category, Product List

## `product_urls` (type: `array`):

(if Product List): List of product URLs to scrape

## `category_url` (type: `string`):

(if Category): Category URL to scrape

## `search` (type: `string`):

(if Search): Keyword to search for

## `country` (type: `string`):

(If Search): Country that the website will display search results

## `sort_by` (type: `string`):

Products will be sorted by this option

## `min_price` (type: `integer`):

Products with this minimum price will be scraped

## `max_price` (type: `integer`):

Products with this maximum price will be scraped

## `get_reviews` (type: `boolean`):

Also scrape reviews from each product

## `max_reviews_per_product` (type: `integer`):

Maximum reviews to scrape per product

## `translate_to_english` (type: `boolean`):

Translate reviews to English or keep in original language

## `proxy_config` (type: `object`):

Select proxies to be used for scraping

## Actor input object example

```json
{
  "scrape_type": "product",
  "product_urls": [
    "https://www.noon.com/uae-en/iphone-14-pro-max-physical-dual-sim-128gb-silver-5g-without-facetime/N53347546A/p"
  ],
  "sort_by": "popularity",
  "get_reviews": true,
  "max_reviews_per_product": 10,
  "translate_to_english": false,
  "proxy_config": {
    "useApifyProxy": false
  }
}
```

# Actor output Schema

## `results` (type: `string`):

Product data including prices, ratings, reviews, and images from Noon.com (UAE, KSA, Egypt).

# API

You can run this Actor programmatically using our API. Below are code examples in JavaScript, Python, and CLI, as well as the OpenAPI specification and MCP server setup.

## JavaScript example

```javascript
import { ApifyClient } from 'apify-client';

// Initialize the ApifyClient with your Apify API token
// Replace the '<YOUR_API_TOKEN>' with your token
const client = new ApifyClient({
    token: '<YOUR_API_TOKEN>',
});

// Prepare Actor input
const input = {
    "scrape_type": "product",
    "product_urls": [
        "https://www.noon.com/uae-en/iphone-14-pro-max-physical-dual-sim-128gb-silver-5g-without-facetime/N53347546A/p"
    ],
    "sort_by": "popularity",
    "get_reviews": false,
    "max_reviews_per_product": 10,
    "proxy_config": {
        "useApifyProxy": false
    }
};

// Run the Actor and wait for it to finish
const run = await client.actor("buseta/noon-advanced-scraper").call(input);

// Fetch and print Actor results from the run's dataset (if any)
console.log('Results from dataset');
console.log(`💾 Check your data here: https://console.apify.com/storage/datasets/${run.defaultDatasetId}`);
const { items } = await client.dataset(run.defaultDatasetId).listItems();
items.forEach((item) => {
    console.dir(item);
});

// 📚 Want to learn more 📖? Go to → https://docs.apify.com/api/client/js/docs

```

## Python example

```python
from apify_client import ApifyClient

# Initialize the ApifyClient with your Apify API token
# Replace '<YOUR_API_TOKEN>' with your token.
client = ApifyClient("<YOUR_API_TOKEN>")

# Prepare the Actor input
run_input = {
    "scrape_type": "product",
    "product_urls": ["https://www.noon.com/uae-en/iphone-14-pro-max-physical-dual-sim-128gb-silver-5g-without-facetime/N53347546A/p"],
    "sort_by": "popularity",
    "get_reviews": False,
    "max_reviews_per_product": 10,
    "proxy_config": { "useApifyProxy": False },
}

# Run the Actor and wait for it to finish
run = client.actor("buseta/noon-advanced-scraper").call(run_input=run_input)

# Fetch and print Actor results from the run's dataset (if there are any)
print("💾 Check your data here: https://console.apify.com/storage/datasets/" + run["defaultDatasetId"])
for item in client.dataset(run["defaultDatasetId"]).iterate_items():
    print(item)

# 📚 Want to learn more 📖? Go to → https://docs.apify.com/api/client/python/docs/quick-start

```

## CLI example

```bash
echo '{
  "scrape_type": "product",
  "product_urls": [
    "https://www.noon.com/uae-en/iphone-14-pro-max-physical-dual-sim-128gb-silver-5g-without-facetime/N53347546A/p"
  ],
  "sort_by": "popularity",
  "get_reviews": false,
  "max_reviews_per_product": 10,
  "proxy_config": {
    "useApifyProxy": false
  }
}' |
apify call buseta/noon-advanced-scraper --silent --output-dataset

```

## MCP server setup

```json
{
    "mcpServers": {
        "apify": {
            "command": "npx",
            "args": [
                "mcp-remote",
                "https://mcp.apify.com/?tools=buseta/noon-advanced-scraper",
                "--header",
                "Authorization: Bearer <YOUR_API_TOKEN>"
            ]
        }
    }
}

```

## OpenAPI specification

```json
{
    "openapi": "3.0.1",
    "info": {
        "title": "Noon Advanced Scraper",
        "description": "Scrape everything with Noon Advanced Scraper ! Whole category, search results, list of products, ratings, reviews, pricing info, and more... Last Update: Mar 22, 2026",
        "version": "1.0",
        "x-build-id": "zlpInmEPgdYhapCT1"
    },
    "servers": [
        {
            "url": "https://api.apify.com/v2"
        }
    ],
    "paths": {
        "/acts/buseta~noon-advanced-scraper/run-sync-get-dataset-items": {
            "post": {
                "operationId": "run-sync-get-dataset-items-buseta-noon-advanced-scraper",
                "x-openai-isConsequential": false,
                "summary": "Executes an Actor, waits for its completion, and returns Actor's dataset items in response.",
                "tags": [
                    "Run Actor"
                ],
                "requestBody": {
                    "required": true,
                    "content": {
                        "application/json": {
                            "schema": {
                                "$ref": "#/components/schemas/inputSchema"
                            }
                        }
                    }
                },
                "parameters": [
                    {
                        "name": "token",
                        "in": "query",
                        "required": true,
                        "schema": {
                            "type": "string"
                        },
                        "description": "Enter your Apify token here"
                    }
                ],
                "responses": {
                    "200": {
                        "description": "OK"
                    }
                }
            }
        },
        "/acts/buseta~noon-advanced-scraper/runs": {
            "post": {
                "operationId": "runs-sync-buseta-noon-advanced-scraper",
                "x-openai-isConsequential": false,
                "summary": "Executes an Actor and returns information about the initiated run in response.",
                "tags": [
                    "Run Actor"
                ],
                "requestBody": {
                    "required": true,
                    "content": {
                        "application/json": {
                            "schema": {
                                "$ref": "#/components/schemas/inputSchema"
                            }
                        }
                    }
                },
                "parameters": [
                    {
                        "name": "token",
                        "in": "query",
                        "required": true,
                        "schema": {
                            "type": "string"
                        },
                        "description": "Enter your Apify token here"
                    }
                ],
                "responses": {
                    "200": {
                        "description": "OK",
                        "content": {
                            "application/json": {
                                "schema": {
                                    "$ref": "#/components/schemas/runsResponseSchema"
                                }
                            }
                        }
                    }
                }
            }
        },
        "/acts/buseta~noon-advanced-scraper/run-sync": {
            "post": {
                "operationId": "run-sync-buseta-noon-advanced-scraper",
                "x-openai-isConsequential": false,
                "summary": "Executes an Actor, waits for completion, and returns the OUTPUT from Key-value store in response.",
                "tags": [
                    "Run Actor"
                ],
                "requestBody": {
                    "required": true,
                    "content": {
                        "application/json": {
                            "schema": {
                                "$ref": "#/components/schemas/inputSchema"
                            }
                        }
                    }
                },
                "parameters": [
                    {
                        "name": "token",
                        "in": "query",
                        "required": true,
                        "schema": {
                            "type": "string"
                        },
                        "description": "Enter your Apify token here"
                    }
                ],
                "responses": {
                    "200": {
                        "description": "OK"
                    }
                }
            }
        }
    },
    "components": {
        "schemas": {
            "inputSchema": {
                "type": "object",
                "required": [
                    "scrape_type"
                ],
                "properties": {
                    "scrape_type": {
                        "title": "Scraping Type",
                        "enum": [
                            "search",
                            "category",
                            "product"
                        ],
                        "type": "string",
                        "description": "Type of scraping to perform: Search, Category, Product List"
                    },
                    "product_urls": {
                        "title": "Product URLs to scrape",
                        "type": "array",
                        "description": "(if Product List): List of product URLs to scrape",
                        "default": [
                            "https://www.noon.com/uae-en/iphone-14-pro-max-physical-dual-sim-128gb-silver-5g-without-facetime/N53347546A/p"
                        ],
                        "items": {
                            "type": "string"
                        }
                    },
                    "category_url": {
                        "title": "Category URL to scrape",
                        "type": "string",
                        "description": "(if Category): Category URL to scrape"
                    },
                    "search": {
                        "title": "Search keyword",
                        "type": "string",
                        "description": "(if Search): Keyword to search for"
                    },
                    "country": {
                        "title": "Country",
                        "enum": [
                            "uae",
                            "ksa",
                            "egypt"
                        ],
                        "type": "string",
                        "description": "(If Search): Country that the website will display search results"
                    },
                    "sort_by": {
                        "title": "Sort Products By",
                        "enum": [
                            "popularity",
                            "highest-price",
                            "lowest-price",
                            "newest",
                            "rating"
                        ],
                        "type": "string",
                        "description": "Products will be sorted by this option",
                        "default": "popularity"
                    },
                    "min_price": {
                        "title": "Minimum Price",
                        "type": "integer",
                        "description": "Products with this minimum price will be scraped"
                    },
                    "max_price": {
                        "title": "Maximum Price",
                        "type": "integer",
                        "description": "Products with this maximum price will be scraped"
                    },
                    "get_reviews": {
                        "title": "Get Reviews",
                        "type": "boolean",
                        "description": "Also scrape reviews from each product",
                        "default": true
                    },
                    "max_reviews_per_product": {
                        "title": "Maximum Reviews Per Product",
                        "type": "integer",
                        "description": "Maximum reviews to scrape per product",
                        "default": 100
                    },
                    "translate_to_english": {
                        "title": "Translate Reviews to English",
                        "type": "boolean",
                        "description": "Translate reviews to English or keep in original language",
                        "default": false
                    },
                    "proxy_config": {
                        "title": "Proxy configuration",
                        "type": "object",
                        "description": "Select proxies to be used for scraping"
                    }
                }
            },
            "runsResponseSchema": {
                "type": "object",
                "properties": {
                    "data": {
                        "type": "object",
                        "properties": {
                            "id": {
                                "type": "string"
                            },
                            "actId": {
                                "type": "string"
                            },
                            "userId": {
                                "type": "string"
                            },
                            "startedAt": {
                                "type": "string",
                                "format": "date-time",
                                "example": "2025-01-08T00:00:00.000Z"
                            },
                            "finishedAt": {
                                "type": "string",
                                "format": "date-time",
                                "example": "2025-01-08T00:00:00.000Z"
                            },
                            "status": {
                                "type": "string",
                                "example": "READY"
                            },
                            "meta": {
                                "type": "object",
                                "properties": {
                                    "origin": {
                                        "type": "string",
                                        "example": "API"
                                    },
                                    "userAgent": {
                                        "type": "string"
                                    }
                                }
                            },
                            "stats": {
                                "type": "object",
                                "properties": {
                                    "inputBodyLen": {
                                        "type": "integer",
                                        "example": 2000
                                    },
                                    "rebootCount": {
                                        "type": "integer",
                                        "example": 0
                                    },
                                    "restartCount": {
                                        "type": "integer",
                                        "example": 0
                                    },
                                    "resurrectCount": {
                                        "type": "integer",
                                        "example": 0
                                    },
                                    "computeUnits": {
                                        "type": "integer",
                                        "example": 0
                                    }
                                }
                            },
                            "options": {
                                "type": "object",
                                "properties": {
                                    "build": {
                                        "type": "string",
                                        "example": "latest"
                                    },
                                    "timeoutSecs": {
                                        "type": "integer",
                                        "example": 300
                                    },
                                    "memoryMbytes": {
                                        "type": "integer",
                                        "example": 1024
                                    },
                                    "diskMbytes": {
                                        "type": "integer",
                                        "example": 2048
                                    }
                                }
                            },
                            "buildId": {
                                "type": "string"
                            },
                            "defaultKeyValueStoreId": {
                                "type": "string"
                            },
                            "defaultDatasetId": {
                                "type": "string"
                            },
                            "defaultRequestQueueId": {
                                "type": "string"
                            },
                            "buildNumber": {
                                "type": "string",
                                "example": "1.0.0"
                            },
                            "containerUrl": {
                                "type": "string"
                            },
                            "usage": {
                                "type": "object",
                                "properties": {
                                    "ACTOR_COMPUTE_UNITS": {
                                        "type": "integer",
                                        "example": 0
                                    },
                                    "DATASET_READS": {
                                        "type": "integer",
                                        "example": 0
                                    },
                                    "DATASET_WRITES": {
                                        "type": "integer",
                                        "example": 0
                                    },
                                    "KEY_VALUE_STORE_READS": {
                                        "type": "integer",
                                        "example": 0
                                    },
                                    "KEY_VALUE_STORE_WRITES": {
                                        "type": "integer",
                                        "example": 1
                                    },
                                    "KEY_VALUE_STORE_LISTS": {
                                        "type": "integer",
                                        "example": 0
                                    },
                                    "REQUEST_QUEUE_READS": {
                                        "type": "integer",
                                        "example": 0
                                    },
                                    "REQUEST_QUEUE_WRITES": {
                                        "type": "integer",
                                        "example": 0
                                    },
                                    "DATA_TRANSFER_INTERNAL_GBYTES": {
                                        "type": "integer",
                                        "example": 0
                                    },
                                    "DATA_TRANSFER_EXTERNAL_GBYTES": {
                                        "type": "integer",
                                        "example": 0
                                    },
                                    "PROXY_RESIDENTIAL_TRANSFER_GBYTES": {
                                        "type": "integer",
                                        "example": 0
                                    },
                                    "PROXY_SERPS": {
                                        "type": "integer",
                                        "example": 0
                                    }
                                }
                            },
                            "usageTotalUsd": {
                                "type": "number",
                                "example": 0.00005
                            },
                            "usageUsd": {
                                "type": "object",
                                "properties": {
                                    "ACTOR_COMPUTE_UNITS": {
                                        "type": "integer",
                                        "example": 0
                                    },
                                    "DATASET_READS": {
                                        "type": "integer",
                                        "example": 0
                                    },
                                    "DATASET_WRITES": {
                                        "type": "integer",
                                        "example": 0
                                    },
                                    "KEY_VALUE_STORE_READS": {
                                        "type": "integer",
                                        "example": 0
                                    },
                                    "KEY_VALUE_STORE_WRITES": {
                                        "type": "number",
                                        "example": 0.00005
                                    },
                                    "KEY_VALUE_STORE_LISTS": {
                                        "type": "integer",
                                        "example": 0
                                    },
                                    "REQUEST_QUEUE_READS": {
                                        "type": "integer",
                                        "example": 0
                                    },
                                    "REQUEST_QUEUE_WRITES": {
                                        "type": "integer",
                                        "example": 0
                                    },
                                    "DATA_TRANSFER_INTERNAL_GBYTES": {
                                        "type": "integer",
                                        "example": 0
                                    },
                                    "DATA_TRANSFER_EXTERNAL_GBYTES": {
                                        "type": "integer",
                                        "example": 0
                                    },
                                    "PROXY_RESIDENTIAL_TRANSFER_GBYTES": {
                                        "type": "integer",
                                        "example": 0
                                    },
                                    "PROXY_SERPS": {
                                        "type": "integer",
                                        "example": 0
                                    }
                                }
                            }
                        }
                    }
                }
            }
        }
    }
}
```
