# Shopify Store Product & Price Scraper (`andok/shopify-intelligence`) Actor

Extract complete product catalogs, variant pricing, and inventory data from any Shopify store to monitor ecommerce competitors.

- **URL**: https://apify.com/andok/shopify-intelligence.md
- **Developed by:** [Andok](https://apify.com/andok) (community)
- **Categories:** Business, Developer tools
- **Stats:** 3 total users, 0 monthly users, 100.0% runs succeeded, 0 bookmarks
- **User rating**: No ratings yet

## Pricing

from $10.00 / 1,000 product scrapeds

This Actor is paid per event. You are not charged for the Apify platform usage, but only a fixed price for specific events.

Learn more: https://docs.apify.com/platform/actors/running/actors-in-store#pay-per-event

## What's an Apify Actor?

Actors are a software tools running on the Apify platform, for all kinds of web data extraction and automation use cases.
In Batch mode, an Actor accepts a well-defined JSON input, performs an action which can take anything from a few seconds to a few hours,
and optionally produces a well-defined JSON output, datasets with results, or files in key-value store.
In Standby mode, an Actor provides a web server which can be used as a website, API, or an MCP server.
Actors are written with capital "A".

## How to integrate an Actor?

If asked about integration, you help developers integrate Actors into their projects.
You adapt to their stack and deliver integrations that are safe, well-documented, and production-ready.
The best way to integrate Actors is as follows.

In JavaScript/TypeScript projects, use official [JavaScript/TypeScript client](https://docs.apify.com/api/client/js.md):

```bash
npm install apify-client
```

In Python projects, use official [Python client library](https://docs.apify.com/api/client/python.md):

```bash
pip install apify-client
```

In shell scripts, use [Apify CLI](https://docs.apify.com/cli/docs.md):

````bash
# MacOS / Linux
curl -fsSL https://apify.com/install-cli.sh | bash
# Windows
irm https://apify.com/install-cli.ps1 | iex
```bash

In AI frameworks, you might use the [Apify MCP server](https://docs.apify.com/platform/integrations/mcp.md).

If your project is in a different language, use the [REST API](https://docs.apify.com/api/v2.md).

For usage examples, see the [API](#api) section below.

For more details, see Apify documentation as [Markdown index](https://docs.apify.com/llms.txt) and [Markdown full-text](https://docs.apify.com/llms-full.txt).


# README

## Shopify Product Scraper & Price Monitor

Scrape complete product catalogs from any Shopify store for competitive pricing intelligence and market analysis. Just provide store URLs and get back structured product data including titles, variants, prices, SKUs, inventory status, and images -- all extracted from Shopify's public API without headless browsers or expensive proxies.

### Features

- **Full catalog extraction** — pulls every product and variant from a Shopify store's public API
- **Variant-level detail** — captures prices, SKUs, inventory availability, and weights for each variant
- **Multi-store support** — scrape multiple Shopify stores in a single run
- **No browser required** — uses Shopify's `products.json` endpoint directly, keeping runs fast and cheap
- **Configurable limits** — set a maximum number of products per store to control output size and cost
- **Structured output** — returns clean, typed data ready for spreadsheets, databases, or analysis pipelines
- **Charge-limit safety** — respects per-run spending caps so you never exceed your budget

### Input

| Field | Type | Required | Default | Description |
|-------|------|----------|---------|-------------|
| `urls` | `array` | Yes | — | List of Shopify store URLs to scrape (e.g., `https://gymshark.com`) |
| `maxProducts` | `integer` | No | `250` | Maximum number of products to extract per store (1-5,000) |
| `timeoutSeconds` | `integer` | No | `15` | Maximum seconds to wait for each store's API to respond |

#### Input Example

```json
{
  "urls": [
    "https://gymshark.com",
    "https://allbirds.com",
    "https://bombas.com"
  ],
  "maxProducts": 250,
  "timeoutSeconds": 15
}
````

### Output

Each product in the store produces one result object with full variant and image data.

Key fields:

- `storeUrl` (`string`) — the Shopify store URL
- `productId` (`number`) — Shopify product ID
- `title` (`string`) — product title
- `handle` (`string`) — URL-friendly product handle
- `productType` (`string`) — product category/type
- `vendor` (`string`) — brand or vendor name
- `publishedAt` (`string`) — ISO date when the product was published
- `variants` (`array`) — list of variants, each with `id`, `title`, `price`, `sku`, `available`, and `grams`
- `images` (`string[]`) — list of product image URLs

#### Output Example

```json
{
  "storeUrl": "https://gymshark.com",
  "productId": 7865432109876,
  "title": "Adapt Animal Seamless Leggings",
  "handle": "adapt-animal-seamless-leggings",
  "productType": "Leggings",
  "vendor": "Gymshark",
  "publishedAt": "2025-01-15T09:00:00-05:00",
  "variants": [
    {
      "id": 43210987654321,
      "title": "Black / XS",
      "price": "62.00",
      "sku": "GSLL-BK-XS",
      "available": true,
      "grams": 200
    },
    {
      "id": 43210987654322,
      "title": "Black / S",
      "price": "62.00",
      "sku": "GSLL-BK-S",
      "available": true,
      "grams": 210
    }
  ],
  "images": [
    "https://cdn.shopify.com/s/files/1/example/products/leggings-front.jpg",
    "https://cdn.shopify.com/s/files/1/example/products/leggings-back.jpg"
  ]
}
```

### Pricing

| Event | Cost |
|-------|------|
| Product Scraped | Pay-per-event (see actor pricing page) |

You pay for each product extracted. A per-run spending cap ensures you stay within budget.

### Use Cases

- **Competitive pricing** — monitor competitor product prices and track changes over time with scheduled runs
- **Market research** — analyze product catalogs across multiple Shopify stores in your niche
- **Dropshipping sourcing** — evaluate supplier catalogs by extracting full product and variant data
- **Inventory monitoring** — track product availability and stock status across competitor stores
- **Price comparison tools** — feed structured product data into comparison engines or dashboards
- **E-commerce analytics** — build datasets of product types, vendors, and pricing tiers across the Shopify ecosystem

### Related Actors

| Actor | What it adds |
|-------|-------------|
| [Company Enrichment & Research](https://apify.com/andok/company-enrichment) | Enriches Shopify store owners with company metadata, social links, and VAT validation |
| [Website Contact Scraper](https://apify.com/andok/contact-scraper) | Finds contact emails and phones on Shopify store pages for outreach |
| [Tech Stack Analyzer](https://apify.com/andok/tech-stack-analyzer) | Confirms whether a site runs on Shopify and identifies other technologies |

# Actor input Schema

## `urls` (type: `array`):

List of Shopify store URLs to scrape (e.g., https://gymshark.com). The actor accesses each store's public products.json endpoint.

## `maxProducts` (type: `integer`):

Maximum number of products to extract per store. Set lower to reduce cost on large catalogs, or higher (up to 5,000) for full extraction.

## `timeoutSeconds` (type: `integer`):

Maximum seconds to wait for each store's API to respond. Increase for stores with very large catalogs.

## Actor input object example

```json
{
  "urls": [
    "https://gymshark.com"
  ],
  "maxProducts": 250,
  "timeoutSeconds": 15
}
```

# Actor output Schema

## `results` (type: `string`):

No description

## `resultsCsv` (type: `string`):

No description

## `run` (type: `string`):

No description

# API

You can run this Actor programmatically using our API. Below are code examples in JavaScript, Python, and CLI, as well as the OpenAPI specification and MCP server setup.

## JavaScript example

```javascript
import { ApifyClient } from 'apify-client';

// Initialize the ApifyClient with your Apify API token
// Replace the '<YOUR_API_TOKEN>' with your token
const client = new ApifyClient({
    token: '<YOUR_API_TOKEN>',
});

// Prepare Actor input
const input = {
    "urls": [
        "https://gymshark.com"
    ]
};

// Run the Actor and wait for it to finish
const run = await client.actor("andok/shopify-intelligence").call(input);

// Fetch and print Actor results from the run's dataset (if any)
console.log('Results from dataset');
console.log(`💾 Check your data here: https://console.apify.com/storage/datasets/${run.defaultDatasetId}`);
const { items } = await client.dataset(run.defaultDatasetId).listItems();
items.forEach((item) => {
    console.dir(item);
});

// 📚 Want to learn more 📖? Go to → https://docs.apify.com/api/client/js/docs

```

## Python example

```python
from apify_client import ApifyClient

# Initialize the ApifyClient with your Apify API token
# Replace '<YOUR_API_TOKEN>' with your token.
client = ApifyClient("<YOUR_API_TOKEN>")

# Prepare the Actor input
run_input = { "urls": ["https://gymshark.com"] }

# Run the Actor and wait for it to finish
run = client.actor("andok/shopify-intelligence").call(run_input=run_input)

# Fetch and print Actor results from the run's dataset (if there are any)
print("💾 Check your data here: https://console.apify.com/storage/datasets/" + run["defaultDatasetId"])
for item in client.dataset(run["defaultDatasetId"]).iterate_items():
    print(item)

# 📚 Want to learn more 📖? Go to → https://docs.apify.com/api/client/python/docs/quick-start

```

## CLI example

```bash
echo '{
  "urls": [
    "https://gymshark.com"
  ]
}' |
apify call andok/shopify-intelligence --silent --output-dataset

```

## MCP server setup

```json
{
    "mcpServers": {
        "apify": {
            "command": "npx",
            "args": [
                "mcp-remote",
                "https://mcp.apify.com/?tools=andok/shopify-intelligence",
                "--header",
                "Authorization: Bearer <YOUR_API_TOKEN>"
            ]
        }
    }
}

```

## OpenAPI specification

```json
{
    "openapi": "3.0.1",
    "info": {
        "title": "Shopify Store Product & Price Scraper",
        "description": "Extract complete product catalogs, variant pricing, and inventory data from any Shopify store to monitor ecommerce competitors.",
        "version": "1.0",
        "x-build-id": "Kq1WJtBBkqkuoB7FD"
    },
    "servers": [
        {
            "url": "https://api.apify.com/v2"
        }
    ],
    "paths": {
        "/acts/andok~shopify-intelligence/run-sync-get-dataset-items": {
            "post": {
                "operationId": "run-sync-get-dataset-items-andok-shopify-intelligence",
                "x-openai-isConsequential": false,
                "summary": "Executes an Actor, waits for its completion, and returns Actor's dataset items in response.",
                "tags": [
                    "Run Actor"
                ],
                "requestBody": {
                    "required": true,
                    "content": {
                        "application/json": {
                            "schema": {
                                "$ref": "#/components/schemas/inputSchema"
                            }
                        }
                    }
                },
                "parameters": [
                    {
                        "name": "token",
                        "in": "query",
                        "required": true,
                        "schema": {
                            "type": "string"
                        },
                        "description": "Enter your Apify token here"
                    }
                ],
                "responses": {
                    "200": {
                        "description": "OK"
                    }
                }
            }
        },
        "/acts/andok~shopify-intelligence/runs": {
            "post": {
                "operationId": "runs-sync-andok-shopify-intelligence",
                "x-openai-isConsequential": false,
                "summary": "Executes an Actor and returns information about the initiated run in response.",
                "tags": [
                    "Run Actor"
                ],
                "requestBody": {
                    "required": true,
                    "content": {
                        "application/json": {
                            "schema": {
                                "$ref": "#/components/schemas/inputSchema"
                            }
                        }
                    }
                },
                "parameters": [
                    {
                        "name": "token",
                        "in": "query",
                        "required": true,
                        "schema": {
                            "type": "string"
                        },
                        "description": "Enter your Apify token here"
                    }
                ],
                "responses": {
                    "200": {
                        "description": "OK",
                        "content": {
                            "application/json": {
                                "schema": {
                                    "$ref": "#/components/schemas/runsResponseSchema"
                                }
                            }
                        }
                    }
                }
            }
        },
        "/acts/andok~shopify-intelligence/run-sync": {
            "post": {
                "operationId": "run-sync-andok-shopify-intelligence",
                "x-openai-isConsequential": false,
                "summary": "Executes an Actor, waits for completion, and returns the OUTPUT from Key-value store in response.",
                "tags": [
                    "Run Actor"
                ],
                "requestBody": {
                    "required": true,
                    "content": {
                        "application/json": {
                            "schema": {
                                "$ref": "#/components/schemas/inputSchema"
                            }
                        }
                    }
                },
                "parameters": [
                    {
                        "name": "token",
                        "in": "query",
                        "required": true,
                        "schema": {
                            "type": "string"
                        },
                        "description": "Enter your Apify token here"
                    }
                ],
                "responses": {
                    "200": {
                        "description": "OK"
                    }
                }
            }
        }
    },
    "components": {
        "schemas": {
            "inputSchema": {
                "type": "object",
                "properties": {
                    "urls": {
                        "title": "Shopify Store URLs",
                        "minItems": 1,
                        "type": "array",
                        "description": "List of Shopify store URLs to scrape (e.g., https://gymshark.com). The actor accesses each store's public products.json endpoint.",
                        "items": {
                            "type": "string"
                        }
                    },
                    "maxProducts": {
                        "title": "Max Products per Store",
                        "minimum": 1,
                        "maximum": 5000,
                        "type": "integer",
                        "description": "Maximum number of products to extract per store. Set lower to reduce cost on large catalogs, or higher (up to 5,000) for full extraction.",
                        "default": 250
                    },
                    "timeoutSeconds": {
                        "title": "Timeout (seconds)",
                        "minimum": 1,
                        "maximum": 120,
                        "type": "integer",
                        "description": "Maximum seconds to wait for each store's API to respond. Increase for stores with very large catalogs.",
                        "default": 15
                    }
                }
            },
            "runsResponseSchema": {
                "type": "object",
                "properties": {
                    "data": {
                        "type": "object",
                        "properties": {
                            "id": {
                                "type": "string"
                            },
                            "actId": {
                                "type": "string"
                            },
                            "userId": {
                                "type": "string"
                            },
                            "startedAt": {
                                "type": "string",
                                "format": "date-time",
                                "example": "2025-01-08T00:00:00.000Z"
                            },
                            "finishedAt": {
                                "type": "string",
                                "format": "date-time",
                                "example": "2025-01-08T00:00:00.000Z"
                            },
                            "status": {
                                "type": "string",
                                "example": "READY"
                            },
                            "meta": {
                                "type": "object",
                                "properties": {
                                    "origin": {
                                        "type": "string",
                                        "example": "API"
                                    },
                                    "userAgent": {
                                        "type": "string"
                                    }
                                }
                            },
                            "stats": {
                                "type": "object",
                                "properties": {
                                    "inputBodyLen": {
                                        "type": "integer",
                                        "example": 2000
                                    },
                                    "rebootCount": {
                                        "type": "integer",
                                        "example": 0
                                    },
                                    "restartCount": {
                                        "type": "integer",
                                        "example": 0
                                    },
                                    "resurrectCount": {
                                        "type": "integer",
                                        "example": 0
                                    },
                                    "computeUnits": {
                                        "type": "integer",
                                        "example": 0
                                    }
                                }
                            },
                            "options": {
                                "type": "object",
                                "properties": {
                                    "build": {
                                        "type": "string",
                                        "example": "latest"
                                    },
                                    "timeoutSecs": {
                                        "type": "integer",
                                        "example": 300
                                    },
                                    "memoryMbytes": {
                                        "type": "integer",
                                        "example": 1024
                                    },
                                    "diskMbytes": {
                                        "type": "integer",
                                        "example": 2048
                                    }
                                }
                            },
                            "buildId": {
                                "type": "string"
                            },
                            "defaultKeyValueStoreId": {
                                "type": "string"
                            },
                            "defaultDatasetId": {
                                "type": "string"
                            },
                            "defaultRequestQueueId": {
                                "type": "string"
                            },
                            "buildNumber": {
                                "type": "string",
                                "example": "1.0.0"
                            },
                            "containerUrl": {
                                "type": "string"
                            },
                            "usage": {
                                "type": "object",
                                "properties": {
                                    "ACTOR_COMPUTE_UNITS": {
                                        "type": "integer",
                                        "example": 0
                                    },
                                    "DATASET_READS": {
                                        "type": "integer",
                                        "example": 0
                                    },
                                    "DATASET_WRITES": {
                                        "type": "integer",
                                        "example": 0
                                    },
                                    "KEY_VALUE_STORE_READS": {
                                        "type": "integer",
                                        "example": 0
                                    },
                                    "KEY_VALUE_STORE_WRITES": {
                                        "type": "integer",
                                        "example": 1
                                    },
                                    "KEY_VALUE_STORE_LISTS": {
                                        "type": "integer",
                                        "example": 0
                                    },
                                    "REQUEST_QUEUE_READS": {
                                        "type": "integer",
                                        "example": 0
                                    },
                                    "REQUEST_QUEUE_WRITES": {
                                        "type": "integer",
                                        "example": 0
                                    },
                                    "DATA_TRANSFER_INTERNAL_GBYTES": {
                                        "type": "integer",
                                        "example": 0
                                    },
                                    "DATA_TRANSFER_EXTERNAL_GBYTES": {
                                        "type": "integer",
                                        "example": 0
                                    },
                                    "PROXY_RESIDENTIAL_TRANSFER_GBYTES": {
                                        "type": "integer",
                                        "example": 0
                                    },
                                    "PROXY_SERPS": {
                                        "type": "integer",
                                        "example": 0
                                    }
                                }
                            },
                            "usageTotalUsd": {
                                "type": "number",
                                "example": 0.00005
                            },
                            "usageUsd": {
                                "type": "object",
                                "properties": {
                                    "ACTOR_COMPUTE_UNITS": {
                                        "type": "integer",
                                        "example": 0
                                    },
                                    "DATASET_READS": {
                                        "type": "integer",
                                        "example": 0
                                    },
                                    "DATASET_WRITES": {
                                        "type": "integer",
                                        "example": 0
                                    },
                                    "KEY_VALUE_STORE_READS": {
                                        "type": "integer",
                                        "example": 0
                                    },
                                    "KEY_VALUE_STORE_WRITES": {
                                        "type": "number",
                                        "example": 0.00005
                                    },
                                    "KEY_VALUE_STORE_LISTS": {
                                        "type": "integer",
                                        "example": 0
                                    },
                                    "REQUEST_QUEUE_READS": {
                                        "type": "integer",
                                        "example": 0
                                    },
                                    "REQUEST_QUEUE_WRITES": {
                                        "type": "integer",
                                        "example": 0
                                    },
                                    "DATA_TRANSFER_INTERNAL_GBYTES": {
                                        "type": "integer",
                                        "example": 0
                                    },
                                    "DATA_TRANSFER_EXTERNAL_GBYTES": {
                                        "type": "integer",
                                        "example": 0
                                    },
                                    "PROXY_RESIDENTIAL_TRANSFER_GBYTES": {
                                        "type": "integer",
                                        "example": 0
                                    },
                                    "PROXY_SERPS": {
                                        "type": "integer",
                                        "example": 0
                                    }
                                }
                            }
                        }
                    }
                }
            }
        }
    }
}
```
