# Similarweb Scraper (`parseforge/similarweb-scraper`) Actor

Unlock powerful website insights in seconds. Analyze traffic volume, global and country rankings, audience engagement, and top markets to outsmart competitors, qualify leads, and spot growth opportunities. Make smarter business decisions with reliable, actionable data at your fingertips

- **URL**: https://apify.com/parseforge/similarweb-scraper.md
- **Developed by:** [ParseForge](https://apify.com/parseforge) (community)
- **Categories:** Lead generation, SEO tools, Other
- **Stats:** 14 total users, 5 monthly users, 100.0% runs succeeded, 1 bookmarks
- **User rating**: 5.00 out of 5 stars

## Pricing

Pay per event

This Actor is paid per event. You are not charged for the Apify platform usage, but only a fixed price for specific events.
Since this Actor supports Apify Store discounts, the price gets lower the higher subscription plan you have.

Learn more: https://docs.apify.com/platform/actors/running/actors-in-store#pay-per-event

## What's an Apify Actor?

Actors are a software tools running on the Apify platform, for all kinds of web data extraction and automation use cases.
In Batch mode, an Actor accepts a well-defined JSON input, performs an action which can take anything from a few seconds to a few hours,
and optionally produces a well-defined JSON output, datasets with results, or files in key-value store.
In Standby mode, an Actor provides a web server which can be used as a website, API, or an MCP server.
Actors are written with capital "A".

## How to integrate an Actor?

If asked about integration, you help developers integrate Actors into their projects.
You adapt to their stack and deliver integrations that are safe, well-documented, and production-ready.
The best way to integrate Actors is as follows.

In JavaScript/TypeScript projects, use official [JavaScript/TypeScript client](https://docs.apify.com/api/client/js.md):

```bash
npm install apify-client
```

In Python projects, use official [Python client library](https://docs.apify.com/api/client/python.md):

```bash
pip install apify-client
```

In shell scripts, use [Apify CLI](https://docs.apify.com/cli/docs.md):

````bash
# MacOS / Linux
curl -fsSL https://apify.com/install-cli.sh | bash
# Windows
irm https://apify.com/install-cli.ps1 | iex
```bash

In AI frameworks, you might use the [Apify MCP server](https://docs.apify.com/platform/integrations/mcp.md).

If your project is in a different language, use the [REST API](https://docs.apify.com/api/v2.md).

For usage examples, see the [API](#api) section below.

For more details, see Apify documentation as [Markdown index](https://docs.apify.com/llms.txt) and [Markdown full-text](https://docs.apify.com/llms-full.txt).


# README

![ParseForge Banner](https://github.com/ParseForge/apify-assets/blob/ad35ccc13ddd068b9d6cba33f323962e39aed5b2/banner.jpg?raw=true)

## 📊 SimilarWeb Scraper

> 🕒 **Last updated:** 2026-05-05


Collect website traffic analytics from SimilarWeb without coding. Download competitor traffic data, monthly visits, bounce rates, engagement metrics, and traffic source breakdowns as CSV, Excel, or JSON. Perfect for competitive intelligence, SEO analysis, market research, and investment due diligence when you need website traffic data or want to monitor multiple sites at once.

> **The SimilarWeb Scraper collects up to 20+ data fields per domain including rankings, monthly visits, bounce rate, and traffic sources with residential proxy support included.**

### ✨ What Does It Do

- 📊 **Global Rank** - understand where a website ranks globally to assess its market position
- 🌍 **Country Rank** - see how a site ranks within its target country to measure regional dominance
- 👥 **Monthly Visits** - track estimated monthly traffic volume to benchmark performance against competitors
- 📈 **Bounce Rate** - monitor visitor engagement quality to identify content or UX issues
- ⏱️ **Avg Visit Duration** - measure how long visitors stay on a site to gauge content relevance
- 📄 **Pages Per Visit** - track user exploration depth to evaluate site navigation effectiveness
- 🔍 **Traffic Sources** - see the breakdown of direct, search, social, referral, and paid traffic to understand marketing channels
- 🌏 **Top Countries** - identify which countries drive the most traffic for international market insights
- 🖼️ **Site Screenshot** - view the website appearance to verify identity and assess design quality
- 📅 **Category Rank** - compare rankings within your industry category for competitive benchmarking

### 🔧 Input

- **Domains** - list of website domains to analyze (e.g., amazon.com, github.com, producthunt.com). URLs are cleaned automatically so you can paste full URLs with http/https.
- **Max Items** - maximum number of domains to process. Free users are limited to 100 domains per run.
- **Proxy Configuration** - residential proxies are required and included by default. No additional setup needed.

Example input:
```json
{
  "domains": ["amazon.com", "google.com", "facebook.com"],
  "maxItems": 10,
  "proxyConfiguration": {
    "useApifyProxy": true,
    "apifyProxyGroups": ["RESIDENTIAL"]
  }
}
````

### 📊 Output

Each domain includes up to 20 data fields. Download as JSON, CSV, or Excel.

| 🌐 Domain | 🏢 Site Name | 🔤 Description |
|---|---|---|
| 📊 Global Rank | 🌍 Country Rank | 🌏 Country Code |
| 🗂️ Category | 🏆 Category Rank | 👥 Monthly Visits |
| 📈 Bounce Rate | ⏱️ Pages Per Visit | ⏰ Avg Visit Duration |
| 🔍 Direct Traffic | 🔎 Search Traffic | 📱 Social Traffic |
| 🔗 Referral Traffic | ✉️ Mail Traffic | 💰 Paid Referral Traffic |
| 🌍 Top Countries | 📅 Estimated Monthly Visits | 🖼️ Site Screenshot |
| 📸 Snapshot Date | ✅ Scraped At | ⚠️ Error |

### 💎 Why Choose the SimilarWeb Scraper?

| Feature | Our Actor | Similar Scrapers |
|---|---|---|
| Batch domain analysis (10+) | ✔️ | ❌ |
| Full traffic source breakdown | ✔️ | Partial |
| Top countries by traffic share | ✔️ | ❌ |
| Category rank information | ✔️ | ❌ |
| Residential proxies included | ✔️ | ❌ |
| Export to CSV/Excel/JSON | ✔️ | ✔️ |
| Up to 1M domains per run | ✔️ | ❌ |
| Monthly visit estimates | ✔️ | Partial |
| Site screenshots included | ✔️ | ❌ |
| Engagement metrics (bounce rate, duration) | ✔️ | Partial |
| Free tier with 100 domains | ✔️ | Partial |
| Automatic domain cleaning | ✔️ | ❌ |

### 📋 How to Use

No technical skills required. Follow these simple steps:

1. **Sign Up**: [Create a free account with $5 credit](https://console.apify.com/sign-up?fpr=vmoqkp)
2. **Find the Tool**: Search for "SimilarWeb Scraper" in the Apify Store and configure your input
3. **Run It**: Click "Start" and watch your results appear

That's it. No coding, no setup, no complicated configuration. Now you can export your data in CSV, Excel, or JSON format.

### 🎯 Business Use Cases

- 📊 **Competitive Analyst** - monitor competitor traffic drops during product launches to adjust your go-to-market timing and positioning
- 💼 **Investment Analyst** - track traffic trends across 50+ fintech startups quarterly to identify growth leaders before funding rounds close
- 🔬 **Market Researcher** - benchmark traffic sources for 100 e-commerce sites in your vertical to identify emerging sales channels

***

### ✨ Why choose this Actor

| | Capability |
|---|---|
| 🎯 | **Built for the job.** Scoped specifically to this data source so you skip the parser engineering entirely. |
| 🔖 | **Structured output.** Clean, typed fields ready for analysis, dashboards, or downstream pipelines. |
| ⚡ | **Fast.** Optimized request patterns return results in seconds, not minutes. |
| 🔁 | **Always fresh.** Every run pulls live data, so the dataset reflects the source as of run time. |
| 🌐 | **No infra to manage.** Apify handles proxies, retries, scaling, scheduling, and storage. |
| 🛡️ | **Reliable.** Battle-tested across many runs and edge cases, with graceful error handling. |
| 🚫 | **No code required.** Configure in the UI, run from CLI, schedule via cron, or call from any language with the Apify SDK. |

> 📊 Production-grade structured data without the engineering overhead of building and maintaining your own scraper.

***

### 📈 How it compares to alternatives

| Approach | Cost | Coverage | Refresh | Filters | Setup |
|---|---|---|---|---|---|
| **⭐ SimilarWeb Scraper** *(this Actor)* | $5 free credit, then pay-per-use | Full source coverage | **Live per run** | Source-native filters supported | ⚡ 2 min |
| Build your own scraper | Engineering hours | Full once built | Whenever you maintain it | Custom code | 🐢 Days to weeks |
| Paid managed APIs | $$$ monthly | Vendor-defined | Live | Vendor-defined | ⏳ Hours |
| Third-party data dumps | Varies | Subset, often stale | Periodic | None | 🕒 Variable |

Pick this Actor when you want broad coverage, server-side filtering, and no pipeline maintenance.

***

### 🚀 How to use

1. 📝 **Sign up.** [Create a free account with $5 credit](https://console.apify.com/sign-up?fpr=vmoqkp) (takes 2 minutes).
2. 🌐 **Open the Actor.** Go to the SimilarWeb Scraper page on the Apify Store.
3. 🎯 **Set input.** Configure the input fields in the form (or paste a JSON), then set `maxItems`.
4. 🚀 **Run it.** Click **Start** and let the Actor collect your data.
5. 📥 **Download.** Grab your results in the **Dataset** tab as CSV, Excel, JSON, or XML.

> ⏱️ Total time from signup to downloaded dataset: **3-5 minutes.** No coding required.

***

### 💼 Business use cases

<table>
<tr>
<td width="50%" valign="top">

#### 📊 Data & Analytics

- Build trend reports and dashboards from live source data
- Feed BI tools, warehouses, and ML pipelines with structured records
- Run periodic snapshots to track changes over time
- Compare segments, regions, or categories with consistent fields

</td>
<td width="50%" valign="top">

#### 🏢 Operations & Strategy

- Monitor competitor moves, pricing, and inventory shifts
- Build internal directories and lookup tools backed by current data
- Power workflows that depend on fresh source records
- Cut manual data-gathering time from hours to minutes

</td>
</tr>
<tr>
<td width="50%" valign="top">

#### 🎯 Marketing & Growth

- Identify market opportunities and trending topics
- Research target audiences and customer personas at scale
- Power lead-generation pipelines with verified records
- Track sentiment, reviews, or social signals over time

</td>
<td width="50%" valign="top">

#### 🛠️ Engineering & Product

- Prototype features that need real-world data without owning a crawler
- Replace fragile in-house scrapers with a managed Actor
- Wire datasets into your apps via the Apify API or webhooks
- Skip the proxy, retry, and parsing maintenance entirely

</td>
</tr>
</table>

***

### 🌟 Beyond business use cases

Data like this powers more than commercial workflows. The same structured records support research, education, civic projects, and personal initiatives.

<table>
<tr>
<td width="50%">

#### 🎓 Research and academia

- Empirical datasets for papers, thesis work, and coursework
- Longitudinal studies tracking changes across snapshots
- Reproducible research with cited, versioned data pulls
- Classroom exercises on data analysis and ethical scraping

</td>
<td width="50%">

#### 🎨 Personal and creative

- Side projects, portfolio demos, and indie app launches
- Data visualizations, dashboards, and infographics
- Content research for bloggers, YouTubers, and podcasters
- Hobbyist collections and personal trackers

</td>
</tr>
<tr>
<td width="50%">

#### 🤝 Non-profit and civic

- Transparency reporting and accountability projects
- Advocacy campaigns backed by public-interest data
- Community-run databases for local issues
- Investigative journalism on public records

</td>
<td width="50%">

#### 🧪 Experimentation

- Prototype AI and machine-learning pipelines with real data
- Validate product-market hypotheses before engineering spend
- Train small domain-specific models on niche corpora
- Test dashboard concepts with live input

</td>
</tr>
</table>

### ❓ FAQ

**🔍 How does it work?**
The actor connects to SimilarWeb using residential proxies, collects traffic analytics for each domain you provide, and returns the results in your preferred format.

**📊 How accurate is the data?**
SimilarWeb's estimates are based on their panel of millions of users and web traffic data. Most metrics are estimates with margins of error. Global and category rankings are generally more reliable than traffic volume estimates.

**📅 Can I schedule runs automatically?**
Yes. Set up recurring runs using Apify's scheduler or integrate with Make, Zapier, or other automation tools to run on a daily, weekly, or monthly schedule.

**⚖️ Is it legal to collect SimilarWeb data?**
SimilarWeb publishes this data publicly on their website. However, review their terms of service regarding data usage. You are responsible for ensuring your usage complies with applicable laws and SimilarWeb's terms.

**🛡️ Will SimilarWeb block me?**
SimilarWeb actively blocks scrapers, which is why residential proxies are required and included by default. Our actor uses proper headers and rotation to minimize blocking risk.

**⚡ How long does a run take?**
For 100 domains with residential proxies, expect 2-5 minutes depending on SimilarWeb's response times and proxy availability. Each domain takes roughly 1-3 seconds.

**⚠️ Are there any limits?**
Free users can collect up to 100 domains per run. Paid users can collect up to 1,000,000 domains per run.

### 🔗 Integrate SimilarWeb Scraper with any app

- [Make](https://docs.apify.com/platform/integrations/make) - Automate workflows
- [Zapier](https://docs.apify.com/platform/integrations/zapier) - Connect 5000+ apps
- [GitHub](https://docs.apify.com/platform/integrations/github) - Version control integration
- [Slack](https://docs.apify.com/platform/integrations/slack) - Get notifications
- [Airbyte](https://docs.apify.com/platform/integrations/airbyte) - Data pipelines
- [Google Drive](https://docs.apify.com/platform/integrations/drive) - Export to spreadsheets

### 🤖 Ask an AI assistant about this scraper

Open a ready-to-send prompt about this ParseForge actor in the AI of your choice:

- 💬 [**ChatGPT**](https://chat.openai.com/?q=How%20do%20I%20use%20the%20SimilarWeb%20Scraper%20by%20ParseForge%20on%20Apify%3F%20Show%20me%20input%20examples%2C%20output%20fields%2C%20common%20use%20cases%2C%20and%20how%20to%20integrate%20it%20into%20a%20workflow.)
- 🧠 [**Claude**](https://claude.ai/new?q=How%20do%20I%20use%20the%20SimilarWeb%20Scraper%20by%20ParseForge%20on%20Apify%3F%20Show%20me%20input%20examples%2C%20output%20fields%2C%20common%20use%20cases%2C%20and%20how%20to%20integrate%20it%20into%20a%20workflow.)
- 🔍 [**Perplexity**](https://perplexity.ai/search?q=How%20do%20I%20use%20the%20SimilarWeb%20Scraper%20by%20ParseForge%20on%20Apify%3F%20Show%20me%20input%20examples%2C%20output%20fields%2C%20common%20use%20cases%2C%20and%20how%20to%20integrate%20it%20into%20a%20workflow.)
- 🅒 [**Copilot**](https://copilot.microsoft.com/?q=How%20do%20I%20use%20the%20SimilarWeb%20Scraper%20by%20ParseForge%20on%20Apify%3F%20Show%20me%20input%20examples%2C%20output%20fields%2C%20common%20use%20cases%2C%20and%20how%20to%20integrate%20it%20into%20a%20workflow.)

***

### 🔌 Integrate with any app

SimilarWeb Scraper connects to any cloud service via [Apify integrations](https://apify.com/integrations):

- [**Make**](https://docs.apify.com/platform/integrations/make) - Automate multi-step workflows
- [**Zapier**](https://docs.apify.com/platform/integrations/zapier) - Connect with 5,000+ apps
- [**Slack**](https://docs.apify.com/platform/integrations/slack) - Get run notifications in your channels
- [**Airbyte**](https://docs.apify.com/platform/integrations/airbyte) - Pipe results into your warehouse
- [**GitHub**](https://docs.apify.com/platform/integrations/github) - Trigger runs from commits and releases
- [**Google Drive**](https://docs.apify.com/platform/integrations/drive) - Export datasets straight to Sheets

You can also use webhooks to trigger downstream actions when a run finishes. Push fresh data into your product backend, or alert your team in Slack.

***

### 💡 More ParseForge Actors

- [Revzilla Scraper](https://apify.com/parseforge/revzilla-scraper) - Scrape motorcycle gear product data
- [Carparts.com Scraper](https://apify.com/parseforge/carparts-com-scraper) - Collect automotive parts catalogs
- [Houzz Scraper](https://apify.com/parseforge/houzz-scraper) - Extract interior design and home improvement listings
- [NYC Building Permits Scraper](https://apify.com/parseforge/nyc-building-permits-scraper) - Gather building permit records
- [Justia Case Law Scraper](https://apify.com/parseforge/justia-case-law-scraper) - Collect legal case information

Browse our complete collection of [data extraction tools](https://apify.com/parseforge) for more.

### 🚀 Ready to Start?

[Create a free account with $5 credit](https://console.apify.com/sign-up?fpr=vmoqkp) and collect your first 100 domains for free. No coding, no setup.

### 🆘 Need Help?

- Check the FAQ section above for common questions
- Visit the [Apify support page](https://docs.apify.com) for documentation and tutorials
- Contact us to request a new scraper, propose a custom project, or report an issue at [Tally contact form](https://tally.so/r/BzdKgA)

### ⚠️ Disclaimer

> This Actor is an independent tool and is not affiliated with, endorsed by, or sponsored by SimilarWeb or any of its subsidiaries. All trademarks mentioned are the property of their respective owners.

***

### 🔗 Recommended Actors

- [**🔍 Google Search Scraper**](https://apify.com/parseforge/google-search-scraper) - Multi-engine SERP results with country and language targeting
- [**🗺️ Nominatim OSM Scraper**](https://apify.com/parseforge/nominatim-osm-scraper) - Geocode addresses via OpenStreetMap
- [**📊 Indexmundi Scraper**](https://apify.com/parseforge/indexmundi-scraper) - Global demographic and economic indicators
- [**📰 RAG Web Browser**](https://apify.com/parseforge/rag-web-browser) - Crawl and extract clean text from any URL for AI retrieval
- [**🌐 Website Content Crawler**](https://apify.com/parseforge/website-content-crawler) - Crawl entire sites and export structured content

> 💡 **Pro Tip:** browse the complete [ParseForge collection](https://apify.com/parseforge) for more reference-data scrapers.

# Actor input Schema

## `maxItems` (type: `integer`):

Maximum number of domains to process. Free users are limited to 100.

## `domains` (type: `array`):

List of domains to analyze (e.g., amazon.com, google.com). URLs will be cleaned automatically.

## `proxyConfiguration` (type: `object`):

Proxy settings. Residential proxies are required and used by default.

## Actor input object example

```json
{
  "maxItems": 10,
  "domains": [
    "amazon.com",
    "google.com",
    "facebook.com",
    "youtube.com",
    "twitter.com",
    "instagram.com",
    "linkedin.com",
    "wikipedia.org",
    "reddit.com",
    "pinterest.com",
    "netflix.com",
    "microsoft.com",
    "apple.com",
    "ebay.com",
    "cnn.com",
    "bbc.com",
    "nytimes.com",
    "walmart.com",
    "target.com",
    "bestbuy.com",
    "costco.com",
    "homedepot.com",
    "lowes.com",
    "ikea.com",
    "nike.com",
    "adidas.com",
    "uber.com",
    "airbnb.com",
    "booking.com",
    "expedia.com",
    "spotify.com",
    "hulu.com",
    "disneyplus.com",
    "hbomax.com",
    "twitch.tv",
    "paypal.com",
    "stripe.com",
    "shopify.com",
    "wix.com",
    "squarespace.com",
    "wordpress.com",
    "medium.com",
    "github.com",
    "stackoverflow.com",
    "quora.com",
    "tumblr.com",
    "duolingo.com",
    "coursera.org",
    "udemy.com",
    "khanacademy.org"
  ],
  "proxyConfiguration": {
    "useApifyProxy": true,
    "apifyProxyGroups": [
      "RESIDENTIAL"
    ]
  }
}
```

# Actor output Schema

## `imageUrl` (type: `string`):

Website screenshot from SimilarWeb

## `domain` (type: `string`):

The analyzed domain name

## `siteName` (type: `string`):

Website name

## `description` (type: `string`):

Website description

## `globalRank` (type: `string`):

Global traffic rank

## `countryRank` (type: `string`):

Rank within the primary country

## `countryCode` (type: `string`):

Primary country code

## `categoryRank` (type: `string`):

Rank within the website category

## `category` (type: `string`):

Website category classification

## `monthlyVisits` (type: `string`):

Estimated monthly visits

## `bounceRate` (type: `string`):

Percentage of single-page visits (0-1)

## `pagesPerVisit` (type: `string`):

Average pages viewed per visit

## `avgVisitDuration` (type: `string`):

Average time on site in seconds

## `trafficSources` (type: `string`):

Traffic breakdown by source (direct, search, social, referrals, mail, paidReferrals)

## `topCountries` (type: `string`):

Top traffic countries with share percentages

## `estimatedMonthlyVisits` (type: `string`):

Historical monthly visit estimates

## `snapshotDate` (type: `string`):

When the data was captured by SimilarWeb

## `scrapedAt` (type: `string`):

When this data was scraped

## `error` (type: `string`):

Error message if scraping failed

# API

You can run this Actor programmatically using our API. Below are code examples in JavaScript, Python, and CLI, as well as the OpenAPI specification and MCP server setup.

## JavaScript example

```javascript
import { ApifyClient } from 'apify-client';

// Initialize the ApifyClient with your Apify API token
// Replace the '<YOUR_API_TOKEN>' with your token
const client = new ApifyClient({
    token: '<YOUR_API_TOKEN>',
});

// Prepare Actor input
const input = {
    "maxItems": 10,
    "domains": [
        "amazon.com",
        "google.com",
        "facebook.com",
        "youtube.com",
        "twitter.com",
        "instagram.com",
        "linkedin.com",
        "wikipedia.org",
        "reddit.com",
        "pinterest.com",
        "netflix.com",
        "microsoft.com",
        "apple.com",
        "ebay.com",
        "cnn.com",
        "bbc.com",
        "nytimes.com",
        "walmart.com",
        "target.com",
        "bestbuy.com",
        "costco.com",
        "homedepot.com",
        "lowes.com",
        "ikea.com",
        "nike.com",
        "adidas.com",
        "uber.com",
        "airbnb.com",
        "booking.com",
        "expedia.com",
        "spotify.com",
        "hulu.com",
        "disneyplus.com",
        "hbomax.com",
        "twitch.tv",
        "paypal.com",
        "stripe.com",
        "shopify.com",
        "wix.com",
        "squarespace.com",
        "wordpress.com",
        "medium.com",
        "github.com",
        "stackoverflow.com",
        "quora.com",
        "tumblr.com",
        "duolingo.com",
        "coursera.org",
        "udemy.com",
        "khanacademy.org"
    ],
    "proxyConfiguration": {
        "useApifyProxy": true,
        "apifyProxyGroups": [
            "RESIDENTIAL"
        ]
    }
};

// Run the Actor and wait for it to finish
const run = await client.actor("parseforge/similarweb-scraper").call(input);

// Fetch and print Actor results from the run's dataset (if any)
console.log('Results from dataset');
console.log(`💾 Check your data here: https://console.apify.com/storage/datasets/${run.defaultDatasetId}`);
const { items } = await client.dataset(run.defaultDatasetId).listItems();
items.forEach((item) => {
    console.dir(item);
});

// 📚 Want to learn more 📖? Go to → https://docs.apify.com/api/client/js/docs

```

## Python example

```python
from apify_client import ApifyClient

# Initialize the ApifyClient with your Apify API token
# Replace '<YOUR_API_TOKEN>' with your token.
client = ApifyClient("<YOUR_API_TOKEN>")

# Prepare the Actor input
run_input = {
    "maxItems": 10,
    "domains": [
        "amazon.com",
        "google.com",
        "facebook.com",
        "youtube.com",
        "twitter.com",
        "instagram.com",
        "linkedin.com",
        "wikipedia.org",
        "reddit.com",
        "pinterest.com",
        "netflix.com",
        "microsoft.com",
        "apple.com",
        "ebay.com",
        "cnn.com",
        "bbc.com",
        "nytimes.com",
        "walmart.com",
        "target.com",
        "bestbuy.com",
        "costco.com",
        "homedepot.com",
        "lowes.com",
        "ikea.com",
        "nike.com",
        "adidas.com",
        "uber.com",
        "airbnb.com",
        "booking.com",
        "expedia.com",
        "spotify.com",
        "hulu.com",
        "disneyplus.com",
        "hbomax.com",
        "twitch.tv",
        "paypal.com",
        "stripe.com",
        "shopify.com",
        "wix.com",
        "squarespace.com",
        "wordpress.com",
        "medium.com",
        "github.com",
        "stackoverflow.com",
        "quora.com",
        "tumblr.com",
        "duolingo.com",
        "coursera.org",
        "udemy.com",
        "khanacademy.org",
    ],
    "proxyConfiguration": {
        "useApifyProxy": True,
        "apifyProxyGroups": ["RESIDENTIAL"],
    },
}

# Run the Actor and wait for it to finish
run = client.actor("parseforge/similarweb-scraper").call(run_input=run_input)

# Fetch and print Actor results from the run's dataset (if there are any)
print("💾 Check your data here: https://console.apify.com/storage/datasets/" + run["defaultDatasetId"])
for item in client.dataset(run["defaultDatasetId"]).iterate_items():
    print(item)

# 📚 Want to learn more 📖? Go to → https://docs.apify.com/api/client/python/docs/quick-start

```

## CLI example

```bash
echo '{
  "maxItems": 10,
  "domains": [
    "amazon.com",
    "google.com",
    "facebook.com",
    "youtube.com",
    "twitter.com",
    "instagram.com",
    "linkedin.com",
    "wikipedia.org",
    "reddit.com",
    "pinterest.com",
    "netflix.com",
    "microsoft.com",
    "apple.com",
    "ebay.com",
    "cnn.com",
    "bbc.com",
    "nytimes.com",
    "walmart.com",
    "target.com",
    "bestbuy.com",
    "costco.com",
    "homedepot.com",
    "lowes.com",
    "ikea.com",
    "nike.com",
    "adidas.com",
    "uber.com",
    "airbnb.com",
    "booking.com",
    "expedia.com",
    "spotify.com",
    "hulu.com",
    "disneyplus.com",
    "hbomax.com",
    "twitch.tv",
    "paypal.com",
    "stripe.com",
    "shopify.com",
    "wix.com",
    "squarespace.com",
    "wordpress.com",
    "medium.com",
    "github.com",
    "stackoverflow.com",
    "quora.com",
    "tumblr.com",
    "duolingo.com",
    "coursera.org",
    "udemy.com",
    "khanacademy.org"
  ],
  "proxyConfiguration": {
    "useApifyProxy": true,
    "apifyProxyGroups": [
      "RESIDENTIAL"
    ]
  }
}' |
apify call parseforge/similarweb-scraper --silent --output-dataset

```

## MCP server setup

```json
{
    "mcpServers": {
        "apify": {
            "command": "npx",
            "args": [
                "mcp-remote",
                "https://mcp.apify.com/?tools=parseforge/similarweb-scraper",
                "--header",
                "Authorization: Bearer <YOUR_API_TOKEN>"
            ]
        }
    }
}

```

## OpenAPI specification

```json
{
    "openapi": "3.0.1",
    "info": {
        "title": "Similarweb Scraper",
        "description": "Unlock powerful website insights in seconds. Analyze traffic volume, global and country rankings, audience engagement, and top markets to outsmart competitors, qualify leads, and spot growth opportunities. Make smarter business decisions with reliable, actionable data at your fingertips",
        "version": "1.0",
        "x-build-id": "UhyPvzd9P6Mcqt3le"
    },
    "servers": [
        {
            "url": "https://api.apify.com/v2"
        }
    ],
    "paths": {
        "/acts/parseforge~similarweb-scraper/run-sync-get-dataset-items": {
            "post": {
                "operationId": "run-sync-get-dataset-items-parseforge-similarweb-scraper",
                "x-openai-isConsequential": false,
                "summary": "Executes an Actor, waits for its completion, and returns Actor's dataset items in response.",
                "tags": [
                    "Run Actor"
                ],
                "requestBody": {
                    "required": true,
                    "content": {
                        "application/json": {
                            "schema": {
                                "$ref": "#/components/schemas/inputSchema"
                            }
                        }
                    }
                },
                "parameters": [
                    {
                        "name": "token",
                        "in": "query",
                        "required": true,
                        "schema": {
                            "type": "string"
                        },
                        "description": "Enter your Apify token here"
                    }
                ],
                "responses": {
                    "200": {
                        "description": "OK"
                    }
                }
            }
        },
        "/acts/parseforge~similarweb-scraper/runs": {
            "post": {
                "operationId": "runs-sync-parseforge-similarweb-scraper",
                "x-openai-isConsequential": false,
                "summary": "Executes an Actor and returns information about the initiated run in response.",
                "tags": [
                    "Run Actor"
                ],
                "requestBody": {
                    "required": true,
                    "content": {
                        "application/json": {
                            "schema": {
                                "$ref": "#/components/schemas/inputSchema"
                            }
                        }
                    }
                },
                "parameters": [
                    {
                        "name": "token",
                        "in": "query",
                        "required": true,
                        "schema": {
                            "type": "string"
                        },
                        "description": "Enter your Apify token here"
                    }
                ],
                "responses": {
                    "200": {
                        "description": "OK",
                        "content": {
                            "application/json": {
                                "schema": {
                                    "$ref": "#/components/schemas/runsResponseSchema"
                                }
                            }
                        }
                    }
                }
            }
        },
        "/acts/parseforge~similarweb-scraper/run-sync": {
            "post": {
                "operationId": "run-sync-parseforge-similarweb-scraper",
                "x-openai-isConsequential": false,
                "summary": "Executes an Actor, waits for completion, and returns the OUTPUT from Key-value store in response.",
                "tags": [
                    "Run Actor"
                ],
                "requestBody": {
                    "required": true,
                    "content": {
                        "application/json": {
                            "schema": {
                                "$ref": "#/components/schemas/inputSchema"
                            }
                        }
                    }
                },
                "parameters": [
                    {
                        "name": "token",
                        "in": "query",
                        "required": true,
                        "schema": {
                            "type": "string"
                        },
                        "description": "Enter your Apify token here"
                    }
                ],
                "responses": {
                    "200": {
                        "description": "OK"
                    }
                }
            }
        }
    },
    "components": {
        "schemas": {
            "inputSchema": {
                "type": "object",
                "required": [
                    "domains"
                ],
                "properties": {
                    "maxItems": {
                        "title": "Max Items",
                        "minimum": 1,
                        "maximum": 1000000,
                        "type": "integer",
                        "description": "Maximum number of domains to process. Free users are limited to 100."
                    },
                    "domains": {
                        "title": "Domains",
                        "type": "array",
                        "description": "List of domains to analyze (e.g., amazon.com, google.com). URLs will be cleaned automatically.",
                        "items": {
                            "type": "string"
                        }
                    },
                    "proxyConfiguration": {
                        "title": "Proxy Configuration",
                        "type": "object",
                        "description": "Proxy settings. Residential proxies are required and used by default."
                    }
                }
            },
            "runsResponseSchema": {
                "type": "object",
                "properties": {
                    "data": {
                        "type": "object",
                        "properties": {
                            "id": {
                                "type": "string"
                            },
                            "actId": {
                                "type": "string"
                            },
                            "userId": {
                                "type": "string"
                            },
                            "startedAt": {
                                "type": "string",
                                "format": "date-time",
                                "example": "2025-01-08T00:00:00.000Z"
                            },
                            "finishedAt": {
                                "type": "string",
                                "format": "date-time",
                                "example": "2025-01-08T00:00:00.000Z"
                            },
                            "status": {
                                "type": "string",
                                "example": "READY"
                            },
                            "meta": {
                                "type": "object",
                                "properties": {
                                    "origin": {
                                        "type": "string",
                                        "example": "API"
                                    },
                                    "userAgent": {
                                        "type": "string"
                                    }
                                }
                            },
                            "stats": {
                                "type": "object",
                                "properties": {
                                    "inputBodyLen": {
                                        "type": "integer",
                                        "example": 2000
                                    },
                                    "rebootCount": {
                                        "type": "integer",
                                        "example": 0
                                    },
                                    "restartCount": {
                                        "type": "integer",
                                        "example": 0
                                    },
                                    "resurrectCount": {
                                        "type": "integer",
                                        "example": 0
                                    },
                                    "computeUnits": {
                                        "type": "integer",
                                        "example": 0
                                    }
                                }
                            },
                            "options": {
                                "type": "object",
                                "properties": {
                                    "build": {
                                        "type": "string",
                                        "example": "latest"
                                    },
                                    "timeoutSecs": {
                                        "type": "integer",
                                        "example": 300
                                    },
                                    "memoryMbytes": {
                                        "type": "integer",
                                        "example": 1024
                                    },
                                    "diskMbytes": {
                                        "type": "integer",
                                        "example": 2048
                                    }
                                }
                            },
                            "buildId": {
                                "type": "string"
                            },
                            "defaultKeyValueStoreId": {
                                "type": "string"
                            },
                            "defaultDatasetId": {
                                "type": "string"
                            },
                            "defaultRequestQueueId": {
                                "type": "string"
                            },
                            "buildNumber": {
                                "type": "string",
                                "example": "1.0.0"
                            },
                            "containerUrl": {
                                "type": "string"
                            },
                            "usage": {
                                "type": "object",
                                "properties": {
                                    "ACTOR_COMPUTE_UNITS": {
                                        "type": "integer",
                                        "example": 0
                                    },
                                    "DATASET_READS": {
                                        "type": "integer",
                                        "example": 0
                                    },
                                    "DATASET_WRITES": {
                                        "type": "integer",
                                        "example": 0
                                    },
                                    "KEY_VALUE_STORE_READS": {
                                        "type": "integer",
                                        "example": 0
                                    },
                                    "KEY_VALUE_STORE_WRITES": {
                                        "type": "integer",
                                        "example": 1
                                    },
                                    "KEY_VALUE_STORE_LISTS": {
                                        "type": "integer",
                                        "example": 0
                                    },
                                    "REQUEST_QUEUE_READS": {
                                        "type": "integer",
                                        "example": 0
                                    },
                                    "REQUEST_QUEUE_WRITES": {
                                        "type": "integer",
                                        "example": 0
                                    },
                                    "DATA_TRANSFER_INTERNAL_GBYTES": {
                                        "type": "integer",
                                        "example": 0
                                    },
                                    "DATA_TRANSFER_EXTERNAL_GBYTES": {
                                        "type": "integer",
                                        "example": 0
                                    },
                                    "PROXY_RESIDENTIAL_TRANSFER_GBYTES": {
                                        "type": "integer",
                                        "example": 0
                                    },
                                    "PROXY_SERPS": {
                                        "type": "integer",
                                        "example": 0
                                    }
                                }
                            },
                            "usageTotalUsd": {
                                "type": "number",
                                "example": 0.00005
                            },
                            "usageUsd": {
                                "type": "object",
                                "properties": {
                                    "ACTOR_COMPUTE_UNITS": {
                                        "type": "integer",
                                        "example": 0
                                    },
                                    "DATASET_READS": {
                                        "type": "integer",
                                        "example": 0
                                    },
                                    "DATASET_WRITES": {
                                        "type": "integer",
                                        "example": 0
                                    },
                                    "KEY_VALUE_STORE_READS": {
                                        "type": "integer",
                                        "example": 0
                                    },
                                    "KEY_VALUE_STORE_WRITES": {
                                        "type": "number",
                                        "example": 0.00005
                                    },
                                    "KEY_VALUE_STORE_LISTS": {
                                        "type": "integer",
                                        "example": 0
                                    },
                                    "REQUEST_QUEUE_READS": {
                                        "type": "integer",
                                        "example": 0
                                    },
                                    "REQUEST_QUEUE_WRITES": {
                                        "type": "integer",
                                        "example": 0
                                    },
                                    "DATA_TRANSFER_INTERNAL_GBYTES": {
                                        "type": "integer",
                                        "example": 0
                                    },
                                    "DATA_TRANSFER_EXTERNAL_GBYTES": {
                                        "type": "integer",
                                        "example": 0
                                    },
                                    "PROXY_RESIDENTIAL_TRANSFER_GBYTES": {
                                        "type": "integer",
                                        "example": 0
                                    },
                                    "PROXY_SERPS": {
                                        "type": "integer",
                                        "example": 0
                                    }
                                }
                            }
                        }
                    }
                }
            }
        }
    }
}
```
