# Vimeo Video Scraper (`runtime/vimeo-video-scraper`) Actor

Vimeo Video Scraper extracts video titles and URLs from Vimeo pages. It automates scrolling, clicks "Load more," and collects data efficiently. Supports proxies and limits results via videoMax, ensuring structured JSON output.

- **URL**: https://apify.com/runtime/vimeo-video-scraper.md
- **Developed by:** [scraping automation](https://apify.com/runtime) (community)
- **Categories:** Videos, Social media, News
- **Stats:** 18 total users, 0 monthly users, 100.0% runs succeeded, 1 bookmarks
- **User rating**: No ratings yet

## Pricing

$25.00/month + usage

To use this Actor, you pay a monthly rental fee to the developer. The rent is subtracted from your prepaid usage every month after the free trial period.You also pay for the Apify platform usage, which gets cheaper the higher Apify subscription plan you have.

Learn more: https://docs.apify.com/platform/actors/running/actors-in-store#rental-actors

## What's an Apify Actor?

Actors are a software tools running on the Apify platform, for all kinds of web data extraction and automation use cases.
In Batch mode, an Actor accepts a well-defined JSON input, performs an action which can take anything from a few seconds to a few hours,
and optionally produces a well-defined JSON output, datasets with results, or files in key-value store.
In Standby mode, an Actor provides a web server which can be used as a website, API, or an MCP server.
Actors are written with capital "A".

## How to integrate an Actor?

If asked about integration, you help developers integrate Actors into their projects.
You adapt to their stack and deliver integrations that are safe, well-documented, and production-ready.
The best way to integrate Actors is as follows.

In JavaScript/TypeScript projects, use official [JavaScript/TypeScript client](https://docs.apify.com/api/client/js.md):

```bash
npm install apify-client
```

In Python projects, use official [Python client library](https://docs.apify.com/api/client/python.md):

```bash
pip install apify-client
```

In shell scripts, use [Apify CLI](https://docs.apify.com/cli/docs.md):

````bash
# MacOS / Linux
curl -fsSL https://apify.com/install-cli.sh | bash
# Windows
irm https://apify.com/install-cli.ps1 | iex
```bash

In AI frameworks, you might use the [Apify MCP server](https://docs.apify.com/platform/integrations/mcp.md).

If your project is in a different language, use the [REST API](https://docs.apify.com/api/v2.md).

For usage examples, see the [API](#api) section below.

For more details, see Apify documentation as [Markdown index](https://docs.apify.com/llms.txt) and [Markdown full-text](https://docs.apify.com/llms-full.txt).


# README

## Vimeo Video Scraper

This Apify Actor scrapes video data from Vimeo profile or channel pages using advanced anti-detection techniques.  
It automatically scrolls, clicks on "Load more" buttons, and extracts key information such as:

- **Video Title**
- **Video URL** 
- **Video ID**

### Features

- **Advanced Anti-Detection**: Implements sophisticated browser fingerprinting and stealth techniques to bypass Vimeo's bot detection
- **Aggressive Loading Strategy**: Uses multiple fallback strategies to handle navigation timeouts and ensure successful page loading
- **Comprehensive Video Extraction**: Employs 4 different strategies to find and extract video information
- **Smart Pagination**: Automatically clicks "Load more" buttons to retrieve additional videos up to `videoMax` limit
- **Duplicate Prevention**: Intelligently filters out duplicate videos and navigation links
- **Proxy Support**: Configurable proxy usage via Apify to avoid IP-based blocking
- **Robust Error Handling**: Continues scraping even when individual elements fail to load

### Input Parameters

| Parameter            | Type      | Description                                                      | Default Value                                                      |
|----------------------|-----------|------------------------------------------------------------------|----------------------------------------------------------------------|
| `url`               | string    | Vimeo profile or channel URL to scrape.                          | `"https://vimeo.com/wearepixelartworks"`                           |
| `videoMax`          | integer   | Maximum number of videos to extract.                             | `500`                                                              |
| `proxyConfiguration` | object    | Proxy configuration for scraping (Apify proxies recommended).   | `{ "useApifyProxy": true, "apifyProxyGroups": [] }`               |

### Output

The Actor generates a JSON dataset containing an array of video objects:

```json
[
  { 
    "title": "The Macallan: The Heart of the Spirit Pixel Artworks", 
    "url": "https://vimeo.com/999441939",
    "videoId": "999441939"
  },
  { 
    "title": "Star Wars Hunters Pixel Artworks", 
    "url": "https://vimeo.com/992725997",
    "videoId": "992725997"
  }
]
````

### How It Works

1. **Stealth Mode**: Applies comprehensive anti-detection measures including browser fingerprinting, header spoofing, and automation detection bypass
2. **Aggressive Loading**: Uses multiple fallback strategies (networkidle → domcontentloaded → forced continuation) to handle Vimeo's sophisticated blocking
3. **Multi-Strategy Extraction**: Combines link extraction, container analysis, text-based search, and image-based detection
4. **Intelligent Pagination**: Automatically detects and clicks "Load more" buttons while avoiding duplicates
5. **Content Validation**: Filters out navigation links, profile pages, and non-video content

### Anti-Detection Features

- **Browser Fingerprinting Override**: Masks webdriver properties, plugins, languages, and hardware characteristics
- **Realistic Headers**: Sets comprehensive HTTP headers including Sec-Ch-Ua, Sec-Fetch-\* patterns
- **Mouse Movement Simulation**: Adds random human-like interactions to avoid behavioral detection
- **Function Override**: Hides automation-related function properties and toString methods
- **Viewport Spoofing**: Sets realistic browser dimensions and viewport characteristics

### Performance

- **Default Timeout**: 120 seconds per request with intelligent fallback strategies
- **Load More Loops**: Up to 20 iterations to ensure maximum video collection
- **Duplicate Prevention**: Efficient filtering to avoid processing the same video multiple times
- **Memory Optimization**: Processes videos in batches to maintain performance

### Use Cases

- **Content Research**: Analyze video portfolios and content strategies
- **Competitive Intelligence**: Monitor competitor video content and publishing patterns
- **Data Collection**: Gather video metadata for analysis and research purposes
- **Content Discovery**: Find relevant videos across Vimeo channels and profiles

## Legal Disclaimer

This project is intended for educational and research purposes only. Please ensure you comply with Vimeo's terms of use and applicable web scraping regulations. The Actor implements anti-detection measures to bypass technical barriers but should be used responsibly and in accordance with applicable laws.

### Related Resources

- [YouTube Channel Scraper](https://apify.com/runtime/youtube-channel-scraper)
- [Youtube Shorts Scraper](https://apify.com/runtime/youtube-shorts-scraper)

# Actor input Schema

## `url` (type: `string`):

The URL of the Vimeo page to scrape.

## `videoMax` (type: `integer`):

Maximum number of videos to collect.

## `useApifyProxy` (type: `boolean`):

Whether to use Apify Proxy

## `proxyConfiguration` (type: `object`):

Configure the proxy settings for scraping (Use Apify proxies to avoid blocking).

## Actor input object example

```json
{
  "url": "https://vimeo.com/wearepixelartworks",
  "videoMax": 50,
  "useApifyProxy": true,
  "proxyConfiguration": {
    "useApifyProxy": true,
    "apifyProxyGroups": []
  }
}
```

# API

You can run this Actor programmatically using our API. Below are code examples in JavaScript, Python, and CLI, as well as the OpenAPI specification and MCP server setup.

## JavaScript example

```javascript
import { ApifyClient } from 'apify-client';

// Initialize the ApifyClient with your Apify API token
// Replace the '<YOUR_API_TOKEN>' with your token
const client = new ApifyClient({
    token: '<YOUR_API_TOKEN>',
});

// Prepare Actor input
const input = {
    "useApifyProxy": true,
    "proxyConfiguration": {
        "useApifyProxy": true,
        "apifyProxyGroups": []
    }
};

// Run the Actor and wait for it to finish
const run = await client.actor("runtime/vimeo-video-scraper").call(input);

// Fetch and print Actor results from the run's dataset (if any)
console.log('Results from dataset');
console.log(`💾 Check your data here: https://console.apify.com/storage/datasets/${run.defaultDatasetId}`);
const { items } = await client.dataset(run.defaultDatasetId).listItems();
items.forEach((item) => {
    console.dir(item);
});

// 📚 Want to learn more 📖? Go to → https://docs.apify.com/api/client/js/docs

```

## Python example

```python
from apify_client import ApifyClient

# Initialize the ApifyClient with your Apify API token
# Replace '<YOUR_API_TOKEN>' with your token.
client = ApifyClient("<YOUR_API_TOKEN>")

# Prepare the Actor input
run_input = {
    "useApifyProxy": True,
    "proxyConfiguration": {
        "useApifyProxy": True,
        "apifyProxyGroups": [],
    },
}

# Run the Actor and wait for it to finish
run = client.actor("runtime/vimeo-video-scraper").call(run_input=run_input)

# Fetch and print Actor results from the run's dataset (if there are any)
print("💾 Check your data here: https://console.apify.com/storage/datasets/" + run["defaultDatasetId"])
for item in client.dataset(run["defaultDatasetId"]).iterate_items():
    print(item)

# 📚 Want to learn more 📖? Go to → https://docs.apify.com/api/client/python/docs/quick-start

```

## CLI example

```bash
echo '{
  "useApifyProxy": true,
  "proxyConfiguration": {
    "useApifyProxy": true,
    "apifyProxyGroups": []
  }
}' |
apify call runtime/vimeo-video-scraper --silent --output-dataset

```

## MCP server setup

```json
{
    "mcpServers": {
        "apify": {
            "command": "npx",
            "args": [
                "mcp-remote",
                "https://mcp.apify.com/?tools=runtime/vimeo-video-scraper",
                "--header",
                "Authorization: Bearer <YOUR_API_TOKEN>"
            ]
        }
    }
}

```

## OpenAPI specification

```json
{
    "openapi": "3.0.1",
    "info": {
        "title": "Vimeo Video Scraper",
        "description": "Vimeo Video Scraper extracts video titles and URLs from Vimeo pages. It automates scrolling, clicks \"Load more,\" and collects data efficiently. Supports proxies and limits results via videoMax, ensuring structured JSON output.",
        "version": "0.5",
        "x-build-id": "FiYEQtYcc2HQ0gEBz"
    },
    "servers": [
        {
            "url": "https://api.apify.com/v2"
        }
    ],
    "paths": {
        "/acts/runtime~vimeo-video-scraper/run-sync-get-dataset-items": {
            "post": {
                "operationId": "run-sync-get-dataset-items-runtime-vimeo-video-scraper",
                "x-openai-isConsequential": false,
                "summary": "Executes an Actor, waits for its completion, and returns Actor's dataset items in response.",
                "tags": [
                    "Run Actor"
                ],
                "requestBody": {
                    "required": true,
                    "content": {
                        "application/json": {
                            "schema": {
                                "$ref": "#/components/schemas/inputSchema"
                            }
                        }
                    }
                },
                "parameters": [
                    {
                        "name": "token",
                        "in": "query",
                        "required": true,
                        "schema": {
                            "type": "string"
                        },
                        "description": "Enter your Apify token here"
                    }
                ],
                "responses": {
                    "200": {
                        "description": "OK"
                    }
                }
            }
        },
        "/acts/runtime~vimeo-video-scraper/runs": {
            "post": {
                "operationId": "runs-sync-runtime-vimeo-video-scraper",
                "x-openai-isConsequential": false,
                "summary": "Executes an Actor and returns information about the initiated run in response.",
                "tags": [
                    "Run Actor"
                ],
                "requestBody": {
                    "required": true,
                    "content": {
                        "application/json": {
                            "schema": {
                                "$ref": "#/components/schemas/inputSchema"
                            }
                        }
                    }
                },
                "parameters": [
                    {
                        "name": "token",
                        "in": "query",
                        "required": true,
                        "schema": {
                            "type": "string"
                        },
                        "description": "Enter your Apify token here"
                    }
                ],
                "responses": {
                    "200": {
                        "description": "OK",
                        "content": {
                            "application/json": {
                                "schema": {
                                    "$ref": "#/components/schemas/runsResponseSchema"
                                }
                            }
                        }
                    }
                }
            }
        },
        "/acts/runtime~vimeo-video-scraper/run-sync": {
            "post": {
                "operationId": "run-sync-runtime-vimeo-video-scraper",
                "x-openai-isConsequential": false,
                "summary": "Executes an Actor, waits for completion, and returns the OUTPUT from Key-value store in response.",
                "tags": [
                    "Run Actor"
                ],
                "requestBody": {
                    "required": true,
                    "content": {
                        "application/json": {
                            "schema": {
                                "$ref": "#/components/schemas/inputSchema"
                            }
                        }
                    }
                },
                "parameters": [
                    {
                        "name": "token",
                        "in": "query",
                        "required": true,
                        "schema": {
                            "type": "string"
                        },
                        "description": "Enter your Apify token here"
                    }
                ],
                "responses": {
                    "200": {
                        "description": "OK"
                    }
                }
            }
        }
    },
    "components": {
        "schemas": {
            "inputSchema": {
                "type": "object",
                "required": [
                    "url"
                ],
                "properties": {
                    "url": {
                        "title": "Vimeo Page URL",
                        "type": "string",
                        "description": "The URL of the Vimeo page to scrape.",
                        "default": "https://vimeo.com/wearepixelartworks"
                    },
                    "videoMax": {
                        "title": "Maximum Videos",
                        "minimum": 1,
                        "type": "integer",
                        "description": "Maximum number of videos to collect.",
                        "default": 50
                    },
                    "useApifyProxy": {
                        "title": "Use Apify Proxy",
                        "type": "boolean",
                        "description": "Whether to use Apify Proxy",
                        "default": true
                    },
                    "proxyConfiguration": {
                        "title": "Proxy Configuration",
                        "type": "object",
                        "description": "Configure the proxy settings for scraping (Use Apify proxies to avoid blocking)."
                    }
                }
            },
            "runsResponseSchema": {
                "type": "object",
                "properties": {
                    "data": {
                        "type": "object",
                        "properties": {
                            "id": {
                                "type": "string"
                            },
                            "actId": {
                                "type": "string"
                            },
                            "userId": {
                                "type": "string"
                            },
                            "startedAt": {
                                "type": "string",
                                "format": "date-time",
                                "example": "2025-01-08T00:00:00.000Z"
                            },
                            "finishedAt": {
                                "type": "string",
                                "format": "date-time",
                                "example": "2025-01-08T00:00:00.000Z"
                            },
                            "status": {
                                "type": "string",
                                "example": "READY"
                            },
                            "meta": {
                                "type": "object",
                                "properties": {
                                    "origin": {
                                        "type": "string",
                                        "example": "API"
                                    },
                                    "userAgent": {
                                        "type": "string"
                                    }
                                }
                            },
                            "stats": {
                                "type": "object",
                                "properties": {
                                    "inputBodyLen": {
                                        "type": "integer",
                                        "example": 2000
                                    },
                                    "rebootCount": {
                                        "type": "integer",
                                        "example": 0
                                    },
                                    "restartCount": {
                                        "type": "integer",
                                        "example": 0
                                    },
                                    "resurrectCount": {
                                        "type": "integer",
                                        "example": 0
                                    },
                                    "computeUnits": {
                                        "type": "integer",
                                        "example": 0
                                    }
                                }
                            },
                            "options": {
                                "type": "object",
                                "properties": {
                                    "build": {
                                        "type": "string",
                                        "example": "latest"
                                    },
                                    "timeoutSecs": {
                                        "type": "integer",
                                        "example": 300
                                    },
                                    "memoryMbytes": {
                                        "type": "integer",
                                        "example": 1024
                                    },
                                    "diskMbytes": {
                                        "type": "integer",
                                        "example": 2048
                                    }
                                }
                            },
                            "buildId": {
                                "type": "string"
                            },
                            "defaultKeyValueStoreId": {
                                "type": "string"
                            },
                            "defaultDatasetId": {
                                "type": "string"
                            },
                            "defaultRequestQueueId": {
                                "type": "string"
                            },
                            "buildNumber": {
                                "type": "string",
                                "example": "1.0.0"
                            },
                            "containerUrl": {
                                "type": "string"
                            },
                            "usage": {
                                "type": "object",
                                "properties": {
                                    "ACTOR_COMPUTE_UNITS": {
                                        "type": "integer",
                                        "example": 0
                                    },
                                    "DATASET_READS": {
                                        "type": "integer",
                                        "example": 0
                                    },
                                    "DATASET_WRITES": {
                                        "type": "integer",
                                        "example": 0
                                    },
                                    "KEY_VALUE_STORE_READS": {
                                        "type": "integer",
                                        "example": 0
                                    },
                                    "KEY_VALUE_STORE_WRITES": {
                                        "type": "integer",
                                        "example": 1
                                    },
                                    "KEY_VALUE_STORE_LISTS": {
                                        "type": "integer",
                                        "example": 0
                                    },
                                    "REQUEST_QUEUE_READS": {
                                        "type": "integer",
                                        "example": 0
                                    },
                                    "REQUEST_QUEUE_WRITES": {
                                        "type": "integer",
                                        "example": 0
                                    },
                                    "DATA_TRANSFER_INTERNAL_GBYTES": {
                                        "type": "integer",
                                        "example": 0
                                    },
                                    "DATA_TRANSFER_EXTERNAL_GBYTES": {
                                        "type": "integer",
                                        "example": 0
                                    },
                                    "PROXY_RESIDENTIAL_TRANSFER_GBYTES": {
                                        "type": "integer",
                                        "example": 0
                                    },
                                    "PROXY_SERPS": {
                                        "type": "integer",
                                        "example": 0
                                    }
                                }
                            },
                            "usageTotalUsd": {
                                "type": "number",
                                "example": 0.00005
                            },
                            "usageUsd": {
                                "type": "object",
                                "properties": {
                                    "ACTOR_COMPUTE_UNITS": {
                                        "type": "integer",
                                        "example": 0
                                    },
                                    "DATASET_READS": {
                                        "type": "integer",
                                        "example": 0
                                    },
                                    "DATASET_WRITES": {
                                        "type": "integer",
                                        "example": 0
                                    },
                                    "KEY_VALUE_STORE_READS": {
                                        "type": "integer",
                                        "example": 0
                                    },
                                    "KEY_VALUE_STORE_WRITES": {
                                        "type": "number",
                                        "example": 0.00005
                                    },
                                    "KEY_VALUE_STORE_LISTS": {
                                        "type": "integer",
                                        "example": 0
                                    },
                                    "REQUEST_QUEUE_READS": {
                                        "type": "integer",
                                        "example": 0
                                    },
                                    "REQUEST_QUEUE_WRITES": {
                                        "type": "integer",
                                        "example": 0
                                    },
                                    "DATA_TRANSFER_INTERNAL_GBYTES": {
                                        "type": "integer",
                                        "example": 0
                                    },
                                    "DATA_TRANSFER_EXTERNAL_GBYTES": {
                                        "type": "integer",
                                        "example": 0
                                    },
                                    "PROXY_RESIDENTIAL_TRANSFER_GBYTES": {
                                        "type": "integer",
                                        "example": 0
                                    },
                                    "PROXY_SERPS": {
                                        "type": "integer",
                                        "example": 0
                                    }
                                }
                            }
                        }
                    }
                }
            }
        }
    }
}
```
