# Expedia Review Scraper (`getdataforme/expedia-scraper`) Actor

Extract detailed hotel data from Expedia, including names, ratings, prices, and amenities. Perfect for travel agencies and researchers, this reliable scraper offers scalable performance, proxy anonymity, and JSON output for seamless integration and analysis.

- **URL**: https://apify.com/getdataforme/expedia-scraper.md
- **Developed by:** [GetDataForMe](https://apify.com/getdataforme) (community)
- **Categories:** Travel, Real estate
- **Stats:** 87 total users, 7 monthly users, 96.6% runs succeeded, 1 bookmarks
- **User rating**: No ratings yet

## Pricing

Pay per usage

This Actor is paid per platform usage. The Actor is free to use, and you only pay for the Apify platform usage, which gets cheaper the higher subscription plan you have.

Learn more: https://docs.apify.com/platform/actors/running/actors-in-store#pay-per-usage

## What's an Apify Actor?

Actors are a software tools running on the Apify platform, for all kinds of web data extraction and automation use cases.
In Batch mode, an Actor accepts a well-defined JSON input, performs an action which can take anything from a few seconds to a few hours,
and optionally produces a well-defined JSON output, datasets with results, or files in key-value store.
In Standby mode, an Actor provides a web server which can be used as a website, API, or an MCP server.
Actors are written with capital "A".

## How to integrate an Actor?

If asked about integration, you help developers integrate Actors into their projects.
You adapt to their stack and deliver integrations that are safe, well-documented, and production-ready.
The best way to integrate Actors is as follows.

In JavaScript/TypeScript projects, use official [JavaScript/TypeScript client](https://docs.apify.com/api/client/js.md):

```bash
npm install apify-client
```

In Python projects, use official [Python client library](https://docs.apify.com/api/client/python.md):

```bash
pip install apify-client
```

In shell scripts, use [Apify CLI](https://docs.apify.com/cli/docs.md):

````bash
# MacOS / Linux
curl -fsSL https://apify.com/install-cli.sh | bash
# Windows
irm https://apify.com/install-cli.ps1 | iex
```bash

In AI frameworks, you might use the [Apify MCP server](https://docs.apify.com/platform/integrations/mcp.md).

If your project is in a different language, use the [REST API](https://docs.apify.com/api/v2.md).

For usage examples, see the [API](#api) section below.

For more details, see Apify documentation as [Markdown index](https://docs.apify.com/llms.txt) and [Markdown full-text](https://docs.apify.com/llms-full.txt).


# README

## Expedia Reviews Spider - Apify Actor

🕷️ **Automated web scraping actor for expedia**

### 📋 Overview

This Apify actor scrapes data from **expedia** using HTTP Request technology. 
It was automatically generated and synchronized from the central repository.

### 🔗 Repository Information

| Property | Value |
|----------|-------|
| **Repository Name** | `expedia_reviews_spider_apify_hrequest` |
| **Spider Name** | `expedia_reviews_spider` |
| **Target Domain** | expedia |
| **Technology Type** | HTTP Request |
| **Template Used** | `apify_hrequest_template` |
| **Source Path** | `central_repo/src/custom/expedia/hrequest` |
| **GitHub Repository** | [View Repository](https://github.com/getdataforme/expedia_reviews_spider_apify_hrequest) |

### 🏠 Parent Repository

This actor is automatically synchronized from the central repository:

- **Central Repository**: [`getdataforme/central_repo`](https://github.com/getdataforme/central_repo)
- **Source Spider File**: `central_repo/src/custom/expedia/hrequest/expedia_reviews_spider.py`
- **Synchronization**: Automatic via GitHub Actions
- **Last Sync**: 2025-08-01 11:04:31 UTC

### 🔄 Development Workflow

#### Making Changes
1. **DO NOT** edit files directly in this repository
2. Navigate to the [central repository](https://github.com/getdataforme/central_repo)
3. Edit the spider file at: `central_repo/src/custom/expedia/hrequest/expedia_reviews_spider.py`
4. Commit and push changes to the central repository
5. The GitHub Action will automatically sync changes to this repository

#### Sync Process
- Changes to spider files trigger automatic synchronization
- The actor configuration and input schema are regenerated
- This README is updated with the latest information

### 📁 Repository Structure

````

├── src/
│   ├── main.py          # Apify actor entry point
│   └── spiders/
│       └── expedia\_reviews\_spider.py    # Spider implementation
├── .actor/
│   ├── actor.json       # Actor configuration
│   └── input\_schema.json # Input parameters schema
├── requirements.txt     # Python dependencies
└── README.md           # This file

````

### 🚀 Usage

#### Local Development
```bash
## Clone the repository
git clone https://github.com/getdataforme/expedia_reviews_spider_apify_hrequest.git
cd expedia_reviews_spider_apify_hrequest

## Install dependencies
pip install -r requirements.txt

## Run locally (if applicable)
python src/main.py
````

#### Input Configuration

The actor accepts input parameters as defined in `.actor/input_schema.json`.
Common parameters may include:

- URLs to scrape
- Output format preferences
- Rate limiting settings
- Custom extraction parameters

### 🔗 Important Links

- 🏠 **[Central Repository](https://github.com/getdataforme/central_repo)** - Main development repository
- 📁 **[This Repository](https://github.com/getdataforme/expedia_reviews_spider_apify_hrequest)** - Generated Apify actor
- 🕷️ **[Source Spider](https://github.com/getdataforme/central_repo/blob/main/central_repo/src/custom/expedia/hrequest/expedia_reviews_spider.py)** - Original spider file
- 📚 **[Apify Documentation](https://docs.apify.com/)** - Apify platform documentation

### ⚠️ Important Notes

- **This repository is auto-generated** - Direct changes will be overwritten
- **All modifications** should be made in the central repository
- **Synchronization is automatic** - No manual intervention required
- **Issues and PRs** should be submitted to the central repository

***

*This README was automatically generated on 2025-08-01 11:04:31 UTC by the central repository sync process.*

### Apify Actor

This actor is published on [Apify](https://apify.com).

#### Actor Details

- **Actor Name**: expedia-reviews-spider-test
- **Organization**: See Apify Console
- **Console URL**: https://console.apify.com/organization/3JrVZU8vEdLyeD4hC/actors/tLNVHf65UtYsrB7uy#/builds/0.0.1

#### Deployment

This actor is automatically deployed to Apify when changes are pushed to the `main` branch via GitHub Actions.

To run this actor:

1. Visit the Apify Console link above
2. Configure the input parameters
3. Click "Start" to run the actor

# Actor input Schema

## `ItemLimit` (type: `integer`):

The item limit for the spider.

## `Sort` (type: `string`):

Choose how to sort the results

## `Urls` (type: `array`):

The urls for the spider.

## `proxyConfiguration` (type: `object`):

Specifies proxy servers that will be used by the scraper in order to hide its origin.

## Actor input object example

```json
{
  "ItemLimit": 10,
  "Sort": "urn:expediagroup:taxonomies:core:#bdb0015d-63cc-4f9f-be86-3f973867ba30",
  "Urls": [
    "https://www.expedia.com/New-York-Hotels-Hotel-Park-Ave-NYC.h17793410.Hotel-Information?chkin=2026-04-03&chkout=2026-04-05&x_pwa=1&rfrr=HSR&pwa_ts=1773392403106&referrerUrl=aHR0cHM6Ly93d3cuZXhwZWRpYS5jb20vSG90ZWwtU2VhcmNo&useRewards=false&rm1=a2&regionId=2621&destination=New+York%2C+New+York%2C+United+States+of+America&destType=MARKET&neighborhoodId=6360639&selected=17793410&latLong=40.712843%2C-74.005966&guestRating=40&sort=RECOMMENDED&top_dp=804&top_cur=USD&userIntent=&selectedRoomType=325620097&selectedRatePlan=408038219&categorySearch=any_option&searchId=d9431362-a964-496b-88f2-9579bafdddd1"
  ],
  "proxyConfiguration": {
    "useApifyProxy": true,
    "apifyProxyGroups": [
      "RESIDENTIAL"
    ]
  }
}
```

# API

You can run this Actor programmatically using our API. Below are code examples in JavaScript, Python, and CLI, as well as the OpenAPI specification and MCP server setup.

## JavaScript example

```javascript
import { ApifyClient } from 'apify-client';

// Initialize the ApifyClient with your Apify API token
// Replace the '<YOUR_API_TOKEN>' with your token
const client = new ApifyClient({
    token: '<YOUR_API_TOKEN>',
});

// Prepare Actor input
const input = {
    "Sort": "urn:expediagroup:taxonomies:core:#bdb0015d-63cc-4f9f-be86-3f973867ba30",
    "proxyConfiguration": {
        "useApifyProxy": true,
        "apifyProxyGroups": [
            "RESIDENTIAL"
        ]
    }
};

// Run the Actor and wait for it to finish
const run = await client.actor("getdataforme/expedia-scraper").call(input);

// Fetch and print Actor results from the run's dataset (if any)
console.log('Results from dataset');
console.log(`💾 Check your data here: https://console.apify.com/storage/datasets/${run.defaultDatasetId}`);
const { items } = await client.dataset(run.defaultDatasetId).listItems();
items.forEach((item) => {
    console.dir(item);
});

// 📚 Want to learn more 📖? Go to → https://docs.apify.com/api/client/js/docs

```

## Python example

```python
from apify_client import ApifyClient

# Initialize the ApifyClient with your Apify API token
# Replace '<YOUR_API_TOKEN>' with your token.
client = ApifyClient("<YOUR_API_TOKEN>")

# Prepare the Actor input
run_input = {
    "Sort": "urn:expediagroup:taxonomies:core:#bdb0015d-63cc-4f9f-be86-3f973867ba30",
    "proxyConfiguration": {
        "useApifyProxy": True,
        "apifyProxyGroups": ["RESIDENTIAL"],
    },
}

# Run the Actor and wait for it to finish
run = client.actor("getdataforme/expedia-scraper").call(run_input=run_input)

# Fetch and print Actor results from the run's dataset (if there are any)
print("💾 Check your data here: https://console.apify.com/storage/datasets/" + run["defaultDatasetId"])
for item in client.dataset(run["defaultDatasetId"]).iterate_items():
    print(item)

# 📚 Want to learn more 📖? Go to → https://docs.apify.com/api/client/python/docs/quick-start

```

## CLI example

```bash
echo '{
  "Sort": "urn:expediagroup:taxonomies:core:#bdb0015d-63cc-4f9f-be86-3f973867ba30",
  "proxyConfiguration": {
    "useApifyProxy": true,
    "apifyProxyGroups": [
      "RESIDENTIAL"
    ]
  }
}' |
apify call getdataforme/expedia-scraper --silent --output-dataset

```

## MCP server setup

```json
{
    "mcpServers": {
        "apify": {
            "command": "npx",
            "args": [
                "mcp-remote",
                "https://mcp.apify.com/?tools=getdataforme/expedia-scraper",
                "--header",
                "Authorization: Bearer <YOUR_API_TOKEN>"
            ]
        }
    }
}

```

## OpenAPI specification

```json
{
    "openapi": "3.0.1",
    "info": {
        "title": "Expedia Review Scraper",
        "description": "Extract detailed hotel data from Expedia, including names, ratings, prices, and amenities. Perfect for travel agencies and researchers, this reliable scraper offers scalable performance, proxy anonymity, and JSON output for seamless integration and analysis.",
        "version": "0.0",
        "x-build-id": "561osQnSbNMcRZjv3"
    },
    "servers": [
        {
            "url": "https://api.apify.com/v2"
        }
    ],
    "paths": {
        "/acts/getdataforme~expedia-scraper/run-sync-get-dataset-items": {
            "post": {
                "operationId": "run-sync-get-dataset-items-getdataforme-expedia-scraper",
                "x-openai-isConsequential": false,
                "summary": "Executes an Actor, waits for its completion, and returns Actor's dataset items in response.",
                "tags": [
                    "Run Actor"
                ],
                "requestBody": {
                    "required": true,
                    "content": {
                        "application/json": {
                            "schema": {
                                "$ref": "#/components/schemas/inputSchema"
                            }
                        }
                    }
                },
                "parameters": [
                    {
                        "name": "token",
                        "in": "query",
                        "required": true,
                        "schema": {
                            "type": "string"
                        },
                        "description": "Enter your Apify token here"
                    }
                ],
                "responses": {
                    "200": {
                        "description": "OK"
                    }
                }
            }
        },
        "/acts/getdataforme~expedia-scraper/runs": {
            "post": {
                "operationId": "runs-sync-getdataforme-expedia-scraper",
                "x-openai-isConsequential": false,
                "summary": "Executes an Actor and returns information about the initiated run in response.",
                "tags": [
                    "Run Actor"
                ],
                "requestBody": {
                    "required": true,
                    "content": {
                        "application/json": {
                            "schema": {
                                "$ref": "#/components/schemas/inputSchema"
                            }
                        }
                    }
                },
                "parameters": [
                    {
                        "name": "token",
                        "in": "query",
                        "required": true,
                        "schema": {
                            "type": "string"
                        },
                        "description": "Enter your Apify token here"
                    }
                ],
                "responses": {
                    "200": {
                        "description": "OK",
                        "content": {
                            "application/json": {
                                "schema": {
                                    "$ref": "#/components/schemas/runsResponseSchema"
                                }
                            }
                        }
                    }
                }
            }
        },
        "/acts/getdataforme~expedia-scraper/run-sync": {
            "post": {
                "operationId": "run-sync-getdataforme-expedia-scraper",
                "x-openai-isConsequential": false,
                "summary": "Executes an Actor, waits for completion, and returns the OUTPUT from Key-value store in response.",
                "tags": [
                    "Run Actor"
                ],
                "requestBody": {
                    "required": true,
                    "content": {
                        "application/json": {
                            "schema": {
                                "$ref": "#/components/schemas/inputSchema"
                            }
                        }
                    }
                },
                "parameters": [
                    {
                        "name": "token",
                        "in": "query",
                        "required": true,
                        "schema": {
                            "type": "string"
                        },
                        "description": "Enter your Apify token here"
                    }
                ],
                "responses": {
                    "200": {
                        "description": "OK"
                    }
                }
            }
        }
    },
    "components": {
        "schemas": {
            "inputSchema": {
                "type": "object",
                "required": [
                    "Urls"
                ],
                "properties": {
                    "ItemLimit": {
                        "title": "Item Limit",
                        "minimum": 1,
                        "type": "integer",
                        "description": "The item limit for the spider.",
                        "default": 10
                    },
                    "Sort": {
                        "title": "Sort By",
                        "enum": [
                            "urn:expediagroup:taxonomies:core:#e9f32feb-5946-4b19-a6f2-8206edc7a130",
                            "urn:expediagroup:taxonomies:core:#bdb0015d-63cc-4f9f-be86-3f973867ba30",
                            "urn:expediagroup:taxonomies:core:#ca5a18bf-0d90-42b9-b81a-6f652de32ed0",
                            "urn:expediagroup:taxonomies:core:#aa534828-5906-4b30-a16f-633958713d55"
                        ],
                        "type": "string",
                        "description": "Choose how to sort the results",
                        "default": "urn:expediagroup:taxonomies:core:#bdb0015d-63cc-4f9f-be86-3f973867ba30"
                    },
                    "Urls": {
                        "title": "Urls",
                        "minItems": 1,
                        "type": "array",
                        "description": "The urls for the spider.",
                        "default": [
                            "https://www.expedia.com/New-York-Hotels-Hotel-Park-Ave-NYC.h17793410.Hotel-Information?chkin=2026-04-03&chkout=2026-04-05&x_pwa=1&rfrr=HSR&pwa_ts=1773392403106&referrerUrl=aHR0cHM6Ly93d3cuZXhwZWRpYS5jb20vSG90ZWwtU2VhcmNo&useRewards=false&rm1=a2&regionId=2621&destination=New+York%2C+New+York%2C+United+States+of+America&destType=MARKET&neighborhoodId=6360639&selected=17793410&latLong=40.712843%2C-74.005966&guestRating=40&sort=RECOMMENDED&top_dp=804&top_cur=USD&userIntent=&selectedRoomType=325620097&selectedRatePlan=408038219&categorySearch=any_option&searchId=d9431362-a964-496b-88f2-9579bafdddd1"
                        ],
                        "items": {
                            "type": "string"
                        }
                    },
                    "proxyConfiguration": {
                        "title": "Proxy configuration",
                        "type": "object",
                        "description": "Specifies proxy servers that will be used by the scraper in order to hide its origin.",
                        "default": {
                            "useApifyProxy": true,
                            "apifyProxyGroups": [
                                "RESIDENTIAL"
                            ]
                        }
                    }
                }
            },
            "runsResponseSchema": {
                "type": "object",
                "properties": {
                    "data": {
                        "type": "object",
                        "properties": {
                            "id": {
                                "type": "string"
                            },
                            "actId": {
                                "type": "string"
                            },
                            "userId": {
                                "type": "string"
                            },
                            "startedAt": {
                                "type": "string",
                                "format": "date-time",
                                "example": "2025-01-08T00:00:00.000Z"
                            },
                            "finishedAt": {
                                "type": "string",
                                "format": "date-time",
                                "example": "2025-01-08T00:00:00.000Z"
                            },
                            "status": {
                                "type": "string",
                                "example": "READY"
                            },
                            "meta": {
                                "type": "object",
                                "properties": {
                                    "origin": {
                                        "type": "string",
                                        "example": "API"
                                    },
                                    "userAgent": {
                                        "type": "string"
                                    }
                                }
                            },
                            "stats": {
                                "type": "object",
                                "properties": {
                                    "inputBodyLen": {
                                        "type": "integer",
                                        "example": 2000
                                    },
                                    "rebootCount": {
                                        "type": "integer",
                                        "example": 0
                                    },
                                    "restartCount": {
                                        "type": "integer",
                                        "example": 0
                                    },
                                    "resurrectCount": {
                                        "type": "integer",
                                        "example": 0
                                    },
                                    "computeUnits": {
                                        "type": "integer",
                                        "example": 0
                                    }
                                }
                            },
                            "options": {
                                "type": "object",
                                "properties": {
                                    "build": {
                                        "type": "string",
                                        "example": "latest"
                                    },
                                    "timeoutSecs": {
                                        "type": "integer",
                                        "example": 300
                                    },
                                    "memoryMbytes": {
                                        "type": "integer",
                                        "example": 1024
                                    },
                                    "diskMbytes": {
                                        "type": "integer",
                                        "example": 2048
                                    }
                                }
                            },
                            "buildId": {
                                "type": "string"
                            },
                            "defaultKeyValueStoreId": {
                                "type": "string"
                            },
                            "defaultDatasetId": {
                                "type": "string"
                            },
                            "defaultRequestQueueId": {
                                "type": "string"
                            },
                            "buildNumber": {
                                "type": "string",
                                "example": "1.0.0"
                            },
                            "containerUrl": {
                                "type": "string"
                            },
                            "usage": {
                                "type": "object",
                                "properties": {
                                    "ACTOR_COMPUTE_UNITS": {
                                        "type": "integer",
                                        "example": 0
                                    },
                                    "DATASET_READS": {
                                        "type": "integer",
                                        "example": 0
                                    },
                                    "DATASET_WRITES": {
                                        "type": "integer",
                                        "example": 0
                                    },
                                    "KEY_VALUE_STORE_READS": {
                                        "type": "integer",
                                        "example": 0
                                    },
                                    "KEY_VALUE_STORE_WRITES": {
                                        "type": "integer",
                                        "example": 1
                                    },
                                    "KEY_VALUE_STORE_LISTS": {
                                        "type": "integer",
                                        "example": 0
                                    },
                                    "REQUEST_QUEUE_READS": {
                                        "type": "integer",
                                        "example": 0
                                    },
                                    "REQUEST_QUEUE_WRITES": {
                                        "type": "integer",
                                        "example": 0
                                    },
                                    "DATA_TRANSFER_INTERNAL_GBYTES": {
                                        "type": "integer",
                                        "example": 0
                                    },
                                    "DATA_TRANSFER_EXTERNAL_GBYTES": {
                                        "type": "integer",
                                        "example": 0
                                    },
                                    "PROXY_RESIDENTIAL_TRANSFER_GBYTES": {
                                        "type": "integer",
                                        "example": 0
                                    },
                                    "PROXY_SERPS": {
                                        "type": "integer",
                                        "example": 0
                                    }
                                }
                            },
                            "usageTotalUsd": {
                                "type": "number",
                                "example": 0.00005
                            },
                            "usageUsd": {
                                "type": "object",
                                "properties": {
                                    "ACTOR_COMPUTE_UNITS": {
                                        "type": "integer",
                                        "example": 0
                                    },
                                    "DATASET_READS": {
                                        "type": "integer",
                                        "example": 0
                                    },
                                    "DATASET_WRITES": {
                                        "type": "integer",
                                        "example": 0
                                    },
                                    "KEY_VALUE_STORE_READS": {
                                        "type": "integer",
                                        "example": 0
                                    },
                                    "KEY_VALUE_STORE_WRITES": {
                                        "type": "number",
                                        "example": 0.00005
                                    },
                                    "KEY_VALUE_STORE_LISTS": {
                                        "type": "integer",
                                        "example": 0
                                    },
                                    "REQUEST_QUEUE_READS": {
                                        "type": "integer",
                                        "example": 0
                                    },
                                    "REQUEST_QUEUE_WRITES": {
                                        "type": "integer",
                                        "example": 0
                                    },
                                    "DATA_TRANSFER_INTERNAL_GBYTES": {
                                        "type": "integer",
                                        "example": 0
                                    },
                                    "DATA_TRANSFER_EXTERNAL_GBYTES": {
                                        "type": "integer",
                                        "example": 0
                                    },
                                    "PROXY_RESIDENTIAL_TRANSFER_GBYTES": {
                                        "type": "integer",
                                        "example": 0
                                    },
                                    "PROXY_SERPS": {
                                        "type": "integer",
                                        "example": 0
                                    }
                                }
                            }
                        }
                    }
                }
            }
        }
    }
}
```
