# Trip Scraper (`lhotanok/trip-scraper`) Actor

This Trip Scraper will extract data for different types of accommodation from Trip.com website.

- **URL**: https://apify.com/lhotanok/trip-scraper.md
- **Developed by:** [Kristýna Lhoťanová](https://apify.com/lhotanok) (community)
- **Categories:** Travel
- **Stats:** 46 total users, 0 monthly users, 100.0% runs succeeded, 2 bookmarks
- **User rating**: No ratings yet

## Pricing

$30.00/month + usage

To use this Actor, you pay a monthly rental fee to the developer. The rent is subtracted from your prepaid usage every month after the free trial period.You also pay for the Apify platform usage, which gets cheaper the higher Apify subscription plan you have.

Learn more: https://docs.apify.com/platform/actors/running/actors-in-store#rental-actors

## What's an Apify Actor?

Actors are a software tools running on the Apify platform, for all kinds of web data extraction and automation use cases.
In Batch mode, an Actor accepts a well-defined JSON input, performs an action which can take anything from a few seconds to a few hours,
and optionally produces a well-defined JSON output, datasets with results, or files in key-value store.
In Standby mode, an Actor provides a web server which can be used as a website, API, or an MCP server.
Actors are written with capital "A".

## How to integrate an Actor?

If asked about integration, you help developers integrate Actors into their projects.
You adapt to their stack and deliver integrations that are safe, well-documented, and production-ready.
The best way to integrate Actors is as follows.

In JavaScript/TypeScript projects, use official [JavaScript/TypeScript client](https://docs.apify.com/api/client/js.md):

```bash
npm install apify-client
```

In Python projects, use official [Python client library](https://docs.apify.com/api/client/python.md):

```bash
pip install apify-client
```

In shell scripts, use [Apify CLI](https://docs.apify.com/cli/docs.md):

````bash
# MacOS / Linux
curl -fsSL https://apify.com/install-cli.sh | bash
# Windows
irm https://apify.com/install-cli.ps1 | iex
```bash

In AI frameworks, you might use the [Apify MCP server](https://docs.apify.com/platform/integrations/mcp.md).

If your project is in a different language, use the [REST API](https://docs.apify.com/api/v2.md).

For usage examples, see the [API](#api) section below.

For more details, see Apify documentation as [Markdown index](https://docs.apify.com/llms.txt) and [Markdown full-text](https://docs.apify.com/llms-full.txt).


# README

## Trip Scraper

Our Trip Scraper will extract data for different types of accommodation from Trip.com website, a popular online travel agency with more than 400 million users. You're free to specify any number of search filters that Trip.com offers in their web interface such as check-in and check-out dates, property type, number of stars or rating, amenities and other filters. Then simply copy the search URL and pass it to the scraper. It will find all properties meeting your search conditions. You can also provide URLs of individual properties for skipping the search step.

Trip Scraper will supply you with data such as:

🏨 Hotel name

📷 Cover image

🌎 Address

📝 Detailed description

🛏 Available rooms

💰 Prices

⭐ Stars & rating

🏊‍♀️ Popular amenities

❓ Questions & answers

You can also adjust the amount of extracted data by switching available rooms and detailed room facilities extraction. If you don't need information on the individual rooms, unchecking `extractRooms` and `extractRoomFacilities` options will save you a lot of space in a dataset.

#### How much will it cost to scrape Trip.com?

Apify gives you $5 free usage credits every month on the Apify Free plan. When you provide direct property URLs as `startUrls`, you can get approximately 600 results within $5 limit. If you want Trip.com scraper to find properties for you based on search filters, it will be a bit more costly - you'll get about 350 results for $5. It is because the scraper has to simulate infinite scroll which adds some extra overhead.

If you need to scale your Trip.com data extraction, you should grab an [Apify subscription](https://apify.com/pricing). We recommend our $49/month Personal plan - you can get up to 6,000 results monthly. Or scrape 60,000 results for $499 with the Team plan, it's up to your needs.

#### Tips for using our Trip.com scraper

1️⃣ If your Apify subscription allows you to do so, increase memory to 8192 MB which will provide the scraper with 2 CPU cores. You'll get your results faster for comparable price.

2️⃣ You may get a different number of results from multiple runs. It is due to Trip.com policy so it can't be resolved by our scraper easily. Even when you browse searched properties manually using the web interface, Trip.com sometimes disables the infinite scroll sooner than you reach the promised number of total results.

3️⃣ If you don't need available rooms data or you're fine without detailed room facilities, switch their extraction off as mentioned above. You can not only save dataset space but also speed up the scraper a little bit.

#### Missing some data that Trip.com offers and our scraper doesn't?

If you'd like to get some extra data that our Trip.com scraper doesn't extract at the moment, let us know by opening a new issue. There's a potential to scrape guest reviews, photos uploaded by both property owners and users, check-in and check-out policy, places of interests nearby and so on. Don't hesitate to comment on what you would like us to add first.

# Actor input Schema

## `startUrls` (type: `array`):

Enter URLs of hotel search or detail pages to extract information on the individual hotels such as their address, facilities, available rooms, photos etc. Note that you can set arbitrary search filters on the Trip.com website such as rating, location, amenities... Search URLs built by Trip.com include those filters so when you provide the URLs to the scraper, your search filters will be respected.
## `maxHotels` (type: `integer`):

Set the maximum number of hotels you want to scrape. If you leave this field unset, the actor will extract all hotels for the provided URLs.
## `extractRoomFacilities` (type: `boolean`):

If checked, detailed facilities of base & sale rooms will be included in the output.
## `extractRooms` (type: `boolean`):

If checked, hotel rooms will be extracted and included in the output. If you don't need information on the individual rooms, uncheck this option as it can speed up the actor significantly.
## `proxyConfiguration` (type: `object`):

Use either automatic Apify proxies or your own.
## `debugMode` (type: `boolean`):

If checked, debug messages will be included in the log.

## Actor input object example

```json
{
  "startUrls": [
    "https://www.trip.com/hotels/list?city=633&cityName=New%20York&provinceId=0&countryId=66&districtId=0&checkin=2023%2F10%2F19&checkout=2023%2F10%2F21&barCurr=USD&searchType=CT&searchWord=New%20York&crn=1&adult=2&children=0&searchBoxArg=t&travelPurpose=0&ctm_ref=ix_sb_dl&domestic=true&listFilters=17%7C1*17*1*2%2C80%7C0%7C1*80*0*2%2C29%7C1*29*1%7C2*2"
  ],
  "maxHotels": 5,
  "extractRoomFacilities": true,
  "extractRooms": true,
  "proxyConfiguration": {
    "useApifyProxy": true
  },
  "debugMode": false
}
````

# API

You can run this Actor programmatically using our API. Below are code examples in JavaScript, Python, and CLI, as well as the OpenAPI specification and MCP server setup.

## JavaScript example

```javascript
import { ApifyClient } from 'apify-client';

// Initialize the ApifyClient with your Apify API token
// Replace the '<YOUR_API_TOKEN>' with your token
const client = new ApifyClient({
    token: '<YOUR_API_TOKEN>',
});

// Prepare Actor input
const input = {
    "startUrls": [
        "https://www.trip.com/hotels/list?city=633&cityName=New%20York&provinceId=0&countryId=66&districtId=0&checkin=2023%2F10%2F19&checkout=2023%2F10%2F21&barCurr=USD&searchType=CT&searchWord=New%20York&crn=1&adult=2&children=0&searchBoxArg=t&travelPurpose=0&ctm_ref=ix_sb_dl&domestic=true&listFilters=17%7C1*17*1*2%2C80%7C0%7C1*80*0*2%2C29%7C1*29*1%7C2*2"
    ],
    "maxHotels": 5,
    "proxyConfiguration": {
        "useApifyProxy": true
    }
};

// Run the Actor and wait for it to finish
const run = await client.actor("lhotanok/trip-scraper").call(input);

// Fetch and print Actor results from the run's dataset (if any)
console.log('Results from dataset');
console.log(`💾 Check your data here: https://console.apify.com/storage/datasets/${run.defaultDatasetId}`);
const { items } = await client.dataset(run.defaultDatasetId).listItems();
items.forEach((item) => {
    console.dir(item);
});

// 📚 Want to learn more 📖? Go to → https://docs.apify.com/api/client/js/docs

```

## Python example

```python
from apify_client import ApifyClient

# Initialize the ApifyClient with your Apify API token
# Replace '<YOUR_API_TOKEN>' with your token.
client = ApifyClient("<YOUR_API_TOKEN>")

# Prepare the Actor input
run_input = {
    "startUrls": ["https://www.trip.com/hotels/list?city=633&cityName=New%20York&provinceId=0&countryId=66&districtId=0&checkin=2023%2F10%2F19&checkout=2023%2F10%2F21&barCurr=USD&searchType=CT&searchWord=New%20York&crn=1&adult=2&children=0&searchBoxArg=t&travelPurpose=0&ctm_ref=ix_sb_dl&domestic=true&listFilters=17%7C1*17*1*2%2C80%7C0%7C1*80*0*2%2C29%7C1*29*1%7C2*2"],
    "maxHotels": 5,
    "proxyConfiguration": { "useApifyProxy": True },
}

# Run the Actor and wait for it to finish
run = client.actor("lhotanok/trip-scraper").call(run_input=run_input)

# Fetch and print Actor results from the run's dataset (if there are any)
print("💾 Check your data here: https://console.apify.com/storage/datasets/" + run["defaultDatasetId"])
for item in client.dataset(run["defaultDatasetId"]).iterate_items():
    print(item)

# 📚 Want to learn more 📖? Go to → https://docs.apify.com/api/client/python/docs/quick-start

```

## CLI example

```bash
echo '{
  "startUrls": [
    "https://www.trip.com/hotels/list?city=633&cityName=New%20York&provinceId=0&countryId=66&districtId=0&checkin=2023%2F10%2F19&checkout=2023%2F10%2F21&barCurr=USD&searchType=CT&searchWord=New%20York&crn=1&adult=2&children=0&searchBoxArg=t&travelPurpose=0&ctm_ref=ix_sb_dl&domestic=true&listFilters=17%7C1*17*1*2%2C80%7C0%7C1*80*0*2%2C29%7C1*29*1%7C2*2"
  ],
  "maxHotels": 5,
  "proxyConfiguration": {
    "useApifyProxy": true
  }
}' |
apify call lhotanok/trip-scraper --silent --output-dataset

```

## MCP server setup

```json
{
    "mcpServers": {
        "apify": {
            "command": "npx",
            "args": [
                "mcp-remote",
                "https://mcp.apify.com/?tools=lhotanok/trip-scraper",
                "--header",
                "Authorization: Bearer <YOUR_API_TOKEN>"
            ]
        }
    }
}

```

## OpenAPI specification

```json
{
    "openapi": "3.0.1",
    "info": {
        "title": "Trip Scraper",
        "description": "This Trip Scraper will extract data for different types of accommodation from Trip.com website.",
        "version": "0.0",
        "x-build-id": "dh4uncKq789OqApfh"
    },
    "servers": [
        {
            "url": "https://api.apify.com/v2"
        }
    ],
    "paths": {
        "/acts/lhotanok~trip-scraper/run-sync-get-dataset-items": {
            "post": {
                "operationId": "run-sync-get-dataset-items-lhotanok-trip-scraper",
                "x-openai-isConsequential": false,
                "summary": "Executes an Actor, waits for its completion, and returns Actor's dataset items in response.",
                "tags": [
                    "Run Actor"
                ],
                "requestBody": {
                    "required": true,
                    "content": {
                        "application/json": {
                            "schema": {
                                "$ref": "#/components/schemas/inputSchema"
                            }
                        }
                    }
                },
                "parameters": [
                    {
                        "name": "token",
                        "in": "query",
                        "required": true,
                        "schema": {
                            "type": "string"
                        },
                        "description": "Enter your Apify token here"
                    }
                ],
                "responses": {
                    "200": {
                        "description": "OK"
                    }
                }
            }
        },
        "/acts/lhotanok~trip-scraper/runs": {
            "post": {
                "operationId": "runs-sync-lhotanok-trip-scraper",
                "x-openai-isConsequential": false,
                "summary": "Executes an Actor and returns information about the initiated run in response.",
                "tags": [
                    "Run Actor"
                ],
                "requestBody": {
                    "required": true,
                    "content": {
                        "application/json": {
                            "schema": {
                                "$ref": "#/components/schemas/inputSchema"
                            }
                        }
                    }
                },
                "parameters": [
                    {
                        "name": "token",
                        "in": "query",
                        "required": true,
                        "schema": {
                            "type": "string"
                        },
                        "description": "Enter your Apify token here"
                    }
                ],
                "responses": {
                    "200": {
                        "description": "OK",
                        "content": {
                            "application/json": {
                                "schema": {
                                    "$ref": "#/components/schemas/runsResponseSchema"
                                }
                            }
                        }
                    }
                }
            }
        },
        "/acts/lhotanok~trip-scraper/run-sync": {
            "post": {
                "operationId": "run-sync-lhotanok-trip-scraper",
                "x-openai-isConsequential": false,
                "summary": "Executes an Actor, waits for completion, and returns the OUTPUT from Key-value store in response.",
                "tags": [
                    "Run Actor"
                ],
                "requestBody": {
                    "required": true,
                    "content": {
                        "application/json": {
                            "schema": {
                                "$ref": "#/components/schemas/inputSchema"
                            }
                        }
                    }
                },
                "parameters": [
                    {
                        "name": "token",
                        "in": "query",
                        "required": true,
                        "schema": {
                            "type": "string"
                        },
                        "description": "Enter your Apify token here"
                    }
                ],
                "responses": {
                    "200": {
                        "description": "OK"
                    }
                }
            }
        }
    },
    "components": {
        "schemas": {
            "inputSchema": {
                "type": "object",
                "required": [
                    "startUrls",
                    "extractRoomFacilities",
                    "extractRooms",
                    "proxyConfiguration",
                    "debugMode"
                ],
                "properties": {
                    "startUrls": {
                        "title": "Start URLs",
                        "uniqueItems": true,
                        "type": "array",
                        "description": "Enter URLs of hotel search or detail pages to extract information on the individual hotels such as their address, facilities, available rooms, photos etc. Note that you can set arbitrary search filters on the Trip.com website such as rating, location, amenities... Search URLs built by Trip.com include those filters so when you provide the URLs to the scraper, your search filters will be respected.",
                        "default": [],
                        "items": {
                            "type": "string"
                        }
                    },
                    "maxHotels": {
                        "title": "Max hotels",
                        "minimum": 0,
                        "type": "integer",
                        "description": "Set the maximum number of hotels you want to scrape. If you leave this field unset, the actor will extract all hotels for the provided URLs."
                    },
                    "extractRoomFacilities": {
                        "title": "Extract detailed room facilities",
                        "type": "boolean",
                        "description": "If checked, detailed facilities of base & sale rooms will be included in the output.",
                        "default": true
                    },
                    "extractRooms": {
                        "title": "Extract available rooms",
                        "type": "boolean",
                        "description": "If checked, hotel rooms will be extracted and included in the output. If you don't need information on the individual rooms, uncheck this option as it can speed up the actor significantly.",
                        "default": true
                    },
                    "proxyConfiguration": {
                        "title": "Proxy Configuration",
                        "type": "object",
                        "description": "Use either automatic Apify proxies or your own.",
                        "default": {
                            "useApifyProxy": true
                        }
                    },
                    "debugMode": {
                        "title": "Debug mode",
                        "type": "boolean",
                        "description": "If checked, debug messages will be included in the log.",
                        "default": false
                    }
                }
            },
            "runsResponseSchema": {
                "type": "object",
                "properties": {
                    "data": {
                        "type": "object",
                        "properties": {
                            "id": {
                                "type": "string"
                            },
                            "actId": {
                                "type": "string"
                            },
                            "userId": {
                                "type": "string"
                            },
                            "startedAt": {
                                "type": "string",
                                "format": "date-time",
                                "example": "2025-01-08T00:00:00.000Z"
                            },
                            "finishedAt": {
                                "type": "string",
                                "format": "date-time",
                                "example": "2025-01-08T00:00:00.000Z"
                            },
                            "status": {
                                "type": "string",
                                "example": "READY"
                            },
                            "meta": {
                                "type": "object",
                                "properties": {
                                    "origin": {
                                        "type": "string",
                                        "example": "API"
                                    },
                                    "userAgent": {
                                        "type": "string"
                                    }
                                }
                            },
                            "stats": {
                                "type": "object",
                                "properties": {
                                    "inputBodyLen": {
                                        "type": "integer",
                                        "example": 2000
                                    },
                                    "rebootCount": {
                                        "type": "integer",
                                        "example": 0
                                    },
                                    "restartCount": {
                                        "type": "integer",
                                        "example": 0
                                    },
                                    "resurrectCount": {
                                        "type": "integer",
                                        "example": 0
                                    },
                                    "computeUnits": {
                                        "type": "integer",
                                        "example": 0
                                    }
                                }
                            },
                            "options": {
                                "type": "object",
                                "properties": {
                                    "build": {
                                        "type": "string",
                                        "example": "latest"
                                    },
                                    "timeoutSecs": {
                                        "type": "integer",
                                        "example": 300
                                    },
                                    "memoryMbytes": {
                                        "type": "integer",
                                        "example": 1024
                                    },
                                    "diskMbytes": {
                                        "type": "integer",
                                        "example": 2048
                                    }
                                }
                            },
                            "buildId": {
                                "type": "string"
                            },
                            "defaultKeyValueStoreId": {
                                "type": "string"
                            },
                            "defaultDatasetId": {
                                "type": "string"
                            },
                            "defaultRequestQueueId": {
                                "type": "string"
                            },
                            "buildNumber": {
                                "type": "string",
                                "example": "1.0.0"
                            },
                            "containerUrl": {
                                "type": "string"
                            },
                            "usage": {
                                "type": "object",
                                "properties": {
                                    "ACTOR_COMPUTE_UNITS": {
                                        "type": "integer",
                                        "example": 0
                                    },
                                    "DATASET_READS": {
                                        "type": "integer",
                                        "example": 0
                                    },
                                    "DATASET_WRITES": {
                                        "type": "integer",
                                        "example": 0
                                    },
                                    "KEY_VALUE_STORE_READS": {
                                        "type": "integer",
                                        "example": 0
                                    },
                                    "KEY_VALUE_STORE_WRITES": {
                                        "type": "integer",
                                        "example": 1
                                    },
                                    "KEY_VALUE_STORE_LISTS": {
                                        "type": "integer",
                                        "example": 0
                                    },
                                    "REQUEST_QUEUE_READS": {
                                        "type": "integer",
                                        "example": 0
                                    },
                                    "REQUEST_QUEUE_WRITES": {
                                        "type": "integer",
                                        "example": 0
                                    },
                                    "DATA_TRANSFER_INTERNAL_GBYTES": {
                                        "type": "integer",
                                        "example": 0
                                    },
                                    "DATA_TRANSFER_EXTERNAL_GBYTES": {
                                        "type": "integer",
                                        "example": 0
                                    },
                                    "PROXY_RESIDENTIAL_TRANSFER_GBYTES": {
                                        "type": "integer",
                                        "example": 0
                                    },
                                    "PROXY_SERPS": {
                                        "type": "integer",
                                        "example": 0
                                    }
                                }
                            },
                            "usageTotalUsd": {
                                "type": "number",
                                "example": 0.00005
                            },
                            "usageUsd": {
                                "type": "object",
                                "properties": {
                                    "ACTOR_COMPUTE_UNITS": {
                                        "type": "integer",
                                        "example": 0
                                    },
                                    "DATASET_READS": {
                                        "type": "integer",
                                        "example": 0
                                    },
                                    "DATASET_WRITES": {
                                        "type": "integer",
                                        "example": 0
                                    },
                                    "KEY_VALUE_STORE_READS": {
                                        "type": "integer",
                                        "example": 0
                                    },
                                    "KEY_VALUE_STORE_WRITES": {
                                        "type": "number",
                                        "example": 0.00005
                                    },
                                    "KEY_VALUE_STORE_LISTS": {
                                        "type": "integer",
                                        "example": 0
                                    },
                                    "REQUEST_QUEUE_READS": {
                                        "type": "integer",
                                        "example": 0
                                    },
                                    "REQUEST_QUEUE_WRITES": {
                                        "type": "integer",
                                        "example": 0
                                    },
                                    "DATA_TRANSFER_INTERNAL_GBYTES": {
                                        "type": "integer",
                                        "example": 0
                                    },
                                    "DATA_TRANSFER_EXTERNAL_GBYTES": {
                                        "type": "integer",
                                        "example": 0
                                    },
                                    "PROXY_RESIDENTIAL_TRANSFER_GBYTES": {
                                        "type": "integer",
                                        "example": 0
                                    },
                                    "PROXY_SERPS": {
                                        "type": "integer",
                                        "example": 0
                                    }
                                }
                            }
                        }
                    }
                }
            }
        }
    }
}
```
