GetYourGuide Review Scraper avatar

GetYourGuide Review Scraper

Pricing

from $3.00 / 1,000 results

Go to Apify Store
GetYourGuide Review Scraper

GetYourGuide Review Scraper

The GetYourGuide Review Scraper is a powerful and easy-to-use actor designed to collect structured review data from GetYourGuide product pages. Whether you’re analyzing customer sentiment, monitoring competitor feedback, or enriching your travel content.

Pricing

from $3.00 / 1,000 results

Rating

5.0

(3)

Developer

Louvre LLC

Louvre LLC

Maintained by Community

Actor stats

1

Bookmarked

88

Total users

21

Monthly active users

9.7 hours

Issues response

a day ago

Last modified

Share

🧳🌍 GetYourGuide Review Scraper

📌 Introduction

GetYourGuide Review Scraper is a powerful and easy-to-use Apify Actor that collects structured review data from GetYourGuide product pages. It allows you to extract ratings, review text, author information, photos, and other metadata — ideal for sentiment analysis, competitor feedback monitoring, travel content enrichment, and research workflows.


💡 Free vs paid plans

On a free Apify plan, this actor returns a sample of the reviews. Upgrade to any paid Apify plan to get the complete results.


🎯 What is this Review Scraper?

This Actor automatically scrapes customer reviews from a specified GetYourGuide product URL and outputs them as structured JSON. It supports flexible pagination, reliable de-duplication, and produces data that's ready for analysis or integration into your systems.


🔍 Features

  • Comprehensive Review Data — Extracts ratings, review text, author name & location, photos, and helpful votes.
  • 🔁 Flexible Pagination — Supports custom starting page and page size.
  • 🔀 Multiple Sorting Options — Reviews can be processed in different orders (date, rating, etc.).
  • 🚀 High Performance — Efficiently handles scraping of large volumes of reviews.
  • 🧹 Reliable Output — Includes duplicate detection and data validation.
  • 🔌 Easy Integration — Simple input parameters and structured output.

🧰 Input Parameters

ParameterTypeRequiredDescription
product_urlStringThe GetYourGuide product page URL to scrape reviews from.
page_sizeNumberMaximum number of reviews per page (default: 100).
pageNumberStart page number; will skip (page – 1) * page_size reviews from the beginning (default: 1).

📥 Example Input

{
"product_url": "https://www.getyourguide.com/uss-midway-museum-l4441/the-uss-midway-museum-t12879",
"page_size": 100,
"page": 1
}

This configuration will scrape up to 100 reviews from the first page of the specified GetYourGuide product.


🗃️ Output Data

Each review returned by this Actor includes the following structured fields:

FieldTypeDescription
idStringUnique review identifier.
review_idStringOriginal GetYourGuide review ID.
product_idStringGetYourGuide product identifier.
product_urlStringProduct URL used for scraping this review.
ratingNumberReview rating (1–5 stars).
titleStringReview title.
contentStringFull review text.
author_nameStringReviewer's name.
user_identifier_valueStringAuthor user identifier.
review_dateStringDate when the review was published (ISO 8601 format).
helpful_votesNumberNumber of helpful votes.
photosArrayArray of photo URLs (if any).
owner_responseStringBusiness owner response text (if any).
owner_response_authorStringResponse author name (if any).
owner_response_dateStringResponse publication date (ISO 8601 format, if any).
sourceStringData source identifier (always "gyg").
languageStringReview language code.
overall_star_ratingNumberOverall product star rating (e.g. 4.5 out of 5). -1 if unavailable.
overall_review_countNumberTotal number of reviews for the product. -1 if unavailable.

🧠 How It Works

  1. Input Validation — Validates the provided GetYourGuide product URL.
  2. Review Extraction — Scrapes reviews from the product page using advanced web scraping logic.
  3. Data Processing — Cleans and processes the extracted review data.
  4. Duplicate Detection — Identifies and handles duplicates.
  5. Data Storage — Saves the structured data to the Apify dataset for retrieval.

⚙️ Usage Examples

📦 Basic Usage

{
"product_url": "https://www.getyourguide.com/uss-midway-museum-l4441/the-uss-midway-museum-t12879"
}

📈 Advanced Usage (Pagination)

{
"product_url": "https://www.getyourguide.com/uss-midway-museum-l4441/the-uss-midway-museum-t12879",
"page_size": 200,
"page": 2
}

💰 Pricing & Limits

This Actor costs $1.00 per 1,000 results with a free trial available.


🚀 Deploying to Apify

To push this Actor to the Apify platform, use the Apify CLI:

Prerequisites

  1. Install Apify CLI (if not already installed):
$npm install -g apify-cli
  1. Login to Apify:
$apify login

Push Command

Navigate to the actor directory and push:

cd gyg-review-scraper
apify push

This will:

  • Build the Actor code
  • Upload it to your Apify account
  • Make it available in the Apify platform

Push and Set as Latest Version

To push and automatically set the version as latest:

$apify push -b latest --force

This command will:

  • Build and push the Actor
  • Set the pushed version as the latest version
  • Make it the default version when running the Actor

Alternative: Push with Specific Actor ID

If you want to push to a specific Actor ID:

$apify push --actor-id <YOUR_ACTOR_ID>

Or push with Actor ID and set as latest:

$apify push --actor-id <YOUR_ACTOR_ID> -b latest --force

Build and Push Separately

You can also build and push separately:

# Build the Actor
npm run build
# Push to Apify
apify push

Modifying the Version

To change the Actor version before pushing:

  1. Edit the version in .actor/actor.json:
{
"version": "1.3"
}

Note: Apify requires version format to be MAJOR.MINOR (e.g., 1.3), where MAJOR and MINOR are numbers from 0 to 99. Three-part versions like 1.3.1 are not supported.

  1. Push with the new version:
$apify push

Or push with a specific version directly:

$apify push --version 1.3

Note: The version in actor.json will be used by default. If you specify --version latest, it will set the pushed version as latest regardless of the version in actor.json.

📡 API / CLI Integration

You can run this Actor using the Apify API or CLI. Example with curl:

curl "https://api.apify.com/v2/acts/louvre~gyg-review-scraper/runs?token=<YOUR_API_TOKEN>" \
-X POST \
-H "Content-Type: application/json" \
-d @input.json

Replace <YOUR_API_TOKEN> with your Apify token and input.json with your JSON input containing the fields above.