# Glassdoor Scraper (`parseforge/glassdoor-scraper`) Actor

Scrape Glassdoor company data: employee reviews with ratings and pros/cons, salary estimates by job title, interview experiences, job listings with pay ranges, office locations, benefits, culture ratings, and company overviews. Supports all 18 international domains. No login required.

- **URL**: https://apify.com/parseforge/glassdoor-scraper.md
- **Developed by:** [ParseForge](https://apify.com/parseforge) (community)
- **Categories:** Automation, Jobs, Other
- **Stats:** 14 total users, 4 monthly users, 100.0% runs succeeded, 0 bookmarks
- **User rating**: No ratings yet

## Pricing

Pay per event

This Actor is paid per event. You are not charged for the Apify platform usage, but only a fixed price for specific events.
Since this Actor supports Apify Store discounts, the price gets lower the higher subscription plan you have.

Learn more: https://docs.apify.com/platform/actors/running/actors-in-store#pay-per-event

## What's an Apify Actor?

Actors are a software tools running on the Apify platform, for all kinds of web data extraction and automation use cases.
In Batch mode, an Actor accepts a well-defined JSON input, performs an action which can take anything from a few seconds to a few hours,
and optionally produces a well-defined JSON output, datasets with results, or files in key-value store.
In Standby mode, an Actor provides a web server which can be used as a website, API, or an MCP server.
Actors are written with capital "A".

## How to integrate an Actor?

If asked about integration, you help developers integrate Actors into their projects.
You adapt to their stack and deliver integrations that are safe, well-documented, and production-ready.
The best way to integrate Actors is as follows.

In JavaScript/TypeScript projects, use official [JavaScript/TypeScript client](https://docs.apify.com/api/client/js.md):

```bash
npm install apify-client
```

In Python projects, use official [Python client library](https://docs.apify.com/api/client/python.md):

```bash
pip install apify-client
```

In shell scripts, use [Apify CLI](https://docs.apify.com/cli/docs.md):

````bash
# MacOS / Linux
curl -fsSL https://apify.com/install-cli.sh | bash
# Windows
irm https://apify.com/install-cli.ps1 | iex
```bash

In AI frameworks, you might use the [Apify MCP server](https://docs.apify.com/platform/integrations/mcp.md).

If your project is in a different language, use the [REST API](https://docs.apify.com/api/v2.md).

For usage examples, see the [API](#api) section below.

For more details, see Apify documentation as [Markdown index](https://docs.apify.com/llms.txt) and [Markdown full-text](https://docs.apify.com/llms-full.txt).


# README

![ParseForge Banner](https://github.com/ParseForge/apify-assets/blob/ad35ccc13ddd068b9d6cba33f323962e39aed5b2/banner.jpg?raw=true)

## 💼 Glassdoor Scraper

> 🚀 **Pull Glassdoor employee reviews, salaries, and ratings by company.** Export structured records for any Glassdoor employer across **18 international domains** in minutes. No API key, no login, no manual copy-paste.

> 🕒 **Last updated:** 2026-04-21 · **📊 40 fields** per review · **⭐ Reviews & salaries** · **🏢 Company ratings** · **🌍 Multi-locale** · **🚫 No auth** required

The **Glassdoor Scraper** collects employee reviews, salary reports, interview experiences, jobs, benefits, office locations, and culture-and-diversity data from any Glassdoor company page. Point it at a Glassdoor employer URL (for example `https://www.glassdoor.com/Reviews/Google-Reviews-E9079.htm`) or search by company name, and get back clean structured JSON.

The scraper understands every regional Glassdoor domain (`.com`, `.co.uk`, `.fr`, `.de`, `.es`, `.nl`, `.com.br`, `.ca`, `.com.au`, and more), supports date filters and multi-language review pulls, and works across any public employer profile. Output works directly in spreadsheets, BI tools, or your data warehouse.

| 🎯 Target Audience | 💡 Primary Use Cases |
|---|---|
| HR analytics teams, recruiters, talent acquisition, competitive-intelligence analysts, investors, journalists, data scientists | Employer brand tracking, salary benchmarking, sentiment analysis, candidate research, M&A due diligence, competitive hiring intel |


---

### 📋 What the Glassdoor Scraper does

Eight structured collection modes in a single run:

- 📝 **Reviews.** Overall rating, CEO rating, pros, cons, advice, work-life balance, culture, diversity, and employer responses.
- 💰 **Salaries.** Base pay, total pay, bonuses, stock, profit sharing, and tips by job title and currency.
- 🎤 **Interviews.** Questions, outcomes, difficulty, experience, offer status, and source per report.
- 💼 **Jobs.** Open roles with title, location, pay range, easy-apply flag, and optional full job description.
- 🧭 **Overview.** Headquarters, company size, industry, revenue, founded year, and website.
- 📍 **Locations.** Office addresses across cities and countries.
- 🎁 **Benefits.** Benefit categories, ratings, comments, and employee reviews.
- 🌈 **Culture & Diversity.** Demographic distributions, diversity programs, and culture ratings.

Pick a single mode with the `command` field (`reviews`, `salaries`, `interviews`, `jobs`, `overview`, `locations`, `benefits`, `cultureDiversity`) or pass `"all"` to sweep everything in one run. Date filters (`reviewsStartDate`, `interviewsStartDate`, `benefitsStartDate`) restrict output to recent records only.

> 💡 **Why it matters:** Glassdoor hosts millions of firsthand employee reports across tens of thousands of companies. That corpus is one of the most cited sources for compensation benchmarks, employer-brand sentiment, and hiring competitive intelligence. Building your own collector means handling pagination, language variants, and fragile HTML. This Actor returns clean JSON you can load straight into analysis.


---

### 🎬 Full Demo

_🚧 Coming soon: a 3-minute walkthrough from sign-up to a downloaded dataset._


---

### ⚙️ Input

<table>
<thead>
<tr><th>Input</th><th>Type</th><th>Default</th><th>Behavior</th></tr>
</thead>
<tbody>
<tr><td>startUrls</td><td>array</td><td>[Google]</td><td>Glassdoor company URLs. Any international domain supported.</td></tr>
<tr><td>maxItems</td><td>integer</td><td>10</td><td>Records to return per run. Free plan caps at 100, paid plan at 1,000,000.</td></tr>
<tr><td>companyName</td><td>string</td><td>""</td><td>Alternative to startUrls. Search by employer name (e.g. Google).</td></tr>
<tr><td>command</td><td>string</td><td>"all"</td><td>One of reviews, salaries, interviews, jobs, overview, locations, benefits, cultureDiversity, or all.</td></tr>
<tr><td>sortReviewsBy</td><td>string</td><td>"RELEVANCE"</td><td>Review sort order. Use DATE for newest-first.</td></tr>
<tr><td>includeAllReviews</td><td>boolean</td><td>false</td><td>Include reviews in every language instead of the domain default.</td></tr>
<tr><td>includeAllSalaries</td><td>boolean</td><td>false</td><td>Include salaries across every supported language.</td></tr>
<tr><td>includeCompanyReviewStats</td><td>boolean</td><td>false</td><td>Attach company-level review aggregates (overall rating, CEO approval, distribution).</td></tr>
<tr><td>includeCompanyInterviewStats</td><td>boolean</td><td>false</td><td>Attach interview aggregates (difficulty, experience counts).</td></tr>
<tr><td>includeCompanyBenefitsStats</td><td>boolean</td><td>false</td><td>Attach benefits overview and category statistics.</td></tr>
<tr><td>includeJobDetails</td><td>boolean</td><td>false</td><td>Fetch full job description from each posting page. Slower but richer.</td></tr>
<tr><td>reviewsStartDate</td><td>string</td><td>null</td><td>ISO date (YYYY-MM-DD). Only returns reviews on or after this date.</td></tr>
<tr><td>interviewsStartDate</td><td>string</td><td>null</td><td>ISO date floor for interview records.</td></tr>
<tr><td>benefitsStartDate</td><td>string</td><td>null</td><td>ISO date floor for benefit review records.</td></tr>
<tr><td>proxyConfiguration</td><td>object</td><td>RESIDENTIAL</td><td>Residential Apify proxy. Required for Glassdoor.</td></tr>
</tbody>
</table>

**Example: 100 most recent Google reviews, newest first.**

```json
{
    "startUrls": [
        { "url": "https://www.glassdoor.com/Reviews/Google-Reviews-E9079.htm" }
    ],
    "maxItems": 100,
    "command": "reviews",
    "sortReviewsBy": "DATE"
}
````

**Example: everything Glassdoor has on Apple, stats included.**

```json
{
    "startUrls": [
        { "url": "https://www.glassdoor.com/Overview/Working-at-Apple-EI_IE1138.htm" }
    ],
    "maxItems": 500,
    "command": "all",
    "includeCompanyReviewStats": true,
    "includeCompanyInterviewStats": true
}
```

**Example: salary reports for SAP on the German domain.**

```json
{
    "startUrls": [
        { "url": "https://www.glassdoor.de/Gehalt/SAP-Gehalter-E10471.htm" }
    ],
    "maxItems": 200,
    "command": "salaries"
}
```

> ⚠️ **Good to Know:** results reflect what is publicly visible on the matching Glassdoor regional domain. Review counts, salary samples, and translated content vary by country and by the reviewer's chosen language. When you need full coverage, enable `includeAllReviews` and `includeAllSalaries`.

***

### 📊 Output

Each record carries a `dataType` discriminator (`review`, `salary`, `interview`, `job`, `overview`, `location`, `benefit`, `cultureDiversity`) so you can split the dataset downstream. Download as CSV, Excel, JSON, or XML.

#### 🧾 Review schema (40 fields)

| Field | Type | Example |
|---|---|---|
| 🏢 `companyName` | string | `"Google"` |
| 🆔 `companyId` | number | `9079` |
| 🔗 `companyUrl` | string | `"https://www.glassdoor.com/Reviews/Google-Reviews-E9079.htm"` |
| 🏷️ `dataType` | string | `"review"` |
| 🆔 `reviewId` | number | `81234567` |
| 📅 `reviewDateTime` | ISO 8601 | `"2026-04-10T12:34:00.000Z"` |
| ⭐ `ratingOverall` | number | `4` |
| 👔 `ratingCeo` | string | null | `"APPROVE"` |
| 📈 `ratingBusinessOutlook` | string | null | `"POSITIVE"` |
| ⚖️ `ratingWorkLifeBalance` | number | `4` |
| 🎭 `ratingCultureAndValues` | number | `5` |
| 🌈 `ratingDiversityAndInclusion` | number | `4` |
| 🚀 `ratingCareerOpportunities` | number | `4` |
| 🧭 `ratingSeniorLeadership` | number | `3` |
| 💵 `ratingCompensationAndBenefits` | number | `5` |
| 🤝 `ratingRecommendToFriend` | string | null | `"POSITIVE"` |
| 📌 `isCurrentJob` | boolean | `true` |
| ⏳ `lengthOfEmployment` | number | `3` |
| 💼 `employmentStatus` | string | `"REGULAR"` |
| ⭐ `featured` | boolean | `false` |
| 🌐 `languageId` | string | `"eng"` |
| 🌐 `originalLanguageId` | string | `"eng"` |
| 🔁 `translationMethod` | string | null | `null` |
| 👍 `countHelpful` | number | `12` |
| 👎 `countNotHelpful` | number | `1` |
| 💼 `jobTitle` | object | `{ "id": 1, "text": "Software Engineer" }` |
| 📍 `location` | object | null | `{ "id": 11527, "name": "Mountain View, CA", "type": "CITY" }` |
| ✅ `pros` | string | `"Strong engineering culture, benefits..."` |
| ✅ `prosOriginal` | string | `"Strong engineering culture, benefits..."` |
| ❌ `cons` | string | `"Large-org politics at times."` |
| ❌ `consOriginal` | string | `"Large-org politics at times."` |
| 💡 `advice` | string | null | `"Keep protecting 20% time."` |
| 💡 `adviceOriginal` | string | null | `"Keep protecting 20% time."` |
| 💬 `employerResponses` | array | `[]` |
| 📊 `companyReviewStats` | object | null | see sample (when enabled) |
| 🕒 `scrapedAt` | ISO 8601 | `"2026-04-21T08:00:00.000Z"` |
| ❗ `error` | string | null | `null` |

#### 🧾 Salary schema (key fields)

| Field | Type | Example |
|---|---|---|
| 🏢 `companyName` | string | `"Google"` |
| 🏷️ `dataType` | string | `"salary"` |
| 💼 `jobTitle` | object | `{ "id": 99, "text": "Software Engineer" }` |
| 📊 `salaryCount` | number | `8523` |
| 💱 `currency` | object | `{ "code": "USD" }` |
| 💵 `basePayStatistics` | object | `{ "percentiles": [{ "ident": "P50", "value": 185000 }] }` |
| 💰 `totalPayStatistics` | object | includes P10, P25, P50, P75, P90 |
| 🎁 `totalAdditionalPayStatistics` | object | bonus, stock, profit-sharing roll-ups |
| 💵 `cashBonusStatistics` | object | cash bonus percentiles |
| 📈 `stockBonusStatistics` | object | stock percentiles |
| 🕒 `scrapedAt` | ISO 8601 | `"2026-04-21T08:00:00.000Z"` |

Jobs, interviews, benefits, culture, locations, and overview modes expose their own structured payloads with the same shape discipline (see sample records below).

#### 📦 Sample records

<details>
<summary><strong>📝 Employee review: Google software engineer</strong></summary>

```json
{
    "dataType": "review",
    "companyName": "Google",
    "companyId": 9079,
    "companyUrl": "https://www.glassdoor.com/Reviews/Google-Reviews-E9079.htm",
    "reviewId": 81234567,
    "reviewDateTime": "2026-04-10T12:34:00.000Z",
    "ratingOverall": 4,
    "ratingCeo": "APPROVE",
    "ratingBusinessOutlook": "POSITIVE",
    "ratingWorkLifeBalance": 4,
    "ratingCultureAndValues": 5,
    "ratingDiversityAndInclusion": 4,
    "ratingCareerOpportunities": 4,
    "ratingSeniorLeadership": 3,
    "ratingCompensationAndBenefits": 5,
    "ratingRecommendToFriend": "POSITIVE",
    "isCurrentJob": true,
    "lengthOfEmployment": 3,
    "employmentStatus": "REGULAR",
    "countHelpful": 12,
    "countNotHelpful": 1,
    "jobTitle": { "id": 1, "text": "Software Engineer" },
    "location": { "id": 11527, "name": "Mountain View, CA", "type": "CITY" },
    "pros": "Strong engineering culture, excellent benefits, brilliant peers.",
    "cons": "Large-org politics can slow down projects.",
    "advice": "Keep protecting the 20% time.",
    "employerResponses": [],
    "scrapedAt": "2026-04-21T08:00:00.000Z"
}
```

</details>

<details>
<summary><strong>💰 Salary estimate: Google software engineer</strong></summary>

```json
{
    "dataType": "salary",
    "companyName": "Google",
    "companyId": 9079,
    "jobTitle": { "id": 99, "text": "Software Engineer" },
    "salaryCount": 8523,
    "currency": { "code": "USD" },
    "basePayStatistics": {
        "percentiles": [
            { "ident": "P10", "value": 135000 },
            { "ident": "P50", "value": 185000 },
            { "ident": "P90", "value": 260000 }
        ]
    },
    "totalPayStatistics": {
        "percentiles": [
            { "ident": "P10", "value": 165000 },
            { "ident": "P50", "value": 245000 },
            { "ident": "P90", "value": 395000 }
        ]
    },
    "scrapedAt": "2026-04-21T08:00:00.000Z"
}
```

</details>

<details>
<summary><strong>🏢 Company overview: Apple</strong></summary>

```json
{
    "dataType": "overview",
    "companyName": "Apple",
    "companyId": 1138,
    "headquarters": "Cupertino, CA",
    "size": "10000+ Employees",
    "founded": 1976,
    "industryName": "Computer Hardware Development",
    "revenue": "$10+ billion (USD)",
    "website": "www.apple.com",
    "shortDescription": "Apple designs, manufactures, and markets consumer electronics...",
    "rating": 4.1,
    "totalRatings": 36214,
    "scrapedAt": "2026-04-21T08:00:00.000Z"
}
```

</details>

***

### ✨ Why choose this Actor

| | Capability |
|---|---|
| 🌍 | **18 domains supported.** `.com`, `.co.uk`, `.de`, `.fr`, `.es`, `.nl`, `.it`, `.com.br`, `.ca`, `.com.au`, and more. Scrape any international Glassdoor profile. |
| 🔄 | **Eight data modes in one Actor.** Reviews, salaries, interviews, jobs, overview, locations, benefits, culture. Use `command: "all"` to collect everything. |
| 📅 | **Date filters.** Pull only reviews, interviews, or benefits on or after a chosen date for fresh-trend analysis. |
| 🌐 | **Multi-language coverage.** Toggle `includeAllReviews` or `includeAllSalaries` to pull every supported language per profile. |
| 📊 | **Structured JSON.** Every record has a stable schema with typed ratings, employer metadata, and timestamps. |
| 🔎 | **Search by name or URL.** Paste a Glassdoor URL or pass `companyName` and the Actor resolves the employer automatically. |
| 🚫 | **No authentication.** Works with public employer profiles. No login, no API key. |

> 📊 Glassdoor is one of the most quoted sources for employer brand, compensation benchmarks, and candidate sentiment. Clean structured exports enable analytics that a manual review read-through never can.

***

### 📈 How it compares to alternatives

| Approach | Cost | Coverage | Refresh | Structured output | Setup |
|---|---|---|---|---|---|
| **⭐ Glassdoor Scraper** *(this Actor)* | $5 free credit, then pay-per-use | All 18 Glassdoor domains | **Live per run** | 8 data modes, typed fields | ⚡ 2 min |
| Official Glassdoor API | Not available to most use cases | Limited endpoints | Rate-limited | Partial | 🐢 Approval-gated |
| Manual copy-paste | Free | Tiny sample | Whenever you do it | ❌ Unstructured | 🕒 Hours per company |
| Generic web crawlers | Varies | Hit-or-miss | Ad hoc | Raw HTML | ⏳ Custom engineering |

Pick this Actor when you want broad multi-domain coverage, structured fields, and zero scraping engineering on your side.

***

### 🚀 How to use

1. 📝 **Sign up.** [Create a free account with $5 credit](https://console.apify.com/sign-up?fpr=vmoqkp) (takes 2 minutes).
2. 🌐 **Open the Actor.** Go to the Glassdoor Scraper page on the Apify Store.
3. 🎯 **Set input.** Paste a company URL (or company name), pick a `command`, set `maxItems`.
4. 🚀 **Run it.** Click **Start** and let the Actor collect your data.
5. 📥 **Download.** Grab your results in the **Dataset** tab as CSV, Excel, JSON, or XML.

> ⏱️ Total time from signup to downloaded dataset: **3-5 minutes.** No coding required.

***

### 💼 Business use cases

<table>
<tr>
<td width="50%" valign="top">

#### 🧑‍💼 Talent & Recruiting

- Employer-brand tracking per competitor
- Candidate sentiment dashboards
- Attrition-signal monitoring (review-volume spikes)
- Interview-difficulty benchmarks per role

</td>
<td width="50%" valign="top">

#### 💰 Compensation Analytics

- Salary benchmarking by title and region
- Total-comp breakdowns (base, bonus, stock)
- Pay-equity studies with currency normalization
- Country-by-country comp ranges

</td>
</tr>
<tr>
<td width="50%" valign="top">

#### 📈 Investors & Analysts

- Employee sentiment as a leading indicator
- CEO approval trend tracking
- Diversity-metric comparisons across peers
- Culture pulse for M\&A due diligence

</td>
<td width="50%" valign="top">

#### 📰 Researchers & Journalists

- Large-N review corpora for NLP studies
- Industry-wide culture comparisons
- Longitudinal tracking of employer scores
- Story research on labor trends

</td>
</tr>
</table>

***

***

### 🌟 Beyond business use cases

Data like this powers more than commercial workflows. The same structured records support research, education, civic projects, and personal initiatives.

<table>
<tr>
<td width="50%">

#### 🎓 Research and academia

- Empirical datasets for papers, thesis work, and coursework
- Longitudinal studies tracking changes across snapshots
- Reproducible research with cited, versioned data pulls
- Classroom exercises on data analysis and ethical scraping

</td>
<td width="50%">

#### 🎨 Personal and creative

- Side projects, portfolio demos, and indie app launches
- Data visualizations, dashboards, and infographics
- Content research for bloggers, YouTubers, and podcasters
- Hobbyist collections and personal trackers

</td>
</tr>
<tr>
<td width="50%">

#### 🤝 Non-profit and civic

- Transparency reporting and accountability projects
- Advocacy campaigns backed by public-interest data
- Community-run databases for local issues
- Investigative journalism on public records

</td>
<td width="50%">

#### 🧪 Experimentation

- Prototype AI and machine-learning pipelines with real data
- Validate product-market hypotheses before engineering spend
- Train small domain-specific models on niche corpora
- Test dashboard concepts with live input

</td>
</tr>
</table>

### ❓ Frequently Asked Questions

<details>
<summary><b>💳 Do I need a paid Apify plan to run this actor?</b></summary>

No. You can start right now on the free Apify plan, which includes **$5 in free monthly credit**. That is enough to run this actor several times and explore the output before committing to anything. Paid plans unlock higher limits, more concurrent runs, and larger datasets. [Create a free Apify account here](https://console.apify.com/sign-up?fpr=vmoqkp) to get started.

</details>

<details>
<summary><b>🚨 What happens if my run fails or returns no results?</b></summary>

Failed runs are not charged. If the source site changes, proxies get rate-limited, or a specific input matches nothing, re-run the actor or open our [contact form](https://tally.so/r/BzdKgA) and we will investigate. You can also check the run log in the Apify console to see why the run stopped.

</details>

<details>
<summary><b>📏 How many items can I scrape per run?</b></summary>

Free users are limited to **10 items per run** so you can preview the output and confirm the actor works for your use case. Paid users can raise `maxItems` up to **1,000,000** per run. [Upgrade here](https://console.apify.com/sign-up?fpr=vmoqkp) if you need full scale.

</details>

<details>
<summary><b>🕒 How fresh is the data?</b></summary>

Every run fetches live data at the moment of execution. There is no cache or delay: the records you get reflect what the source returned at that moment. Schedule the actor to maintain a rolling snapshot of the data you need.

</details>

<details>
<summary><b>🧑‍💻 Can I call this actor from my own code?</b></summary>

Yes. Apify exposes every actor as a REST endpoint and ships first-class SDKs for [Node.js](https://docs.apify.com/sdk/js) and [Python](https://docs.apify.com/sdk/python). You can start a run, read the dataset, and handle webhooks from your own app in a few lines. All you need is your Apify API token.

</details>

<details>
<summary><b>📤 How do I export the data?</b></summary>

Every Apify dataset can be downloaded in one click from the console as CSV, JSON, JSONL, Excel, HTML, XML, or RSS. You can also pull results programmatically via the [Apify API](https://docs.apify.com/api/v2) or stream them into BigQuery, S3, and other destinations through built-in integrations.

</details>

<details>
<summary><b>📅 Can I schedule the actor to run automatically?</b></summary>

Yes. Use the Apify scheduler to run the actor on any cadence, from hourly to monthly. Results are saved to your dataset and can be delivered to webhooks, email, Slack, cloud storage, or automation tools such as Zapier and Make.

</details>

***

### 🔌 Automating Glassdoor Scraper

Control the scraper programmatically for scheduled runs and pipeline integrations:

- 🟢 **Node.js.** Install the `apify-client` NPM package.
- 🐍 **Python.** Use the `apify-client` PyPI package.
- 📚 See the [Apify API documentation](https://docs.apify.com/api/v2) for full details.

The [Apify Schedules feature](https://docs.apify.com/platform/schedules) lets you trigger this Actor on any cron interval. Hourly, daily, or weekly refreshes keep downstream sentiment and comp dashboards in sync automatically.

### 🔌 Integrate with any app

Glassdoor Scraper connects to any cloud service via [Apify integrations](https://apify.com/integrations):

- [**Make**](https://docs.apify.com/platform/integrations/make) - Automate multi-step workflows
- [**Zapier**](https://docs.apify.com/platform/integrations/zapier) - Connect with 5,000+ apps
- [**Slack**](https://docs.apify.com/platform/integrations/slack) - Get run notifications in your channels
- [**Airbyte**](https://docs.apify.com/platform/integrations/airbyte) - Pipe review and salary data into your warehouse
- [**GitHub**](https://docs.apify.com/platform/integrations/github) - Trigger runs from commits and releases
- [**Google Drive**](https://docs.apify.com/platform/integrations/drive) - Export datasets straight to Sheets

You can also use webhooks to trigger downstream actions when a run finishes. Push fresh review or salary data into your product backend, or alert your analytics team in Slack.

***

### 🔗 Recommended Actors

- [**💼 Glassdoor Jobs Scraper**](https://apify.com/parseforge/glassdoor-jobs-scraper) - Job postings from Glassdoor with salary, location, and JSON-LD schema
- [**🔍 Indeed Scraper**](https://apify.com/parseforge/indeed-scraper) - Job listings and company pages from Indeed
- [**🏢 LinkedIn Company Profiles Scraper**](https://apify.com/parseforge/linkedin-company-profiles-scraper) - Company size, industry, specialties, and follower counts
- [**💼 ZipRecruiter Scraper**](https://apify.com/parseforge/ziprecruiter-scraper) - Jobs from ZipRecruiter with pay and company metadata
- [**🧑‍💻 CareerBuilder Scraper**](https://apify.com/parseforge/careerbuilder-scraper) - Job postings from CareerBuilder with salary and location

> 💡 **Pro Tip:** browse the complete [ParseForge collection](https://apify.com/parseforge) for more HR and talent-intel scrapers.

***

**🆘 Need Help?** [**Open our contact form**](https://tally.so/r/BzdKgA) to request a new scraper, propose a custom data project, or report an issue.

***

> **⚠️ Disclaimer:** this Actor is an independent tool and is not affiliated with, endorsed by, or sponsored by Glassdoor or its parent company. All trademarks mentioned are the property of their respective owners. Only publicly available data visible to any signed-out visitor is collected.

# Actor input Schema

## `startUrls` (type: `array`):

Glassdoor company URLs to scrape. Supports all international domains (glassdoor.com, .co.uk, .fr, .de, etc.). Use this OR Company Name below, not both.

## `maxItems` (type: `integer`):

Free users: Limited to 10 items (preview). Paid users: Optional, max 1,000,000

## `companyName` (type: `string`):

Search for a company by name. Use this OR Start URLs above, not both. Example: Google, Apple, Microsoft

## `command` (type: `string`):

Type of data to scrape. 'reviews' is fastest. Use 'all' to scrape every section in one run (slower, multi-minute).

## `sortReviewsBy` (type: `string`):

Sort order for reviews.

## `includeAllReviews` (type: `boolean`):

Include reviews in all languages instead of only the country's default language.

## `includeAllSalaries` (type: `boolean`):

Include salaries in all supported languages instead of only the country's default language.

## `includeCompanyReviewStats` (type: `boolean`):

Include company-level review statistics (overall ratings, CEO approval, etc.).

## `includeCompanyInterviewStats` (type: `boolean`):

Include company-level interview statistics (difficulty, experience counts, etc.).

## `includeCompanyBenefitsStats` (type: `boolean`):

Include company-level benefits overview and category statistics.

## `includeJobDetails` (type: `boolean`):

Fetch full job descriptions from detail pages. Slower but provides complete job information including description and employment type.

## `reviewsStartDate` (type: `string`):

Only include reviews from this date onwards. Format: YYYY-MM-DD

## `interviewsStartDate` (type: `string`):

Only include interviews from this date onwards. Format: YYYY-MM-DD

## `benefitsStartDate` (type: `string`):

Only include benefit reviews from this date onwards. Format: YYYY-MM-DD

## `proxyConfiguration` (type: `object`):

Residential proxies are required for Glassdoor. Country is auto-selected from the start URL domain when not set; defaults to US.

## Actor input object example

```json
{
  "startUrls": [
    {
      "url": "https://www.glassdoor.com/Reviews/Google-Reviews-E9079.htm"
    }
  ],
  "maxItems": 10,
  "command": "reviews",
  "sortReviewsBy": "RELEVANCE",
  "includeAllReviews": false,
  "includeAllSalaries": false,
  "includeCompanyReviewStats": false,
  "includeCompanyInterviewStats": false,
  "includeCompanyBenefitsStats": false,
  "includeJobDetails": false,
  "proxyConfiguration": {
    "useApifyProxy": true,
    "apifyProxyGroups": [
      "RESIDENTIAL"
    ],
    "apifyProxyCountry": "US"
  }
}
```

# Actor output Schema

## `all` (type: `string`):

Complete dataset with all scraped data

## `reviews` (type: `string`):

Employee reviews with ratings, pros, cons, and advice

## `salaries` (type: `string`):

Salary estimates by job title with base pay, total pay, and bonus statistics

## `overview` (type: `string`):

Overview of all scraped items

# API

You can run this Actor programmatically using our API. Below are code examples in JavaScript, Python, and CLI, as well as the OpenAPI specification and MCP server setup.

## JavaScript example

```javascript
import { ApifyClient } from 'apify-client';

// Initialize the ApifyClient with your Apify API token
// Replace the '<YOUR_API_TOKEN>' with your token
const client = new ApifyClient({
    token: '<YOUR_API_TOKEN>',
});

// Prepare Actor input
const input = {
    "startUrls": [
        {
            "url": "https://www.glassdoor.com/Reviews/Google-Reviews-E9079.htm"
        }
    ],
    "maxItems": 10,
    "command": "reviews",
    "proxyConfiguration": {
        "useApifyProxy": true,
        "apifyProxyGroups": [
            "RESIDENTIAL"
        ],
        "apifyProxyCountry": "US"
    }
};

// Run the Actor and wait for it to finish
const run = await client.actor("parseforge/glassdoor-scraper").call(input);

// Fetch and print Actor results from the run's dataset (if any)
console.log('Results from dataset');
console.log(`💾 Check your data here: https://console.apify.com/storage/datasets/${run.defaultDatasetId}`);
const { items } = await client.dataset(run.defaultDatasetId).listItems();
items.forEach((item) => {
    console.dir(item);
});

// 📚 Want to learn more 📖? Go to → https://docs.apify.com/api/client/js/docs

```

## Python example

```python
from apify_client import ApifyClient

# Initialize the ApifyClient with your Apify API token
# Replace '<YOUR_API_TOKEN>' with your token.
client = ApifyClient("<YOUR_API_TOKEN>")

# Prepare the Actor input
run_input = {
    "startUrls": [{ "url": "https://www.glassdoor.com/Reviews/Google-Reviews-E9079.htm" }],
    "maxItems": 10,
    "command": "reviews",
    "proxyConfiguration": {
        "useApifyProxy": True,
        "apifyProxyGroups": ["RESIDENTIAL"],
        "apifyProxyCountry": "US",
    },
}

# Run the Actor and wait for it to finish
run = client.actor("parseforge/glassdoor-scraper").call(run_input=run_input)

# Fetch and print Actor results from the run's dataset (if there are any)
print("💾 Check your data here: https://console.apify.com/storage/datasets/" + run["defaultDatasetId"])
for item in client.dataset(run["defaultDatasetId"]).iterate_items():
    print(item)

# 📚 Want to learn more 📖? Go to → https://docs.apify.com/api/client/python/docs/quick-start

```

## CLI example

```bash
echo '{
  "startUrls": [
    {
      "url": "https://www.glassdoor.com/Reviews/Google-Reviews-E9079.htm"
    }
  ],
  "maxItems": 10,
  "command": "reviews",
  "proxyConfiguration": {
    "useApifyProxy": true,
    "apifyProxyGroups": [
      "RESIDENTIAL"
    ],
    "apifyProxyCountry": "US"
  }
}' |
apify call parseforge/glassdoor-scraper --silent --output-dataset

```

## MCP server setup

```json
{
    "mcpServers": {
        "apify": {
            "command": "npx",
            "args": [
                "mcp-remote",
                "https://mcp.apify.com/?tools=parseforge/glassdoor-scraper",
                "--header",
                "Authorization: Bearer <YOUR_API_TOKEN>"
            ]
        }
    }
}

```

## OpenAPI specification

```json
{
    "openapi": "3.0.1",
    "info": {
        "title": "Glassdoor Scraper",
        "description": "Scrape Glassdoor company data: employee reviews with ratings and pros/cons, salary estimates by job title, interview experiences, job listings with pay ranges, office locations, benefits, culture ratings, and company overviews. Supports all 18 international domains. No login required.",
        "version": "2.0",
        "x-build-id": "TkvpgeA1EBvZXrTSC"
    },
    "servers": [
        {
            "url": "https://api.apify.com/v2"
        }
    ],
    "paths": {
        "/acts/parseforge~glassdoor-scraper/run-sync-get-dataset-items": {
            "post": {
                "operationId": "run-sync-get-dataset-items-parseforge-glassdoor-scraper",
                "x-openai-isConsequential": false,
                "summary": "Executes an Actor, waits for its completion, and returns Actor's dataset items in response.",
                "tags": [
                    "Run Actor"
                ],
                "requestBody": {
                    "required": true,
                    "content": {
                        "application/json": {
                            "schema": {
                                "$ref": "#/components/schemas/inputSchema"
                            }
                        }
                    }
                },
                "parameters": [
                    {
                        "name": "token",
                        "in": "query",
                        "required": true,
                        "schema": {
                            "type": "string"
                        },
                        "description": "Enter your Apify token here"
                    }
                ],
                "responses": {
                    "200": {
                        "description": "OK"
                    }
                }
            }
        },
        "/acts/parseforge~glassdoor-scraper/runs": {
            "post": {
                "operationId": "runs-sync-parseforge-glassdoor-scraper",
                "x-openai-isConsequential": false,
                "summary": "Executes an Actor and returns information about the initiated run in response.",
                "tags": [
                    "Run Actor"
                ],
                "requestBody": {
                    "required": true,
                    "content": {
                        "application/json": {
                            "schema": {
                                "$ref": "#/components/schemas/inputSchema"
                            }
                        }
                    }
                },
                "parameters": [
                    {
                        "name": "token",
                        "in": "query",
                        "required": true,
                        "schema": {
                            "type": "string"
                        },
                        "description": "Enter your Apify token here"
                    }
                ],
                "responses": {
                    "200": {
                        "description": "OK",
                        "content": {
                            "application/json": {
                                "schema": {
                                    "$ref": "#/components/schemas/runsResponseSchema"
                                }
                            }
                        }
                    }
                }
            }
        },
        "/acts/parseforge~glassdoor-scraper/run-sync": {
            "post": {
                "operationId": "run-sync-parseforge-glassdoor-scraper",
                "x-openai-isConsequential": false,
                "summary": "Executes an Actor, waits for completion, and returns the OUTPUT from Key-value store in response.",
                "tags": [
                    "Run Actor"
                ],
                "requestBody": {
                    "required": true,
                    "content": {
                        "application/json": {
                            "schema": {
                                "$ref": "#/components/schemas/inputSchema"
                            }
                        }
                    }
                },
                "parameters": [
                    {
                        "name": "token",
                        "in": "query",
                        "required": true,
                        "schema": {
                            "type": "string"
                        },
                        "description": "Enter your Apify token here"
                    }
                ],
                "responses": {
                    "200": {
                        "description": "OK"
                    }
                }
            }
        }
    },
    "components": {
        "schemas": {
            "inputSchema": {
                "type": "object",
                "properties": {
                    "startUrls": {
                        "title": "Start URLs",
                        "type": "array",
                        "description": "Glassdoor company URLs to scrape. Supports all international domains (glassdoor.com, .co.uk, .fr, .de, etc.). Use this OR Company Name below, not both.",
                        "items": {
                            "type": "object",
                            "required": [
                                "url"
                            ],
                            "properties": {
                                "url": {
                                    "type": "string",
                                    "title": "URL of a web page",
                                    "format": "uri"
                                }
                            }
                        }
                    },
                    "maxItems": {
                        "title": "Max Items",
                        "minimum": 1,
                        "maximum": 1000000,
                        "type": "integer",
                        "description": "Free users: Limited to 10 items (preview). Paid users: Optional, max 1,000,000"
                    },
                    "companyName": {
                        "title": "Company Name",
                        "type": "string",
                        "description": "Search for a company by name. Use this OR Start URLs above, not both. Example: Google, Apple, Microsoft"
                    },
                    "command": {
                        "title": "Command",
                        "enum": [
                            "reviews",
                            "salaries",
                            "interviews",
                            "jobs",
                            "overview",
                            "locations",
                            "benefits",
                            "cultureDiversity",
                            "all"
                        ],
                        "type": "string",
                        "description": "Type of data to scrape. 'reviews' is fastest. Use 'all' to scrape every section in one run (slower, multi-minute).",
                        "default": "reviews"
                    },
                    "sortReviewsBy": {
                        "title": "Sort Reviews By",
                        "enum": [
                            "RELEVANCE",
                            "DATE"
                        ],
                        "type": "string",
                        "description": "Sort order for reviews.",
                        "default": "RELEVANCE"
                    },
                    "includeAllReviews": {
                        "title": "Include All Languages",
                        "type": "boolean",
                        "description": "Include reviews in all languages instead of only the country's default language.",
                        "default": false
                    },
                    "includeAllSalaries": {
                        "title": "Include All Salary Languages",
                        "type": "boolean",
                        "description": "Include salaries in all supported languages instead of only the country's default language.",
                        "default": false
                    },
                    "includeCompanyReviewStats": {
                        "title": "Include Review Statistics",
                        "type": "boolean",
                        "description": "Include company-level review statistics (overall ratings, CEO approval, etc.).",
                        "default": false
                    },
                    "includeCompanyInterviewStats": {
                        "title": "Include Interview Statistics",
                        "type": "boolean",
                        "description": "Include company-level interview statistics (difficulty, experience counts, etc.).",
                        "default": false
                    },
                    "includeCompanyBenefitsStats": {
                        "title": "Include Benefits Statistics",
                        "type": "boolean",
                        "description": "Include company-level benefits overview and category statistics.",
                        "default": false
                    },
                    "includeJobDetails": {
                        "title": "Include Job Descriptions",
                        "type": "boolean",
                        "description": "Fetch full job descriptions from detail pages. Slower but provides complete job information including description and employment type.",
                        "default": false
                    },
                    "reviewsStartDate": {
                        "title": "Reviews Start Date",
                        "type": "string",
                        "description": "Only include reviews from this date onwards. Format: YYYY-MM-DD"
                    },
                    "interviewsStartDate": {
                        "title": "Interviews Start Date",
                        "type": "string",
                        "description": "Only include interviews from this date onwards. Format: YYYY-MM-DD"
                    },
                    "benefitsStartDate": {
                        "title": "Benefits Start Date",
                        "type": "string",
                        "description": "Only include benefit reviews from this date onwards. Format: YYYY-MM-DD"
                    },
                    "proxyConfiguration": {
                        "title": "Proxy Configuration",
                        "type": "object",
                        "description": "Residential proxies are required for Glassdoor. Country is auto-selected from the start URL domain when not set; defaults to US."
                    }
                }
            },
            "runsResponseSchema": {
                "type": "object",
                "properties": {
                    "data": {
                        "type": "object",
                        "properties": {
                            "id": {
                                "type": "string"
                            },
                            "actId": {
                                "type": "string"
                            },
                            "userId": {
                                "type": "string"
                            },
                            "startedAt": {
                                "type": "string",
                                "format": "date-time",
                                "example": "2025-01-08T00:00:00.000Z"
                            },
                            "finishedAt": {
                                "type": "string",
                                "format": "date-time",
                                "example": "2025-01-08T00:00:00.000Z"
                            },
                            "status": {
                                "type": "string",
                                "example": "READY"
                            },
                            "meta": {
                                "type": "object",
                                "properties": {
                                    "origin": {
                                        "type": "string",
                                        "example": "API"
                                    },
                                    "userAgent": {
                                        "type": "string"
                                    }
                                }
                            },
                            "stats": {
                                "type": "object",
                                "properties": {
                                    "inputBodyLen": {
                                        "type": "integer",
                                        "example": 2000
                                    },
                                    "rebootCount": {
                                        "type": "integer",
                                        "example": 0
                                    },
                                    "restartCount": {
                                        "type": "integer",
                                        "example": 0
                                    },
                                    "resurrectCount": {
                                        "type": "integer",
                                        "example": 0
                                    },
                                    "computeUnits": {
                                        "type": "integer",
                                        "example": 0
                                    }
                                }
                            },
                            "options": {
                                "type": "object",
                                "properties": {
                                    "build": {
                                        "type": "string",
                                        "example": "latest"
                                    },
                                    "timeoutSecs": {
                                        "type": "integer",
                                        "example": 300
                                    },
                                    "memoryMbytes": {
                                        "type": "integer",
                                        "example": 1024
                                    },
                                    "diskMbytes": {
                                        "type": "integer",
                                        "example": 2048
                                    }
                                }
                            },
                            "buildId": {
                                "type": "string"
                            },
                            "defaultKeyValueStoreId": {
                                "type": "string"
                            },
                            "defaultDatasetId": {
                                "type": "string"
                            },
                            "defaultRequestQueueId": {
                                "type": "string"
                            },
                            "buildNumber": {
                                "type": "string",
                                "example": "1.0.0"
                            },
                            "containerUrl": {
                                "type": "string"
                            },
                            "usage": {
                                "type": "object",
                                "properties": {
                                    "ACTOR_COMPUTE_UNITS": {
                                        "type": "integer",
                                        "example": 0
                                    },
                                    "DATASET_READS": {
                                        "type": "integer",
                                        "example": 0
                                    },
                                    "DATASET_WRITES": {
                                        "type": "integer",
                                        "example": 0
                                    },
                                    "KEY_VALUE_STORE_READS": {
                                        "type": "integer",
                                        "example": 0
                                    },
                                    "KEY_VALUE_STORE_WRITES": {
                                        "type": "integer",
                                        "example": 1
                                    },
                                    "KEY_VALUE_STORE_LISTS": {
                                        "type": "integer",
                                        "example": 0
                                    },
                                    "REQUEST_QUEUE_READS": {
                                        "type": "integer",
                                        "example": 0
                                    },
                                    "REQUEST_QUEUE_WRITES": {
                                        "type": "integer",
                                        "example": 0
                                    },
                                    "DATA_TRANSFER_INTERNAL_GBYTES": {
                                        "type": "integer",
                                        "example": 0
                                    },
                                    "DATA_TRANSFER_EXTERNAL_GBYTES": {
                                        "type": "integer",
                                        "example": 0
                                    },
                                    "PROXY_RESIDENTIAL_TRANSFER_GBYTES": {
                                        "type": "integer",
                                        "example": 0
                                    },
                                    "PROXY_SERPS": {
                                        "type": "integer",
                                        "example": 0
                                    }
                                }
                            },
                            "usageTotalUsd": {
                                "type": "number",
                                "example": 0.00005
                            },
                            "usageUsd": {
                                "type": "object",
                                "properties": {
                                    "ACTOR_COMPUTE_UNITS": {
                                        "type": "integer",
                                        "example": 0
                                    },
                                    "DATASET_READS": {
                                        "type": "integer",
                                        "example": 0
                                    },
                                    "DATASET_WRITES": {
                                        "type": "integer",
                                        "example": 0
                                    },
                                    "KEY_VALUE_STORE_READS": {
                                        "type": "integer",
                                        "example": 0
                                    },
                                    "KEY_VALUE_STORE_WRITES": {
                                        "type": "number",
                                        "example": 0.00005
                                    },
                                    "KEY_VALUE_STORE_LISTS": {
                                        "type": "integer",
                                        "example": 0
                                    },
                                    "REQUEST_QUEUE_READS": {
                                        "type": "integer",
                                        "example": 0
                                    },
                                    "REQUEST_QUEUE_WRITES": {
                                        "type": "integer",
                                        "example": 0
                                    },
                                    "DATA_TRANSFER_INTERNAL_GBYTES": {
                                        "type": "integer",
                                        "example": 0
                                    },
                                    "DATA_TRANSFER_EXTERNAL_GBYTES": {
                                        "type": "integer",
                                        "example": 0
                                    },
                                    "PROXY_RESIDENTIAL_TRANSFER_GBYTES": {
                                        "type": "integer",
                                        "example": 0
                                    },
                                    "PROXY_SERPS": {
                                        "type": "integer",
                                        "example": 0
                                    }
                                }
                            }
                        }
                    }
                }
            }
        }
    }
}
```
