# CV.lv Scraper - Latvian Job Board Extractor (`studio-amba/cv-lv-scraper`) Actor

Scrape job listings from CV.lv, Latvia's largest job portal with thousands of active vacancies. Search by keyword, location, and category. Extract job titles, companies, salaries, locations, descriptions, and requirements. No login or cookies required.

- **URL**: https://apify.com/studio-amba/cv-lv-scraper.md
- **Developed by:** [Studio Amba](https://apify.com/studio-amba) (community)
- **Categories:** Jobs
- **Stats:** 2 total users, 1 monthly users, 100.0% runs succeeded, 0 bookmarks
- **User rating**: No ratings yet

## Pricing

Pay per usage

This Actor is paid per platform usage. The Actor is free to use, and you only pay for the Apify platform usage, which gets cheaper the higher subscription plan you have.

Learn more: https://docs.apify.com/platform/actors/running/actors-in-store#pay-per-usage

## What's an Apify Actor?

Actors are a software tools running on the Apify platform, for all kinds of web data extraction and automation use cases.
In Batch mode, an Actor accepts a well-defined JSON input, performs an action which can take anything from a few seconds to a few hours,
and optionally produces a well-defined JSON output, datasets with results, or files in key-value store.
In Standby mode, an Actor provides a web server which can be used as a website, API, or an MCP server.
Actors are written with capital "A".

## How to integrate an Actor?

If asked about integration, you help developers integrate Actors into their projects.
You adapt to their stack and deliver integrations that are safe, well-documented, and production-ready.
The best way to integrate Actors is as follows.

In JavaScript/TypeScript projects, use official [JavaScript/TypeScript client](https://docs.apify.com/api/client/js.md):

```bash
npm install apify-client
```

In Python projects, use official [Python client library](https://docs.apify.com/api/client/python.md):

```bash
pip install apify-client
```

In shell scripts, use [Apify CLI](https://docs.apify.com/cli/docs.md):

````bash
# MacOS / Linux
curl -fsSL https://apify.com/install-cli.sh | bash
# Windows
irm https://apify.com/install-cli.ps1 | iex
```bash

In AI frameworks, you might use the [Apify MCP server](https://docs.apify.com/platform/integrations/mcp.md).

If your project is in a different language, use the [REST API](https://docs.apify.com/api/v2.md).

For usage examples, see the [API](#api) section below.

For more details, see Apify documentation as [Markdown index](https://docs.apify.com/llms.txt) and [Markdown full-text](https://docs.apify.com/llms-full.txt).


# README

### What does CV.lv Scraper do?

**CV.lv Scraper** extracts job listings from [CV.lv](https://cv.lv), Latvia's largest and most established job portal, operated by CV-Online since 1999 with over 70% market share. The scraper searches for jobs by keyword and location, then returns structured data including job titles, companies, salaries, locations, descriptions, and requirements.

Run it on the Apify platform to get **API access, scheduling, proxy rotation, webhook integrations, and monitoring** out of the box. No login or cookies required -- all data is extracted from publicly available job listings.

### Why use CV.lv Scraper?

- **Market research**: Analyze the Latvian job market by sector, salary ranges, and demand for specific skills.
- **Recruitment intelligence**: Monitor competitor hiring patterns across Latvia and the Baltics.
- **Salary benchmarking**: Collect salary data from thousands of Latvian job postings, all listed in EUR.
- **Job aggregation**: Feed Latvian job listings into your own job board or aggregator platform.
- **Academic research**: Study employment trends, skill demand, and labor market dynamics in Latvia.
- **Lead generation**: Identify actively hiring companies in Latvia for B2B outreach.
- **Baltic coverage**: Combine with Estonian and Lithuanian job board scrapers for full Baltic region data.

### How to scrape CV.lv data

1. Go to the CV.lv Scraper page on Apify Store.
2. Click **Try for free** to open the Actor in Apify Console.
3. Enter your search keyword (e.g. "developer", "accountant", or Latvian terms like "programmētājs").
4. Optionally set a location (Riga, Liepaja, Daugavpils, etc.).
5. Set the maximum number of results you want.
6. Click **Start** and wait for the run to finish.
7. Download your data as JSON, CSV, Excel, or connect it to your workflow via API.

That's it -- no coding required. The scraper handles pagination, proxy rotation, and data extraction automatically.

### Input

| Field | Type | Description | Default |
|-------|------|-------------|---------|
| `searchQuery` | String | Job search keyword (English, Latvian, or Russian) | `developer` |
| `location` | String | City or region (Riga, Liepaja, Daugavpils, Jelgava, etc.) | All Latvia |
| `maxResults` | Integer | Maximum number of listings to scrape (1-10,000) | `100` |
| `includeDetails` | Boolean | Visit detail pages for full descriptions (slower) | `false` |
| `proxyConfiguration` | Object | Proxy settings (residential recommended) | Apify Residential |

#### Example input

```json
{
    "searchQuery": "developer",
    "location": "Rīga",
    "maxResults": 50,
    "proxyConfiguration": {
        "useApifyProxy": true,
        "apifyProxyGroups": ["RESIDENTIAL"]
    }
}
````

### Output

The scraper outputs structured JSON data for each job listing. You can download the dataset in various formats such as JSON, HTML, CSV, or Excel.

#### Example output

```json
{
    "jobTitle": "Senior Software Developer",
    "company": "Accenture Latvia",
    "location": "Rīga",
    "salary": "€ 3500 – 5000",
    "jobType": "Full-time",
    "category": "IT",
    "postedDate": "Renewed about 2 hours ago",
    "jobDescription": "We are looking for an experienced software developer to join our growing team in Riga...",
    "requirements": "5+ years experience in Java or Python, Bachelor's degree in CS...",
    "url": "https://cv.lv/en/vacancy/1590000/accenture-latvia/senior-software-developer",
    "scrapedAt": "2026-06-09T12:00:00.000Z"
}
```

### Data fields

| Field | Description |
|-------|-------------|
| `jobTitle` | Job position title |
| `company` | Hiring company name |
| `location` | Job location (city or region in Latvia) |
| `salary` | Salary range if displayed (EUR) |
| `jobType` | Employment type (Full-time, Part-time, Remote, Hybrid, etc.) |
| `category` | Job category or industry sector |
| `postedDate` | When the job was posted or last renewed |
| `jobDescription` | Description text (full with `includeDetails` enabled) |
| `requirements` | Job requirements and qualifications (with `includeDetails`) |
| `url` | Direct link to the job listing on CV.lv |
| `scrapedAt` | ISO 8601 timestamp of data collection |

### How much does it cost to scrape CV.lv?

The CV.lv Scraper uses Playwright with residential proxies for reliable data extraction. Typical costs:

- **100 listings** (search results only): ~$0.10-0.20 in platform credits
- **100 listings** (with full details): ~$0.30-0.50 in platform credits
- **1,000 listings** (search results only): ~$0.50-1.00 in platform credits

Costs depend on the number of pages loaded and proxy usage. Disabling `includeDetails` is faster and cheaper when you only need basic listing data.

### Tips and advanced options

- **Use Latvian keywords** for broader results. Many listings are in Latvian only (e.g. "grāmatvedis" instead of "accountant"). CV.lv also supports Russian-language listings.
- **Enable `includeDetails`** only when you need full job descriptions, requirements, and detailed salary info. It's 5-10x slower because it visits each job page individually.
- **Set `maxResults`** to control costs. Start small (10-50) to verify the output, then scale up.
- **Location filtering** supports major Latvian cities: Riga, Liepaja, Daugavpils, Jelgava, Jurmala, Ventspils, Rezekne, Valmiera, and more.
- **Schedule runs** to track new listings daily or weekly using Apify's built-in scheduler.
- **Combine with other scrapers** to build a comprehensive Baltic job market dataset.

### About CV.lv

CV.lv (CV-Online) is the leading online recruitment platform in Latvia, operating since 1999. It holds over 70% of the Latvian online recruitment market share and is part of the Alma Career group, which operates job portals across the Baltic states. Job listings are available in Latvian, English, and Russian.

### Supported locations

Riga (Rīga), Liepaja (Liepāja), Daugavpils, Jelgava, Jurmala (Jūrmala), Ventspils, Rezekne (Rēzekne), Valmiera, Jekabpils (Jēkabpils), Ogre, Tukums, Cesis (Cēsis), Kuldiga (Kuldīga), Sigulda, and all other Latvian cities and regions. Remote and hybrid positions are also supported.

### FAQ and support

#### Is it legal to scrape CV.lv?

This scraper extracts only publicly available job listing data that any visitor can see without logging in. It does not access private data, bypass authentication, or violate GDPR. Always review the website's Terms of Service before large-scale scraping and ensure your use case complies with applicable laws.

#### The scraper returns 0 results

- Make sure your search query matches actual job listings on CV.lv.
- Try broader search terms or remove the location filter.
- Ensure residential proxies are enabled in the proxy configuration.
- The site may temporarily block automated access -- try again after a few minutes.

#### Can I get custom modifications?

Yes. If you need additional fields, different output formats, or integration with your systems, open an issue in the **Issues** tab or contact us for a custom solution.

#### How often should I run this scraper?

CV.lv updates listings continuously. For most use cases, running once daily or weekly is sufficient. Use Apify's scheduler to automate recurring runs.

# Actor input Schema

## `searchQuery` (type: `string`):

Job search keyword (e.g. 'developer', 'accountant', 'programmētājs'). Supports English, Latvian, and Russian. Defaults to 'developer' if empty.

## `location` (type: `string`):

City or region (e.g. 'Rīga', 'Liepāja', 'Daugavpils'). Leave empty for all Latvia.

## `maxResults` (type: `integer`):

Maximum number of job listings to scrape.

## `includeDetails` (type: `boolean`):

Visit each job detail page to extract full description, requirements, and salary details. Slower but provides richer data.

## `proxyConfiguration` (type: `object`):

Proxy settings. Residential proxies recommended to avoid blocking.

## Actor input object example

```json
{
  "searchQuery": "developer",
  "maxResults": 100,
  "includeDetails": false,
  "proxyConfiguration": {
    "useApifyProxy": true,
    "apifyProxyGroups": [
      "RESIDENTIAL"
    ]
  }
}
```

# API

You can run this Actor programmatically using our API. Below are code examples in JavaScript, Python, and CLI, as well as the OpenAPI specification and MCP server setup.

## JavaScript example

```javascript
import { ApifyClient } from 'apify-client';

// Initialize the ApifyClient with your Apify API token
// Replace the '<YOUR_API_TOKEN>' with your token
const client = new ApifyClient({
    token: '<YOUR_API_TOKEN>',
});

// Prepare Actor input
const input = {
    "searchQuery": "developer",
    "proxyConfiguration": {
        "useApifyProxy": true,
        "apifyProxyGroups": [
            "RESIDENTIAL"
        ]
    }
};

// Run the Actor and wait for it to finish
const run = await client.actor("studio-amba/cv-lv-scraper").call(input);

// Fetch and print Actor results from the run's dataset (if any)
console.log('Results from dataset');
console.log(`💾 Check your data here: https://console.apify.com/storage/datasets/${run.defaultDatasetId}`);
const { items } = await client.dataset(run.defaultDatasetId).listItems();
items.forEach((item) => {
    console.dir(item);
});

// 📚 Want to learn more 📖? Go to → https://docs.apify.com/api/client/js/docs

```

## Python example

```python
from apify_client import ApifyClient

# Initialize the ApifyClient with your Apify API token
# Replace '<YOUR_API_TOKEN>' with your token.
client = ApifyClient("<YOUR_API_TOKEN>")

# Prepare the Actor input
run_input = {
    "searchQuery": "developer",
    "proxyConfiguration": {
        "useApifyProxy": True,
        "apifyProxyGroups": ["RESIDENTIAL"],
    },
}

# Run the Actor and wait for it to finish
run = client.actor("studio-amba/cv-lv-scraper").call(run_input=run_input)

# Fetch and print Actor results from the run's dataset (if there are any)
print("💾 Check your data here: https://console.apify.com/storage/datasets/" + run["defaultDatasetId"])
for item in client.dataset(run["defaultDatasetId"]).iterate_items():
    print(item)

# 📚 Want to learn more 📖? Go to → https://docs.apify.com/api/client/python/docs/quick-start

```

## CLI example

```bash
echo '{
  "searchQuery": "developer",
  "proxyConfiguration": {
    "useApifyProxy": true,
    "apifyProxyGroups": [
      "RESIDENTIAL"
    ]
  }
}' |
apify call studio-amba/cv-lv-scraper --silent --output-dataset

```

## MCP server setup

```json
{
    "mcpServers": {
        "apify": {
            "command": "npx",
            "args": [
                "mcp-remote",
                "https://mcp.apify.com/?tools=studio-amba/cv-lv-scraper",
                "--header",
                "Authorization: Bearer <YOUR_API_TOKEN>"
            ]
        }
    }
}

```

## OpenAPI specification

```json
{
    "openapi": "3.0.1",
    "info": {
        "title": "CV.lv Scraper - Latvian Job Board Extractor",
        "description": "Scrape job listings from CV.lv, Latvia's largest job portal with thousands of active vacancies. Search by keyword, location, and category. Extract job titles, companies, salaries, locations, descriptions, and requirements. No login or cookies required.",
        "version": "0.1",
        "x-build-id": "rdVAMnO67WK8B0ZDW"
    },
    "servers": [
        {
            "url": "https://api.apify.com/v2"
        }
    ],
    "paths": {
        "/acts/studio-amba~cv-lv-scraper/run-sync-get-dataset-items": {
            "post": {
                "operationId": "run-sync-get-dataset-items-studio-amba-cv-lv-scraper",
                "x-openai-isConsequential": false,
                "summary": "Executes an Actor, waits for its completion, and returns Actor's dataset items in response.",
                "tags": [
                    "Run Actor"
                ],
                "requestBody": {
                    "required": true,
                    "content": {
                        "application/json": {
                            "schema": {
                                "$ref": "#/components/schemas/inputSchema"
                            }
                        }
                    }
                },
                "parameters": [
                    {
                        "name": "token",
                        "in": "query",
                        "required": true,
                        "schema": {
                            "type": "string"
                        },
                        "description": "Enter your Apify token here"
                    }
                ],
                "responses": {
                    "200": {
                        "description": "OK"
                    }
                }
            }
        },
        "/acts/studio-amba~cv-lv-scraper/runs": {
            "post": {
                "operationId": "runs-sync-studio-amba-cv-lv-scraper",
                "x-openai-isConsequential": false,
                "summary": "Executes an Actor and returns information about the initiated run in response.",
                "tags": [
                    "Run Actor"
                ],
                "requestBody": {
                    "required": true,
                    "content": {
                        "application/json": {
                            "schema": {
                                "$ref": "#/components/schemas/inputSchema"
                            }
                        }
                    }
                },
                "parameters": [
                    {
                        "name": "token",
                        "in": "query",
                        "required": true,
                        "schema": {
                            "type": "string"
                        },
                        "description": "Enter your Apify token here"
                    }
                ],
                "responses": {
                    "200": {
                        "description": "OK",
                        "content": {
                            "application/json": {
                                "schema": {
                                    "$ref": "#/components/schemas/runsResponseSchema"
                                }
                            }
                        }
                    }
                }
            }
        },
        "/acts/studio-amba~cv-lv-scraper/run-sync": {
            "post": {
                "operationId": "run-sync-studio-amba-cv-lv-scraper",
                "x-openai-isConsequential": false,
                "summary": "Executes an Actor, waits for completion, and returns the OUTPUT from Key-value store in response.",
                "tags": [
                    "Run Actor"
                ],
                "requestBody": {
                    "required": true,
                    "content": {
                        "application/json": {
                            "schema": {
                                "$ref": "#/components/schemas/inputSchema"
                            }
                        }
                    }
                },
                "parameters": [
                    {
                        "name": "token",
                        "in": "query",
                        "required": true,
                        "schema": {
                            "type": "string"
                        },
                        "description": "Enter your Apify token here"
                    }
                ],
                "responses": {
                    "200": {
                        "description": "OK"
                    }
                }
            }
        }
    },
    "components": {
        "schemas": {
            "inputSchema": {
                "type": "object",
                "properties": {
                    "searchQuery": {
                        "title": "Search Query",
                        "type": "string",
                        "description": "Job search keyword (e.g. 'developer', 'accountant', 'programmētājs'). Supports English, Latvian, and Russian. Defaults to 'developer' if empty.",
                        "default": "developer"
                    },
                    "location": {
                        "title": "Location",
                        "type": "string",
                        "description": "City or region (e.g. 'Rīga', 'Liepāja', 'Daugavpils'). Leave empty for all Latvia."
                    },
                    "maxResults": {
                        "title": "Max Results",
                        "minimum": 1,
                        "maximum": 10000,
                        "type": "integer",
                        "description": "Maximum number of job listings to scrape.",
                        "default": 100
                    },
                    "includeDetails": {
                        "title": "Include Full Details",
                        "type": "boolean",
                        "description": "Visit each job detail page to extract full description, requirements, and salary details. Slower but provides richer data.",
                        "default": false
                    },
                    "proxyConfiguration": {
                        "title": "Proxy Configuration",
                        "type": "object",
                        "description": "Proxy settings. Residential proxies recommended to avoid blocking.",
                        "default": {
                            "useApifyProxy": true,
                            "apifyProxyGroups": [
                                "RESIDENTIAL"
                            ]
                        }
                    }
                }
            },
            "runsResponseSchema": {
                "type": "object",
                "properties": {
                    "data": {
                        "type": "object",
                        "properties": {
                            "id": {
                                "type": "string"
                            },
                            "actId": {
                                "type": "string"
                            },
                            "userId": {
                                "type": "string"
                            },
                            "startedAt": {
                                "type": "string",
                                "format": "date-time",
                                "example": "2025-01-08T00:00:00.000Z"
                            },
                            "finishedAt": {
                                "type": "string",
                                "format": "date-time",
                                "example": "2025-01-08T00:00:00.000Z"
                            },
                            "status": {
                                "type": "string",
                                "example": "READY"
                            },
                            "meta": {
                                "type": "object",
                                "properties": {
                                    "origin": {
                                        "type": "string",
                                        "example": "API"
                                    },
                                    "userAgent": {
                                        "type": "string"
                                    }
                                }
                            },
                            "stats": {
                                "type": "object",
                                "properties": {
                                    "inputBodyLen": {
                                        "type": "integer",
                                        "example": 2000
                                    },
                                    "rebootCount": {
                                        "type": "integer",
                                        "example": 0
                                    },
                                    "restartCount": {
                                        "type": "integer",
                                        "example": 0
                                    },
                                    "resurrectCount": {
                                        "type": "integer",
                                        "example": 0
                                    },
                                    "computeUnits": {
                                        "type": "integer",
                                        "example": 0
                                    }
                                }
                            },
                            "options": {
                                "type": "object",
                                "properties": {
                                    "build": {
                                        "type": "string",
                                        "example": "latest"
                                    },
                                    "timeoutSecs": {
                                        "type": "integer",
                                        "example": 300
                                    },
                                    "memoryMbytes": {
                                        "type": "integer",
                                        "example": 1024
                                    },
                                    "diskMbytes": {
                                        "type": "integer",
                                        "example": 2048
                                    }
                                }
                            },
                            "buildId": {
                                "type": "string"
                            },
                            "defaultKeyValueStoreId": {
                                "type": "string"
                            },
                            "defaultDatasetId": {
                                "type": "string"
                            },
                            "defaultRequestQueueId": {
                                "type": "string"
                            },
                            "buildNumber": {
                                "type": "string",
                                "example": "1.0.0"
                            },
                            "containerUrl": {
                                "type": "string"
                            },
                            "usage": {
                                "type": "object",
                                "properties": {
                                    "ACTOR_COMPUTE_UNITS": {
                                        "type": "integer",
                                        "example": 0
                                    },
                                    "DATASET_READS": {
                                        "type": "integer",
                                        "example": 0
                                    },
                                    "DATASET_WRITES": {
                                        "type": "integer",
                                        "example": 0
                                    },
                                    "KEY_VALUE_STORE_READS": {
                                        "type": "integer",
                                        "example": 0
                                    },
                                    "KEY_VALUE_STORE_WRITES": {
                                        "type": "integer",
                                        "example": 1
                                    },
                                    "KEY_VALUE_STORE_LISTS": {
                                        "type": "integer",
                                        "example": 0
                                    },
                                    "REQUEST_QUEUE_READS": {
                                        "type": "integer",
                                        "example": 0
                                    },
                                    "REQUEST_QUEUE_WRITES": {
                                        "type": "integer",
                                        "example": 0
                                    },
                                    "DATA_TRANSFER_INTERNAL_GBYTES": {
                                        "type": "integer",
                                        "example": 0
                                    },
                                    "DATA_TRANSFER_EXTERNAL_GBYTES": {
                                        "type": "integer",
                                        "example": 0
                                    },
                                    "PROXY_RESIDENTIAL_TRANSFER_GBYTES": {
                                        "type": "integer",
                                        "example": 0
                                    },
                                    "PROXY_SERPS": {
                                        "type": "integer",
                                        "example": 0
                                    }
                                }
                            },
                            "usageTotalUsd": {
                                "type": "number",
                                "example": 0.00005
                            },
                            "usageUsd": {
                                "type": "object",
                                "properties": {
                                    "ACTOR_COMPUTE_UNITS": {
                                        "type": "integer",
                                        "example": 0
                                    },
                                    "DATASET_READS": {
                                        "type": "integer",
                                        "example": 0
                                    },
                                    "DATASET_WRITES": {
                                        "type": "integer",
                                        "example": 0
                                    },
                                    "KEY_VALUE_STORE_READS": {
                                        "type": "integer",
                                        "example": 0
                                    },
                                    "KEY_VALUE_STORE_WRITES": {
                                        "type": "number",
                                        "example": 0.00005
                                    },
                                    "KEY_VALUE_STORE_LISTS": {
                                        "type": "integer",
                                        "example": 0
                                    },
                                    "REQUEST_QUEUE_READS": {
                                        "type": "integer",
                                        "example": 0
                                    },
                                    "REQUEST_QUEUE_WRITES": {
                                        "type": "integer",
                                        "example": 0
                                    },
                                    "DATA_TRANSFER_INTERNAL_GBYTES": {
                                        "type": "integer",
                                        "example": 0
                                    },
                                    "DATA_TRANSFER_EXTERNAL_GBYTES": {
                                        "type": "integer",
                                        "example": 0
                                    },
                                    "PROXY_RESIDENTIAL_TRANSFER_GBYTES": {
                                        "type": "integer",
                                        "example": 0
                                    },
                                    "PROXY_SERPS": {
                                        "type": "integer",
                                        "example": 0
                                    }
                                }
                            }
                        }
                    }
                }
            }
        }
    }
}
```
