# Internshala Scraper - Internships & Fresher Jobs in India (`blackfalcondata/internshala-scraper`) Actor

Scrape internships and fresher jobs from Internshala across India. Every listing carries a structured stipend, the required skills and the hiring company's location and details, ready to rank and shortlist in one clean dataset.

- **URL**: https://apify.com/blackfalcondata/internshala-scraper.md
- **Developed by:** [Black Falcon Data](https://apify.com/blackfalcondata) (community)
- **Categories:** Jobs, Lead generation, Automation
- **Stats:** 18 total users, 7 monthly users, 100.0% runs succeeded, 0 bookmarks
- **User rating**: No ratings yet

## Pricing

from $1.50 / 1,000 results

This Actor is paid per event and usage. You are charged both the fixed price for specific events and for Apify platform usage.

Learn more: https://docs.apify.com/platform/actors/running/actors-in-store#pay-per-event

## What's an Apify Actor?

Actors are a software tools running on the Apify platform, for all kinds of web data extraction and automation use cases.
In Batch mode, an Actor accepts a well-defined JSON input, performs an action which can take anything from a few seconds to a few hours,
and optionally produces a well-defined JSON output, datasets with results, or files in key-value store.
In Standby mode, an Actor provides a web server which can be used as a website, API, or an MCP server.
Actors are written with capital "A".

## How to integrate an Actor?

If asked about integration, you help developers integrate Actors into their projects.
You adapt to their stack and deliver integrations that are safe, well-documented, and production-ready.
The best way to integrate Actors is as follows.

In JavaScript/TypeScript projects, use official [JavaScript/TypeScript client](https://docs.apify.com/api/client/js.md):

```bash
npm install apify-client
```

In Python projects, use official [Python client library](https://docs.apify.com/api/client/python.md):

```bash
pip install apify-client
```

In shell scripts, use [Apify CLI](https://docs.apify.com/cli/docs.md):

````bash
# MacOS / Linux
curl -fsSL https://apify.com/install-cli.sh | bash
# Windows
irm https://apify.com/install-cli.ps1 | iex
```bash

In AI frameworks, you might use the [Apify MCP server](https://docs.apify.com/platform/integrations/mcp.md).

If your project is in a different language, use the [REST API](https://docs.apify.com/api/v2.md).

For usage examples, see the [API](#api) section below.

For more details, see Apify documentation as [Markdown index](https://docs.apify.com/llms.txt) and [Markdown full-text](https://docs.apify.com/llms-full.txt).


# README

### What does Internshala Scraper do?

Internshala Scraper extracts structured job data from [internshala.com](https://internshala.com) — including salary data, contact details (email, apply URL), company metadata, full descriptions, and skill tags. It supports keyword search, location filters, and controllable result limits, so you can run the same query consistently over time. The actor also offers detail enrichment (full descriptions, company metadata, and contact information) where the source provides them.

**New to Apify?** [Sign up free](https://console.apify.com/sign-up?fpr=1h3gvi&fp_sid=ctaplain) and use the included $5 monthly platform credit to test this actor.

### Key features

<!-- KEY_FEATURES:START -->
- **💰 Structured stipend & job-offer signal** — stipend parsed to min, max, currency and period (handles Indian lakh formatting). Pre-placement job offers are flagged with the offer amount. Filter by minimum stipend or salary.
- **📋 Detail enrichment** — optional detail pages add the full description, skills, duration, start date, openings, apply-by deadline, who-can-apply, perks and company overview.
- **🔔 Notifications** — Telegram, Slack, Discord, WhatsApp Cloud API, generic webhook — out of the box. Pair with incremental + `notifyOnlyChanges` for daily "new Internshala jobs" pings to your hiring channel.
- **🔗 Paste-mode** — paste any internshala.com search URL as a start URL — multiple URLs are merged and deduplicated by listing id in one run.
- **📦 Compact mode** — AI-agent and MCP-friendly compact payloads with core fields only — pipe straight into your ATS, salary-benchmarking tool, or LLM context without parsing extras.
- **♻️ Incremental mode** — recurring runs emit only NEW / UPDATED / REAPPEARED records — UNCHANGED and EXPIRED are opt-in. First run builds the baseline; subsequent runs emit and charge only for the diff. Pair with notifications for daily "new jobs" alerts to your hiring team. Saves 80–95% on daily monitoring.
- **📤 Export anywhere** — Download the dataset as JSON, CSV, or Excel from the Apify Console, or stream live via the Apify API and integrations (Make, Zapier, Google Sheets, n8n, …).
- **🔌 MCP connectors** — export your results into Notion via Apify's MCP connectors — a clean run-summary page, no glue code. Opt-in via the App connector field; deterministic field-mapping, no AI. Built on Apify's connector framework, so more destinations open up as their catalog grows.
<!-- KEY_FEATURES:END -->

### What data can you extract from Internshala?

Each result includes Core listing fields (`jobId`, `jobKey`, `listingType`, `title`, `location`, `isWorkFromHome`, `isPartTime`, and `workMode`, and more), detail fields when enrichment is enabled (`description`, `descriptionHtml`, `descriptionMarkdown`, `descriptionLength`, and `detailFetched`), contact and apply information (`extractedEmails` and `whoCanApply`), and company metadata (`company`, `companyLogo`, `companyUrl`, and `aboutCompany`). In standard mode, all fields are always present — unavailable data points are returned as `null`, never omitted. In compact mode, only core fields are returned.

Enable detail enrichment in the input to get richer fields such as full descriptions, company metadata, and contact information where the source provides them.

### Input

The main inputs are a search keyword, an optional location filter, and a result limit. Additional filters and options are available in the input schema.

Key parameters:

- **`listingType`** — Scrape internships or fresher jobs. (default: `"internship"`)
- **`query`** — Free-text keyword search (e.g. "web development"). Use a JSON array for multiple keywords, e.g. ["data science","marketing"]. Keyword search runs standalone — for category/city/WFH filtering, leave this empty and use the filters below.
- **`category`** — Internshala category slug, e.g. web-development, data-science, marketing, graphic-design, content-writing, human-resources, finance, sales, operations, android-app-development, machine-learning. Use a JSON array for multiple categories.
- **`location`** — City name, e.g. mumbai, delhi, bangalore. Use a JSON array for multiple cities.
- **`workFromHome`** — Only return work-from-home (remote) listings. (default: `false`)
- **`partTime`** — Only return part-time listings. (default: `false`)
- **`durationMonths`** — Internships only. Restrict to specific durations in months (e.g. 1, 2, 3, 4, 6). One search runs per value. Leave empty for any duration.
- **`withJobOffer`** — Only return internships that come with a guaranteed pre-placement job offer. (default: `false`)
- **`fastResponse`** — Internships only. Only return listings where employers are likely to respond quickly. (default: `false`)
- **`earlyApplicant`** — Only return listings where you would be among the first applicants. (default: `false`)
- **`forWomen`** — Only return listings from Internshala's "for women" vertical. (default: `false`)
- **`minStipend`** — Internships: only return listings whose monthly stipend is at least this amount (INR). 0 = no minimum. Undisclosed stipends are kept. (default: `0`)
- ...and 28 more parameters

### Input examples

**Basic search** — Keyword-driven search with a result cap.

→ Full payload per result — all standard fields populated where the source provides them.

```json
{
  "query": "web development",
  "maxResults": 50
}
````

**Incremental tracking** — Only emit jobs that changed since the previous run with this `stateKey`.

→ First run builds the baseline state. Subsequent runs emit only records that are new or whose tracked content changed. Set `emitUnchanged: true` to include unchanged records as well.

```json
{
  "query": "web development",
  "maxResults": 200,
  "incrementalMode": true,
  "stateKey": "web-development-tracker"
}
```

**Compact output for AI agents** — Return only core fields for AI-agent and MCP workflows.

→ Small payload with the most important fields — ideal for piping into LLMs without token overhead.

```json
{
  "query": "web development",
  "maxResults": 50,
  "compact": true
}
```

### Output

Each run produces a dataset of structured job records. Results can be downloaded as JSON, CSV, or Excel from the Dataset tab in Apify Console.

### Example job record

```json
{
  "jobId": "c387864018cf7a89417e1fd90e8cbd093210fbb38622712734ccbafd0031abf1",
  "jobKey": "3186154",
  "listingType": "internship",
  "title": "Business Development (Sales) - Internship",
  "company": "Zytexa Technology Llp",
  "companyLogo": "https://internshala-uploads.internshala.com/logo%2Fe020vr35iht-28054.jpg.webp",
  "companyUrl": "https://internshala.com/company/zytexa-technology-llp-1761735741",
  "location": "Jaipur",
  "isWorkFromHome": false,
  "isPartTime": false,
  "workMode": "onsite",
  "category": "Computer Services , Marketing , Consulting",
  "skills": [
    "Client Relationship Management (CRM)",
    "Lead Generation",
    "English Proficiency (Spoken)",
    "English Proficiency (Written)",
    "Market research"
  ],
  "description": "About the internship:Selected intern's day-to-day responsibilities include: 1. Conduct market research to identify potential clients and business opportunities. 2. Generate leads through LinkedIn, ema...",
  "descriptionHtml": "<br><p>About the internship:</p>Selected intern's day-to-day responsibilities include: <br />\n<br />\n1. Conduct market research to identify potential clients and business opportunities.<br />\n2. Gener...",
  "descriptionMarkdown": "About the internship:\n\nSelected intern's day-to-day responsibilities include:\n\n1. Conduct market research to identify potential clients and business opportunities.\n\n2. Generate leads through LinkedIn,...",
  "descriptionLength": 1756,
  "whoCanApply": "Only those candidates can apply who: 1. are available for full time (in-office) internship 2. can start the internship between 19th Jun'26 and 24th Jul'26 3. are available for duration of 6 months 4....",
  "perks": [
    "Certificate",
    "Letter of recommendation",
    "Informal dress code",
    "Free snacks & beverages"
  ],
  "aboutCompany": "Zytexa Technology LLP is a Jaipur-based IT company specializing in website development, mobile app development, digital marketing, e-commerce solutions, CRM & ERP systems, branding, and business autom...",
  "stipendText": "₹ 3,000 - 5,000 /month",
  "salaryMin": 3000,
  "salaryMax": 5000,
  "salaryCurrency": "INR",
  "salaryPeriod": "MONTH",
  "employmentType": "INTERN, FULL_TIME",
  "duration": "6 Months",
  "durationMonths": 6,
  "startDate": "Immediately",
  "numberOfOpenings": 5,
  "hasJobOffer": false,
  "activelyHiring": true,
  "earlyApplicant": true,
  "postedAt": "2026-06-19",
  "publishedAge": "Posted 1 day ago",
  "applyBy": "19 Jul' 26",
  "validThrough": "2026-07-19 23:59:59",
  "extractedEmails": [
    "complaints@internshala.com"
  ],
  "canonicalUrl": "https://internshala.com/internship/detail/business-development-sales-internship-in-jaipur-at-zytexa-technology-llp1781852989",
  "applyUrl": "https://internshala.com/internship/detail/business-development-sales-internship-in-jaipur-at-zytexa-technology-llp1781852989",
  "sourceUrl": "https://internshala.com/internship/detail/business-development-sales-internship-in-jaipur-at-zytexa-technology-llp1781852989",
  "sourceCountry": "IN",
  "sourceDomain": "internshala.com",
  "searchQuery": "all",
  "searchUrl": "https://internshala.com/internships/",
  "scrapedAt": "2026-06-20T13:59:31.078Z",
  "fetchedAt": "2026-06-20T13:59:31.078Z",
  "detailFetched": true,
  "contentQuality": "full",
  "contentHash": "095c90aff7c418161cc8c94062cd81145ce280aa094e5a2a1e7a449e8375c2ac"
}
```

### Incremental fields

When incremental mode is on, each record also carries:

- `changeType` — one of `NEW`, `UPDATED`, `UNCHANGED`, `REAPPEARED`, `EXPIRED`. Default output covers `NEW` / `UPDATED` / `REAPPEARED`; set `emitUnchanged: true` or `emitExpired: true` to opt into the others.
- `firstSeenAt`, `lastSeenAt` — ISO-8601 timestamps tracking the listing across runs.
- `isRepost`, `repostOfId`, `repostDetectedAt` — populated when a new listing matches the tracked content of a previously expired one. Set `skipReposts: true` to drop detected reposts from the output.

### How to scrape Internshala

1. Go to [Internshala Scraper](https://apify.com/blackfalcondata/internshala-scraper?fpr=1h3gvi) in Apify Console.
2. Enter a search keyword and optional location filter.
3. Set `maxResults` to control how many results you need.
4. Enable `includeDetails` if you need full descriptions, contact info, company data.
5. Click **Start** and wait for the run to finish.
6. Export the dataset as JSON, CSV, or Excel.

### Use cases

- Extract job data from Internshala for market research and competitive analysis.
- Track salary trends across regions and categories over time.
- Monitor new and changed listings on scheduled runs without processing the full dataset every time.
- Build outreach lists using contact details and apply URLs from listings.
- Research company hiring patterns, employer profiles, and industry distribution.
- Feed structured data into AI agents, MCP tools, and automated pipelines using compact mode.
- Export clean, structured data to dashboards, spreadsheets, or data warehouses.
- Analyze skill demand across listings using structured skill tags.

### How much does it cost to scrape Internshala?

Internshala Scraper uses [pay-per-event](https://docs.apify.com/platform/actors/paid-actors/pay-per-event) pricing. You pay a small fee when the run starts and then for each result that is actually produced.

- **Run start:** $0.0005 per run
- **Per result:** $0.0015 per job record

Example costs:

- 10 results: **$0.015**
- 25 results: **$0.038**
- 100 results: **$0.15**
- 200 results: **$0.3**
- 500 results: **$0.75**

#### Example: recurring monitoring savings

These examples compare full re-scrapes with incremental runs at different churn rates. Churn is the share of listings that are new or whose tracked content changed since the previous run. Actual churn depends on your query breadth, source activity, and polling frequency — the scenarios below are examples, not predictions.

Example setup: 200 results per run, daily polling (30 runs/month). Event-pricing examples scale linearly with result count.

| Churn rate | Full re-scrape run cost | Incremental run cost | Savings vs full re-scrape | Monthly cost after baseline |
|---|---:|---:|---:|---:|
| 5% — stable niche query | $0.30 | $0.02 | $0.28 (95%) | $0.46 |
| 15% — moderate broad query | $0.30 | $0.05 | $0.26 (85%) | $1.36 |
| 30% — high-volume aggregator | $0.30 | $0.09 | $0.21 (70%) | $2.71 |

Full re-scrape monthly cost at daily polling: $9.02. First month with incremental costs $0.75 / $1.62 / $2.92 for the 5% / 15% / 30% scenarios because the first run builds baseline state at full cost before incremental savings apply.

Platform usage (compute and proxies) is billed separately by Apify based on actual consumption. Incremental runs consume less on result processing, though fixed per-run overhead stays the same.

### FAQ

#### How many results can I get from Internshala?

The number of results depends on the search query and available listings on Internshala. Use the `maxResults` parameter to control how many results are returned per run.

#### Does Internshala Scraper support recurring monitoring?

Yes. Enable incremental mode to only receive new or changed listings on subsequent runs. This is ideal for scheduled monitoring where you want to track changes over time without re-processing the full dataset.

#### Can I integrate Internshala Scraper with other apps?

Yes. Internshala Scraper works with Apify's [integrations](https://apify.com/integrations?fpr=1h3gvi) to connect with tools like Zapier, Make, Google Sheets, Slack, and more. You can also use webhooks to trigger actions when a run completes.

#### Can I use Internshala Scraper with the Apify API?

Yes. You can start runs, manage inputs, and retrieve results programmatically through the [Apify API](https://docs.apify.com/api/v2). Client libraries are available for JavaScript, Python, and other languages.

#### Can I use Internshala Scraper through an MCP Server?

Yes. Apify provides an [MCP Server](https://apify.com/apify/actors-mcp-server?fpr=1h3gvi) that lets AI assistants and agents call this actor directly. Use compact mode, `descriptionMaxLength`, a single `descriptionFormat`, and `excludeEmptyFields` to keep payloads manageable for LLM context windows.

#### Is it legal to scrape Internshala?

This actor extracts publicly available data from Internshala. Web scraping of public information is generally considered legal, but you should always review the target site's terms of service and ensure your use case complies with applicable laws and regulations, including GDPR where relevant.

#### Your feedback

If you have questions, need a feature, or found a bug, please [open an issue](https://apify.com/blackfalcondata/internshala-scraper/issues?fpr=1h3gvi) on the actor's page in Apify Console. Your feedback helps us improve.

### You might also like

- [Actiris Brussels Job Scraper](https://apify.com/blackfalcondata/actiris-scraper?fpr=1h3gvi) — Scrape all active job listings from actiris.brussels — official Brussels public employment service..
- [AMS Austria Job Scraper — Austrian Public Employment Service](https://apify.com/blackfalcondata/ams-austria-job-scraper?fpr=1h3gvi) — Scrape jobs.ams.at — Austria's official AMS public employment portal, branded "alle jobs" ("all.
- [APEC.fr Scraper - French Executive Jobs](https://apify.com/blackfalcondata/apec-scraper?fpr=1h3gvi) — Scrape apec.fr - French executive job listings with salary ranges, company, location, skills,.
- [Arbeitsagentur Jobs Feed — German Federal Employment Agency](https://apify.com/blackfalcondata/arbeitsagentur-jobs-feed?fpr=1h3gvi) — Scrape arbeitsagentur.de — Germany's official public employment portal with over 1 million live job.
- [Arbetsformedlingen Job Scraper](https://apify.com/blackfalcondata/arbetsformedlingen-scraper?fpr=1h3gvi) — Scrape arbetsformedlingen.se (Platsbanken) — Sweden's official employment portal. Returns 84.
- [Bayt.com Scraper — MENA Jobs with Salary & Skills Filter](https://apify.com/blackfalcondata/bayt-scraper?fpr=1h3gvi) — Scrape bayt.com — the leading Middle East job board spanning UAE, Saudi Arabia, Qatar, Egypt.
- [Bumeran Scraper — LATAM Jobs across 7 Countries & 8 Brands](https://apify.com/blackfalcondata/bumeran-scraper?fpr=1h3gvi) — Scrape Bumeran Group's job boards across LATAM — Argentina (bumeran.com.ar + zonajobs), Chile.
- [Cadremploi Scraper — French Executive & Management Jobs](https://apify.com/blackfalcondata/cadremploi-scraper?fpr=1h3gvi) — Scrape cadremploi.fr — France's leading job board for executives and managers (cadres). Salary.

### Getting started with Apify

New to Apify? [Create a free account with $5 credit](https://console.apify.com/sign-up?fpr=1h3gvi\&fp_sid=ctaplain) — no credit card required.

1. Sign up — $5 platform credit included
2. Open this actor and configure your input
3. Click **Start** — export results as JSON, CSV, or Excel

Need more later? [See Apify pricing](https://apify.com/pricing?fpr=1h3gvi).

# Actor input Schema

## `listingType` (type: `string`):

Scrape internships or fresher jobs.

## `query` (type: `string`):

Free-text keyword search (e.g. "web development"). Use a JSON array for multiple keywords, e.g. \["data science","marketing"]. Keyword search runs standalone — for category/city/WFH filtering, leave this empty and use the filters below.

## `category` (type: `string`):

Internshala category slug, e.g. web-development, data-science, marketing, graphic-design, content-writing, human-resources, finance, sales, operations, android-app-development, machine-learning. Use a JSON array for multiple categories.

## `location` (type: `string`):

City name, e.g. mumbai, delhi, bangalore. Use a JSON array for multiple cities.

## `workFromHome` (type: `boolean`):

Only return work-from-home (remote) listings.

## `partTime` (type: `boolean`):

Only return part-time listings.

## `durationMonths` (type: `array`):

Internships only. Restrict to specific durations in months (e.g. 1, 2, 3, 4, 6). One search runs per value. Leave empty for any duration.

## `withJobOffer` (type: `boolean`):

Only return internships that come with a guaranteed pre-placement job offer.

## `fastResponse` (type: `boolean`):

Internships only. Only return listings where employers are likely to respond quickly.

## `earlyApplicant` (type: `boolean`):

Only return listings where you would be among the first applicants.

## `forWomen` (type: `boolean`):

Only return listings from Internshala's "for women" vertical.

## `minStipend` (type: `integer`):

Internships: only return listings whose monthly stipend is at least this amount (INR). 0 = no minimum. Undisclosed stipends are kept.

## `minSalary` (type: `integer`):

Jobs: only return listings whose annual salary is at least this amount (INR). 0 = no minimum. Undisclosed salaries are kept.

## `startUrls` (type: `array`):

Paste any Internshala listing URL (e.g. a filtered search) to scrape it directly. Overrides/augments the filters above.

## `country` (type: `string`):

Internshala operates in India.

## `maxResults` (type: `integer`):

Maximum total listings to return (0 = unlimited).

## `maxPages` (type: `integer`):

Maximum SERP pages to scrape per search source.

## `includeDetails` (type: `boolean`):

Fetch each listing's detail page for the full description, structured stipend, skills, perks, who-can-apply, and openings. Disable for a faster, listing-only run.

## `descriptionMaxLength` (type: `integer`):

Truncate description to this many characters. 0 = no truncation.

## `compact` (type: `boolean`):

Output only core fields (for AI-agent/MCP workflows).

## `incrementalMode` (type: `boolean`):

Compare against previous run state and label each listing NEW / UPDATED / UNCHANGED / EXPIRED. stateKey is optional — defaults to a stable key derived from your filters so different searches keep separate history.

## `stateKey` (type: `string`):

Optional. Stable identifier for the tracked search universe (e.g. "wfh-web-dev"). Leave empty to auto-generate from filters.

## `emitUnchanged` (type: `boolean`):

When incremental, also emit records that haven't changed.

## `emitExpired` (type: `boolean`):

When incremental, also emit records no longer found.

## `skipReposts` (type: `boolean`):

When incremental, skip listings whose content matches an expired listing from a prior run (cross-run repost detection).

## `telegramToken` (type: `string`):

Telegram bot token (from @BotFather). Required for Telegram notifications.

## `telegramChatId` (type: `string`):

Telegram chat or channel ID (e.g. "-100123456789"). Required when telegramToken is set.

## `discordWebhookUrl` (type: `string`):

Discord incoming webhook URL. Server Settings → Integrations → Webhooks → New Webhook.

## `slackWebhookUrl` (type: `string`):

Slack incoming webhook URL. api.slack.com/messaging/webhooks.

## `notificationLimit` (type: `integer`):

Maximum number of listings included in each notification message (1–20).

## `notifyOnlyChanges` (type: `boolean`):

When Incremental Mode is on, only send notifications for NEW and UPDATED listings. Has no effect outside incremental mode.

## `whatsappAccessToken` (type: `string`):

WhatsApp Cloud API permanent access token (System User token from Meta Business). Recipient must have messaged the business number within the last 24h (service-conversation window).

## `whatsappPhoneNumberId` (type: `string`):

Your WhatsApp Business phone-number ID (numeric, from Meta dashboard). Required when whatsappAccessToken is set.

## `whatsappTo` (type: `string`):

Recipient phone in E.164 format without + (e.g. "919812345678"). Recipient must have messaged your business number within last 24h.

## `webhookUrl` (type: `string`):

Receives a JSON POST with {metadata, items} after each run. Universal escape hatch for n8n / Make / Zapier / custom backends.

## `webhookHeaders` (type: `object`):

Optional JSON object of custom headers (e.g. {"Authorization":"Bearer ..."}).

## `appConnector` (type: `string`):

Optional. Pick a connected app under Settings → API & Integrations to receive your results (including any contact details). Best-effort across MCP connectors as Apify expands its catalog.

## `mcpIssueTeam` (type: `string`):

Only when the connected app is an issue tracker: the team (name or ID) the summary issue is created under, if that app requires one.

## `descriptionFormat` (type: `string`):

Pick a single description representation. `all` keeps every variant; `text` / `html` / `markdown` drop the others.

## `excludeEmptyFields` (type: `boolean`):

Drop null, empty-string, and empty-array fields from each record before push. Smaller payloads for AI agents and dashboards.

## Actor input object example

```json
{
  "listingType": "internship",
  "query": "web development",
  "workFromHome": false,
  "partTime": false,
  "withJobOffer": false,
  "fastResponse": false,
  "earlyApplicant": false,
  "forWomen": false,
  "minStipend": 0,
  "minSalary": 0,
  "country": "IN",
  "maxResults": 10,
  "maxPages": 5,
  "includeDetails": true,
  "descriptionMaxLength": 0,
  "compact": false,
  "incrementalMode": false,
  "emitUnchanged": false,
  "emitExpired": false,
  "skipReposts": false,
  "notificationLimit": 5,
  "notifyOnlyChanges": false,
  "descriptionFormat": "all",
  "excludeEmptyFields": false
}
```

# Actor output Schema

## `results` (type: `string`):

No description

# API

You can run this Actor programmatically using our API. Below are code examples in JavaScript, Python, and CLI, as well as the OpenAPI specification and MCP server setup.

## JavaScript example

```javascript
import { ApifyClient } from 'apify-client';

// Initialize the ApifyClient with your Apify API token
// Replace the '<YOUR_API_TOKEN>' with your token
const client = new ApifyClient({
    token: '<YOUR_API_TOKEN>',
});

// Prepare Actor input
const input = {
    "query": "web development",
    "maxResults": 10,
    "includeDetails": true,
    "descriptionFormat": "all",
    "excludeEmptyFields": false
};

// Run the Actor and wait for it to finish
const run = await client.actor("blackfalcondata/internshala-scraper").call(input);

// Fetch and print Actor results from the run's dataset (if any)
console.log('Results from dataset');
console.log(`💾 Check your data here: https://console.apify.com/storage/datasets/${run.defaultDatasetId}`);
const { items } = await client.dataset(run.defaultDatasetId).listItems();
items.forEach((item) => {
    console.dir(item);
});

// 📚 Want to learn more 📖? Go to → https://docs.apify.com/api/client/js/docs

```

## Python example

```python
from apify_client import ApifyClient

# Initialize the ApifyClient with your Apify API token
# Replace '<YOUR_API_TOKEN>' with your token.
client = ApifyClient("<YOUR_API_TOKEN>")

# Prepare the Actor input
run_input = {
    "query": "web development",
    "maxResults": 10,
    "includeDetails": True,
    "descriptionFormat": "all",
    "excludeEmptyFields": False,
}

# Run the Actor and wait for it to finish
run = client.actor("blackfalcondata/internshala-scraper").call(run_input=run_input)

# Fetch and print Actor results from the run's dataset (if there are any)
print("💾 Check your data here: https://console.apify.com/storage/datasets/" + run["defaultDatasetId"])
for item in client.dataset(run["defaultDatasetId"]).iterate_items():
    print(item)

# 📚 Want to learn more 📖? Go to → https://docs.apify.com/api/client/python/docs/quick-start

```

## CLI example

```bash
echo '{
  "query": "web development",
  "maxResults": 10,
  "includeDetails": true,
  "descriptionFormat": "all",
  "excludeEmptyFields": false
}' |
apify call blackfalcondata/internshala-scraper --silent --output-dataset

```

## MCP server setup

```json
{
    "mcpServers": {
        "apify": {
            "command": "npx",
            "args": [
                "mcp-remote",
                "https://mcp.apify.com/?tools=blackfalcondata/internshala-scraper",
                "--header",
                "Authorization: Bearer <YOUR_API_TOKEN>"
            ]
        }
    }
}

```

## OpenAPI specification

```json
{
    "openapi": "3.0.1",
    "info": {
        "title": "Internshala Scraper - Internships & Fresher Jobs in India",
        "description": "Scrape internships and fresher jobs from Internshala across India. Every listing carries a structured stipend, the required skills and the hiring company's location and details, ready to rank and shortlist in one clean dataset.",
        "version": "0.1",
        "x-build-id": "XuG0eqcS7OtWkjunS"
    },
    "servers": [
        {
            "url": "https://api.apify.com/v2"
        }
    ],
    "paths": {
        "/acts/blackfalcondata~internshala-scraper/run-sync-get-dataset-items": {
            "post": {
                "operationId": "run-sync-get-dataset-items-blackfalcondata-internshala-scraper",
                "x-openai-isConsequential": false,
                "summary": "Executes an Actor, waits for its completion, and returns Actor's dataset items in response.",
                "tags": [
                    "Run Actor"
                ],
                "requestBody": {
                    "required": true,
                    "content": {
                        "application/json": {
                            "schema": {
                                "$ref": "#/components/schemas/inputSchema"
                            }
                        }
                    }
                },
                "parameters": [
                    {
                        "name": "token",
                        "in": "query",
                        "required": true,
                        "schema": {
                            "type": "string"
                        },
                        "description": "Enter your Apify token here"
                    }
                ],
                "responses": {
                    "200": {
                        "description": "OK"
                    }
                }
            }
        },
        "/acts/blackfalcondata~internshala-scraper/runs": {
            "post": {
                "operationId": "runs-sync-blackfalcondata-internshala-scraper",
                "x-openai-isConsequential": false,
                "summary": "Executes an Actor and returns information about the initiated run in response.",
                "tags": [
                    "Run Actor"
                ],
                "requestBody": {
                    "required": true,
                    "content": {
                        "application/json": {
                            "schema": {
                                "$ref": "#/components/schemas/inputSchema"
                            }
                        }
                    }
                },
                "parameters": [
                    {
                        "name": "token",
                        "in": "query",
                        "required": true,
                        "schema": {
                            "type": "string"
                        },
                        "description": "Enter your Apify token here"
                    }
                ],
                "responses": {
                    "200": {
                        "description": "OK",
                        "content": {
                            "application/json": {
                                "schema": {
                                    "$ref": "#/components/schemas/runsResponseSchema"
                                }
                            }
                        }
                    }
                }
            }
        },
        "/acts/blackfalcondata~internshala-scraper/run-sync": {
            "post": {
                "operationId": "run-sync-blackfalcondata-internshala-scraper",
                "x-openai-isConsequential": false,
                "summary": "Executes an Actor, waits for completion, and returns the OUTPUT from Key-value store in response.",
                "tags": [
                    "Run Actor"
                ],
                "requestBody": {
                    "required": true,
                    "content": {
                        "application/json": {
                            "schema": {
                                "$ref": "#/components/schemas/inputSchema"
                            }
                        }
                    }
                },
                "parameters": [
                    {
                        "name": "token",
                        "in": "query",
                        "required": true,
                        "schema": {
                            "type": "string"
                        },
                        "description": "Enter your Apify token here"
                    }
                ],
                "responses": {
                    "200": {
                        "description": "OK"
                    }
                }
            }
        }
    },
    "components": {
        "schemas": {
            "inputSchema": {
                "type": "object",
                "properties": {
                    "listingType": {
                        "title": "🧭 Listing Type",
                        "enum": [
                            "internship",
                            "job"
                        ],
                        "type": "string",
                        "description": "Scrape internships or fresher jobs.",
                        "default": "internship"
                    },
                    "query": {
                        "title": "🔍 Keyword(s)",
                        "type": "string",
                        "description": "Free-text keyword search (e.g. \"web development\"). Use a JSON array for multiple keywords, e.g. [\"data science\",\"marketing\"]. Keyword search runs standalone — for category/city/WFH filtering, leave this empty and use the filters below."
                    },
                    "category": {
                        "title": "🗂️ Category",
                        "type": "string",
                        "description": "Internshala category slug, e.g. web-development, data-science, marketing, graphic-design, content-writing, human-resources, finance, sales, operations, android-app-development, machine-learning. Use a JSON array for multiple categories."
                    },
                    "location": {
                        "title": "📍 City",
                        "type": "string",
                        "description": "City name, e.g. mumbai, delhi, bangalore. Use a JSON array for multiple cities."
                    },
                    "workFromHome": {
                        "title": "🏠 Work From Home Only",
                        "type": "boolean",
                        "description": "Only return work-from-home (remote) listings.",
                        "default": false
                    },
                    "partTime": {
                        "title": "⏱️ Part-Time Only",
                        "type": "boolean",
                        "description": "Only return part-time listings.",
                        "default": false
                    },
                    "durationMonths": {
                        "title": "📆 Internship Duration (months)",
                        "type": "array",
                        "description": "Internships only. Restrict to specific durations in months (e.g. 1, 2, 3, 4, 6). One search runs per value. Leave empty for any duration.",
                        "items": {
                            "type": "string"
                        }
                    },
                    "withJobOffer": {
                        "title": "🎯 With Job Offer (PPO) Only",
                        "type": "boolean",
                        "description": "Only return internships that come with a guaranteed pre-placement job offer.",
                        "default": false
                    },
                    "fastResponse": {
                        "title": "⚡ Fast Response Only",
                        "type": "boolean",
                        "description": "Internships only. Only return listings where employers are likely to respond quickly.",
                        "default": false
                    },
                    "earlyApplicant": {
                        "title": "🏃 Early Applicant Only",
                        "type": "boolean",
                        "description": "Only return listings where you would be among the first applicants.",
                        "default": false
                    },
                    "forWomen": {
                        "title": "♀️ For Women Only",
                        "type": "boolean",
                        "description": "Only return listings from Internshala's \"for women\" vertical.",
                        "default": false
                    },
                    "minStipend": {
                        "title": "💰 Minimum Stipend (INR/month)",
                        "minimum": 0,
                        "type": "integer",
                        "description": "Internships: only return listings whose monthly stipend is at least this amount (INR). 0 = no minimum. Undisclosed stipends are kept.",
                        "default": 0
                    },
                    "minSalary": {
                        "title": "💵 Minimum Salary (INR/year)",
                        "minimum": 0,
                        "type": "integer",
                        "description": "Jobs: only return listings whose annual salary is at least this amount (INR). 0 = no minimum. Undisclosed salaries are kept.",
                        "default": 0
                    },
                    "startUrls": {
                        "title": "🔗 Start URLs",
                        "type": "array",
                        "description": "Paste any Internshala listing URL (e.g. a filtered search) to scrape it directly. Overrides/augments the filters above.",
                        "items": {
                            "type": "string"
                        }
                    },
                    "country": {
                        "title": "🌍 Country",
                        "enum": [
                            "IN"
                        ],
                        "type": "string",
                        "description": "Internshala operates in India.",
                        "default": "IN"
                    },
                    "maxResults": {
                        "title": "💯 Max Results",
                        "minimum": 0,
                        "maximum": 5000,
                        "type": "integer",
                        "description": "Maximum total listings to return (0 = unlimited).",
                        "default": 50
                    },
                    "maxPages": {
                        "title": "📄 Max Pages",
                        "minimum": 1,
                        "maximum": 200,
                        "type": "integer",
                        "description": "Maximum SERP pages to scrape per search source.",
                        "default": 5
                    },
                    "includeDetails": {
                        "title": "📋 Include Full Details",
                        "type": "boolean",
                        "description": "Fetch each listing's detail page for the full description, structured stipend, skills, perks, who-can-apply, and openings. Disable for a faster, listing-only run.",
                        "default": true
                    },
                    "descriptionMaxLength": {
                        "title": "✂️ Description Max Length",
                        "minimum": 0,
                        "type": "integer",
                        "description": "Truncate description to this many characters. 0 = no truncation.",
                        "default": 0
                    },
                    "compact": {
                        "title": "📦 Compact Output",
                        "type": "boolean",
                        "description": "Output only core fields (for AI-agent/MCP workflows).",
                        "default": false
                    },
                    "incrementalMode": {
                        "title": "♻️ Incremental Mode",
                        "type": "boolean",
                        "description": "Compare against previous run state and label each listing NEW / UPDATED / UNCHANGED / EXPIRED. stateKey is optional — defaults to a stable key derived from your filters so different searches keep separate history.",
                        "default": false
                    },
                    "stateKey": {
                        "title": "🔑 State Key",
                        "type": "string",
                        "description": "Optional. Stable identifier for the tracked search universe (e.g. \"wfh-web-dev\"). Leave empty to auto-generate from filters."
                    },
                    "emitUnchanged": {
                        "title": "♻️ Emit Unchanged Records",
                        "type": "boolean",
                        "description": "When incremental, also emit records that haven't changed.",
                        "default": false
                    },
                    "emitExpired": {
                        "title": "⚰️ Emit Expired Records",
                        "type": "boolean",
                        "description": "When incremental, also emit records no longer found.",
                        "default": false
                    },
                    "skipReposts": {
                        "title": "🚫 Skip Reposts",
                        "type": "boolean",
                        "description": "When incremental, skip listings whose content matches an expired listing from a prior run (cross-run repost detection).",
                        "default": false
                    },
                    "telegramToken": {
                        "title": "🔑 Telegram Bot Token",
                        "type": "string",
                        "description": "Telegram bot token (from @BotFather). Required for Telegram notifications."
                    },
                    "telegramChatId": {
                        "title": "💬 Telegram Chat ID",
                        "type": "string",
                        "description": "Telegram chat or channel ID (e.g. \"-100123456789\"). Required when telegramToken is set."
                    },
                    "discordWebhookUrl": {
                        "title": "🎮 Discord Webhook URL",
                        "type": "string",
                        "description": "Discord incoming webhook URL. Server Settings → Integrations → Webhooks → New Webhook."
                    },
                    "slackWebhookUrl": {
                        "title": "💼 Slack Webhook URL",
                        "type": "string",
                        "description": "Slack incoming webhook URL. api.slack.com/messaging/webhooks."
                    },
                    "notificationLimit": {
                        "title": "📊 Max Jobs Per Notification",
                        "minimum": 1,
                        "maximum": 20,
                        "type": "integer",
                        "description": "Maximum number of listings included in each notification message (1–20).",
                        "default": 5
                    },
                    "notifyOnlyChanges": {
                        "title": "🔄 Notify Only New/Updated",
                        "type": "boolean",
                        "description": "When Incremental Mode is on, only send notifications for NEW and UPDATED listings. Has no effect outside incremental mode.",
                        "default": false
                    },
                    "whatsappAccessToken": {
                        "title": "📱 WhatsApp Access Token",
                        "type": "string",
                        "description": "WhatsApp Cloud API permanent access token (System User token from Meta Business). Recipient must have messaged the business number within the last 24h (service-conversation window)."
                    },
                    "whatsappPhoneNumberId": {
                        "title": "📞 WhatsApp Phone Number ID",
                        "type": "string",
                        "description": "Your WhatsApp Business phone-number ID (numeric, from Meta dashboard). Required when whatsappAccessToken is set."
                    },
                    "whatsappTo": {
                        "title": "📲 WhatsApp Recipient",
                        "type": "string",
                        "description": "Recipient phone in E.164 format without + (e.g. \"919812345678\"). Recipient must have messaged your business number within last 24h."
                    },
                    "webhookUrl": {
                        "title": "🪝 Generic Webhook URL",
                        "type": "string",
                        "description": "Receives a JSON POST with {metadata, items} after each run. Universal escape hatch for n8n / Make / Zapier / custom backends."
                    },
                    "webhookHeaders": {
                        "title": "📋 Webhook Headers",
                        "type": "object",
                        "description": "Optional JSON object of custom headers (e.g. {\"Authorization\":\"Bearer ...\"})."
                    },
                    "appConnector": {
                        "title": "Send results to a connected app",
                        "type": "string",
                        "description": "Optional. Pick a connected app under Settings → API & Integrations to receive your results (including any contact details). Best-effort across MCP connectors as Apify expands its catalog."
                    },
                    "mcpIssueTeam": {
                        "title": "Issue tracker team",
                        "type": "string",
                        "description": "Only when the connected app is an issue tracker: the team (name or ID) the summary issue is created under, if that app requires one."
                    },
                    "descriptionFormat": {
                        "title": "📝 Description Format",
                        "enum": [
                            "all",
                            "text",
                            "html",
                            "markdown"
                        ],
                        "type": "string",
                        "description": "Pick a single description representation. `all` keeps every variant; `text` / `html` / `markdown` drop the others.",
                        "default": "all"
                    },
                    "excludeEmptyFields": {
                        "title": "🧹 Exclude Empty Fields",
                        "type": "boolean",
                        "description": "Drop null, empty-string, and empty-array fields from each record before push. Smaller payloads for AI agents and dashboards.",
                        "default": false
                    }
                }
            },
            "runsResponseSchema": {
                "type": "object",
                "properties": {
                    "data": {
                        "type": "object",
                        "properties": {
                            "id": {
                                "type": "string"
                            },
                            "actId": {
                                "type": "string"
                            },
                            "userId": {
                                "type": "string"
                            },
                            "startedAt": {
                                "type": "string",
                                "format": "date-time",
                                "example": "2025-01-08T00:00:00.000Z"
                            },
                            "finishedAt": {
                                "type": "string",
                                "format": "date-time",
                                "example": "2025-01-08T00:00:00.000Z"
                            },
                            "status": {
                                "type": "string",
                                "example": "READY"
                            },
                            "meta": {
                                "type": "object",
                                "properties": {
                                    "origin": {
                                        "type": "string",
                                        "example": "API"
                                    },
                                    "userAgent": {
                                        "type": "string"
                                    }
                                }
                            },
                            "stats": {
                                "type": "object",
                                "properties": {
                                    "inputBodyLen": {
                                        "type": "integer",
                                        "example": 2000
                                    },
                                    "rebootCount": {
                                        "type": "integer",
                                        "example": 0
                                    },
                                    "restartCount": {
                                        "type": "integer",
                                        "example": 0
                                    },
                                    "resurrectCount": {
                                        "type": "integer",
                                        "example": 0
                                    },
                                    "computeUnits": {
                                        "type": "integer",
                                        "example": 0
                                    }
                                }
                            },
                            "options": {
                                "type": "object",
                                "properties": {
                                    "build": {
                                        "type": "string",
                                        "example": "latest"
                                    },
                                    "timeoutSecs": {
                                        "type": "integer",
                                        "example": 300
                                    },
                                    "memoryMbytes": {
                                        "type": "integer",
                                        "example": 1024
                                    },
                                    "diskMbytes": {
                                        "type": "integer",
                                        "example": 2048
                                    }
                                }
                            },
                            "buildId": {
                                "type": "string"
                            },
                            "defaultKeyValueStoreId": {
                                "type": "string"
                            },
                            "defaultDatasetId": {
                                "type": "string"
                            },
                            "defaultRequestQueueId": {
                                "type": "string"
                            },
                            "buildNumber": {
                                "type": "string",
                                "example": "1.0.0"
                            },
                            "containerUrl": {
                                "type": "string"
                            },
                            "usage": {
                                "type": "object",
                                "properties": {
                                    "ACTOR_COMPUTE_UNITS": {
                                        "type": "integer",
                                        "example": 0
                                    },
                                    "DATASET_READS": {
                                        "type": "integer",
                                        "example": 0
                                    },
                                    "DATASET_WRITES": {
                                        "type": "integer",
                                        "example": 0
                                    },
                                    "KEY_VALUE_STORE_READS": {
                                        "type": "integer",
                                        "example": 0
                                    },
                                    "KEY_VALUE_STORE_WRITES": {
                                        "type": "integer",
                                        "example": 1
                                    },
                                    "KEY_VALUE_STORE_LISTS": {
                                        "type": "integer",
                                        "example": 0
                                    },
                                    "REQUEST_QUEUE_READS": {
                                        "type": "integer",
                                        "example": 0
                                    },
                                    "REQUEST_QUEUE_WRITES": {
                                        "type": "integer",
                                        "example": 0
                                    },
                                    "DATA_TRANSFER_INTERNAL_GBYTES": {
                                        "type": "integer",
                                        "example": 0
                                    },
                                    "DATA_TRANSFER_EXTERNAL_GBYTES": {
                                        "type": "integer",
                                        "example": 0
                                    },
                                    "PROXY_RESIDENTIAL_TRANSFER_GBYTES": {
                                        "type": "integer",
                                        "example": 0
                                    },
                                    "PROXY_SERPS": {
                                        "type": "integer",
                                        "example": 0
                                    }
                                }
                            },
                            "usageTotalUsd": {
                                "type": "number",
                                "example": 0.00005
                            },
                            "usageUsd": {
                                "type": "object",
                                "properties": {
                                    "ACTOR_COMPUTE_UNITS": {
                                        "type": "integer",
                                        "example": 0
                                    },
                                    "DATASET_READS": {
                                        "type": "integer",
                                        "example": 0
                                    },
                                    "DATASET_WRITES": {
                                        "type": "integer",
                                        "example": 0
                                    },
                                    "KEY_VALUE_STORE_READS": {
                                        "type": "integer",
                                        "example": 0
                                    },
                                    "KEY_VALUE_STORE_WRITES": {
                                        "type": "number",
                                        "example": 0.00005
                                    },
                                    "KEY_VALUE_STORE_LISTS": {
                                        "type": "integer",
                                        "example": 0
                                    },
                                    "REQUEST_QUEUE_READS": {
                                        "type": "integer",
                                        "example": 0
                                    },
                                    "REQUEST_QUEUE_WRITES": {
                                        "type": "integer",
                                        "example": 0
                                    },
                                    "DATA_TRANSFER_INTERNAL_GBYTES": {
                                        "type": "integer",
                                        "example": 0
                                    },
                                    "DATA_TRANSFER_EXTERNAL_GBYTES": {
                                        "type": "integer",
                                        "example": 0
                                    },
                                    "PROXY_RESIDENTIAL_TRANSFER_GBYTES": {
                                        "type": "integer",
                                        "example": 0
                                    },
                                    "PROXY_SERPS": {
                                        "type": "integer",
                                        "example": 0
                                    }
                                }
                            }
                        }
                    }
                }
            }
        }
    }
}
```
