# B-Corporation ESG Data Scraper (`njoylab/b-corporation-data-scraper`) Actor

Scrapes B-Corporation ESG data from bcorporation.net. Retrieves either paginated lists of certified companies or detailed profiles with assessment scores, demographics, and locations

- **URL**: https://apify.com/njoylab/b-corporation-data-scraper.md
- **Developed by:** [njoylab](https://apify.com/njoylab) (community)
- **Categories:** Developer tools, Lead generation, Automation
- **Stats:** 49 total users, 1 monthly users, 100.0% runs succeeded, 3 bookmarks
- **User rating**: 5.00 out of 5 stars

## Pricing

$10.00/month + usage

To use this Actor, you pay a monthly rental fee to the developer. The rent is subtracted from your prepaid usage every month after the free trial period.You also pay for the Apify platform usage, which gets cheaper the higher Apify subscription plan you have.

Learn more: https://docs.apify.com/platform/actors/running/actors-in-store#rental-actors

## What's an Apify Actor?

Actors are a software tools running on the Apify platform, for all kinds of web data extraction and automation use cases.
In Batch mode, an Actor accepts a well-defined JSON input, performs an action which can take anything from a few seconds to a few hours,
and optionally produces a well-defined JSON output, datasets with results, or files in key-value store.
In Standby mode, an Actor provides a web server which can be used as a website, API, or an MCP server.
Actors are written with capital "A".

## How to integrate an Actor?

If asked about integration, you help developers integrate Actors into their projects.
You adapt to their stack and deliver integrations that are safe, well-documented, and production-ready.
The best way to integrate Actors is as follows.

In JavaScript/TypeScript projects, use official [JavaScript/TypeScript client](https://docs.apify.com/api/client/js.md):

```bash
npm install apify-client
```

In Python projects, use official [Python client library](https://docs.apify.com/api/client/python.md):

```bash
pip install apify-client
```

In shell scripts, use [Apify CLI](https://docs.apify.com/cli/docs.md):

````bash
# MacOS / Linux
curl -fsSL https://apify.com/install-cli.sh | bash
# Windows
irm https://apify.com/install-cli.ps1 | iex
```bash

In AI frameworks, you might use the [Apify MCP server](https://docs.apify.com/platform/integrations/mcp.md).

If your project is in a different language, use the [REST API](https://docs.apify.com/api/v2.md).

For usage examples, see the [API](#api) section below.

For more details, see Apify documentation as [Markdown index](https://docs.apify.com/llms.txt) and [Markdown full-text](https://docs.apify.com/llms-full.txt).


# README

## B Corporation Data Scraper

This Apify actor scrapes business data from https://bcorporation.net. You can fetch either a paginated list of certified B Corporations or detailed information for a single company.

### Table of Contents
- [Input Parameters](#input-parameters)
- [Output Structure](#output-structure)
- [Usage](#usage)
- [Example](#example)

### Input Parameters

The input parameters are defined as a JSON object with the following fields:

#### List Mode
```json
{
    "mode": "index",
    "page": 1,
    "pageSize": 10 // max page Size is 250
}
````

#### Single Company Mode

```json
{
    "mode": "company",
    "url": "company-name-slug"  // or full URL to company profile
}
```

### Output Structure

The output structure varies depending on the mode selected:

#### List Mode Output

```typescript
interface IndexResponse {
  companies: CompanyListItem[];
}

interface CompanyListItem {
  id: string;
  name: string;
  description: string;
  slug: string;
  industry: string;
  sector: string;
  size: string;
  isCertified: boolean;
  latestVerifiedScore: string;
  initialCertificationDateTimestamp: number;
  companyLogo?: string;
  
  // Location information
  hqCity: string;
  hqCountry: string;
  hqProvince?: string;
  hqPostalCode?: string;
  cities: string[];
  countries: string[];
  provinces: string[];

  // Additional metadata
  demographicsList: string[];
  websiteKeywords?: string;
}
```

#### Company Mode Output

```typescript
interface CompanyDetail {
  id: string;
  name: string;
  slug: string;
  description: string;
  sector: string;
  industry: string;
  size: string;
  website?: string;
  companyLogo?: string;
  lifestyleLogo?: string;
  
  // Demographics
  demographics: {
    womenOwned: boolean;
    minorityOwned: boolean;
    lgbtqOwned: boolean;
    blackOwned: boolean;
    employeeOwned: boolean;
    familyOwned: boolean;
    veteranOwned: boolean;
    indigenousOwned: boolean;
    personWithDisabilityOwned: boolean;
    immigrantOwned: boolean;
  };

  // Location information
  hqCountry: string;
  hqProvince: string;
  hqCity: string;
  hqPostalCode: string;
  countries: string[];
  provinces: string[];
  cities: string[];

  // B Corp specific data
  latestVerifiedScore: number;
  initialCertificationDateTimestamp: number;
  websiteKeywords?: string;
  
  // Assessment details
  assessments: Array<{
    overallScore: number;
    ratingDate: number;
    fullTimeWorkers: number;
    impactAreas: Array<{
      id: number;
      name: string;
      score: number;
      impactTopics: Array<{
        id: number;
        name: string;
        score: number;
        isIbm: boolean;
      }>;
    }>;
  }>;
}
```

#### Example Company Response

```json
{
  "id": "example-id-123",
  "name": "Example Sustainable Solutions",
  "description": "A technology company dedicated to sustainable solutions...",
  "sector": "Service with Significant Environmental Footprint",
  "industry": "Technology & Software",
  "size": "50-249",
  "website": "www.example.com",
  "latestVerifiedScore": 95.5,
  "demographics": {
    "womenOwned": false,
    "minorityOwned": false,
    "lgbtqOwned": false,
    "employeeOwned": true
  },
  "hqCity": "Austin",
  "hqCountry": "United States",
  "hqProvince": "Texas",
  "assessments": [
    {
      "overallScore": 95.5,
      "fullTimeWorkers": 75,
      "impactAreas": [
        {
          "name": "Governance",
          "score": 18.2,
          "impactTopics": [
            {
              "name": "Mission & Engagement",
              "score": 4.5,
              "isIbm": false
            }
          ]
        }
      ]
    }
  ]
}
```

### Usage

1. **Choose operation mode**: Decide whether you want to scrape the company index or a single company profile
2. **Configure input**: Create a JSON file with your desired input parameters (see Input Parameters section)
3. **Run the actor**: Execute the Apify actor with your JSON input
4. **Process results**: Access the structured data from Apify's storage

### Example

#### List Mode Example

```json
{
  "mode": "index",
  "index": {
    "page": 1,
    "pageSize": 25 // max page Size is 250
  }
}
```

#### Single Company Example

```json
{
  "mode": "company",
  "company": {
    "url": "smith-limited"
  }
}
```

### Disclaimer

This actor is designed for legitimate data collection from B Lab's public B Corporation directory. Please ensure your usage complies with bcorporation.net's terms of service and robots.txt policies.

# Actor input Schema

## `mode` (type: `string`):

Choose the operation mode:

- **Index**: Scrapes the B Corp directory to get a paginated list of certified companies. Use this to discover companies or build a full database.
- **Company**: Scrapes detailed data for a specific company including B Corp score, impact areas, headquarters, and certification date.

## `page` (type: `integer`):

The page number to fetch from the B Corp directory (1-indexed). Use this with 'Page Size' to paginate through the full list of certified companies. For example, page 1 with size 100 returns companies 1-100, page 2 returns 101-200, etc.

## `pageSize` (type: `integer`):

Number of companies to return per page. Higher values fetch more data at once but may be slower. Maximum allowed is 250 companies per request.

## `url` (type: `string`):

The B Corp company to scrape. You can provide either:

- **Full URL**: e.g., `https://www.bcorporation.net/en-us/find-a-b-corp/company/patagonia/`
- **Slug only**: e.g., `patagonia`

The slug is the last part of the company URL on bcorporation.net.

## Actor input object example

```json
{
  "mode": "index",
  "page": 1,
  "pageSize": 10
}
```

# Actor output Schema

## `companies` (type: `string`):

List of scraped B Corp companies with their details including name, sector, industry, B Corp score, headquarters location, and impact assessments

# API

You can run this Actor programmatically using our API. Below are code examples in JavaScript, Python, and CLI, as well as the OpenAPI specification and MCP server setup.

## JavaScript example

```javascript
import { ApifyClient } from 'apify-client';

// Initialize the ApifyClient with your Apify API token
// Replace the '<YOUR_API_TOKEN>' with your token
const client = new ApifyClient({
    token: '<YOUR_API_TOKEN>',
});

// Prepare Actor input
const input = {};

// Run the Actor and wait for it to finish
const run = await client.actor("njoylab/b-corporation-data-scraper").call(input);

// Fetch and print Actor results from the run's dataset (if any)
console.log('Results from dataset');
console.log(`💾 Check your data here: https://console.apify.com/storage/datasets/${run.defaultDatasetId}`);
const { items } = await client.dataset(run.defaultDatasetId).listItems();
items.forEach((item) => {
    console.dir(item);
});

// 📚 Want to learn more 📖? Go to → https://docs.apify.com/api/client/js/docs

```

## Python example

```python
from apify_client import ApifyClient

# Initialize the ApifyClient with your Apify API token
# Replace '<YOUR_API_TOKEN>' with your token.
client = ApifyClient("<YOUR_API_TOKEN>")

# Prepare the Actor input
run_input = {}

# Run the Actor and wait for it to finish
run = client.actor("njoylab/b-corporation-data-scraper").call(run_input=run_input)

# Fetch and print Actor results from the run's dataset (if there are any)
print("💾 Check your data here: https://console.apify.com/storage/datasets/" + run["defaultDatasetId"])
for item in client.dataset(run["defaultDatasetId"]).iterate_items():
    print(item)

# 📚 Want to learn more 📖? Go to → https://docs.apify.com/api/client/python/docs/quick-start

```

## CLI example

```bash
echo '{}' |
apify call njoylab/b-corporation-data-scraper --silent --output-dataset

```

## MCP server setup

```json
{
    "mcpServers": {
        "apify": {
            "command": "npx",
            "args": [
                "mcp-remote",
                "https://mcp.apify.com/?tools=njoylab/b-corporation-data-scraper",
                "--header",
                "Authorization: Bearer <YOUR_API_TOKEN>"
            ]
        }
    }
}

```

## OpenAPI specification

```json
{
    "openapi": "3.0.1",
    "info": {
        "title": "B-Corporation ESG Data Scraper",
        "description": "Scrapes B-Corporation ESG data from bcorporation.net. Retrieves either paginated lists of certified companies or detailed profiles with assessment scores, demographics, and locations",
        "version": "0.0",
        "x-build-id": "2PEKuof8Ss24eEzdE"
    },
    "servers": [
        {
            "url": "https://api.apify.com/v2"
        }
    ],
    "paths": {
        "/acts/njoylab~b-corporation-data-scraper/run-sync-get-dataset-items": {
            "post": {
                "operationId": "run-sync-get-dataset-items-njoylab-b-corporation-data-scraper",
                "x-openai-isConsequential": false,
                "summary": "Executes an Actor, waits for its completion, and returns Actor's dataset items in response.",
                "tags": [
                    "Run Actor"
                ],
                "requestBody": {
                    "required": true,
                    "content": {
                        "application/json": {
                            "schema": {
                                "$ref": "#/components/schemas/inputSchema"
                            }
                        }
                    }
                },
                "parameters": [
                    {
                        "name": "token",
                        "in": "query",
                        "required": true,
                        "schema": {
                            "type": "string"
                        },
                        "description": "Enter your Apify token here"
                    }
                ],
                "responses": {
                    "200": {
                        "description": "OK"
                    }
                }
            }
        },
        "/acts/njoylab~b-corporation-data-scraper/runs": {
            "post": {
                "operationId": "runs-sync-njoylab-b-corporation-data-scraper",
                "x-openai-isConsequential": false,
                "summary": "Executes an Actor and returns information about the initiated run in response.",
                "tags": [
                    "Run Actor"
                ],
                "requestBody": {
                    "required": true,
                    "content": {
                        "application/json": {
                            "schema": {
                                "$ref": "#/components/schemas/inputSchema"
                            }
                        }
                    }
                },
                "parameters": [
                    {
                        "name": "token",
                        "in": "query",
                        "required": true,
                        "schema": {
                            "type": "string"
                        },
                        "description": "Enter your Apify token here"
                    }
                ],
                "responses": {
                    "200": {
                        "description": "OK",
                        "content": {
                            "application/json": {
                                "schema": {
                                    "$ref": "#/components/schemas/runsResponseSchema"
                                }
                            }
                        }
                    }
                }
            }
        },
        "/acts/njoylab~b-corporation-data-scraper/run-sync": {
            "post": {
                "operationId": "run-sync-njoylab-b-corporation-data-scraper",
                "x-openai-isConsequential": false,
                "summary": "Executes an Actor, waits for completion, and returns the OUTPUT from Key-value store in response.",
                "tags": [
                    "Run Actor"
                ],
                "requestBody": {
                    "required": true,
                    "content": {
                        "application/json": {
                            "schema": {
                                "$ref": "#/components/schemas/inputSchema"
                            }
                        }
                    }
                },
                "parameters": [
                    {
                        "name": "token",
                        "in": "query",
                        "required": true,
                        "schema": {
                            "type": "string"
                        },
                        "description": "Enter your Apify token here"
                    }
                ],
                "responses": {
                    "200": {
                        "description": "OK"
                    }
                }
            }
        }
    },
    "components": {
        "schemas": {
            "inputSchema": {
                "type": "object",
                "required": [
                    "mode"
                ],
                "properties": {
                    "mode": {
                        "title": "Mode",
                        "enum": [
                            "index",
                            "company"
                        ],
                        "type": "string",
                        "description": "Choose the operation mode:\n\n- **Index**: Scrapes the B Corp directory to get a paginated list of certified companies. Use this to discover companies or build a full database.\n- **Company**: Scrapes detailed data for a specific company including B Corp score, impact areas, headquarters, and certification date.",
                        "default": "index"
                    },
                    "page": {
                        "title": "Page Number",
                        "minimum": 1,
                        "type": "integer",
                        "description": "The page number to fetch from the B Corp directory (1-indexed). Use this with 'Page Size' to paginate through the full list of certified companies. For example, page 1 with size 100 returns companies 1-100, page 2 returns 101-200, etc.",
                        "default": 1
                    },
                    "pageSize": {
                        "title": "Page Size",
                        "minimum": 1,
                        "maximum": 250,
                        "type": "integer",
                        "description": "Number of companies to return per page. Higher values fetch more data at once but may be slower. Maximum allowed is 250 companies per request.",
                        "default": 10
                    },
                    "url": {
                        "title": "Company URL or Slug",
                        "type": "string",
                        "description": "The B Corp company to scrape. You can provide either:\n\n- **Full URL**: e.g., `https://www.bcorporation.net/en-us/find-a-b-corp/company/patagonia/`\n- **Slug only**: e.g., `patagonia`\n\nThe slug is the last part of the company URL on bcorporation.net."
                    }
                }
            },
            "runsResponseSchema": {
                "type": "object",
                "properties": {
                    "data": {
                        "type": "object",
                        "properties": {
                            "id": {
                                "type": "string"
                            },
                            "actId": {
                                "type": "string"
                            },
                            "userId": {
                                "type": "string"
                            },
                            "startedAt": {
                                "type": "string",
                                "format": "date-time",
                                "example": "2025-01-08T00:00:00.000Z"
                            },
                            "finishedAt": {
                                "type": "string",
                                "format": "date-time",
                                "example": "2025-01-08T00:00:00.000Z"
                            },
                            "status": {
                                "type": "string",
                                "example": "READY"
                            },
                            "meta": {
                                "type": "object",
                                "properties": {
                                    "origin": {
                                        "type": "string",
                                        "example": "API"
                                    },
                                    "userAgent": {
                                        "type": "string"
                                    }
                                }
                            },
                            "stats": {
                                "type": "object",
                                "properties": {
                                    "inputBodyLen": {
                                        "type": "integer",
                                        "example": 2000
                                    },
                                    "rebootCount": {
                                        "type": "integer",
                                        "example": 0
                                    },
                                    "restartCount": {
                                        "type": "integer",
                                        "example": 0
                                    },
                                    "resurrectCount": {
                                        "type": "integer",
                                        "example": 0
                                    },
                                    "computeUnits": {
                                        "type": "integer",
                                        "example": 0
                                    }
                                }
                            },
                            "options": {
                                "type": "object",
                                "properties": {
                                    "build": {
                                        "type": "string",
                                        "example": "latest"
                                    },
                                    "timeoutSecs": {
                                        "type": "integer",
                                        "example": 300
                                    },
                                    "memoryMbytes": {
                                        "type": "integer",
                                        "example": 1024
                                    },
                                    "diskMbytes": {
                                        "type": "integer",
                                        "example": 2048
                                    }
                                }
                            },
                            "buildId": {
                                "type": "string"
                            },
                            "defaultKeyValueStoreId": {
                                "type": "string"
                            },
                            "defaultDatasetId": {
                                "type": "string"
                            },
                            "defaultRequestQueueId": {
                                "type": "string"
                            },
                            "buildNumber": {
                                "type": "string",
                                "example": "1.0.0"
                            },
                            "containerUrl": {
                                "type": "string"
                            },
                            "usage": {
                                "type": "object",
                                "properties": {
                                    "ACTOR_COMPUTE_UNITS": {
                                        "type": "integer",
                                        "example": 0
                                    },
                                    "DATASET_READS": {
                                        "type": "integer",
                                        "example": 0
                                    },
                                    "DATASET_WRITES": {
                                        "type": "integer",
                                        "example": 0
                                    },
                                    "KEY_VALUE_STORE_READS": {
                                        "type": "integer",
                                        "example": 0
                                    },
                                    "KEY_VALUE_STORE_WRITES": {
                                        "type": "integer",
                                        "example": 1
                                    },
                                    "KEY_VALUE_STORE_LISTS": {
                                        "type": "integer",
                                        "example": 0
                                    },
                                    "REQUEST_QUEUE_READS": {
                                        "type": "integer",
                                        "example": 0
                                    },
                                    "REQUEST_QUEUE_WRITES": {
                                        "type": "integer",
                                        "example": 0
                                    },
                                    "DATA_TRANSFER_INTERNAL_GBYTES": {
                                        "type": "integer",
                                        "example": 0
                                    },
                                    "DATA_TRANSFER_EXTERNAL_GBYTES": {
                                        "type": "integer",
                                        "example": 0
                                    },
                                    "PROXY_RESIDENTIAL_TRANSFER_GBYTES": {
                                        "type": "integer",
                                        "example": 0
                                    },
                                    "PROXY_SERPS": {
                                        "type": "integer",
                                        "example": 0
                                    }
                                }
                            },
                            "usageTotalUsd": {
                                "type": "number",
                                "example": 0.00005
                            },
                            "usageUsd": {
                                "type": "object",
                                "properties": {
                                    "ACTOR_COMPUTE_UNITS": {
                                        "type": "integer",
                                        "example": 0
                                    },
                                    "DATASET_READS": {
                                        "type": "integer",
                                        "example": 0
                                    },
                                    "DATASET_WRITES": {
                                        "type": "integer",
                                        "example": 0
                                    },
                                    "KEY_VALUE_STORE_READS": {
                                        "type": "integer",
                                        "example": 0
                                    },
                                    "KEY_VALUE_STORE_WRITES": {
                                        "type": "number",
                                        "example": 0.00005
                                    },
                                    "KEY_VALUE_STORE_LISTS": {
                                        "type": "integer",
                                        "example": 0
                                    },
                                    "REQUEST_QUEUE_READS": {
                                        "type": "integer",
                                        "example": 0
                                    },
                                    "REQUEST_QUEUE_WRITES": {
                                        "type": "integer",
                                        "example": 0
                                    },
                                    "DATA_TRANSFER_INTERNAL_GBYTES": {
                                        "type": "integer",
                                        "example": 0
                                    },
                                    "DATA_TRANSFER_EXTERNAL_GBYTES": {
                                        "type": "integer",
                                        "example": 0
                                    },
                                    "PROXY_RESIDENTIAL_TRANSFER_GBYTES": {
                                        "type": "integer",
                                        "example": 0
                                    },
                                    "PROXY_SERPS": {
                                        "type": "integer",
                                        "example": 0
                                    }
                                }
                            }
                        }
                    }
                }
            }
        }
    }
}
```
