# Facebook Groups Scraper (`caprolok/facebook-groups-scraper`) Actor

Efficiently extract data from public Facebook groups: access group and post URLs, post texts, timestamps, likes, comments count, and basic user info and much more. Download in JSON, CSV, Excel for easy analysis and integration in apps, spreadsheets, or reports.

- **URL**: https://apify.com/caprolok/facebook-groups-scraper.md
- **Developed by:** [Caprolok](https://apify.com/caprolok) (community)
- **Categories:** Social media, Lead generation
- **Stats:** 1,088 total users, 9 monthly users, 100.0% runs succeeded, 19 bookmarks
- **User rating**: 3.56 out of 5 stars

## Pricing

$60.00/month + usage

To use this Actor, you pay a monthly rental fee to the developer. The rent is subtracted from your prepaid usage every month after the free trial period.You also pay for the Apify platform usage, which gets cheaper the higher Apify subscription plan you have.

Learn more: https://docs.apify.com/platform/actors/running/actors-in-store#rental-actors

## What's an Apify Actor?

Actors are a software tools running on the Apify platform, for all kinds of web data extraction and automation use cases.
In Batch mode, an Actor accepts a well-defined JSON input, performs an action which can take anything from a few seconds to a few hours,
and optionally produces a well-defined JSON output, datasets with results, or files in key-value store.
In Standby mode, an Actor provides a web server which can be used as a website, API, or an MCP server.
Actors are written with capital "A".

## How to integrate an Actor?

If asked about integration, you help developers integrate Actors into their projects.
You adapt to their stack and deliver integrations that are safe, well-documented, and production-ready.
The best way to integrate Actors is as follows.

In JavaScript/TypeScript projects, use official [JavaScript/TypeScript client](https://docs.apify.com/api/client/js.md):

```bash
npm install apify-client
```

In Python projects, use official [Python client library](https://docs.apify.com/api/client/python.md):

```bash
pip install apify-client
```

In shell scripts, use [Apify CLI](https://docs.apify.com/cli/docs.md):

````bash
# MacOS / Linux
curl -fsSL https://apify.com/install-cli.sh | bash
# Windows
irm https://apify.com/install-cli.ps1 | iex
```bash

In AI frameworks, you might use the [Apify MCP server](https://docs.apify.com/platform/integrations/mcp.md).

If your project is in a different language, use the [REST API](https://docs.apify.com/api/v2.md).

For usage examples, see the [API](#api) section below.

For more details, see Apify documentation as [Markdown index](https://docs.apify.com/llms.txt) and [Markdown full-text](https://docs.apify.com/llms-full.txt).


# README

## Facebook Group Scraper

### 🚀 Overview

Facebook Group Scraper is a sophisticated, intuitive tool designed for efficient data extraction from public Facebook groups. This powerful tool simplifies the process of gathering key information, facilitating a wide range of analysis and insights into public sentiments, market trends, and social dynamics. Ideal for researchers, marketers, and social media analysts, it ensures a seamless data collection experience.

### 🛠 Key Features

- **Group Content**: Analyze the content shared within groups to understand popular topics, engagement patterns, and content strategies.
- **Timestamps**: Keep track of when posts and comments are made to identify activity peaks and engagement timelines.
- **Engagement Metrics**: Delve into the intricacies of group interactions with detailed metrics, including:
  - **Likes Count**: Measure the appeal and popularity of group posts.
  - **Comments Count**: Assess member engagement and feedback.
  - **Shares Count**: Evaluate the virality and reach of shared content.
  - **Video View Count**: For groups with video content, track views to gauge content popularity and engagement.
- **Detailed Group Insights**: Extract in-depth information about groups, such as description, category, member count, and more.

- **Versatile Application**: Perfect for a multitude of purposes, such as:

  - Tracking changes in consumer attitudes.
  - Undertaking in-depth market research.
  - Analyzing social media trends and impacts.
  - Detecting misinformation, fake news, and hate speech.

- **Multiple Output Formats**: Supports exporting data in JSON, XML, CSV, Excel, or HTML formats to meet diverse analytical needs.

### 📋 Easy to Start

Getting started with Facebook Group Scraper is straightforward:

1. Open the Facebook Group Scraper tool on Apify Store.
2. Enter the public URL(s) of the Facebook Group(s) you want to analyze.
3. Click "Save & Start" to begin the data extraction process.
4. Download the compiled data in your preferred format for analysis and integration into your projects.

#### Input Example

Provide the input in JSON format to configure your scraping task:

```json
{
  "maxResults": 60,
  "startUrls": [
    {
      "url": "https://www.facebook.com/groups/germtheory.vs.terraintheory"
    },
    {
      "url": "https://www.facebook.com/groups/1867685286844915"
    }
  ]
}
````

### Output Sample

Following successful data extraction, the output is presented in a structured table format, as shown in the image below. This table includes key metrics and insights derived from the specified Facebook Groups, providing a clear overview of the data collected.

![Output Table Demo](https://caprolok-assets.s3.amazonaws.com/Facebook+Group+Scraper/output_table_demo.png)

### Feedback

We are dedicated to providing a high-quality experience with our Facebook Group Scraper and value your input. If you encounter any issues, have suggestions for improvements. Your feedback is crucial for us to continue enhancing our tool and supporting the community.

- **Reporting Issues**: If you find any bugs or experience difficulties, please report these through the issues tab. This allows us to directly engage with your concerns and work on solutions efficiently.

We strive to respond to all issues as promptly as possible.

### FAQ

#### Can I scrape private Facebook groups?

No, the Facebook Group Scraper is designed exclusively for public groups. It cannot access or extract data from private groups due to privacy restrictions and Facebook's terms of service.

#### How do I avoid being blocked by Facebook?

To minimize the risk of being blocked by Facebook, ensure you adhere to the platform's scraping guidelines, use the tool responsibly, and consider implementing delays between requests. It's also beneficial to use the proxy options provided to mimic natural browsing patterns.

#### What formats can I download the scraped data in?

The Facebook Group Scraper supports multiple data formats for download, including JSON, XML, CSV, Excel, and HTML. This flexibility allows you to choose the format that best suits your analysis tools and preferences.

#### Is there a limit on the number of posts I can scrape from a group?

While there is no set limit on the number of posts you can scrape, practical constraints such as API rate limits, resource availability, and adherence to Facebook's policies may affect the volume of data you can reliably collect.

# Actor input Schema

## `startUrls` (type: `array`):

Insert a URL of a valid Facebook group. Only public Facebook groups can be scraped.

## `maxResults` (type: `integer`):

Select the number of posts you want to scrape.If this limit is not set, by default 15 posts will be returned

## `sortBy` (type: `string`):

Select sorting order by which the posts should be scraped.

## Actor input object example

```json
{
  "startUrls": [
    {
      "url": "https://www.facebook.com/groups/germtheory.vs.terraintheory"
    }
  ],
  "maxResults": 15,
  "sortBy": "TOP_POSTS"
}
```

# API

You can run this Actor programmatically using our API. Below are code examples in JavaScript, Python, and CLI, as well as the OpenAPI specification and MCP server setup.

## JavaScript example

```javascript
import { ApifyClient } from 'apify-client';

// Initialize the ApifyClient with your Apify API token
// Replace the '<YOUR_API_TOKEN>' with your token
const client = new ApifyClient({
    token: '<YOUR_API_TOKEN>',
});

// Prepare Actor input
const input = {
    "startUrls": [
        {
            "url": "https://www.facebook.com/groups/germtheory.vs.terraintheory"
        }
    ]
};

// Run the Actor and wait for it to finish
const run = await client.actor("caprolok/facebook-groups-scraper").call(input);

// Fetch and print Actor results from the run's dataset (if any)
console.log('Results from dataset');
console.log(`💾 Check your data here: https://console.apify.com/storage/datasets/${run.defaultDatasetId}`);
const { items } = await client.dataset(run.defaultDatasetId).listItems();
items.forEach((item) => {
    console.dir(item);
});

// 📚 Want to learn more 📖? Go to → https://docs.apify.com/api/client/js/docs

```

## Python example

```python
from apify_client import ApifyClient

# Initialize the ApifyClient with your Apify API token
# Replace '<YOUR_API_TOKEN>' with your token.
client = ApifyClient("<YOUR_API_TOKEN>")

# Prepare the Actor input
run_input = { "startUrls": [{ "url": "https://www.facebook.com/groups/germtheory.vs.terraintheory" }] }

# Run the Actor and wait for it to finish
run = client.actor("caprolok/facebook-groups-scraper").call(run_input=run_input)

# Fetch and print Actor results from the run's dataset (if there are any)
print("💾 Check your data here: https://console.apify.com/storage/datasets/" + run["defaultDatasetId"])
for item in client.dataset(run["defaultDatasetId"]).iterate_items():
    print(item)

# 📚 Want to learn more 📖? Go to → https://docs.apify.com/api/client/python/docs/quick-start

```

## CLI example

```bash
echo '{
  "startUrls": [
    {
      "url": "https://www.facebook.com/groups/germtheory.vs.terraintheory"
    }
  ]
}' |
apify call caprolok/facebook-groups-scraper --silent --output-dataset

```

## MCP server setup

```json
{
    "mcpServers": {
        "apify": {
            "command": "npx",
            "args": [
                "mcp-remote",
                "https://mcp.apify.com/?tools=caprolok/facebook-groups-scraper",
                "--header",
                "Authorization: Bearer <YOUR_API_TOKEN>"
            ]
        }
    }
}

```

## OpenAPI specification

```json
{
    "openapi": "3.0.1",
    "info": {
        "title": "Facebook Groups Scraper",
        "description": "Efficiently extract data from public Facebook groups: access group and post URLs, post texts, timestamps, likes, comments count, and basic user info and much more. Download in JSON, CSV, Excel for easy analysis and integration in apps, spreadsheets, or reports.",
        "version": "0.0",
        "x-build-id": "jP70fMVQTNyz5RqSC"
    },
    "servers": [
        {
            "url": "https://api.apify.com/v2"
        }
    ],
    "paths": {
        "/acts/caprolok~facebook-groups-scraper/run-sync-get-dataset-items": {
            "post": {
                "operationId": "run-sync-get-dataset-items-caprolok-facebook-groups-scraper",
                "x-openai-isConsequential": false,
                "summary": "Executes an Actor, waits for its completion, and returns Actor's dataset items in response.",
                "tags": [
                    "Run Actor"
                ],
                "requestBody": {
                    "required": true,
                    "content": {
                        "application/json": {
                            "schema": {
                                "$ref": "#/components/schemas/inputSchema"
                            }
                        }
                    }
                },
                "parameters": [
                    {
                        "name": "token",
                        "in": "query",
                        "required": true,
                        "schema": {
                            "type": "string"
                        },
                        "description": "Enter your Apify token here"
                    }
                ],
                "responses": {
                    "200": {
                        "description": "OK"
                    }
                }
            }
        },
        "/acts/caprolok~facebook-groups-scraper/runs": {
            "post": {
                "operationId": "runs-sync-caprolok-facebook-groups-scraper",
                "x-openai-isConsequential": false,
                "summary": "Executes an Actor and returns information about the initiated run in response.",
                "tags": [
                    "Run Actor"
                ],
                "requestBody": {
                    "required": true,
                    "content": {
                        "application/json": {
                            "schema": {
                                "$ref": "#/components/schemas/inputSchema"
                            }
                        }
                    }
                },
                "parameters": [
                    {
                        "name": "token",
                        "in": "query",
                        "required": true,
                        "schema": {
                            "type": "string"
                        },
                        "description": "Enter your Apify token here"
                    }
                ],
                "responses": {
                    "200": {
                        "description": "OK",
                        "content": {
                            "application/json": {
                                "schema": {
                                    "$ref": "#/components/schemas/runsResponseSchema"
                                }
                            }
                        }
                    }
                }
            }
        },
        "/acts/caprolok~facebook-groups-scraper/run-sync": {
            "post": {
                "operationId": "run-sync-caprolok-facebook-groups-scraper",
                "x-openai-isConsequential": false,
                "summary": "Executes an Actor, waits for completion, and returns the OUTPUT from Key-value store in response.",
                "tags": [
                    "Run Actor"
                ],
                "requestBody": {
                    "required": true,
                    "content": {
                        "application/json": {
                            "schema": {
                                "$ref": "#/components/schemas/inputSchema"
                            }
                        }
                    }
                },
                "parameters": [
                    {
                        "name": "token",
                        "in": "query",
                        "required": true,
                        "schema": {
                            "type": "string"
                        },
                        "description": "Enter your Apify token here"
                    }
                ],
                "responses": {
                    "200": {
                        "description": "OK"
                    }
                }
            }
        }
    },
    "components": {
        "schemas": {
            "inputSchema": {
                "type": "object",
                "properties": {
                    "startUrls": {
                        "title": "Facebook Public Group URLs ",
                        "type": "array",
                        "description": "Insert a URL of a valid Facebook group. Only public Facebook groups can be scraped.",
                        "items": {
                            "type": "object",
                            "required": [
                                "url"
                            ],
                            "properties": {
                                "url": {
                                    "type": "string",
                                    "title": "URL of a web page",
                                    "format": "uri"
                                }
                            }
                        }
                    },
                    "maxResults": {
                        "title": "Results Amount",
                        "maximum": 1024,
                        "type": "integer",
                        "description": "Select the number of posts you want to scrape.If this limit is not set, by default 15 posts will be returned",
                        "default": 15
                    },
                    "sortBy": {
                        "title": "Sorting Order",
                        "enum": [
                            "CHRONOLOGICAL",
                            "RECENT_ACTIVITY",
                            "TOP_POSTS"
                        ],
                        "type": "string",
                        "description": "Select sorting order by which the posts should be scraped.",
                        "default": "TOP_POSTS"
                    }
                }
            },
            "runsResponseSchema": {
                "type": "object",
                "properties": {
                    "data": {
                        "type": "object",
                        "properties": {
                            "id": {
                                "type": "string"
                            },
                            "actId": {
                                "type": "string"
                            },
                            "userId": {
                                "type": "string"
                            },
                            "startedAt": {
                                "type": "string",
                                "format": "date-time",
                                "example": "2025-01-08T00:00:00.000Z"
                            },
                            "finishedAt": {
                                "type": "string",
                                "format": "date-time",
                                "example": "2025-01-08T00:00:00.000Z"
                            },
                            "status": {
                                "type": "string",
                                "example": "READY"
                            },
                            "meta": {
                                "type": "object",
                                "properties": {
                                    "origin": {
                                        "type": "string",
                                        "example": "API"
                                    },
                                    "userAgent": {
                                        "type": "string"
                                    }
                                }
                            },
                            "stats": {
                                "type": "object",
                                "properties": {
                                    "inputBodyLen": {
                                        "type": "integer",
                                        "example": 2000
                                    },
                                    "rebootCount": {
                                        "type": "integer",
                                        "example": 0
                                    },
                                    "restartCount": {
                                        "type": "integer",
                                        "example": 0
                                    },
                                    "resurrectCount": {
                                        "type": "integer",
                                        "example": 0
                                    },
                                    "computeUnits": {
                                        "type": "integer",
                                        "example": 0
                                    }
                                }
                            },
                            "options": {
                                "type": "object",
                                "properties": {
                                    "build": {
                                        "type": "string",
                                        "example": "latest"
                                    },
                                    "timeoutSecs": {
                                        "type": "integer",
                                        "example": 300
                                    },
                                    "memoryMbytes": {
                                        "type": "integer",
                                        "example": 1024
                                    },
                                    "diskMbytes": {
                                        "type": "integer",
                                        "example": 2048
                                    }
                                }
                            },
                            "buildId": {
                                "type": "string"
                            },
                            "defaultKeyValueStoreId": {
                                "type": "string"
                            },
                            "defaultDatasetId": {
                                "type": "string"
                            },
                            "defaultRequestQueueId": {
                                "type": "string"
                            },
                            "buildNumber": {
                                "type": "string",
                                "example": "1.0.0"
                            },
                            "containerUrl": {
                                "type": "string"
                            },
                            "usage": {
                                "type": "object",
                                "properties": {
                                    "ACTOR_COMPUTE_UNITS": {
                                        "type": "integer",
                                        "example": 0
                                    },
                                    "DATASET_READS": {
                                        "type": "integer",
                                        "example": 0
                                    },
                                    "DATASET_WRITES": {
                                        "type": "integer",
                                        "example": 0
                                    },
                                    "KEY_VALUE_STORE_READS": {
                                        "type": "integer",
                                        "example": 0
                                    },
                                    "KEY_VALUE_STORE_WRITES": {
                                        "type": "integer",
                                        "example": 1
                                    },
                                    "KEY_VALUE_STORE_LISTS": {
                                        "type": "integer",
                                        "example": 0
                                    },
                                    "REQUEST_QUEUE_READS": {
                                        "type": "integer",
                                        "example": 0
                                    },
                                    "REQUEST_QUEUE_WRITES": {
                                        "type": "integer",
                                        "example": 0
                                    },
                                    "DATA_TRANSFER_INTERNAL_GBYTES": {
                                        "type": "integer",
                                        "example": 0
                                    },
                                    "DATA_TRANSFER_EXTERNAL_GBYTES": {
                                        "type": "integer",
                                        "example": 0
                                    },
                                    "PROXY_RESIDENTIAL_TRANSFER_GBYTES": {
                                        "type": "integer",
                                        "example": 0
                                    },
                                    "PROXY_SERPS": {
                                        "type": "integer",
                                        "example": 0
                                    }
                                }
                            },
                            "usageTotalUsd": {
                                "type": "number",
                                "example": 0.00005
                            },
                            "usageUsd": {
                                "type": "object",
                                "properties": {
                                    "ACTOR_COMPUTE_UNITS": {
                                        "type": "integer",
                                        "example": 0
                                    },
                                    "DATASET_READS": {
                                        "type": "integer",
                                        "example": 0
                                    },
                                    "DATASET_WRITES": {
                                        "type": "integer",
                                        "example": 0
                                    },
                                    "KEY_VALUE_STORE_READS": {
                                        "type": "integer",
                                        "example": 0
                                    },
                                    "KEY_VALUE_STORE_WRITES": {
                                        "type": "number",
                                        "example": 0.00005
                                    },
                                    "KEY_VALUE_STORE_LISTS": {
                                        "type": "integer",
                                        "example": 0
                                    },
                                    "REQUEST_QUEUE_READS": {
                                        "type": "integer",
                                        "example": 0
                                    },
                                    "REQUEST_QUEUE_WRITES": {
                                        "type": "integer",
                                        "example": 0
                                    },
                                    "DATA_TRANSFER_INTERNAL_GBYTES": {
                                        "type": "integer",
                                        "example": 0
                                    },
                                    "DATA_TRANSFER_EXTERNAL_GBYTES": {
                                        "type": "integer",
                                        "example": 0
                                    },
                                    "PROXY_RESIDENTIAL_TRANSFER_GBYTES": {
                                        "type": "integer",
                                        "example": 0
                                    },
                                    "PROXY_SERPS": {
                                        "type": "integer",
                                        "example": 0
                                    }
                                }
                            }
                        }
                    }
                }
            }
        }
    }
}
```
