# Bluesky Scraper - Posts Profiles Feeds (`openclawmara/bluesky-scraper`) Actor

Scrape Bluesky posts, profiles, feeds and search results. Extract text, authors, engagement stats, media. No auth required. Social listening, trend monitoring, LLM training data.

- **URL**: https://apify.com/openclawmara/bluesky-scraper.md
- **Developed by:** [OpenClaw Mara](https://apify.com/openclawmara) (community)
- **Categories:** Social media, AI, Developer tools
- **Stats:** 2 total users, 1 monthly users, 100.0% runs succeeded, 0 bookmarks
- **User rating**: No ratings yet

## Pricing

$5.00 / 1,000 post scrapeds

This Actor is paid per event. You are not charged for the Apify platform usage, but only a fixed price for specific events.

Learn more: https://docs.apify.com/platform/actors/running/actors-in-store#pay-per-event

## What's an Apify Actor?

Actors are a software tools running on the Apify platform, for all kinds of web data extraction and automation use cases.
In Batch mode, an Actor accepts a well-defined JSON input, performs an action which can take anything from a few seconds to a few hours,
and optionally produces a well-defined JSON output, datasets with results, or files in key-value store.
In Standby mode, an Actor provides a web server which can be used as a website, API, or an MCP server.
Actors are written with capital "A".

## How to integrate an Actor?

If asked about integration, you help developers integrate Actors into their projects.
You adapt to their stack and deliver integrations that are safe, well-documented, and production-ready.
The best way to integrate Actors is as follows.

In JavaScript/TypeScript projects, use official [JavaScript/TypeScript client](https://docs.apify.com/api/client/js.md):

```bash
npm install apify-client
```

In Python projects, use official [Python client library](https://docs.apify.com/api/client/python.md):

```bash
pip install apify-client
```

In shell scripts, use [Apify CLI](https://docs.apify.com/cli/docs.md):

````bash
# MacOS / Linux
curl -fsSL https://apify.com/install-cli.sh | bash
# Windows
irm https://apify.com/install-cli.ps1 | iex
```bash

In AI frameworks, you might use the [Apify MCP server](https://docs.apify.com/platform/integrations/mcp.md).

If your project is in a different language, use the [REST API](https://docs.apify.com/api/v2.md).

For usage examples, see the [API](#api) section below.

For more details, see Apify documentation as [Markdown index](https://docs.apify.com/llms.txt) and [Markdown full-text](https://docs.apify.com/llms-full.txt).


# README

## 🦋 Bluesky Scraper — Posts, Profiles & Social Graph

**Structured data from the fastest-growing decentralized social network. $0.005 per item.**

Scrape Bluesky (AT Protocol) for posts, profiles, search results, followers, and social graphs. No login, no API key, no OAuth — powered by Bluesky's public AT Protocol endpoints.

Perfect for social listening on the post-Twitter open social web, audience research, community mapping, influencer discovery, and backing up your own Bluesky content.

### 🚀 What does this Actor do?

Bluesky is growing into **the** open alternative to X/Twitter, and unlike the old Twitter API era, its data is publicly accessible. This Actor turns that into a structured, scheduled data source across four capabilities:

- **Profile scraping** — Pass a list of handles (`jay.bsky.team`, `pfrazee.com`, any custom-domain handle) and get profile metadata + recent posts with engagement.
- **Search** — Full-text search across Bluesky posts with author info, timestamps, and content.
- **Social graph extraction** — Pull followers and/or following lists (up to 2,000 per account) for community mapping.
- **Bulk mode** — Multiple profiles in one run with per-profile post limits.

All output is JSON, ready to load into Notion, Airtable, a vector DB, or a BI dashboard.

### 💡 Use Cases

#### 1. Brand & keyword listening on open social
Track mentions of your brand, product, or industry keywords on Bluesky — the platform where a lot of the tech, media, and AI conversation has migrated.

```json
{
  "searchQuery": "retrieval augmented generation",
  "maxSearchResults": 200
}
````

#### 2. Influencer & community mapping

Identify who drives conversation in a niche by pulling follower graphs of known accounts and finding the overlap.

```json
{
  "handles": ["dril.bsky.social", "pfrazee.com"],
  "includeFollowers": true,
  "maxFollowers": 2000,
  "maxPosts": 0
}
```

#### 3. Content archiving & compliance

Back up your own Bluesky post history, or archive a journalist's / public figure's posts for research or compliance.

```json
{
  "handles": ["your-handle.bsky.social"],
  "maxPosts": 500
}
```

#### 4. Competitive & trend monitoring

Schedule a weekly run against your competitors' handles and track follower growth, posting cadence, and top-performing posts.

```json
{
  "handles": ["competitor-1.bsky.social", "competitor-2.bsky.social"],
  "maxPosts": 100
}
```

### 📊 Output Example

```json
{
  "handle": "jay.bsky.team",
  "displayName": "Jay Graber",
  "description": "CEO of Bluesky",
  "followersCount": 245893,
  "followsCount": 1247,
  "postsCount": 3421,
  "posts": [
    {
      "uri": "at://did:plc:abc.../app.bsky.feed.post/xyz",
      "text": "Excited to announce our latest feature...",
      "createdAt": "2025-03-15T14:30:00.000Z",
      "likeCount": 842,
      "repostCount": 156,
      "replyCount": 73
    }
  ]
}
```

### ⚙️ Input Parameters

| Parameter | Type | Description |
|-----------|------|-------------|
| `handles` | array | Bluesky handles without `@` — `jay.bsky.team`, `pfrazee.com`. Custom domains supported. |
| `searchQuery` | string | Full-text keyword/phrase search across all public Bluesky posts. |
| `maxPosts` | int | Posts per profile (default 50, up to 500). Set `0` for profile metadata only. |
| `maxSearchResults` | int | 1–500 (default 100) for search mode. |
| `includeFollowers` | bool | Pull the followers list for each handle. |
| `includeFollowing` | bool | Pull the "following" list for each handle. |
| `maxFollowers` | int | Cap follower/following fetch at 1–2000 per account (default 200). |

### 📤 Output Fields

| Field | Description |
|-------|-------------|
| `handle` | User's Bluesky handle |
| `did` | Decentralized identifier (stable ID across handle changes) |
| `displayName`, `description`, `avatar` | Profile metadata |
| `followersCount`, `followsCount`, `postsCount` | Profile counters |
| `posts[]` | Array of recent posts with text, timestamps, `likeCount`, `repostCount`, `replyCount` |
| `followers[]` / `following[]` | Social graph (when enabled) |

### 💰 Pricing & Performance

- **Pay-per-event:** **$0.005 per item** (profile, post, search result, or social-graph entry).
- **Typical monthly cost:** $1–$4 for weekly brand listening or competitor tracking at realistic volumes.
- **Speed:** ~200 posts/minute, ~500 social-graph entries/minute.
- **No auth required** — no OAuth token, no app password, no risk of rate-limit bans tied to your account.

### 🔌 Integrations

- **Zapier / Make / n8n** — schedule and push new search hits to Slack, Discord, or email.
- **Notion / Airtable / Google Sheets** — build a mentions database with auto-updates.
- **LangChain / LlamaIndex / vector DBs** — embed posts for semantic search over Bluesky conversations (RAG over open social).
- **BI tools (Metabase, Superset, Hex)** — track follower growth and post engagement over time.
- **Apify webhooks** — POST new matches straight to your own endpoint.

### ❓ FAQ

**Do I need a Bluesky account or app password?**
No. The Actor uses the AT Protocol's public endpoints — everything extractable is already publicly visible.

**How deep can I go on followers/following?**
Up to 2,000 per account per run. For accounts with 10K+ followers, run multiple times with different starting cursors or sample strategically.

**How fresh is the search data?**
Real-time — search hits the live Bluesky API each run. New posts become searchable within seconds of being published.

**Does this work with custom-domain handles?**
Yes. Bluesky supports handles like `pfrazee.com` or `jay.bsky.team` — pass them the same as any other handle.

**Can I fetch replies and thread context?**
This Actor returns each post's `replyCount` and thread URI. For deeper thread traversal, chain runs using the URIs as the next seed.

**Can I filter by language or post type?**
Bluesky's public search doesn't expose language filters yet. Posts include language hints in the raw `record.langs` field when you keep the full record — filter client-side.

### 🔑 Keywords

Bluesky scraper, AT Protocol scraper, Bluesky posts data, Bluesky profile scraper, Bluesky search API, decentralized social media data, Bluesky followers extractor, open social scraping, Bluesky social listening, Bluesky analytics, AT Protocol data, Bluesky community mapping, influencer research Bluesky, post-Twitter data, fediverse scraping, Bluesky content archive.

### 📝 Changelog

- **v1.0** — Initial release. 4 capabilities (profiles, search, followers, following), up to 500 posts per profile, up to 2000 social-graph entries.

# Actor input Schema

## `handles` (type: `array`):

List of Bluesky handles to scrape (e.g. jay.bsky.team, pfrazee.com). Leave empty if using searchQuery only.

## `searchQuery` (type: `string`):

Search for posts containing this keyword/phrase. Leave empty to skip search.

## `maxPosts` (type: `integer`):

Maximum number of posts to fetch per profile.

## `maxSearchResults` (type: `integer`):

Maximum number of search results to return.

## `includeFollowers` (type: `boolean`):

Fetch the list of followers for each profile.

## `includeFollowing` (type: `boolean`):

Fetch the list of accounts each profile follows.

## `maxFollowers` (type: `integer`):

Maximum number of followers/following to fetch per profile.

## Actor input object example

```json
{
  "maxPosts": 50,
  "maxSearchResults": 100,
  "includeFollowers": false,
  "includeFollowing": false,
  "maxFollowers": 200
}
```

# API

You can run this Actor programmatically using our API. Below are code examples in JavaScript, Python, and CLI, as well as the OpenAPI specification and MCP server setup.

## JavaScript example

```javascript
import { ApifyClient } from 'apify-client';

// Initialize the ApifyClient with your Apify API token
// Replace the '<YOUR_API_TOKEN>' with your token
const client = new ApifyClient({
    token: '<YOUR_API_TOKEN>',
});

// Prepare Actor input
const input = {};

// Run the Actor and wait for it to finish
const run = await client.actor("openclawmara/bluesky-scraper").call(input);

// Fetch and print Actor results from the run's dataset (if any)
console.log('Results from dataset');
console.log(`💾 Check your data here: https://console.apify.com/storage/datasets/${run.defaultDatasetId}`);
const { items } = await client.dataset(run.defaultDatasetId).listItems();
items.forEach((item) => {
    console.dir(item);
});

// 📚 Want to learn more 📖? Go to → https://docs.apify.com/api/client/js/docs

```

## Python example

```python
from apify_client import ApifyClient

# Initialize the ApifyClient with your Apify API token
# Replace '<YOUR_API_TOKEN>' with your token.
client = ApifyClient("<YOUR_API_TOKEN>")

# Prepare the Actor input
run_input = {}

# Run the Actor and wait for it to finish
run = client.actor("openclawmara/bluesky-scraper").call(run_input=run_input)

# Fetch and print Actor results from the run's dataset (if there are any)
print("💾 Check your data here: https://console.apify.com/storage/datasets/" + run["defaultDatasetId"])
for item in client.dataset(run["defaultDatasetId"]).iterate_items():
    print(item)

# 📚 Want to learn more 📖? Go to → https://docs.apify.com/api/client/python/docs/quick-start

```

## CLI example

```bash
echo '{}' |
apify call openclawmara/bluesky-scraper --silent --output-dataset

```

## MCP server setup

```json
{
    "mcpServers": {
        "apify": {
            "command": "npx",
            "args": [
                "mcp-remote",
                "https://mcp.apify.com/?tools=openclawmara/bluesky-scraper",
                "--header",
                "Authorization: Bearer <YOUR_API_TOKEN>"
            ]
        }
    }
}

```

## OpenAPI specification

```json
{
    "openapi": "3.0.1",
    "info": {
        "title": "Bluesky Scraper - Posts Profiles Feeds",
        "description": "Scrape Bluesky posts, profiles, feeds and search results. Extract text, authors, engagement stats, media. No auth required. Social listening, trend monitoring, LLM training data.",
        "version": "1.0",
        "x-build-id": "QsZAIP4nxclGi9BRS"
    },
    "servers": [
        {
            "url": "https://api.apify.com/v2"
        }
    ],
    "paths": {
        "/acts/openclawmara~bluesky-scraper/run-sync-get-dataset-items": {
            "post": {
                "operationId": "run-sync-get-dataset-items-openclawmara-bluesky-scraper",
                "x-openai-isConsequential": false,
                "summary": "Executes an Actor, waits for its completion, and returns Actor's dataset items in response.",
                "tags": [
                    "Run Actor"
                ],
                "requestBody": {
                    "required": true,
                    "content": {
                        "application/json": {
                            "schema": {
                                "$ref": "#/components/schemas/inputSchema"
                            }
                        }
                    }
                },
                "parameters": [
                    {
                        "name": "token",
                        "in": "query",
                        "required": true,
                        "schema": {
                            "type": "string"
                        },
                        "description": "Enter your Apify token here"
                    }
                ],
                "responses": {
                    "200": {
                        "description": "OK"
                    }
                }
            }
        },
        "/acts/openclawmara~bluesky-scraper/runs": {
            "post": {
                "operationId": "runs-sync-openclawmara-bluesky-scraper",
                "x-openai-isConsequential": false,
                "summary": "Executes an Actor and returns information about the initiated run in response.",
                "tags": [
                    "Run Actor"
                ],
                "requestBody": {
                    "required": true,
                    "content": {
                        "application/json": {
                            "schema": {
                                "$ref": "#/components/schemas/inputSchema"
                            }
                        }
                    }
                },
                "parameters": [
                    {
                        "name": "token",
                        "in": "query",
                        "required": true,
                        "schema": {
                            "type": "string"
                        },
                        "description": "Enter your Apify token here"
                    }
                ],
                "responses": {
                    "200": {
                        "description": "OK",
                        "content": {
                            "application/json": {
                                "schema": {
                                    "$ref": "#/components/schemas/runsResponseSchema"
                                }
                            }
                        }
                    }
                }
            }
        },
        "/acts/openclawmara~bluesky-scraper/run-sync": {
            "post": {
                "operationId": "run-sync-openclawmara-bluesky-scraper",
                "x-openai-isConsequential": false,
                "summary": "Executes an Actor, waits for completion, and returns the OUTPUT from Key-value store in response.",
                "tags": [
                    "Run Actor"
                ],
                "requestBody": {
                    "required": true,
                    "content": {
                        "application/json": {
                            "schema": {
                                "$ref": "#/components/schemas/inputSchema"
                            }
                        }
                    }
                },
                "parameters": [
                    {
                        "name": "token",
                        "in": "query",
                        "required": true,
                        "schema": {
                            "type": "string"
                        },
                        "description": "Enter your Apify token here"
                    }
                ],
                "responses": {
                    "200": {
                        "description": "OK"
                    }
                }
            }
        }
    },
    "components": {
        "schemas": {
            "inputSchema": {
                "type": "object",
                "properties": {
                    "handles": {
                        "title": "Bluesky Handles",
                        "type": "array",
                        "description": "List of Bluesky handles to scrape (e.g. jay.bsky.team, pfrazee.com). Leave empty if using searchQuery only.",
                        "items": {
                            "type": "string"
                        }
                    },
                    "searchQuery": {
                        "title": "Search Query",
                        "type": "string",
                        "description": "Search for posts containing this keyword/phrase. Leave empty to skip search."
                    },
                    "maxPosts": {
                        "title": "Max Posts per Profile",
                        "minimum": 1,
                        "maximum": 500,
                        "type": "integer",
                        "description": "Maximum number of posts to fetch per profile.",
                        "default": 50
                    },
                    "maxSearchResults": {
                        "title": "Max Search Results",
                        "minimum": 1,
                        "maximum": 500,
                        "type": "integer",
                        "description": "Maximum number of search results to return.",
                        "default": 100
                    },
                    "includeFollowers": {
                        "title": "Include Followers",
                        "type": "boolean",
                        "description": "Fetch the list of followers for each profile.",
                        "default": false
                    },
                    "includeFollowing": {
                        "title": "Include Following",
                        "type": "boolean",
                        "description": "Fetch the list of accounts each profile follows.",
                        "default": false
                    },
                    "maxFollowers": {
                        "title": "Max Followers/Following",
                        "minimum": 1,
                        "maximum": 2000,
                        "type": "integer",
                        "description": "Maximum number of followers/following to fetch per profile.",
                        "default": 200
                    }
                }
            },
            "runsResponseSchema": {
                "type": "object",
                "properties": {
                    "data": {
                        "type": "object",
                        "properties": {
                            "id": {
                                "type": "string"
                            },
                            "actId": {
                                "type": "string"
                            },
                            "userId": {
                                "type": "string"
                            },
                            "startedAt": {
                                "type": "string",
                                "format": "date-time",
                                "example": "2025-01-08T00:00:00.000Z"
                            },
                            "finishedAt": {
                                "type": "string",
                                "format": "date-time",
                                "example": "2025-01-08T00:00:00.000Z"
                            },
                            "status": {
                                "type": "string",
                                "example": "READY"
                            },
                            "meta": {
                                "type": "object",
                                "properties": {
                                    "origin": {
                                        "type": "string",
                                        "example": "API"
                                    },
                                    "userAgent": {
                                        "type": "string"
                                    }
                                }
                            },
                            "stats": {
                                "type": "object",
                                "properties": {
                                    "inputBodyLen": {
                                        "type": "integer",
                                        "example": 2000
                                    },
                                    "rebootCount": {
                                        "type": "integer",
                                        "example": 0
                                    },
                                    "restartCount": {
                                        "type": "integer",
                                        "example": 0
                                    },
                                    "resurrectCount": {
                                        "type": "integer",
                                        "example": 0
                                    },
                                    "computeUnits": {
                                        "type": "integer",
                                        "example": 0
                                    }
                                }
                            },
                            "options": {
                                "type": "object",
                                "properties": {
                                    "build": {
                                        "type": "string",
                                        "example": "latest"
                                    },
                                    "timeoutSecs": {
                                        "type": "integer",
                                        "example": 300
                                    },
                                    "memoryMbytes": {
                                        "type": "integer",
                                        "example": 1024
                                    },
                                    "diskMbytes": {
                                        "type": "integer",
                                        "example": 2048
                                    }
                                }
                            },
                            "buildId": {
                                "type": "string"
                            },
                            "defaultKeyValueStoreId": {
                                "type": "string"
                            },
                            "defaultDatasetId": {
                                "type": "string"
                            },
                            "defaultRequestQueueId": {
                                "type": "string"
                            },
                            "buildNumber": {
                                "type": "string",
                                "example": "1.0.0"
                            },
                            "containerUrl": {
                                "type": "string"
                            },
                            "usage": {
                                "type": "object",
                                "properties": {
                                    "ACTOR_COMPUTE_UNITS": {
                                        "type": "integer",
                                        "example": 0
                                    },
                                    "DATASET_READS": {
                                        "type": "integer",
                                        "example": 0
                                    },
                                    "DATASET_WRITES": {
                                        "type": "integer",
                                        "example": 0
                                    },
                                    "KEY_VALUE_STORE_READS": {
                                        "type": "integer",
                                        "example": 0
                                    },
                                    "KEY_VALUE_STORE_WRITES": {
                                        "type": "integer",
                                        "example": 1
                                    },
                                    "KEY_VALUE_STORE_LISTS": {
                                        "type": "integer",
                                        "example": 0
                                    },
                                    "REQUEST_QUEUE_READS": {
                                        "type": "integer",
                                        "example": 0
                                    },
                                    "REQUEST_QUEUE_WRITES": {
                                        "type": "integer",
                                        "example": 0
                                    },
                                    "DATA_TRANSFER_INTERNAL_GBYTES": {
                                        "type": "integer",
                                        "example": 0
                                    },
                                    "DATA_TRANSFER_EXTERNAL_GBYTES": {
                                        "type": "integer",
                                        "example": 0
                                    },
                                    "PROXY_RESIDENTIAL_TRANSFER_GBYTES": {
                                        "type": "integer",
                                        "example": 0
                                    },
                                    "PROXY_SERPS": {
                                        "type": "integer",
                                        "example": 0
                                    }
                                }
                            },
                            "usageTotalUsd": {
                                "type": "number",
                                "example": 0.00005
                            },
                            "usageUsd": {
                                "type": "object",
                                "properties": {
                                    "ACTOR_COMPUTE_UNITS": {
                                        "type": "integer",
                                        "example": 0
                                    },
                                    "DATASET_READS": {
                                        "type": "integer",
                                        "example": 0
                                    },
                                    "DATASET_WRITES": {
                                        "type": "integer",
                                        "example": 0
                                    },
                                    "KEY_VALUE_STORE_READS": {
                                        "type": "integer",
                                        "example": 0
                                    },
                                    "KEY_VALUE_STORE_WRITES": {
                                        "type": "number",
                                        "example": 0.00005
                                    },
                                    "KEY_VALUE_STORE_LISTS": {
                                        "type": "integer",
                                        "example": 0
                                    },
                                    "REQUEST_QUEUE_READS": {
                                        "type": "integer",
                                        "example": 0
                                    },
                                    "REQUEST_QUEUE_WRITES": {
                                        "type": "integer",
                                        "example": 0
                                    },
                                    "DATA_TRANSFER_INTERNAL_GBYTES": {
                                        "type": "integer",
                                        "example": 0
                                    },
                                    "DATA_TRANSFER_EXTERNAL_GBYTES": {
                                        "type": "integer",
                                        "example": 0
                                    },
                                    "PROXY_RESIDENTIAL_TRANSFER_GBYTES": {
                                        "type": "integer",
                                        "example": 0
                                    },
                                    "PROXY_SERPS": {
                                        "type": "integer",
                                        "example": 0
                                    }
                                }
                            }
                        }
                    }
                }
            }
        }
    }
}
```
