US Business Formation Scraper — New LLC & Company Leads avatar

US Business Formation Scraper — New LLC & Company Leads

Pricing

$4.00 / 1,000 business record scrapeds

Go to Apify Store
US Business Formation Scraper — New LLC & Company Leads

US Business Formation Scraper — New LLC & Company Leads

Scrape newly registered US businesses & LLCs from state open-data registries (CO, OR, CT, PA). Get business name, entity type, formation date, status, registered agent and address — filter by date, type & keyword, with monitoring for only-new filings. Keyless, no browser.

Pricing

$4.00 / 1,000 business record scrapeds

Rating

0.0

(0)

Developer

Scrape Sage

Scrape Sage

Maintained by Community

Actor stats

0

Bookmarked

2

Total users

1

Monthly active users

10 days ago

Last modified

Share

US Business Formation Scraper — New LLC & Company Registration Leads

Pull newly registered US businesses and LLCs straight from official state open-data registries — New York, Colorado, Pennsylvania, Connecticut and Oregon — with the fields that make a fresh lead actionable: business name, entity type, formation/registration date, status, registered agent / owner, county, principal address, and a 0–100 lead score. Every extra column each state publishes is also included under additionalData. Filter by date, entity type and keyword, and use monitoring mode to capture only businesses registered since your last run.

New filings are the highest-intent B2B signal there is — a brand-new company needs banking, insurance, accounting, payroll, a website, software and more. This actor turns the public record into a clean, deduplicated lead list. Keyless, no login, no browser — direct from each state's Socrata data API.

Supported states (the only ones with data): New York, Colorado, Pennsylvania, Connecticut, Oregon. These are the US states that publish a clean, queryable open-data business registry with formation dates. The States input only lets you choose from this list, so a run never comes back empty because of an unsupported state. More states are added as they become available — request one on the Issues tab.

Why this business-formation scraper?

DataTypical scrapersThis actor
Multiple states in one run❌ single state✅ NY + CO + PA + CT + OR
Business name + entity type
Formation / registration datepartial✅ (filter + monitor on it)
Normalized entity category (LLC, Corp…)
Registered agent / owner (person and company) + address✅ (where the state publishes it)
Principal address + ZIP + county (+ geo)partial
Business email + NAICS industry✅ (CT)
Diversity flags (woman/veteran/minority/LGBTQ-owned)✅ (CT)
Every extra published column (additionalData)
Lead score (freshness + completeness)
Monitoring (only new filings)

Use cases

  • Fresh B2B lead generation — reach brand-new businesses before competitors: business insurance, banking, accounting/bookkeeping, payroll, merchant services, web/marketing agencies, registered-agent and compliance services, B2B SaaS.
  • New-mover/territory alerts — schedule daily runs filtered by state + registeredAfter to get a feed of that day's new LLCs and corporations.
  • Market & economic research — measure business-formation trends by state, entity type, and geography over time.
  • Registered-agent & compliance — pull agent-of-record data (Colorado publishes full registered-agent name + address).
  • Supplier diversity — filter Connecticut for woman-, veteran-, minority- and LGBTQ-owned organizations.

How to use

  1. Sign up for Apify — the free plan is enough to try this actor.
  2. Open the US Business Formation Scraper, pick states, set registeredAfter (e.g. last 30 days) and any entity types, then click Start.
  3. Watch businesses stream into the dataset table (newest first).
  4. Export as JSON, CSV, Excel, XML, or RSS — or pull results via the Apify API.

Input

{
"states": ["Colorado", "Oregon", "Connecticut", "Pennsylvania"],
"registeredAfter": "2026-05-01",
"entityTypes": ["LLC", "Corporation"],
"statusActiveOnly": true,
"maxResultsPerState": 500,
"maxResults": 1000,
"monitorMode": false
}
  • states — any of New York, Colorado, Pennsylvania, Connecticut, Oregon (default: all five). More states added on request.
  • registeredAfterYYYY-MM-DD; keep only businesses formed on/after this date (the key to fresh leads). Empty = all, newest first.
  • entityTypesLLC, Corporation, Nonprofit, Partnership, Sole Proprietorship, Trade Name (matched flexibly across each state's wording). Empty = all.
  • keyword — optional full-text filter (matches business name and indexed fields).
  • statusActiveOnly — drop clearly-inactive records (dissolved/revoked/expired…).
  • maxResultsPerState / maxResults — caps per state and for the whole run.
  • monitorMode — emit only filings not seen in previous runs (see below).

Output

One record per business (type: "business"):

{
"type": "business",
"state": "Colorado",
"stateCode": "CO",
"businessName": "Blue Ridge Ventures LLC",
"entityType": "Limited Liability Company",
"entityCategory": "LLC",
"registryNumber": "20261234567",
"status": "Good Standing",
"isActive": true,
"formationDate": "2026-06-10T00:00:00.000",
"ageInDays": 5,
"jurisdiction": "Colorado",
"county": null,
"principalAddress": { "street": "123 Main St", "city": "Denver", "state": "CO", "zip": "80202" },
"city": "Denver",
"postalCode": "80202",
"registeredAgent": { "name": "Jane Smith", "role": "Registered Agent", "isOrganization": false, "street": "123 Main St", "city": "Denver", "state": "CO", "zip": "80202" },
"email": null,
"naicsCode": null,
"womanOwned": null,
"leadScore": 84,
"additionalData": { "principalcountry": "US", "agentprincipaladdress2": "Suite 400" },
"sourceUrl": "https://data.colorado.gov/resource/4ykn-tg5h.json?$where=entityid='20261234567'",
"scrapedAt": "2026-06-15T12:00:00.000Z"
}

Fields vary slightly by state — e.g. New York adds county and the process agent; Colorado includes the registered agent (person or company) with its own address; Connecticut adds business email, NAICS industry and woman/veteran/minority/LGBTQ-owned flags; Pennsylvania adds county and geo coordinates. Everything else a state publishes (entity status dates, annual-report due date, citizenship, business details…) is preserved under additionalData. A field is null only when that state doesn't publish it.

Monitoring mode (only new filings)

Turn on monitorMode and the actor remembers (in a named key-value store) every filing it has returned, so future runs emit only businesses not seen before. Combined with a daily Schedule and a registeredAfter date, you get a clean, de-duplicated feed of each day's brand-new businesses — exactly what a fresh-lead pipeline needs. It works with Schedules, not against them: the schedule triggers the run, monitoring decides which records are new.

Automate & schedule

import { ApifyClient } from 'apify-client';
const client = new ApifyClient({ token: 'MY_APIFY_TOKEN' });
const run = await client.actor('scrapesage/us-business-formation-scraper').call({
states: ['Colorado', 'Pennsylvania'],
registeredAfter: '2026-06-01',
entityTypes: ['LLC'],
monitorMode: true,
});
const { items } = await client.dataset(run.defaultDatasetId).listItems();
console.log(`Got ${items.length} new businesses`);

Integrate with any app

  • Make — multi-step automation scenarios.
  • Zapier — push new business leads straight into your CRM.
  • Slack — get a daily new-business digest.
  • Google Drive / Sheets — auto-export every run to a spreadsheet.
  • Airbyte — pipe results into your data warehouse.
  • GitHub — trigger runs from commits or releases.

Use with AI assistants (MCP)

The output is clean, LLM-ready JSON. Call this actor from Claude, ChatGPT, or any agent framework through the Apify MCP server — ask your assistant to "list new Colorado LLCs registered this month with their registered agents" and let it run this scraper.

More scrapers from scrapesage

Build a complete B2B lead-gen & registry stack:

Tips

  • Fresh leads: set registeredAfter to ~30 days ago and turn on monitorMode + a daily Schedule for a rolling new-business feed.
  • Oregon publishes a rolling "newly registered" window — ideal for the fresh-lead use case.
  • Colorado is the richest source — it includes the registered agent's name and full address.
  • Cost control: use maxResultsPerState and entity-type filters to keep runs tight.

FAQ

Which states are supported? New York, Colorado, Pennsylvania, Connecticut and Oregon today — states that publish clean, queryable business registries as open data (including NY, the largest by formation volume). More states are added on request.

How fresh is the data? It mirrors each state's open-data portal, which most states refresh daily or weekly. Use registeredAfter + monitoring to track the newest filings.

Are emails/phones included? State business registries publish names and addresses (and, in Colorado, the registered agent) — not emails or phones. Pair this with the Google Maps Email Extractor to append contact details.

Can I export to Google Sheets, CSV, or Excel? Yes — one click in the dataset view, or automatically via the Google Drive integration.

Is this legal? This actor reads publicly available government open data. You are responsible for using it in compliance with applicable laws (e.g. CAN-SPAM/TCPA for outreach) and each portal's terms.

Need help?

Open an issue on the actor's Issues tab, or visit the Apify help center. Requests for more states or fields are welcome — this actor is actively maintained.