Website Contact Data Extractor avatar

Website Contact Data Extractor

Pricing

from $3.00 / 1,000 website results

Go to Apify Store
Website Contact Data Extractor

Website Contact Data Extractor

Extract public business contact data from websites, including validated emails, phone numbers, contact/about pages, and social profiles. Delivers clean, deduplicated JSON output for CRM enrichment, lead generation, prospecting, research, and automation workflows.

Pricing

from $3.00 / 1,000 website results

Rating

0.0

(0)

Developer

Techionik

Techionik

Maintained by Community

Actor stats

0

Bookmarked

14

Total users

0

Monthly active users

a day ago

Last modified

Share

Extract public contact details and social profiles from company websites in a clean, structured format.

Website Contact Data Extractor is built for lead generation, CRM enrichment, sales research, market research, and automation workflows. Provide one or more business website URLs and the Actor scans a small set of high-signal pages such as home, contact, about, company, team, location, legal, imprint, and impressum pages.

Features

  • Extracts public email addresses from visible text, mailto: links, and JSON-LD structured data
  • Extracts public phone numbers from tel: links, page contact sections, footers, and JSON-LD
  • Detects contact and about page URLs
  • Extracts social profile links from Facebook, Instagram, LinkedIn, Twitter/X, YouTube, and TikTok
  • Crawls only same-website pages to keep results relevant
  • Deduplicates and validates emails and phone numbers
  • Returns one clean structured result per website
  • Includes run summary output for quick quality checks

Best For

  • Sales prospecting
  • CRM enrichment
  • Lead list cleanup
  • Company research
  • Market research
  • Agency prospecting
  • Directory enrichment
  • Automation pipelines with Apify, Make, n8n, Zapier, Google Sheets, Airtable, or custom APIs

Input

startUrls

Add one or more public company or business website URLs. URLs without https:// are accepted and normalized automatically.

The Actor automatically scans the homepage plus selected high-signal pages such as contact, about, company, team, location, legal, imprint, and impressum pages.

Example Input

{
"startUrls": [
{ "url": "https://www.apify.com" },
{ "url": "https://stripe.com" },
{ "url": "https://www.shopify.com" }
]
}

Output

Each dataset item represents one processed website.

{
"status": "ok",
"websiteUrl": "https://www.example.com/",
"domain": "example.com",
"emails": ["info@example.com", "sales@example.com"],
"phones": ["+1 800 123 4567"],
"contactPage": "https://example.com/contact",
"aboutPage": "https://example.com/about",
"facebook": "https://facebook.com/example",
"instagram": "https://instagram.com/example",
"linkedin": "https://linkedin.com/company/example",
"twitter": "https://x.com/example",
"youtube": "https://youtube.com/@example",
"tiktok": "https://www.tiktok.com/@example"
}

Status Values

  • ok: Contact data, social links, or important pages were found.
  • empty: The website was scanned successfully, but no useful contact data was found.
  • failed: The website could not be loaded after retries.

Output Fields

FieldDescription
statusProcessing result: ok, empty, or failed
websiteUrlOriginal normalized website URL
domainWebsite hostname without www
emailsDeduplicated public email addresses
phonesDeduplicated public phone numbers
contactPageDetected contact page URL
aboutPageDetected about/company/team page URL
facebookFacebook page/profile URL
instagramInstagram profile URL
linkedinLinkedIn company/profile URL
twitterTwitter/X profile URL
youtubeYouTube channel/profile URL
tiktokTikTok profile URL

How It Works

  1. The Actor normalizes and validates input URLs.
  2. It loads each website with Crawlee and Cheerio for fast, low-cost scraping.
  3. It extracts emails, phone numbers, contact pages, about pages, and social links.
  4. It parses JSON-LD structured data before scripts are removed.
  5. It discovers and scans high-signal same-domain pages.
  6. It deduplicates and validates extracted data.
  7. It saves one result per website to the default dataset and stores run statistics in SUMMARY.

Notes And Limitations

  • Only publicly available website data is extracted.
  • Login-only pages, CAPTCHA-protected pages, and contact forms without visible contact details are not supported.
  • Heavily JavaScript-rendered websites may return fewer results because this Actor uses fast HTML scraping instead of a browser.
  • Some websites intentionally do not publish emails or phone numbers.
  • Phone extraction is conservative to avoid collecting dates, IDs, and tracking numbers.
  • Always use extracted contact data responsibly and follow applicable privacy, anti-spam, and data protection laws.

Pricing Recommendation

Recommended marketplace pricing: paid per result.

Suggested starting price: $2.00 to $4.00 per 1,000 website results, with a small free trial allowance if available. This pricing is easy for lead generation users to understand because they pay per processed website, while the Actor keeps compute cost low by using Cheerio instead of browser automation.

Search Keywords

website contact extractor, email scraper, phone number scraper, company contact scraper, lead generation, CRM enrichment, business email finder, website social links, LinkedIn company finder, sales prospecting, contact data, public emails, Apify contact scraper