Website Contact Data Extractor
Pricing
from $3.00 / 1,000 website results
Website Contact Data Extractor
Extract public business contact data from websites, including validated emails, phone numbers, contact/about pages, and social profiles. Delivers clean, deduplicated JSON output for CRM enrichment, lead generation, prospecting, research, and automation workflows.
Pricing
from $3.00 / 1,000 website results
Rating
0.0
(0)
Developer
Techionik
Maintained by CommunityActor stats
0
Bookmarked
14
Total users
0
Monthly active users
a day ago
Last modified
Categories
Share
Extract public contact details and social profiles from company websites in a clean, structured format.
Website Contact Data Extractor is built for lead generation, CRM enrichment, sales research, market research, and automation workflows. Provide one or more business website URLs and the Actor scans a small set of high-signal pages such as home, contact, about, company, team, location, legal, imprint, and impressum pages.
Features
- Extracts public email addresses from visible text,
mailto:links, and JSON-LD structured data - Extracts public phone numbers from
tel:links, page contact sections, footers, and JSON-LD - Detects contact and about page URLs
- Extracts social profile links from Facebook, Instagram, LinkedIn, Twitter/X, YouTube, and TikTok
- Crawls only same-website pages to keep results relevant
- Deduplicates and validates emails and phone numbers
- Returns one clean structured result per website
- Includes run summary output for quick quality checks
Best For
- Sales prospecting
- CRM enrichment
- Lead list cleanup
- Company research
- Market research
- Agency prospecting
- Directory enrichment
- Automation pipelines with Apify, Make, n8n, Zapier, Google Sheets, Airtable, or custom APIs
Input
startUrls
Add one or more public company or business website URLs. URLs without https:// are accepted and normalized automatically.
The Actor automatically scans the homepage plus selected high-signal pages such as contact, about, company, team, location, legal, imprint, and impressum pages.
Example Input
{"startUrls": [{ "url": "https://www.apify.com" },{ "url": "https://stripe.com" },{ "url": "https://www.shopify.com" }]}
Output
Each dataset item represents one processed website.
{"status": "ok","websiteUrl": "https://www.example.com/","domain": "example.com","emails": ["info@example.com", "sales@example.com"],"phones": ["+1 800 123 4567"],"contactPage": "https://example.com/contact","aboutPage": "https://example.com/about","facebook": "https://facebook.com/example","instagram": "https://instagram.com/example","linkedin": "https://linkedin.com/company/example","twitter": "https://x.com/example","youtube": "https://youtube.com/@example","tiktok": "https://www.tiktok.com/@example"}
Status Values
ok: Contact data, social links, or important pages were found.empty: The website was scanned successfully, but no useful contact data was found.failed: The website could not be loaded after retries.
Output Fields
| Field | Description |
|---|---|
status | Processing result: ok, empty, or failed |
websiteUrl | Original normalized website URL |
domain | Website hostname without www |
emails | Deduplicated public email addresses |
phones | Deduplicated public phone numbers |
contactPage | Detected contact page URL |
aboutPage | Detected about/company/team page URL |
facebook | Facebook page/profile URL |
instagram | Instagram profile URL |
linkedin | LinkedIn company/profile URL |
twitter | Twitter/X profile URL |
youtube | YouTube channel/profile URL |
tiktok | TikTok profile URL |
How It Works
- The Actor normalizes and validates input URLs.
- It loads each website with Crawlee and Cheerio for fast, low-cost scraping.
- It extracts emails, phone numbers, contact pages, about pages, and social links.
- It parses JSON-LD structured data before scripts are removed.
- It discovers and scans high-signal same-domain pages.
- It deduplicates and validates extracted data.
- It saves one result per website to the default dataset and stores run statistics in
SUMMARY.
Notes And Limitations
- Only publicly available website data is extracted.
- Login-only pages, CAPTCHA-protected pages, and contact forms without visible contact details are not supported.
- Heavily JavaScript-rendered websites may return fewer results because this Actor uses fast HTML scraping instead of a browser.
- Some websites intentionally do not publish emails or phone numbers.
- Phone extraction is conservative to avoid collecting dates, IDs, and tracking numbers.
- Always use extracted contact data responsibly and follow applicable privacy, anti-spam, and data protection laws.
Pricing Recommendation
Recommended marketplace pricing: paid per result.
Suggested starting price: $2.00 to $4.00 per 1,000 website results, with a small free trial allowance if available. This pricing is easy for lead generation users to understand because they pay per processed website, while the Actor keeps compute cost low by using Cheerio instead of browser automation.
Search Keywords
website contact extractor, email scraper, phone number scraper, company contact scraper, lead generation, CRM enrichment, business email finder, website social links, LinkedIn company finder, sales prospecting, contact data, public emails, Apify contact scraper