Web Scraper for business phone numbers
New here? Learn about Bountify and follow @bountify to get notified of new bounties! x

Just about every company has a business phone number for general inquiries. I'm trying to compose a list of them.

I understand that "Every business ever" is much too large an ambitious. So here's what I think would work:

1) I feed a .CSV list of company names

2) Scraper finds the company website (I imagine a google search would get there)

3) Scraper finds an phone number (Regex should work, looking at only American companies) (Possible side step of finding the contact information tab, then looking there)

4) Scraper puts the phone number next to the company name in the email address (And why not grab the website URL while we're there)

I'm not fussy on language - anything that works.

However, I'm more inclined to using particular languages. In order:

1) VBA (As a .xlsm file - can self-reference the list I'd give it)

2) Python

3) Any other language

Also acceptable: If you happen to have a nice large list of American companies and their phone numbers already, that would save me a bunch of time and effort.

Hi Selkie, the solution requested is too broad & companies having subsidiary adds to it. Here's a wiki list of all US companies and their website in the Current section.
Scraping can be done using Java Jsoup However scraping itself revolves around copyright issues, so keeping a safe side there's a public list.
SilverHood Apps 3 years ago
what type of phone number you want? sales phone No or customer service No?
Memo40k 3 years ago
Hoping to get sales phone No - but the phone number for one line can usually be used to find the phone for another one. The website list is pretty useful. I'm not too interested in subsidiary phone numbers - just A phone number from the website is good enough
Selkie 3 years ago
Hi Selkie, I've managed to scrape US companies and respective website (1,872 to be precise) via Wikipedia using Java in NetBeans IDE. However Its difficult extracting contact details as every websites have their own format/naming convention & number representation. Let me know what you think Output
SilverHood Apps 3 years ago
Kindly indicate whether I should carry on the task ?
SilverHood Apps 3 years ago
Hi Selkie, I've posted a possible solution with output, do go through in time..
Ps: The posted bounty will expire if not awarded within time
SilverHood Apps 3 years ago

Crowdsource coding tasks.

2 Solutions

Which comes with a $250 price tag. Hoping to make my own list
Selkie 3 years ago

Simple python function to search phone number on google

import requests
from bs4 import BeautifulSoup

def check_number(c_name):
url = "https://www.google.com/search?source=hp&ei=bTDhW_3hBqHOjwSr5qGwDg&q="+c_name
get_html = requests.get(url)
soup = BeautifulSoup(get_html.content,'html.parser')
p = soup.find_all('span',{'class':'A1t5ne'})
for i in p:
if i.text.startswith("1 "):
return phone

View Timeline