This project scrapes the list of largest companies in the United States by revenue from Wikipedia and stores the data in a structured CSV file.
- requests β Fetching the webpage
- BeautifulSoup (bs4) β Parsing & extracting HTML data
- pandas β Storing data in DataFrame
- Sends a request to Wikipedia page
- Extracts the companies table using BeautifulSoup
- Stores data into a pandas DataFrame
- Exports the DataFrame to a CSV file
Wikipedia - List of largest companies in the United States by revenue