WebScraping-Python

Overview

This is a simple web scraper implemented in Python, designed to extract data from an HTML web page. The scraper utilizes the lxml library for HTML parsing and the requests library for making HTTP requests.

Dependencies

Before using the web scraper, make sure you have the following dependencies installed:

lxml requests

You can install these dependencies using the following command:

pip install lxml requests

How to Use

Clone or download the repository to your local machine.
Install the required dependencies using the command mentioned above.
Open the Python script containing the web scraper code.
Update the url variable with the URL of the web page you want to scrape.
Customize the XPath expression according to the structure of the HTML page you are scraping. In the provided example, the XPath expression is modified to extract names from a specific class of div elements.

Run the Python script.

python3 scraping.py

Check the console output. If the HTTP request is successful (status code 200), the scraper will attempt to extract data based on the provided XPath expression. If successful, it will print the retrieved data; otherwise, it will notify you that no data was found.

Name		Name	Last commit message	Last commit date
Latest commit History 5 Commits
.DS_Store		.DS_Store
README.md		README.md
scraping.py		scraping.py

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Repository files navigation

WebScraping-Python

Overview

Dependencies

How to Use

About

Uh oh!

Releases

Packages

Languages

AdamLefebvre/WebScraping-Python

Folders and files

Latest commit

History

Repository files navigation

WebScraping-Python

Overview

Dependencies

How to Use

About

Resources

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages