Automated Website Url scraper #804

Gud-will · 2021-10-31T15:35:28Z

Pull Request Template

Automated Website Url scraper

script name -
webUrlscraper.py

Brief about script
This python script focuses on retrieving all the webpages links from a giver Url(input from the user)
These links can also be present inside a button or an action.

Issue no.(must) -

Issue no-#677

Self Check(Tick After Making pull Request)

This issue was assigned to me.
One Change in one Pull Request
My file is in proper folder (Name of folder should be in lowercase with no space in between) (E.g. meet_schedular)
I am following clean code and Documentation and my code is well linted with flake8.
I have added README.md and requirements.txt (Include version numbers too e.g. pandas==0.0.1) with my script
I have used REPO README TEAMPLATE (Necessary)
Just including required dependencies in requirements.txt (Don't include Python version too)

If issue was not assigned to you Please don't make a PR. It will marked as invalid.

Have created a python script that uses beautifulsoup to read the html file of a page and then this information is further processed to get href links

github-actions

Congratulations!! 🎉 @Gud-will for making your first PR. We will review the changes soon and merge finally.😊 Do give a star ⭐ meanwhile if you like this project.

pawangeek · 2021-11-09T05:53:54Z

Change the folder name to websiteurl_scraper

Gud-will added 4 commits October 31, 2021 20:45

have created an automatic web Url scraper

4a88cf3

Have created a python script that uses beautifulsoup to read the html file of a page and then this information is further processed to get href links

added readme.md and requirements

d081c73

updated README.d

dc32ca4

updated README.md

22eb859

github-actions bot reviewed Oct 31, 2021

View reviewed changes

Gud-will added 13 commits October 31, 2021 21:11

Upadted README.md

cdf888d

updated webUrlscraper.py

314306f

Merge branch 'main' of https://github.com/Gud-will/Automation-scripts

5ffeeb0

update webUrlscraper.py

7b5b418

update .py

4ea3dfa

update .py

59af73c

update .py

866d464

update .py

7999c95

update .py

0d7e2f4

update .py

63bea64

.py

87d8e4d

,py

ffae454

.py

c7965b2

pawangeek linked an issue Nov 9, 2021 that may be closed by this pull request

Scrape all URLs from a Website #677

Closed

1 task

changed name of the folder to websiteurl_scraper

2e32d14

pawangeek merged commit 9a61d57 into python-geeks:main Jan 15, 2022

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Automated Website Url scraper #804

Automated Website Url scraper #804

Gud-will commented Oct 31, 2021 •

edited

Loading

github-actions bot left a comment

pawangeek commented Nov 9, 2021

Automated Website Url scraper #804

Automated Website Url scraper #804

Conversation

Gud-will commented Oct 31, 2021 • edited Loading

Pull Request Template

Issue no.(must) -

Self Check(Tick After Making pull Request)

github-actions bot left a comment

Choose a reason for hiding this comment

pawangeek commented Nov 9, 2021

Gud-will commented Oct 31, 2021 •

edited

Loading