Google Scholar Paper Scraper

A Python-based scraping tool designed to extract research paper data from Google Scholar, specifically focused on Hyperspectral Image (HSI) Classification using Graph Neural Networks (GNNs).

📌 Overview

This repository contains a Jupyter Notebook (googlescholar_scraper.ipynb) that automates the collection of academic titles and authors. It is pre-configured to target the latest advancements (2020–present) in graph-based remote sensing.

🚀 Features

Architecture-Specific Queries: Specialized search loops for GCN, GAT, and GraphSAGE models.
Temporal Filtering: Automatically restricts results to publications from the year 2020 onwards.
Clean Data Extraction: Parses HTML to isolate clean Paper Titles and Author/Source strings.
Pagination Support: Scrapes multiple result pages (approx. 60 papers per query).
Rate Limit Protection: Implements a time.sleep() delay to minimize the risk of IP blocking.

🛠️ Requirements

The script utilizes the following Python libraries:

requests: For handling HTTP requests to Google Scholar.
beautifulsoup4: For parsing the search result HTML.
time: For managing request intervals.

📖 Usage

Open the notebook in Jupyter or Google Colab.

Define your search string in the query variable:

query = "graph attention hyperspectral image classification"

Name		Name	Last commit message	Last commit date
Latest commit History 4 Commits
LICENSE		LICENSE
README.md		README.md
googlescholar_scraper.ipynb		googlescholar_scraper.ipynb

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Google Scholar Paper Scraper

📌 Overview

🚀 Features

🛠️ Requirements

📖 Usage

About

Uh oh!

Releases

Packages

Uh oh!

Contributors

Uh oh!

Languages

Folders and files

Latest commit

History

Repository files navigation

Google Scholar Paper Scraper

📌 Overview

🚀 Features

🛠️ Requirements

📖 Usage

About

Resources

License

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Uh oh!

Contributors

Uh oh!

Languages

Packages