Web Scraper with RabbitMQ

This project is a web scraper that fetches property details from a website and uses RabbitMQ for message queuing. The project is structured using the SOLID principles and uses Puppeteer for web scraping.

Getting Started

These instructions will get you a copy of the project up and running on your local machine for development and testing purposes.

Prerequisites

Docker and Docker Compose

Installing

Clone the repository:

git clone https://github.com/man0l/imot-scraper.git
cd imot-scraper

Build the Docker images:

docker-compose build

Run the migrations:

docker-compose run web_scraper_consumer npx sequelize-cli db:migrate --migrations-path ./src/migrations/ --models-path ./src/models/ --config ./src/config/db.json

Usage

To start the RabbitMQ server, publisher, and consumer:

docker-compose up

The property_type_publisher.js script will automatically publish property types URLs to RabbitMQ, and the main.js script will consume the URLs and scrape property details.

You can view the logs for each service in the Docker Compose output.

Work with RabbitMQ Management

You can access the RabbitMQ Management interface at http://host.docker.internal:15672. The default username and password are guest. Also, you could connect to the rabbitmq server through the same host and port host.docker.internal:5672

Built With

Docker - Containerization platform
Node.js - JavaScript runtime
Puppeteer - Headless browser for web scraping
RabbitMQ - Open source message broker

Contributing

Please read CONTRIBUTING.md for details on our code of conduct, and the process for submitting pull requests to us.

License

This project is licensed under the MIT License - see the LICENSE.md file for details

Name		Name	Last commit message	Last commit date
Latest commit History 46 Commits
.github/workflows		.github/workflows
.vscode		.vscode
notebook		notebook
src		src
tests		tests
user-data		user-data
.dockerignore		.dockerignore
.env.example		.env.example
.gitignore		.gitignore
CONTRIBUTING.md		CONTRIBUTING.md
Dockerfile		Dockerfile
README.md		README.md
buildspec.yml		buildspec.yml
docker-compose.analysis.yml		docker-compose.analysis.yml
docker-compose.yml		docker-compose.yml
index.js		index.js
main.js		main.js
package.json		package.json
prompts.txt		prompts.txt

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Web Scraper with RabbitMQ

Getting Started

Prerequisites

Installing

Usage

Work with RabbitMQ Management

Built With

Contributing

License

About

Releases

Packages

Languages

man0l/imot-scraper

Folders and files

Latest commit

History

Repository files navigation

Web Scraper with RabbitMQ

Getting Started

Prerequisites

Installing

Usage

Work with RabbitMQ Management

Built With

Contributing

License

About

Resources

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages