🏠 Real Estate Price Intelligence – Oman Edition

🎯 Objective

The goal of this project is to build a mini data pipeline for real estate price prediction in Oman. It starts by scraping property listings from two local websites, cleaning and integrating the data, engineering useful features, and framing a predictive modeling problem based on property prices.

🌐 Data Sources

Two real estate platforms were used:

Dubizzle Oman – scraped using BeautifulSoup
Tibiaan – scraped using Selenium

This project was a hands-on learning experience in web scraping, using two different techniques to deal with static and dynamic content.

📦 Steps in Data Collection & Cleaning

Explore: Analyzed both websites to understand their structure and potential fields to extract.
Plan: Created an Excel sheet listing possible data fields and identified the overlap between both platforms.
Scrape: Fetched data using Python scripts, then cleaned:
- Removed duplicates
- Filled missing values using mean, median, or mode depending on context
- Trimmed extra spaces to improve matching and consistency
Merge: Cleaned the datasets individually, then integrated them for modeling.

🧠 Feature Engineering

Understanding the data: Identified important columns and assessed their value.
New features: Created new columns and converted types where needed.
Scaling: Used Box-Cox transformation on numerical features to normalize data.
Encoding: Applied OneHotEncoder to handle categorical features for modeling.

📂 Technologies Used

Python
Pandas & NumPy
BeautifulSoup
Selenium
Scikit-learn

This project highlights my ability to go from raw web data to a clean, structured dataset ready for modeling — combining web scraping, data preprocessing, and feature engineering in one pipeline.

Name		Name	Last commit message	Last commit date
Latest commit History 1 Commit
cleaning_code		cleaning_code
scraping_code		scraping_code
uncleaned_data_csv's		uncleaned_data_csv's
README.md		README.md
both_data_feature_engineering_and_scaling.ipynb		both_data_feature_engineering_and_scaling.ipynb
combined_data.csv		combined_data.csv

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

🏠 Real Estate Price Intelligence – Oman Edition

🎯 Objective

🌐 Data Sources

📦 Steps in Data Collection & Cleaning

🧠 Feature Engineering

📂 Technologies Used

About

Uh oh!

Releases

Packages

Languages

Os3m3/DS_Project

Folders and files

Latest commit

History

Repository files navigation

🏠 Real Estate Price Intelligence – Oman Edition

🎯 Objective

🌐 Data Sources

📦 Steps in Data Collection & Cleaning

🧠 Feature Engineering

📂 Technologies Used

About

Resources

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages