Austin house pricing analysis

Authors

Marijose Cavazos - My github
Paola Aleman - My github
Javier Robles - My github
Cesar Cruz - My github

Project overview:

The purpose of this project is to carry out an analysis of the real estate market behavior in Austin, Texas, from 2018 to 2021. Various variables that influence the determination of the sale price will be examined, such as the number of bathrooms, bedrooms, parking spaces, land area, and construction area, among others. This information will be used to predict, through linear regression and neural networks, whether the sale prices were in line with the market. The same machine learning methods will be used to obtain a sale price based on users' input data of their house.

We will create a visualization in HTML coding with the use of Python Flask, HTML/CSS and JavaScript.

Relevant questions:

Did I buy my house above or below the market price?
Did I sell my house above or below the market price?
What is the price at which I could sell my house according to the market?

Housing database

The chosen data has been downloaded from Kaggle. The purpose of the dataset is to collect information about the houses that participated in the real estate market in Austin, Texas, in the latest years.

There are 46 categories called: City, streetAddress, Zipcode, Latitude, Longitude, propertyTaxRate, garageSpaces, hasAssociation, hasCooling, hasGarage, hasHeating, hasSpa, hasView, homeType, parkingSpaces, yearBuilt, latestPrice, numPriceChanges, latest_saledate, latest_salemonth, latest_saleyear, latestPriceSource, numOfPhotos, numOfAccessibilityFeatures, numOfAppliances,numOfParkingFeatures, numOfPatioAndPorchFeatures, numOfSecurityFeatures, numOfWaterfrontFeatures, numOfWindowFeatures, numOfCommunityFeatures, lotSizeSqFt, livingAreaSqFt, numOfPrimarySchools, numOfElementarySchools, numOfMiddleSchools, numOfHighSchools, avgSchoolDistance, avgSchoolRating, avgSchoolSize, MedianStudentsPerTeacher, numOfBathrooms, numOfBedrooms, numOfStories, homeImage.

Finding Data

For this project we fetch and grabbed the data from /www.kaggle.com/ our data set were retrived form https://www.kaggle.com/code/threnjen/austin-housing-eda-nlp-models-visualizations/input

Data Cleanup and Analysis

Exploration and clean up

The first step was to import the file "austin_housing.csv" to Jupyter Notebook and analyze it in order to be able to select the most relevant variables for the project.

The dataset was filtered to 30 variables, and "numOfSchools" was created as a result of summing the different school levels. All this data was exported to the "austin_housing_reduced.csv" file, which contains these columns and their related information: city, streetAddress, zipcode, latitude, longitude, propertyTaxRate, garageSpaces, hasCooling, hasGarage, hasHeating, hasSpa, hasView, homeType, yearBuilt, latestPrice, numPriceChanges, numOfAccessibilityFeatures, numOfAppliances, numOfParkingFeatures, numOfPatioAndPorchFeatures, numOfSecurityFeatures, numOfWaterfrontFeatures, numOfWindowFeatures, numOfCommunityFeatures, lotSizeSqFt, livingAreaSqFt, avgSchoolRating, numOfBathrooms, numOfBedrooms, numOfStories, numOfSchools.

Project Development

The Austin House Pricing Project utilized Flask to develop a web application consisting of app.py, index.html, and JavaScript files. The primary objective of this project was to display a map using Leaflet, showcasing information about houses sold between 2018 and 2021.

To enhance the functionality, machine learning techniques were employed. Both linear regression and neural networks were utilized to predict house prices based on user input.

The Flask application seamlessly integrated the machine learning models, enabling users to input specific parameters related to the house they were interested in. The models would then provide an estimated price based on the given inputs, utilizing the predictive capabilities of the trained models.

Overall, the project combined web development, data visualization, and machine learning techniques to create an interactive platform that empowered users to explore and obtain estimated prices for houses in Austin, Texas.

Visual references:

Leaflet showing information about the selected home.

Map filtering.

Price prediction.

Graphs.

Tools and sources

Javascript
HTML/CSS
JSON
GitHub and GitHub Pages
console.log
Matplotlib.pyplot
Flask
Jupyter Notebook
CORS
LiveServer JS

Conclusions

The model that best fits our data set was the neural networks model based on the MAE (Mean Average Error), which had a lower value.
The linear regression model don´t take into consideration relevant fields like zipcode and year of construction.
Categorized the predicted price values into good, bad and neutral by comparing it to the listed price.
Offered different options for user to display visualizations comparing both models.

Acknowledgments

Austin Housing - EDA, NLP, Models, Visualizations. (2021). Retrived from https://www.kaggle.com/code/threnjen/austin-housing-eda-nlp-models-visualizations/input

Name		Name	Last commit message	Last commit date
Latest commit History 56 Commits
API		API
Notebooks		Notebooks
Resources		Resources
environment		environment
.DS_Store		.DS_Store
.gitignore		.gitignore
Austin House Pricing Analysis-PPT.pdf		Austin House Pricing Analysis-PPT.pdf
LICENSE		LICENSE
README.md		README.md

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Repository files navigation

Austin house pricing analysis

Authors

Project overview:

Relevant questions:

Housing database

Finding Data

Data Cleanup and Analysis

Exploration and clean up

Project Development

Visual references:

Tools and sources

Conclusions

Acknowledgments

Copyright

About

Uh oh!

Releases

Packages

Contributors 4

Uh oh!

Languages

License

minmincg/house_pricing_analysis

Folders and files

Latest commit

History

Repository files navigation

Austin house pricing analysis

Authors

Project overview:

Relevant questions:

Housing database

Finding Data

Data Cleanup and Analysis

Exploration and clean up

Project Development

Visual references:

Tools and sources

Conclusions

Acknowledgments

Copyright

About

Resources

License

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Contributors 4

Uh oh!

Languages

Packages