Welcome to our Retail Analysis Project repository! 🎉
This project was developed using PyCharm, leveraging the power of PySpark for efficient big data processing. In addition, a comprehensive testing framework has been implemented with PyTest to ensure the reliability and integrity of the analysis. Logging is also enabled using Log4j for effective monitoring and debugging.
- Efficient data processing with PySpark
- Exploratory Data Analysis (EDA) for retail datasets
- Customer segmentation and behavior analysis
- Product recommendation systems
- Trend analysis and forecasting
To get started with the project:
- Clone this repository:
git clone https://github.com/Akshay-ST/RetailAnalysis.git - Install the required dependencies:
pip install -r requirements.txt - Run the file application_main.py in the root directory to explore the analysis and insights.
The project includes a complete testing framework implemented with PyTest. Run the tests using:
Contributions are welcome! Feel free to open issues or pull requests to suggest improvements or report bugs. For major changes, please open an issue first to discuss the proposed changes.
This project is licensed under the MIT License. See the LICENSE file for details.