A comprehensive course on advanced machine learning techniques. This course provides hands-on experience with various unsupervised learning algorithms and other methods through practical assignments with automated testing.
This assignment covers advanced topics in machine learning, with a particular focus on:
- Data Aggregation and Processing
- Clustering Algorithms
- Expectation-Maximization (EM) Algorithm
- Semi-supervised Learning
- Topic Modeling
The course is organized into several modules under the Unsupervised learning section:
- Intro: Introduction to unsupervised learning concepts
- AggregationPoints10: Techniques for data aggregation including:
- Majority voting
- Dawid-Skene method
- ClusterizationPoints20: Advanced clustering algorithms
- EMforDSPoints30: Expectation-Maximization algorithm for Data Science
- SemisupervisedPoints20: Semi-supervised learning techniques
- TopicModelingPoints20: Topic modeling and text analysis
Each module contains:
- Task description (task.md)
- Implementation file (task.py or main.py)
- Test suite (tests/test_task.py)
- Additional resources when needed
To complete a task:
- Read the task description in the respective module's task.md
- Implement the required functionality in the implementation file
- Run the tests to verify your solution
The course uses unittest framework for automated testing. To run tests for a specific task:
python -m unittest Unsupervised/ModuleName/tests/test_task.py
Aleksandr Avdiushenko