Applying Data Science to Real-World Problems
Organized by the Departmental Statistics Association, Department of Statistics Β· Faculty of Science, The Maharaja Sayajirao University of Baroda Β· Est. 1949
DataVerse-2026 is a data analytics competition designed to bridge the gap between raw data and real-world impact. Participants are challenged to apply data science, visualization, and analytical storytelling to solve authentic, scenario-based problems across multiple domains.
This competition spans three days across two rounds β a qualifying visualization sprint and a 9-hour grand finale datathon β culminating in live presentations before a panel of judges.
"Transforming Data into Insight Β· Insight into Impact"
DataVerse-2026/
β
βββ ROUND - 1/ # Qualifying Round submissions
β βββ (Team notebooks & dashboards)
β
βββ ROUND - 2/ # Finals submissions
β βββ (Team solutions, models & presentations)
β
βββ index.html # Website homepage
βββ rounds.html # Competition rounds page
βββ winners.html # Winners showcase
βββ about.html # About & rules
| Place | Team | Problem Statement | Repository |
|---|---|---|---|
| π₯ 1st | Alpha Analyst | The Urban Heat Mystery | View β |
| π₯ 1st | Peaky Blinders | The Safe Lending | View β |
| π₯ 1st | Entropy | The Promotional Paradox | View β |
All participants received a Certificate of Participation in recognition of their contribution.
Date: 26th February, 2026 Β· Duration: 3 Hours
Participants were provided with real-world datasets and challenged to demonstrate their data storytelling and visualization skills. Each team selected one domain, received a data dictionary and guiding questions, and submitted a visualization dashboard or notebook along with a short insight summary.
Evaluation Criteria:
- Clarity and relevance of visualizations
- Quality of insights and storytelling
- Accuracy and appropriate use of data
- Creativity and effective design choices
- Overall coherence and interpretability
Qualification: Top 3 teams per domain advanced to the Finals.
Date: 27th February, 2026 Β· Duration: 9 Hours Β· Presentations: 28th February, 2026
Finalist teams selected one of three scenario-based problem statements and worked through a 9-hour continuous datathon with mentor support. Solutions were presented live before a judge panel on the final day.
Problem Statements:
| # | Title | Domain |
|---|---|---|
| 1 | The Urban Heat Mystery | Climate & Environment (Navapur) |
| 2 | The Safe Lending | Finance & Risk (Lending Club) |
| 3 | The Promotional Paradox | Retail & Marketing (Mercato) |
Evaluation Criteria:
- Problem understanding
- Technical correctness
- Innovation
- Impact and feasibility
- Quality of presentation
Participants were free to use any tools or languages. Common choices included:
Click to expand
- Eligibility β Open to all students enrolled in any course under the Faculty of Science, MSU Baroda.
- Team Composition β Individual or teams of 2β4 members. Composition must remain the same across all rounds.
- AI Tools β Use of AI tools is permitted and encouraged, provided participants maintain academic integrity and transparency.
- Originality β All submissions must be original. Plagiarism or use of pre-existing solutions without attribution leads to disqualification.
- Submission Deadlines β All submissions must be within prescribed time limits and in the specified format. Late submissions are not accepted.
- Code of Conduct β Maintain discipline and professionalism. Misconduct may result in immediate disqualification.
- Judging & Decisions β Decisions by the panel of judges and organising committee are final and binding.
- Team Formation Support β Solo participants wishing to join a team could indicate so during registration; the committee facilitated team formation where possible.
The official event website is live at:
https://threed2y.github.io/DataVerse-2026/
Built with pure HTML, CSS, and JavaScript β fully static, no dependencies, deployable on GitHub Pages with zero configuration.
| Page | Description |
|---|---|
index.html |
Homepage with event overview and key details |
rounds.html |
Full breakdown of Round 1 and Round 2 |
winners.html |
Winners showcase with project links |
about.html |
About the event, rules, and contact |
The official competition dataset is hosted on Hugging Face, containing global development indicators spanning economic, environmental, health, digital, and governance dimensions across world regions from 2000β2020.
π€ huggingface.co/datasets/sleepysaurus/DataVerse
| Field | Details |
|---|---|
| Observations | ~1,000+ region-year rows |
| Columns | 47 features across economic, environmental, health, digital & governance domains |
| License | MIT |
| Format | Parquet / CSV (HuggingFace datasets compatible) |
from datasets import load_dataset
ds = load_dataset("sleepysaurus/DataVerse")| π§ Email | dataversestats@gmail.com |
| π Dharmik | +91 63540 93708 |
| π Alok | +91 63539 08759 |
| ποΈ Department | Dept. of Statistics, Faculty of Science, MSU Baroda |
Departmental Statistics Association Β· Department of Statistics Β· Faculty of Science The Maharaja Sayajirao University of Baroda Β· Est. 1949
DataVerse-2026