This project analyzes global university rankings using multiple datasets including Times Higher Education and Shanghai Rankings. We examine various ranking metrics including educational quality, employability, faculty resources, and research output.
-
Times Higher Education Dataset: Contains 2000 universities with metrics like:
- World Rank
- Educational Rank
- Employability Rank
- Faculty Rank
- Research Rank
- Overall Score
-
Shanghai Ranking Dataset: Contains 1000 universities with:
- Global Ranking
- National/Regional Rank
- Total Score
-
Top Universities: Harvard consistently ranks #1 across different ranking systems
-
Employability Leaders:
- Harvard University (USA)
- INSEAD (France)
- Stanford University (USA)
-
Saudi Arabian Universities: Multiple institutions in global rankings, with positions ranging from 245 to 1979
-
Correlation Analysis:
- Strong correlation found between world rank and research output
- Moderate correlation between employability and overall score
-
Data Cleaning:
- Handled missing values
- Standardized ranking formats
- Removed duplicates
-
Analysis Techniques:
- Univariate analysis for rank distributions
- Correlation analysis between different ranking metrics
- Geographic distribution analysis
- Bar charts for top university comparisons
- Correlation heatmaps
- Scatter plots for rank relationships
- Geographic distribution plots
Below are ten key insights derived from the data along with the charts used to illustrate them:
- Python
- Pandas
- Plotly
- Seaborn
- Matplotlib
The analysis reveals strong patterns in global university rankings, with certain institutions consistently performing well across different ranking systems. Geographic distribution shows concentration of top universities in specific regions.



