High-Dimensional-Data-and-Gradient-Descent

Analyzing and overcoming the curse of dimensionality and exploring various gradient descent techniques with implementations in R

This project was completed in partial fulfilment of the course FOUNDATIONS OF DATA SCIENCE (CS F320) offered during Second Semester 2019-20 at BITS Pilani, Pilani Campus.

The aim of this project was to analyse high dimensional data (HDD) by identifying and implementing techniques to mitigate the problems occuring with HDD. The subsequent part of the project dealt with common issues with Gradient Descent and ways to overcome those issues.

Curse of dimensionality

In this project we first looked at various problems related to high dimensional data which include:

1. sparsity of data

2. overfitting

3. irrelevant features, and

4. diminishing effectiveness of distance measures.

The proposed solutions consist of:

1. Principal Component Analysis (PCA)

2. Kernel PCA, and

3. Singular Value Decomposition (SVD).

These solutions were implemented in R and the results obtained clearly indicate their efficacy in overcoming all the problems.

Gradient Descent

We then shifted our focus to Gradient Descent and realized that we face problems where the objective function gets stuck at a local minima for a non-convex loss function or that GD runs slow for larger datasets. We also observed that the learning rate should be chosen suitably.

A few successful variations of GD exist to overcome these issues such as:

1. Batch Gradient Descent

2. Stochastic Gradient Descent, and

3. Mini-Batch Gradient Descent.

We saw that the cost is often highly sensitive to some directions in parameter space and insensitive to others. The momentum algorithm can mitigate these issues somewhat, but does so at the expense of introducing another hyperparameter.

We then discussed a number of incremental (or mini-batch-based) methods that adapt the learning rates of model parameters. These include:

1. Nesterov Momentum

2. AdaGrad, and

3. Adam

Having implemented these solutions in R, the results suggest that the family of algorithms with adaptive learning rates performed fairly robustly and emerged as clear winners.

The choice of which algorithm to use, at this point, seems to depend largely on the user's familiarity with the algorithm (for ease of hyperparameter tuning).

Want to know more?

Please read the report as it contains detailed information about the datasets used, methods implemented, results obtained and further discussion.

For any doubts don't hesitate to contact me at [email protected]

If you find our work helpful, do not forget to ⭐ the repository!

Name		Name	Last commit message	Last commit date
Latest commit History 6 Commits
code_notebooks		code_notebooks
FDS Assignment #1.docx		FDS Assignment #1.docx
FoDS Report.pdf		FoDS Report.pdf
README.md		README.md

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

High-Dimensional-Data-and-Gradient-Descent

Curse of dimensionality

1. sparsity of data

2. overfitting

3. irrelevant features, and

4. diminishing effectiveness of distance measures.

1. Principal Component Analysis (PCA)

2. Kernel PCA, and

3. Singular Value Decomposition (SVD).

Gradient Descent

1. Batch Gradient Descent

2. Stochastic Gradient Descent, and

3. Mini-Batch Gradient Descent.

1. Nesterov Momentum

2. AdaGrad, and

3. Adam

Want to know more?

About

Releases

Packages

Languages

vitthal-bhandari/High-Dimensional-Data-and-Gradient-Descent

Folders and files

Latest commit

History

Repository files navigation

High-Dimensional-Data-and-Gradient-Descent

Curse of dimensionality

1. sparsity of data

2. overfitting

3. irrelevant features, and

4. diminishing effectiveness of distance measures.

1. Principal Component Analysis (PCA)

2. Kernel PCA, and

3. Singular Value Decomposition (SVD).

Gradient Descent

1. Batch Gradient Descent

2. Stochastic Gradient Descent, and

3. Mini-Batch Gradient Descent.

1. Nesterov Momentum

2. AdaGrad, and

3. Adam

Want to know more?

About

Topics

Resources

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages