Conversation
This file contains hidden or bidirectional Unicode text that may be interpreted or compiled differently than what appears below. To review, open the file in an editor that reveals hidden Unicode characters.
Learn more about bidirectional Unicode characters
Sign up for free
to join this conversation on GitHub.
Already have an account?
Sign in to comment
Add this suggestion to a batch that can be applied as a single commit.This suggestion is invalid because no changes were made to the code.Suggestions cannot be applied while the pull request is closed.Suggestions cannot be applied while viewing a subset of changes.Only one suggestion per line can be applied in a batch.Add this suggestion to a batch that can be applied as a single commit.Applying suggestions on deleted lines is not supported.You must change the existing code in this line in order to create a valid suggestion.Outdated suggestions cannot be applied.This suggestion has been applied or marked resolved.Suggestions cannot be applied from pending reviews.Suggestions cannot be applied on multi-line comments.Suggestions cannot be applied while the pull request is queued to merge.Suggestion cannot be applied right now. Please check back later.
PULL REQUEST TITLE: UofT-DSI | LCR-Assignment 3
What changes are you trying to make? (e.g. Adding or removing code, refactoring existing code, adding reports)
Add code to display the number of many observations and features in the dataset.
Add code to apply K-means clustering to the wine dataset.
Add code to implement bootstrapping on the mean of color intensity, generating 10000 bootstrap samples, calculating the mean for each sample, and determine the 90% confidence interval.
What did you learn from the changes you have made?
How to implement bootstrapping, calculating the mean of the samples, and calculate the 90% confidence interval.
Was there another approach you were thinking about making? If so, what approach(es) were you thinking of?
N/A
Were there any challenges? If so, what issue(s) did you face? How did you overcome it?
Determining the lower and upper bound of the confidence interval. I overcame the challenge by reviewing the live coding the determine the calculations.
Determining the variability of the bootstrapped means. I overcame the challenge by plotting the histogram to look at the distribution of the mean values and compared the mean color intensity, mean of the bootstrap samples, and the mean values from the lower and upper confidence interval bounds.
How were these changes tested?
By generating 10000 bootstrap samples, calculating the mean for each sample, determining the 90% confidence interval, and plotting the histogram to determine the variability.
A reference to a related issue in your repository (if applicable)
Checklist