Skip to content

Commit 6ea485b

Browse files
committed
Add chapter 4 figures
1 parent a59ab1c commit 6ea485b

28 files changed

+38
-6
lines changed

README.md

+1-1
Original file line numberDiff line numberDiff line change
@@ -44,7 +44,7 @@ We delve into the world of image classification in a mere five lines of Keras co
4444

4545
We use transfer learning to reuse a previously trained network on a new custom classification task to get near state-of-the-art accuracy in a matter of minutes. We then slice and dice the results to understand how well is it classifying. Along the way, we build a common machine learning pipeline, which is repurposed throughout the book. Bonus: we hear from **Jeremy Howard**, co-founder of fast.ai, on how hundreds of thousands of students use transfer learning to jumpstart their AI journey.
4646

47-
[**Chapter 4 - Building a Reverse Image Search Engine: Understanding Embeddings**](https://github.com/practicaldl/Practical-Deep-Learning-Book/tree/master/code/chapter-4) | [Read online](https://learning.oreilly.com/library/view/practical-deep-learning/9781492034858/ch04.html)
47+
[**Chapter 4 - Building a Reverse Image Search Engine: Understanding Embeddings**](https://github.com/practicaldl/Practical-Deep-Learning-Book/tree/master/code/chapter-4) | [Read online](https://learning.oreilly.com/library/view/practical-deep-learning/9781492034858/ch04.html) | [Figures](figures/chapter-4)
4848

4949
Like Google Reverse Image Search, we explore how one can use embeddings—a contextual representation of an image to find similar images in under ten lines. And then the fun starts when we explore different strategies and algorithms to speed this up at scale, from thousands to several million images, and making them searchable in microseconds.
5050

figures/README.md

+1-1
Original file line numberDiff line numberDiff line change
@@ -11,7 +11,7 @@ This directory contains references to the figures used within the book. All imag
1111
| [Chapter 1 - Exploring the Landscape of Artificial Intelligence](chapter-1/) |
1212
| [Chapter 2 - What’s in the Picture: Image Classification with Keras](chapter-2/) |
1313
| [Chapter 3 - Cats versus Dogs: Transfer Learning in 30 Lines with Keras](chapter-3/) |
14-
| Chapter 4 - Building a Reverse Image Search Engine: Understanding Embeddings |
14+
| [Chapter 4 - Building a Reverse Image Search Engine: Understanding Embeddings](chapter-4/) |
1515
| Chapter 5 - From Novice to Master Predictor: Maximizing Convolutional Neural Network Accuracy |
1616
| Chapter 6 - Maximizing Speed and Performance of TensorFlow: A Handy Checklist |
1717
| Chapter 7 - Practical Tools, Tips, and Tricks |

figures/chapter-1/README.md

+1-1
Original file line numberDiff line numberDiff line change
@@ -1,4 +1,4 @@
1-
# Chapter 1: Exploring the Landscape of Artificial Intelligence
1+
# Chapter 1 - Exploring the Landscape of Artificial Intelligence
22

33
Note: All images in this directory, unless specified otherwise, are licensed under [CC BY-NC 4.0](https://creativecommons.org/licenses/by-nc/4.0/legalcode).
44

figures/chapter-2/README.md

+1-1
Original file line numberDiff line numberDiff line change
@@ -1,4 +1,4 @@
1-
# Chapter 2: What's in the Picture: Image Classification with Keras
1+
# Chapter 2 - What's in the Picture: Image Classification with Keras
22

33
Note: All images in this directory, unless specified otherwise, are licensed under [CC BY-NC 4.0](https://creativecommons.org/licenses/by-nc/4.0/legalcode).
44

figures/chapter-3/README.md

+2-2
Original file line numberDiff line numberDiff line change
@@ -1,4 +1,4 @@
1-
# Chapter 3: Cats Versus Dogs: Transfer Learning in 30 Lines with Keras
1+
# Chapter 3 - Cats Versus Dogs: Transfer Learning in 30 Lines with Keras
22

33
Note: All images in this directory, unless specified otherwise, are licensed under [CC BY-NC 4.0](https://creativecommons.org/licenses/by-nc/4.0/legalcode).
44

@@ -13,7 +13,7 @@ Note: All images in this directory, unless specified otherwise, are licensed und
1313
| [3-5](5-finetuning.png?raw=true) | Fine tuning a convolutional neural network | |
1414
| [3-6](6-folder-tree-structure.png?raw=true) | Example directory structure of the training and validation data for different classes | |
1515
| [3-7](7-overfitting-underfitting.png?raw=true) | Underfitting, overfitting, and ideal fitting for points close to a sine curve | |
16-
| [3-8](8-rabbit-augmentations.png?raw=true) | Possible image augmentations generated from a single image | |
16+
| [3-8](8-rabbit-augmentations.jpg?raw=true) | Possible image augmentations generated from a single image | |
1717
| [3-9](9-highest-probability-dogs.png?raw=true) | Images with the highest probability of containing dogs | |
1818
| [3-10](10-lowest-probability-dogs.png?raw=true) | Images with the lowest probability of containing dogs | |
1919
| [3-11](11-cats-highest-probability-containing-dogs.png?raw=true) | Images of cats with the highest probability of containing dogs | |

figures/chapter-4/1-rgb-histogram.png

1.22 MB
Loading

figures/chapter-4/10-tsne-grid.png

3.91 MB
Loading
Loading
Loading
Loading
46.7 KB
Loading
103 KB
Loading
92.2 KB
Loading
72.7 KB
Loading
Loading

figures/chapter-4/2-amazon-scan.png

855 KB
Loading
1.41 MB
Loading
3.07 MB
Loading
2.94 MB
Loading
3.16 MB
Loading

figures/chapter-4/3-tqdm.png

14.9 KB
Loading
270 KB
Loading

figures/chapter-4/5-first-result.png

270 KB
Loading
281 KB
Loading
1.94 MB
Loading

figures/chapter-4/8-tsne.png

172 KB
Loading

figures/chapter-4/9-tsne-clusters.png

8.24 MB
Loading

figures/chapter-4/README.md

+32
Original file line numberDiff line numberDiff line change
@@ -0,0 +1,32 @@
1+
# Chapter 4 - Building a Reverse Image Search Engine: Understanding Embeddings
2+
3+
Note: All images in this directory, unless specified otherwise, are licensed under [CC BY-NC 4.0](https://creativecommons.org/licenses/by-nc/4.0/legalcode).
4+
5+
## Figure List
6+
7+
| Figure number | Description | Notes |
8+
|:---|:---|:---|
9+
| [4-1](1-rgb-histogram?raw=true) | RGB histogram-based "Similar Image Detector" program | |
10+
| [4-2](2-amazon-scan.png?raw=true) | Product scanner in Amazon app with visual features highlighted | |
11+
| [4-3](3-tqdm.png) | Progress bar shown with tqdm_notebook | |
12+
| [4-4](4-original-image.png?raw=true) |The query image from the Caltech-101 dataset | |
13+
| [4-5](5-first-result.png?raw=true) | The nearest neighbor to our query image | |
14+
| [4-6](6-second-closest.png?raw=true) | The second nearest neighbour of the queried image | |
15+
| [4-7](7-nearest-neighbors.png?raw=true) | Nearest neighbor for different images returns similar-looking images | |
16+
| [4-8](8-tsne.png?raw=true) | t-SNE visualizing clusters of image features, where each cluster represents one object class in the same color | |
17+
| [4-9](9-tsne-clusters.png?raw=true) | t-SNE visualization showing image clusters; similar images in the same cluster | |
18+
| [4-10](10-tsne-grid.png?raw=true) | t-SNE visualization with tiled images; similar images are close together | |
19+
| [4-11](11-tensorflow-embedding-projector.png?raw=true) | TensorFlow Embedding projector showing a 3D representation of 10,000 common English words and highlighting words related to "Beatles" | |
20+
| [4-12](12-variance-vs-num-pca-dimensions.png?raw=true) | Variance for each PCA dimension |
21+
| [4-13](13-cumulative-variance-vs-num-pca-dimensions.png?raw=true) | Cumulative variance with each PCA dimension | |
22+
| [4-14](14-test-time-vs-accuracy.png?raw=true) | Test time versus accuracy for each PCA dimension | |
23+
| [4-15](15-recall-vs-qps.png?raw=true) | Comparison of ANN libraries ([data source](http://ann-benchmarks.com/)) | |
24+
| [4-16](16-before-finetune.png?raw=true) | t-SNE visualization of feature vectors of least-accurate classes before fine tuning | |
25+
| [4-17](17-after-finetune.png?raw=true) | t-SNE visualization of feature vectors of least-accurate classes after fine tuning | |
26+
| [4-18](18-siamese-network-flowchart.png?raw=true) | A Siamese network for signature verification; note that the same CNN was used for both input images | |
27+
| [4-19](https://code.flickr.net/2017/03/07/introducing-similarity-search-at-flickr/) | Similar patterns of a desert photo | |
28+
| [4-20](https://labs.pinterest.com/user/themes/pin_labs/assets/paper/visual_search_at_pinterest.pdf) | The Similar Looks feature of the Pinterest application | |
29+
| [4-21](21-celebs-like-me.png?raw=true) | Testing our friend Pete Warden’s photo on the [celebslike.me](celebslike.me) website | |
30+
| [4-22](https://papers.nips.cc/paper/5004-deep-content-based-music-recommendation.pdf) | t-SNE visualization of the distribution of predicted usage patterns, using latent factors predicted from audio | Page 7 |
31+
| [4-23a](23a-image-captioning.png?raw=true), [4-23b](23b-image-captioning.png?raw=true), [4-23c](23c-image-captioning.png?raw=true) | Image captioning feature in Seeing AI: the Talking Camera App for the blind community | |
32+
| [4-24](https://arxiv.org/pdf/1608.08716.pdf) | Defining a CNN and visualizing the output of each layer during training in ConvNetJS | Page 2 |

0 commit comments

Comments
 (0)