Question on random crop "End-to-end optimized image compressio" #87

Mareeta26 · 2021-07-01T23:14:07Z

Mareeta26
Jul 1, 2021

Hi,
I'm relatively new to the data augmentation methods.I would like to know more the cropping done on the paper "End-to-end optimized image compression".
What I understood is the images are randomly cropped to 256x256 patches and then fed to the training model. I tried to remove the cropping in training function, but couldn't train since the image was no longer 256x256.
How does it work for testing? I found that testing images are not cropped or resized. How does the trained model recognize the new sizes in the testing set?
Please advise

Thanks,
Mareeta

Answered by lingyu98

Jul 6, 2021

Hi Mareeta,

In my understanding, the analysis transform is implemented by a convolutional neural network. This architecture utilizes a number of learnable filters to scan the input image and computes the output feature maps by convolution. The output size of each convolutional layer(including the size of the bottleneck layer) is determined by the size of the input layer and the striding length. In other words, the CNN architecture is adaptable to any (large enough) sized input.

During the training period, the input image sizes are required to be the same because you are training in batches, every image has to have the same height and widths so that it forms a 4D matrix before being fed in…

View full answer

lingyu98 · 2021-07-06T15:01:42Z

lingyu98
Jul 6, 2021

Hi Mareeta,

In my understanding, the analysis transform is implemented by a convolutional neural network. This architecture utilizes a number of learnable filters to scan the input image and computes the output feature maps by convolution. The output size of each convolutional layer(including the size of the bottleneck layer) is determined by the size of the input layer and the striding length. In other words, the CNN architecture is adaptable to any (large enough) sized input.

During the training period, the input image sizes are required to be the same because you are training in batches, every image has to have the same height and widths so that it forms a 4D matrix before being fed into the network. However, images from most training sets come in all sizes. That's why cropping is necessary. You can change the patch size from 256x256 to any size you like, as long as you crop the images to the same size.

In testing, we want to test the full image of each image from the test set so that results are comparable to certain standards. So there shouldn't be random cropping in the testing, since that would essentially change your test set, giving unreliable results. As mentioned, the CNN adapts to different sizes of inputs so there are no restraints on the size of test images. (Unless you want to test them in patches as well, which, empirically is faster)

Lingyu

2 replies

Mareeta26 Jul 7, 2021
Author

@Mikeyboiii Thank you very much for the reply.

jonaballe Aug 24, 2021
Maintainer

Thanks for the accurate explanation!

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Question on random crop "End-to-end optimized image compressio" #87

Uh oh!

{{title}}

Uh oh!

Replies: 1 comment 2 replies

Uh oh!

{{title}}

Uh oh!

Uh oh!

{{editor}}'s edit

{{editor}}'s edit

Uh oh!

Uh oh!

{{title}}

Uh oh!

Uh oh!

{{title}}

Uh oh!

Select a reply

Uh oh!

Question on random crop "End-to-end optimized image compressio" #87

Uh oh!

Mareeta26 Jul 1, 2021

Replies: 1 comment · 2 replies

Uh oh!

Uh oh!

lingyu98 Jul 6, 2021

Uh oh!

Mareeta26 Jul 7, 2021 Author

Uh oh!

jonaballe Aug 24, 2021 Maintainer

Mareeta26
Jul 1, 2021

Replies: 1 comment 2 replies

lingyu98
Jul 6, 2021

Mareeta26 Jul 7, 2021
Author

jonaballe Aug 24, 2021
Maintainer