Skip to content

Question on random crop "End-to-end optimized image compressio" #87

Answered by lingyu98
Mareeta26 asked this question in Q&A
Discussion options

You must be logged in to vote

Hi Mareeta,

In my understanding, the analysis transform is implemented by a convolutional neural network. This architecture utilizes a number of learnable filters to scan the input image and computes the output feature maps by convolution. The output size of each convolutional layer(including the size of the bottleneck layer) is determined by the size of the input layer and the striding length. In other words, the CNN architecture is adaptable to any (large enough) sized input.

During the training period, the input image sizes are required to be the same because you are training in batches, every image has to have the same height and widths so that it forms a 4D matrix before being fed in…

Replies: 1 comment 2 replies

Comment options

You must be logged in to vote
2 replies
@Mareeta26
Comment options

@jonaballe
Comment options

Answer selected by jonaballe
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Category
Q&A
Labels
None yet
3 participants