Question about the finetuning and model configuration #85

jsw-zorro · 2021-06-01T14:15:42Z

jsw-zorro
Jun 1, 2021

Hi, I am wondering is there any convenient way to finetune the pre-trained model on a new dataset? Or is it convenient for you to provide the model parameter setup for the different pre-trained models? I am wondering about different parameter setup for the models ranging from bmshj2018-hyperprior-msssim-1 to bmshj2018-hyperprior-msssim-8.

Thanks for help in advance.

jonaballe · 2021-06-10T22:30:29Z

jonaballe
Jun 10, 2021
Maintainer

Hi, unfortunately our published pre-trained models can't easily be fine tuned. To make the computations cross-platform compatible and prevent them from being dependent on specific TensorFlow versions, we provide them as computation graphs that have been constant-folded, with all unnecessary parts removed. So, no TensorFlow variables remain in the graph, and it doesn't directly map to Python code anymore. Also, the variables representing the continuous versions of the probability models are missing from the graphs, since they are not needed during inference time. So even if you reverse engineered the graphs to extract the kernels/biases, there would be no way to continue training the differential proxy for the rate. You'd have to reinitialize that part of the model.

Right now, I think your best option would be to train a model from scratch, based on the provided code in the models/ directory.

1 reply

jsw-zorro Jun 16, 2021
Author

Get it, thx!

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Question about the finetuning and model configuration #85

Uh oh!

{{title}}

Uh oh!

Replies: 1 comment 1 reply

Uh oh!

{{title}}

Uh oh!

Uh oh!

{{title}}

Uh oh!

Select a reply

Uh oh!

Question about the finetuning and model configuration #85

Uh oh!

jsw-zorro Jun 1, 2021

Replies: 1 comment · 1 reply

Uh oh!

jonaballe Jun 10, 2021 Maintainer

Uh oh!

jsw-zorro Jun 16, 2021 Author

jsw-zorro
Jun 1, 2021

Replies: 1 comment 1 reply

jonaballe
Jun 10, 2021
Maintainer

jsw-zorro Jun 16, 2021
Author