Merging quantized model with pass through

Is there a way to do this?

I understand why F16 is required for linear and slerp, but can we do passthrough of quantized layer, as currently it necessary to go via huge models and requantize, which is a big pain point.