During the model warm-up, especially in the initial steps, there is significant fluctuation in GPU memory, and the process takes a considerable amount of time, reaching around twenty seconds. The GPU memory and the latency time is not that stable. Is there any suggestions please? thx.