Speed up RIFE by 45x (36.4 seconds -> 0.8 seconds) by JohnAlcatraz · Pull Request #102 · Fannovel16/ComfyUI-Frame-Interpolation

JohnAlcatraz · 2025-08-12T19:50:21Z

Processing without this PR: 36.4 seconds

Processing with this PR: 0.8 seconds

Tested on RTX 5090 with a 832x480 video, 81 frames.

To be clear, this is ChatGPT Agents work, I'm not a Python programmer. But it works very well :) No difference in quality, and way faster.

dhm99 · 2025-11-09T19:03:54Z

Working great on my RTX 5090.
Thanks!

obraia · 2025-12-18T21:37:28Z

It worked well on a 4070 Ti Super 16GB, thanks.

deniaud · 2026-01-12T15:35:32Z

I'll be showing this vibe-coding example in my lectures, great job!

deniaud · 2026-01-12T15:36:16Z

@Fannovel16 what about merging this?

…ode opts Incorporates the approach from PR Fannovel16#102 (JohnAlcatraz) which replaces the generic_frame_loop with an inline task-list loop. This enables true GPU-level batching: multiple (pair_index, timestep) tasks are stacked into a single batched tensor and processed in one IFNet forward pass, since IFNet already supports batched tensor timesteps. Combined with our existing optimisations: - dtype widget (float32/float16/bfloat16): model and inputs cast to chosen precision; output always returned as float32 for downstream compatibility - torch_compile widget: optional torch.compile() wrapping for 10-30% speedup - batch_size widget: now controls true task-level batching — each task is one intermediate frame, multiple tasks share a single forward pass - torch.inference_mode(): wraps the entire inference loop Co-Authored-By: Claude Opus 4.6 <noreply@anthropic.com>

Add a module-level _model_cache dict keyed by (ckpt_name, dtype, torch_compile). On repeated runs with the same settings the model weights, dtype cast, device transfer, and torch.compile() step are all skipped — only the inference loop runs. Cache is invalidated automatically when any of the three key parameters change. This is the primary source of the speedup claimed by PR Fannovel16#102: the original code reloaded the model from disk on every execution. Co-Authored-By: Claude Opus 4.6 <noreply@anthropic.com>

marduk191 · 2026-02-27T15:26:41Z

rolled this into another pr that is merged. thanks a lot!

Optimize RIFE

fa2f594

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Speed up RIFE by 45x (36.4 seconds -> 0.8 seconds)#102

Speed up RIFE by 45x (36.4 seconds -> 0.8 seconds)#102
JohnAlcatraz wants to merge 1 commit intoFannovel16:mainfrom
JohnAlcatraz:main

JohnAlcatraz commented Aug 12, 2025

Uh oh!

dhm99 commented Nov 9, 2025

Uh oh!

obraia commented Dec 18, 2025

Uh oh!

deniaud commented Jan 12, 2026

Uh oh!

deniaud commented Jan 12, 2026

Uh oh!

marduk191 commented Feb 27, 2026

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

5 participants

Conversation

JohnAlcatraz commented Aug 12, 2025

Uh oh!

dhm99 commented Nov 9, 2025

Uh oh!

obraia commented Dec 18, 2025

Uh oh!

deniaud commented Jan 12, 2026

Uh oh!

deniaud commented Jan 12, 2026

Uh oh!

marduk191 commented Feb 27, 2026

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

5 participants