models : Added support for RND1 Diffusion Language Model #17433

wp4032 · 2025-11-21T18:18:23Z

RND1 works on llama.cpp, can run:

llama-diffusion-cli -m RND1-Base-0910.gguf -p "write code to train MNIST in pytorch" -ub 256 --diffusion-algorithm 1 --diffusion-steps 256 --diffusion-visual --temp 0.5

Model Card

https://huggingface.co/radicalnumerics/RND1-Base-0910

Instructions

# Create conda env
cd llama.cpp && conda create --name rnd1 python=3.12
conda activate rnd1
pip install -r requirements.txt

# Converting to gguf
huggingface-cli download radicalnumerics/RND1-Base-0910 --local-dir RND1-Base-0910
python llama.cpp/convert_hf_to_gguf.py RND1-Base-0910/ --outfile RND1-Base-0910.gguf --outtype bf16 

# Building diffusion cli
cmake -B build    # Will build with Metal automatically
cmake --build llama.cpp/build --target llama-diffusion-cli -j
llama.cpp/build/bin/llama-diffusion-cli -m RND1-Base-0910.gguf -p "What is a GPU?" -ub 32 --temp 0.01 -ngl 999 -fa on --seed 1234 --verbose

Results

Works on GB200, non-causal, BF16, 512 context len, 256 diffusion steps:

total time: 55793.55ms, time per step: 217.94ms, sampling time per step: 52.18ms

Works on H100, non-causal, BF16, 512 context len, 256 diffusion steps:

total time: 55602.47ms, time per step: 217.20ms, sampling time per step: 63.52ms

Works on Mac Studio M3 Ultra, non-causal, BF16, 512 context len, 256 diffusion steps:

total time: 65817.30ms, time per step: 257.10ms, sampling time per step: 16.65ms

convert_hf_to_gguf.py

src/models/models.h

convert_hf_to_gguf.py

Co-authored-by: Sigbjørn Skjæret <[email protected]>

CISC

Nice, thank you!

am17an · 2025-11-22T00:26:43Z

Please update or close the issue with a comment #17291

examples/diffusion/README.md

src/models/rnd1.cpp

wp4032 · 2025-11-22T16:25:26Z

Please update or close the issue with a comment #17291

Regarding this, I am still looking for the cause of the issue, however, I am quite confident it is not to do with RND1's implementation. If you really want me to close the issue #17291, I can.

src/models/rnd1.cpp

am17an · 2025-11-23T01:55:47Z

Regarding this, I am still looking for the cause of the issue, however, I am quite confident it is not to do with RND1's implementation. If you really want me to close the issue #17291, I can.

I don't really want you to close the issue, however if it is a correctness bug as you mention there then the outputs for RND1 will also suffer from the same issue, I'm not sure why you are saying it is not to do with RND1's implementation as that is the same implementation as Qwen2. I would rather see that resolved before merging.

wp4032 added 5 commits November 4, 2025 20:49

Converted RND1 model to GGUF weights

cfee32d

RND1 llama.cpp support v1

15d938c

RND1 llama.cpp support v2 non causal bug

0911acf

RND1 llama.cpp support v3 doccumentation

5f36f0a

RND1 llama.cpp support v4 clean code

d960ace

wp4032 requested review from CISC, am17an and ggerganov as code owners November 21, 2025 18:18

github-actions bot added model Model specific examples python python script changes labels Nov 21, 2025

Merge branch 'master' into rnd1-llama-cpp, fix merge conflicts

e40d24b

loci-dev mentioned this pull request Nov 21, 2025

UPSTREAM PR #17433: models : Added support for RND1 Diffusion Language Model auroralabs-loci/llama.cpp#284

Open

linting issues

e02174b

CISC reviewed Nov 21, 2025

View reviewed changes

convert_hf_to_gguf.py Outdated Show resolved Hide resolved

convert_hf_to_gguf.py Outdated Show resolved Hide resolved

convert_hf_to_gguf.py Outdated Show resolved Hide resolved

convert_hf_to_gguf.py Outdated Show resolved Hide resolved

src/models/models.h Outdated Show resolved Hide resolved

RND1 pr fixes v1

53a517b

CISC reviewed Nov 21, 2025

View reviewed changes

convert_hf_to_gguf.py Show resolved Hide resolved

convert_hf_to_gguf.py Show resolved Hide resolved

RND1 pr fixes v2

a877fe3

Co-authored-by: Sigbjørn Skjæret <[email protected]>

CISC approved these changes Nov 22, 2025

View reviewed changes

am17an requested changes Nov 22, 2025

View reviewed changes

examples/diffusion/README.md Outdated Show resolved Hide resolved

examples/diffusion/README.md Outdated Show resolved Hide resolved

examples/diffusion/README.md Outdated Show resolved Hide resolved

src/models/rnd1.cpp Show resolved Hide resolved

Diffusion documentation edits

bf6d002

am17an reviewed Nov 23, 2025

View reviewed changes

src/models/rnd1.cpp Show resolved Hide resolved

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

models : Added support for RND1 Diffusion Language Model #17433

models : Added support for RND1 Diffusion Language Model #17433

wp4032 commented Nov 21, 2025 •

edited

Loading

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

CISC left a comment

Uh oh!

am17an commented Nov 22, 2025

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

wp4032 commented Nov 22, 2025 •

edited

Loading

Uh oh!

Uh oh!

am17an commented Nov 23, 2025

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants

models : Added support for RND1 Diffusion Language Model #17433

Are you sure you want to change the base?

models : Added support for RND1 Diffusion Language Model #17433

Conversation

wp4032 commented Nov 21, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Model Card

Instructions

Results

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

CISC left a comment

Choose a reason for hiding this comment

Uh oh!

am17an commented Nov 22, 2025

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

wp4032 commented Nov 22, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

Uh oh!

am17an commented Nov 23, 2025

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants

wp4032 commented Nov 21, 2025 •

edited

Loading

wp4032 commented Nov 22, 2025 •

edited

Loading