ADD RWKV7 #2421

pass-lin · 2025-09-28T15:11:13Z

RWKV7 is one of the strongest RNN models available today, and we now provide a full implementation for it in keras_hub.

📚 References

🔗 Pre-trained Checkpoints (ModelScope)

Numerical-verification and Inference Example notebook

This is the first modern RNN architecture in keras_hub. With the resurgence of recurrent models, more pre-trained RNN backbones will follow; hence this PR also serves as a reference implementation for future work.

Current progress

[✅] backbone implementation
[✅] checkpoint-conversion script
[✅] tokenizer implementation
[✅] unit tests / examples
[✅] complete CausalLM task wrapper
[✅] Add document

gemini-code-assist · 2025-09-28T15:11:32Z

Summary of Changes

Hello @pass-lin, I'm Gemini Code Assist¹! I'm currently reviewing this pull request and will post my feedback shortly. In the meantime, here's a summary to help you and other reviewers quickly get up to speed!

This pull request significantly enhances the keras_hub library by integrating the RWKV7 model, a cutting-edge recurrent neural network. This addition not only provides a robust new model for users but also serves as a foundational reference implementation, encouraging the future inclusion of more modern RNN architectures within the library.

Highlights

RWKV7 Model Integration: Introduced the RWKV7 model, a powerful RNN architecture, into keras_hub, marking a significant expansion of the library's capabilities.
Comprehensive Implementation: The pull request includes a full implementation of the RWKV7 backbone, its dedicated tokenizer, and a causal language model wrapper.
Foundation for Modern RNNs: This is the first modern RNN architecture added to keras_hub, serving as a reference implementation and paving the way for future recurrent model additions.

Using Gemini Code Assist

The full guide for Gemini Code Assist can be found on our documentation page, here are some quick tips.

Invoking Gemini

You can request assistance from Gemini at any point by creating a comment using either /gemini <command> or @gemini-code-assist <command>. Below is a summary of the supported commands on the current page.

Feature	Command	Description
Code Review	`/gemini review`	Performs a code review for the current pull request in its current state.
Pull Request Summary	`/gemini summary`	Provides a summary of the current pull request in its current state.
Comment	@gemini-code-assist	Responds in comments when explicitly tagged, both in pull request comments and review comments.
Help	`/gemini help`	Displays a list of available commands.

Customization

To customize Gemini Code Assist for GitHub experience, repository maintainers can create a configuration file and/or provide a custom code review style guide (such as PEP-8 for Python) by creating and adding files to a .gemini/ folder in the base of the repository. Detailed instructions can be found here.

Limitations & Feedback

Gemini Code Assist may make mistakes. Please leave feedback on any instances where its feedback is incorrect or counter productive. You can react with 👍 and 👎 on @gemini-code-assist comments. If you're interested in giving your feedback about your experience with Gemini Code Assist for Github and other Google products, sign up here.

You can also get AI-powered code generation, chat, as well as code reviews directly in the IDE at no cost with the Gemini Code Assist IDE Extension.

Review the Privacy Notices, Generative AI Prohibited Use Policy, Terms of Service, and learn how to configure Gemini Code Assist in GitHub here. Gemini can make mistakes, so double check it and use code with caution. ↩

gemini-code-assist

Code Review

This PR introduces the RWKV-7 model, a powerful RNN architecture, to keras_hub. The contribution is significant and includes the backbone, tokenizer, preprocessor, an incomplete task model, and a checkpoint conversion script. The implementation follows the modular structure of keras_hub.

However, there are several critical issues that must be addressed before this PR can be merged:

Missing Tests: The PR lacks unit tests for all new components. According to the contribution guidelines, testing is a mandatory requirement.[^1]
Incomplete CausalLM Task: The RWKV7CausalLM task model is a stub with TODOs, making it non-functional for generation.
Critical Bugs: There are critical bugs in the tokenizer and preprocessor implementations that will cause runtime errors.
Style Guide Violations: There are numerous style guide violations, including a filename typo, missing docstrings, and inconsistencies with the recommended model input structure.

I've left detailed comments on these issues. Once these are resolved, this will be a great addition to the library.

keras_hub/src/models/rwkv7/rwkv7_tokenizer.py

keras_hub/src/models/rwkv7/rwkv7_casual_lm.py

tools/checkpoint_conversion/convert_rwkv7_checkpoints.py

keras_hub/src/models/rwkv7/rwkv7_causal_lm_preprocessor.py

keras_hub/src/models/rwkv7/rwkv7_tokenizer.py

keras_hub/src/models/rwkv7/rwkv7_layer.py

keras_hub/src/models/rwkv7/rwkv7_backbone.py

keras_hub/src/models/rwkv7/rwkv7_causal_lm_preprocessor.py

keras_hub/src/models/rwkv7/rwkv7_backbone.py

pass-lin · 2025-10-19T02:28:53Z

@divyashreepathihalli @mattdangerw Anybody review my code?

pass-lin · 2025-10-20T10:17:06Z

@sachinprasadhs

gemini-code-assist

Code Review

This pull request introduces a comprehensive implementation of the RWKV-7 model, a modern RNN architecture, into keras_hub. The contribution is well-structured, following the repository's modular design with a backbone, causal LM task, preprocessor, tokenizer, and a checkpoint conversion script. The code is generally of high quality.

My review highlights a few areas for improvement. There are critical bugs in the tokenizer's asset loading and saving logic that need to be addressed. The custom layers in rwkv7_layer.py have inconsistent return signatures, which could lead to runtime errors and makes the code harder to maintain. Additionally, the test files do not use the standardized helper methods from TestCase as required by the repository's style guide, which is a significant deviation. I have also pointed out some minor typos and opportunities for code clarification.

Overall, this is an excellent and valuable addition. Addressing the feedback will improve the robustness, correctness, and maintainability of the new model.

keras_hub/src/models/rwkv7/rwkv7_tokenizer.py

keras_hub/src/models/rwkv7/rwkv7_layer.py

keras_hub/src/models/rwkv7/rwkv7_backbone.py

keras_hub/src/models/rwkv7/rwkv7_causal_lm.py

tools/checkpoint_conversion/convert_rwkv7_checkpoints.py

pass-lin · 2025-10-29T07:39:28Z

@mattdangerw @divyashreepathihalli @sachinprasadhs
It's been almost a month, is there anyone willing to pay attention to me?

sachinprasadhs · 2025-11-04T21:13:41Z

Apologies for the delay in review, taking a look into this. Will add my comments.

sachinprasadhs · 2025-11-04T21:13:49Z

/gemini review

gemini-code-assist

Code Review

This pull request introduces the RWKV-7 model, a modern RNN architecture, to keras_hub. The implementation is comprehensive, covering the backbone, causal LM task, preprocessor, tokenizer, and a checkpoint conversion script. The code is well-structured and follows the modular design principles of the repository.

My review focuses on ensuring adherence to the repository's style guide, particularly regarding testing practices and code style conventions. I've identified several areas for improvement:

The testing for the new components should be updated to use the standardized helper methods from the base TestCase. Some tests also contain incorrect assertions.
There are a few deviations from the coding style, such as the use of type hints in function signatures and a few hardcoded values that could be made more flexible.
The backbone implementation should be updated to accept a padding_mask as input, aligning with the repository's conventions.

Addressing these points will improve the consistency, correctness, and maintainability of the new model. Overall, this is a great contribution, adding a powerful and interesting architecture to the library.

keras_hub/src/models/rwkv7/rwkv7_backbone.py

keras_hub/src/models/rwkv7/rwkv7_backbone_test.py

keras_hub/src/models/rwkv7/rwkv7_causal_lm_preprocessor_test.py

keras_hub/src/models/rwkv7/rwkv7_causal_lm_test.py

keras_hub/src/models/rwkv7/rwkv7_causal_lm.py

keras_hub/src/models/rwkv7/rwkv7_layer.py

keras_hub/src/models/rwkv7/rwkv7_tokenizer.py

pass-lin · 2025-11-11T06:21:39Z

Thanks for providing the details, will again circle back with the team and get back to you on this one.

Finally, please allow me to add one more point.

From February 26, 2025, to today, RWKV-LM has gained 1,000 stars (from 13.1k to 14.1k). It has also increased by 100 stars from October 16 to today. You can see this trend of star growth at the following link.

I believe this demonstrates that RWKV is a very popular and highly active community.

tempdragon · 2025-11-11T10:09:29Z

BTW, RWKV is also mentioned here by the Linux Foundation.
https://lfaidata.foundation/projects/rwkv/

divyashreepathihalli · 2025-11-14T20:33:32Z

Thank you @pass-lin!!

pass-lin · 2025-11-15T03:57:21Z

Thank you @pass-lin!!

I want to know if the Keras team thinks RWKV is suitable to be merged into Keras Hub.

sachinprasadhs · 2025-11-17T17:46:26Z

@pass-lin , We can add it since there is already a lot of efforts involved.
I will add the review for other files.
Going forward please create an issue with the details of the model along with the download trend etc.

sachinprasadhs

Still there are many unresolved comments, please go though them carefully and let us know once this is ready for review again.
Also, match the coding style to Keras Hub standard implementation.
refer our Model and contribution guidelines.

keras_hub/src/models/rwkv7/rwkv7_backbone_test.py

tools/checkpoint_conversion/convert_rwkv7_checkpoints.py

keras_hub/src/models/rwkv7/rwkv7_tokenizer.py

keras_hub/src/models/rwkv7/rwkv7_tokenizer_test.py

keras_hub/src/models/rwkv7/rwkv7_tokenizer.py

pass-lin · 2025-11-26T04:17:23Z

Following your review, I readjusted rwkv7_tokenizer. Please note that I retained the recursion, as rashly modifying the recursive code written by the original rwkv author would be too bug-prone. Given that the trie tree's maximum depth in the rwkv vocabulary is 80, stack overflow is unlikely, so I believe it should be kept.
@sachinprasadhs

sachinprasadhs · 2025-11-26T05:04:38Z

Okay, also once you address any comment mark the comment as resolved

pass-lin · 2025-11-26T06:31:16Z

Still there are many unresolved comments, please go though them carefully and let us know once this is ready for review again. Also, match the coding style to Keras Hub standard implementation. refer our Model and contribution guidelines.

I think all the current issues have been resolved, and we can proceed to the next step.

sachinprasadhs · 2025-11-26T18:43:30Z

There is an option/button as "Resolve conversation" for each review, could you please click that if the comment is resolved there are 100 plus comments and many are still showing open.

pass-lin · 2025-11-27T02:57:00Z

There is an option/button as "Resolve conversation" for each review, could you please click that if the comment is resolved there are 100 plus comments and many are still showing open.

Ok, i have resolved all conversations

pass-lin · 2025-12-04T15:27:37Z

@sachinprasadhs Can you review the current code?

pass-lin added 3 commits September 28, 2025 22:48

add RWKV

195ef79

fix

7bc36b5

fix

7d4a7a1

gemini-code-assist bot reviewed Sep 28, 2025

View reviewed changes

pass-lin added 7 commits October 7, 2025 23:15

add inference

e5bb446

add inference

afcff31

add tokenizer doc

ec0baf3

add doc

bd6c618

add test case

4201a7f

fix test

897a64b

fix doc

ff11f94

divyashreepathihalli requested a review from sachinprasadhs October 19, 2025 18:38

gemini-code-assist bot reviewed Oct 20, 2025

View reviewed changes

pass-lin added 3 commits October 20, 2025 18:44

fix gemini review.

ce13d54

format.

0e36b4a

format.

7218888

pass-lin added 5 commits October 29, 2025 16:02

save tokenizer

cc5815b

fix tokenizer load

dd80464

fix save

5e8723d

renew preset

f223002

renew perset.

b2b1573

pass-lin force-pushed the rwkv branch from c2afdde to b2b1573 Compare November 3, 2025 09:11

debug for remat

c5ebeec

gemini-code-assist bot reviewed Nov 4, 2025

View reviewed changes

pass-lin added 3 commits November 13, 2025 13:46

modify RWKV7CausalLMPreprocessor

75c8a88

modify RWKV7CausalLMPreprocessor

eac1505

modify RWKV7CausalLMPreprocessor

06ec6c5

modify variable shape

f67d37a

pass-lin mentioned this pull request Nov 18, 2025

Add RWKV7 #2457

Open

remove vfrist at layer0

3d79d2f

sachinprasadhs reviewed Nov 25, 2025

View reviewed changes

modify tokenizer

4e9e6e7

pass-lin added 2 commits November 26, 2025 12:18

remove vscode file

36ebd7f

remove typing

fb5aef5

pass-lin and others added 7 commits November 28, 2025 15:35

Merge branch 'keras-team:master' into rwkv

b8971ca

recover .vscode file.

89a311f

add faster inference op.

c7cf4de

add faster inference op.

d890ec6

limit cuda kernel at jax and torch backend

55dc91f

fix tensorflow inference bug

1675d0f

update new preset.

e06724b

pass-lin added 2 commits December 6, 2025 15:50

modify preprocessor

2aecaa5

modify code .

8cc1e29

ADD RWKV7 #2421

Are you sure you want to change the base?

ADD RWKV7 #2421

Uh oh!

Conversation

pass-lin commented Sep 28, 2025 • edited by sachinprasadhs Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

📚 References

🔗 Pre-trained Checkpoints (ModelScope)

Uh oh!

gemini-code-assist bot commented Sep 28, 2025

Summary of Changes

Highlights

Footnotes

Uh oh!

gemini-code-assist bot left a comment

Choose a reason for hiding this comment

Code Review

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

pass-lin commented Oct 19, 2025

Uh oh!

pass-lin commented Oct 20, 2025

Uh oh!

gemini-code-assist bot left a comment

Choose a reason for hiding this comment

Code Review

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

pass-lin commented Oct 29, 2025

Uh oh!

sachinprasadhs commented Nov 4, 2025

Uh oh!

sachinprasadhs commented Nov 4, 2025

Uh oh!

gemini-code-assist bot left a comment

Choose a reason for hiding this comment

Code Review

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

pass-lin commented Nov 11, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

tempdragon commented Nov 11, 2025

Uh oh!

divyashreepathihalli commented Nov 14, 2025

Uh oh!

pass-lin commented Nov 15, 2025

Uh oh!

sachinprasadhs commented Nov 17, 2025

Uh oh!

sachinprasadhs left a comment • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

pass-lin commented Sep 28, 2025 •

edited by sachinprasadhs

Loading

pass-lin commented Nov 11, 2025 •

edited

Loading

sachinprasadhs left a comment •

edited

Loading

pass-lin commented Nov 26, 2025 •

edited

Loading

pass-lin commented Dec 4, 2025 •

edited

Loading