Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

I want to provide the Keras_hub implementation of RoFormer. #2118

Open
pass-lin opened this issue Feb 27, 2025 · 2 comments
Open

I want to provide the Keras_hub implementation of RoFormer. #2118

pass-lin opened this issue Feb 27, 2025 · 2 comments
Assignees
Labels
type:feature New feature or request

Comments

@pass-lin
Copy link
Contributor

https://github.com/ZhuiyiTechnology/roformer
Roformer is a BERT-like model. It adds the now very commonly used Rope position encoding on top of BERT. In fact, this is the first practical application of Rope position encoding.
I found that Keras_hub lacks a powerful Chinese BERT-like model. And RoFormer happens to be a native Chinese BERT model, and its architecture is very similar to that of Modern BERT. This will also be helpful for future implementations related to Modern BERT.

@pass-lin
Copy link
Contributor Author

Currently, there are two models, Bert and XLMroberta, which have Chinese and multilingual versions. However, one problem is that they have a limit on the length, making it difficult to meet the needs of many modern long-text applications.

@pass-lin
Copy link
Contributor Author

pass-lin commented Feb 27, 2025

https://github.com/ZhuiyiTechnology/roformer-v2 (Sorry, the webpage only has a Chinese interface.)
I found that RoFormer also has a more powerful v2 version, which doesn't have a corresponding implementation in HF, but its performance is better.

I tend to think we can directly implement this version, which offers a large base and a small version. Compared to previous versions, it may be more applicable.

@dhantule dhantule added the type:feature New feature or request label Mar 6, 2025
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
type:feature New feature or request
Projects
None yet
Development

No branches or pull requests

2 participants