Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Streaming infer #44

Open
wntg opened this issue Oct 17, 2024 · 3 comments
Open

Streaming infer #44

wntg opened this issue Oct 17, 2024 · 3 comments

Comments

@wntg
Copy link

wntg commented Oct 17, 2024

I want to use wavtokenizer to speech AI. Is avtokenizer apply streaming infer?

@jishengpeng
Copy link
Owner

I want to use wavtokenizer to speech AI. Is avtokenizer apply streaming infer?

Since WavTokenizer follows the same paradigm as Encodec and DAC, it can be adapted for streaming. However, this requires retraining with causal convolution. Based on empirical evidence, this retraining is unlikely to result in performance degradation.

@handsomelys
Copy link

I want to use wavtokenizer to speech AI. Is avtokenizer apply streaming infer?

Since WavTokenizer follows the same paradigm as Encodec and DAC, it can be adapted for streaming. However, this requires retraining with causal convolution. Based on empirical evidence, this retraining is unlikely to result in performance degradation.

Do I need to replace all convolutional layers involved in the WavTokenizer network with casual convolution?

@jishengpeng
Copy link
Owner

I want to use wavtokenizer to speech AI. Is avtokenizer apply streaming infer?

Since WavTokenizer follows the same paradigm as Encodec and DAC, it can be adapted for streaming. However, this requires retraining with causal convolution. Based on empirical evidence, this retraining is unlikely to result in performance degradation.

Do I need to replace all convolutional layers involved in the WavTokenizer network with casual convolution?

yes, and attention part.

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

No branches or pull requests

3 participants