Use pre-built wheel for flash-attn on Linux & Resolve version mismatch for torchaudio#1848
Use pre-built wheel for flash-attn on Linux & Resolve version mismatch for torchaudio#18482vibeornot2vibe wants to merge 3 commits into
Conversation
|
I added two more patches:
|
…d, works on Volta)
|
thx but you cant replace pytorch 2.7 with pytorch 2.10 as it invalidates all the precompiled wheels for the various kernels provided with wangp: there is no combination python 3.10 / pytorch 2.10 / cuda 12.8. |
Thanks for the reply. The reason I updated the CUDA 12.8 config to pytorch 2.10 was that I saw only the GTX_10 profile (aka pre-Turin cards) used it. And this profile doesn't install any kernels (they aren't supported anyways). |
Since this project uses Python 3.11 + CUDA 13.0 + PyTorch 2.10, and there's no pre-built flash-attn wheel for this combination on PyPI, it will default to building the package, which will then fail due to build isolation issues. Even with build isolation disabled, it will still take a very long time. The windows config already uses a pre-built wheel, so this PR essentially does the same for Linux.