While running the script prepare_wikipedia.py, I encountered the following error:
It appears that the module fairseq2.assets.error is being imported but is not present in the repository.
Steps to Reproduce:
- Clone the repository.
- Install the dependencies
- Run
python prepare_wikipedia.py.
- The error occurs.
The Error
2025-04-02 03:43 INFO 15260:numexpr.utils - NumExpr defaulting to 2 threads. 2025-04-02 03:43:07.263072: E external/local_xla/xla/stream_executor/cuda/cuda_fft.cc:477] Unable to register cuFFT factory: Attempting to register factory for plugin cuFFT when one has already been registered WARNING: All log messages before absl::InitializeLog() is called are written to STDERR E0000 00:00:1743565387.283834 15260 cuda_dnn.cc:8310] Unable to register cuDNN factory: Attempting to register factory for plugin cuDNN when one has already been registered E0000 00:00:1743565387.291319 15260 cuda_blas.cc:1418] Unable to register cuBLAS factory: Attempting to register factory for plugin cuBLAS when one has already been registered 2025-04-02 03:43:07.312556: I tensorflow/core/platform/cpu_feature_guard.cc:210] This TensorFlow binary is optimized to use available CPU instructions in performance-critical operations. To enable the following instructions: AVX2 AVX512F FMA, in other operations, rebuild TensorFlow with the appropriate compiler flags. Traceback (most recent call last): File "/content/large_concept_model/scripts/prepare_wikipedia.py", line 13, in <module> from stopes.modules.preprocess.sonar_text_embedding import ( File "/usr/local/lib/python3.11/dist-packages/stopes/modules/preprocess/sonar_text_embedding.py", line 17, in <module> from fairseq2.assets.error import AssetError ModuleNotFoundError: No module named 'fairseq2.assets.error'
Thank you.
While running the script
prepare_wikipedia.py, I encountered the following error:It appears that the module
fairseq2.assets.erroris being imported but is not present in the repository.Steps to Reproduce:
python prepare_wikipedia.py.The Error
2025-04-02 03:43 INFO 15260:numexpr.utils - NumExpr defaulting to 2 threads. 2025-04-02 03:43:07.263072: E external/local_xla/xla/stream_executor/cuda/cuda_fft.cc:477] Unable to register cuFFT factory: Attempting to register factory for plugin cuFFT when one has already been registered WARNING: All log messages before absl::InitializeLog() is called are written to STDERR E0000 00:00:1743565387.283834 15260 cuda_dnn.cc:8310] Unable to register cuDNN factory: Attempting to register factory for plugin cuDNN when one has already been registered E0000 00:00:1743565387.291319 15260 cuda_blas.cc:1418] Unable to register cuBLAS factory: Attempting to register factory for plugin cuBLAS when one has already been registered 2025-04-02 03:43:07.312556: I tensorflow/core/platform/cpu_feature_guard.cc:210] This TensorFlow binary is optimized to use available CPU instructions in performance-critical operations. To enable the following instructions: AVX2 AVX512F FMA, in other operations, rebuild TensorFlow with the appropriate compiler flags. Traceback (most recent call last): File "/content/large_concept_model/scripts/prepare_wikipedia.py", line 13, in <module> from stopes.modules.preprocess.sonar_text_embedding import ( File "/usr/local/lib/python3.11/dist-packages/stopes/modules/preprocess/sonar_text_embedding.py", line 17, in <module> from fairseq2.assets.error import AssetError ModuleNotFoundError: No module named 'fairseq2.assets.error'Thank you.