| license | mit |
|---|
The proposed LocustLens is a multimodal large language model system designed for global desert locust (Schistocerca gregaria) presence risk prediction. Built upon Qwen3-VL-4B, LocustLens MLLM is fine-tuned through a two-stage training pipeline combining lightweight supervised fine-tuning and reinforcement learning on a curated 43-year locust event dataset.
- Early warning systems for locust outbreak risk assessment
- Decision support for desert locust surveillance and monitoring
- Multimodal reasoning research on spatiotemporal environmental data
This repository is built on top of the VERL framework. We would like to express our sincere gratitude to the VERL team for their outstanding work.
The used dataset can be download here
Please unzip the RAR file into your specific folder.
pip install uv
uv venv --python 3.12 --seed
source .venv/bin/activate
uv pip install "sglang[all]==0.5.2" --no-cache-dir && pip install torch-memory-saver --no-cache-dir
uv pip install --no-cache-dir "vllm==0.11.0"
uv pip install pytest
uv pip install "transformers[hf_xet]>=4.51.0" accelerate datasets peft hf-transfer \
"numpy<2.0.0" "pyarrow>=15.0.0" pandas "tensordict>=0.8.0,<=0.10.0,!=0.9.0" torchdata \
ray[default] codetiming hydra-core pylatexenc qwen-vl-utils wandb dill pybind11 liger-kernel mathruler \
pytest py-spy pre-commit ruff tensorboard
uv pip install "nvidia-ml-py>=12.560.30" "fastapi[standard]>=0.115.0" "optree>=0.13.0" "pydantic>=2.9" "grpcio>=1.62.1"
uv pip install --no-cache-dir flashinfer-python==0.3.1
uv pip install opencv-python
uv pip install opencv-fixer && \
python -c "from opencv_fixer import AutoFix; AutoFix()"
git clone https://github.com/zza234s/LocustPrediction.git
cd verl
uv pip install --no-deps -e .
Note: Please update data.train_files and data.val_files in the following shell scripts so that they point to the actual paths of your local training and validation dataset files.
bash LocustLens-SFT-RL-4B_test.shbash LocustLens_RL_traininig.sh 

