Optimized Grounded-Segment-Anything

This is a public fork of Grounded-Segment-Anything with some optimizations fro faster inference.

🚀 Optimizations

Ram and SAM:

First iteration 4 its/s 1% gpu util, 20% gpu mem taken: 4.13 s
Don't save raw image, 4.03 s
Add torch cuda sync for measure. Do NMS on gpu, 4.05 s
rejig image loading, 4.03 s
move sam definition out LOL 😱, time taken: 0.86 s
torch inference mode, 0.79 s
compile sam, 0.48 s
compile ram, 0.47 s
fast-sam and autocast, 0.2 s
removed torch.cuda.synchronizes and used turbojpeg 0.1-0.16
add scaled dot product attention to RAM swin. SDPA, 87images/s
batch everything -most important - 10x. Batch 512, workers=8, 256images/s
Move to gpu in int8 and normalise on gpu 420 images/s. 🔥 🧑‍🍳 Now we're cooking son
Move all to compile 593.92 images/s peak
Using autocast, can try with fp16 direct. FP16 732 images/s ✅

Grounding Dino:

On H100 it takes 15hrs for 1M images ~18 images/s

Installation

The code requires python>=3.8, as well as pytorch>=1.7 and torchvision>=0.8.

TODO

Marco Forte, ML Team @ Photoroom

Name		Name	Last commit message	Last commit date
Latest commit History 295 Commits
EfficientSAM		EfficientSAM
GroundingDINO		GroundingDINO
VISAM @ d7c3823		VISAM @ d7c3823
assets		assets
grounded-sam-osx @ 6688b03		grounded-sam-osx @ 6688b03
playground		playground
recognize-anything		recognize-anything
segment_anything		segment_anything
voxelnext_3d_box		voxelnext_3d_box
.gitignore		.gitignore
.gitmodules		.gitmodules
CITATION.cff		CITATION.cff
Dockerfile		Dockerfile
LICENSE		LICENSE
Makefile		Makefile
README.md		README.md
automatic_label_demo.py		automatic_label_demo.py
automatic_label_ram_demo.py		automatic_label_ram_demo.py
automatic_label_simple_demo.py		automatic_label_simple_demo.py
automatic_label_tag2text_demo.py		automatic_label_tag2text_demo.py
chatbot.py		chatbot.py
cog.yaml		cog.yaml
gradio_app.py		gradio_app.py
grounded_sam.ipynb		grounded_sam.ipynb
grounded_sam_3d_box.ipynb		grounded_sam_3d_box.ipynb
grounded_sam_colab_demo.ipynb		grounded_sam_colab_demo.ipynb
grounded_sam_demo.py		grounded_sam_demo.py
grounded_sam_inpainting_demo.py		grounded_sam_inpainting_demo.py
grounded_sam_osx_demo.py		grounded_sam_osx_demo.py
grounded_sam_simple_demo.py		grounded_sam_simple_demo.py
grounded_sam_visam.py		grounded_sam_visam.py
grounded_sam_whisper_demo.py		grounded_sam_whisper_demo.py
grounded_sam_whisper_inpainting_demo.py		grounded_sam_whisper_inpainting_demo.py
grounding_dino_demo.py		grounding_dino_demo.py
predict.py		predict.py
requirements.txt		requirements.txt