mihirp1998

Follow

Mihir Prabhudesai mihirp1998

Follow

researcher at CMU MLD

42 followers · 0 following

Pittsburgh

Achievements

Achievements

Highlights

Pro

Pinned Loading

AlignProp AlignProp Public

AlignProp uses direct reward backpropogation for the alignment of large-scale text-to-image diffusion models. Our method is 25x more sample and compute efficient than reinforcement learning methods…

Python 254 8
VADER VADER Public

Video Diffusion Alignment via Reward Gradients. We improve a variety of video diffusion models such as VideoCrafter, OpenSora, ModelScope and StableVideoDiffusion by finetuning them using various r…

Python 227 14
Diffusion-TTA Diffusion-TTA Public

Diffusion-TTA improves pre-trained discriminative models such as image classifiers or segmentors using pre-trained generative models.

Python 64 5
Slot-TTA Slot-TTA Public

Slot-TTA shows that test-time adaptation using slot-centric models can improve image segmentation on out-of-distribution examples.

Python 26 3
Disentangling-3D-Prototypical-Nets Disentangling-3D-Prototypical-Nets Public

We present neural architectures that disentangle RGB-D images into objects' shapes and styles and a map of the background scene, and explore their applications for few-shot 3D object detection and …

Python 11
huggingface/trl huggingface/trl Public

Train transformer language models with reinforcement learning.

Python 10.6k 1.4k