Bridging the Domain Gap in Tracking by Diffusion

The repository for my bachelor thesis written at TUM. It contains a Stable Diffusion pipeline based on ALDM to convert semantic segmentation maps into realistic images.

How does the pipeline work

The pipeline revolves around a core class, the Pipeline, which uses a builder pattern to construct and execute image processing tasks. It encapsulates data within a stream, which includes the image, metadata, and additional information like bounding boxes or depth data. The data stream is represented as a dictionary or tuple, ensuring ordered and immutable data flow through the pipeline. Several methods within the Pipeline class facilitate various functionalities, such as looping through sub-pipelines, running processes in parallel, and handling multiple image inputs.

Install this project

This project was tested on Ubuntu and heavily relies on CUDA. Please make sure to have CUDA installed before the next steps.

To install this project, first setup a Conda environment running Python version 1.8.

conda env create -y -f=environment.yml
conda activate sdpipeline

For the next step PDM has to be installed. Please refer to the official page for installation or use:

conda install pdm

After that the project can be installed by using the following command:

python -m pdm install

This should install all modules inside this repository in editable mode. If you want to install them in production mode please use:

python -m pdm install --prod

For installing only selected modules please modify the pyproject.toml file accordingly

Installing weights and config files for different modules

To make the different large modules work often configuration files has to be placed into the data folder. Please refer to the different modules for detailed instruction.

Generating the results in the bachelor thesis

To generate the results from the bachelor thesis by yourself you need all modules installed.

For the object detection part please refer to

for generation,

3_coco.py (Synthehicle)
scripts (MUAD)

for dataset generation and

4_yolox-x.py

for the object detection via yolox.

All resulting yolox checkpoints can also be found via this link.

For object tracking and RMSE refer to:

for image generation and

scripts

for testing. The results can also be found in the scripts (for RMSE in the plot.py file).

Where to test?

On the server at the chair generation and Human-Machine-Communication is a deployed version of this code. You can find there also all scripts, datasets and other results like the yolox weights. The conda environment has the name "application". For more information feel free to contact me :)

The scripts folder

The script folder contains a lot of different scripts used for converting datasets into different formats or the video generation part. In the futures these could be also integrated into the pipeline as modules.

Sync & Share

The Sync and Share link contains all generated datasets.

Final words

A hugh thanks to the Institute for Human-Machine-Communication at TUM, Univ.-Prof. Dr.-Ing. habil. G. Rigoll and my advisors Philipp Wolters M.Sc. and Fabian Herzog Ph.D. for giving me a place to write my thesis and for the great support.

Also hugh thanks to Sonja Nagy and Fabian Lehr for proofreading :)

Name		Name	Last commit message	Last commit date
Latest commit History 76 Commits
data		data
images		images
modules		modules
scripts		scripts
src		src
tests		tests
thesis		thesis
.gitignore		.gitignore
.pre-commit-config.yaml		.pre-commit-config.yaml
.thirdparty.md		.thirdparty.md
LICENSE		LICENSE
README.md		README.md
environment.yml		environment.yml
pdm.lock		pdm.lock
pyproject.toml		pyproject.toml

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Bridging the Domain Gap in Tracking by Diffusion

How does the pipeline work

Install this project

Installing weights and config files for different modules

Generating the results in the bachelor thesis

For the object detection part please refer to

For object tracking and RMSE refer to:

Where to test?

The scripts folder

Sync & Share

Final words

About

Releases 1

Packages

Languages

License

Nomiez/bridging-the-domain-gap-by-diffusion

Folders and files

Latest commit

History

Repository files navigation

Bridging the Domain Gap in Tracking by Diffusion

How does the pipeline work

Install this project

Installing weights and config files for different modules

Generating the results in the bachelor thesis

For the object detection part please refer to

For object tracking and RMSE refer to:

Where to test?

The scripts folder

Sync & Share

Final words

About

Resources

License

Stars

Watchers

Forks

Releases 1

Packages 0

Languages

Packages