XaiR

XaiR: An XR Platform that Integrates Large Language Models with the Physical World

Setup Guide

Clone the repository onto your device:

git clone https://github.com/srutisrinidhi/XaiR.git

Clone apple's ferret multimodal LLM to a separate folder. Instructions can be found at Apple ML-Ferret

You can use the virtual env already created by Ferret, or create your own if you do not plan to use ferret

To create your own:

cd XaiR
conda create -n XaiR python=3.10 -y
conda activate XaiR

Download essential libraries

conda install --yes --file requirements.txt
pip install --upgrade pip  # enable PEP 660 support

Generate certs:

mkdir ssl
openssl req -x509 -newkey rsa:2048 -keyout ssl/key.pem -out ssl/cert.pem -days 365
openssl rsa -in ssl/key.pem -out ssl/newkey.pem && mv ssl/newkey.pem ssl/key.pem

Run Ferret

Enter the ferret directory and run

conda activate ferret

In a Terminal window, run

python -m ferret.serve.controller --host 0.0.0.0 --port 10000

In another Terminal window, run

CUDA_VISIBLE_DEVICES=0 python -m ferret.serve.model_worker --host 0.0.0.0 --controller http://localhost:10000 --port 40000 --worker http://localhost:40000 --model-path <PATH TO FERRET MODEL> --add_region_feature

Edit the ferret model name in model_interface/get_ferret_response.py

Run if you want to ask questions to the assistant:

python3 -m cognitive-assistant-server.scripts.server_with_llm --host <YOUR IP ADDRESS> --cert-file ssl/cert.pem --key-file ssl/key.pem

Run if you want to record what the user is doing and get a log of actions and aotomatically generated instructions:

python3 -m cognitive-assistant-server.scripts.server_with_llm_for_instructing --host <YOUR IP ADDRESS> --cert-file ssl/cert.pem --key-file ssl/key.pem

Run if you want to also follow a tutorial to do a task as well as ask questions:

Edit model_interface/tutorial_follower.py to have the correct path to the instructions to follow

Then, run

python3 -m cognitive-assistant-server.scripts.server_with_llm_for_instructing --host <YOUR IP ADDRESS> --cert-file ssl/cert.pem --key-file ssl/key.pem

Run if you want to also follow a tutorial to do a task as well as ask questions to a human:

Edit model_interface/tutorial_follower_human.py to have the correct path to the instructions to follow

Then, run

python3 -m cognitive-assistant-server.scripts.server_with_llm_for_instructing_human --host <YOUR IP ADDRESS> --cert-file ssl/cert.pem --key-file ssl/key.pem

On the ML2, set the Server Address of the WebRTCConnection component to https://<YOUR IP ADDRESS>:8000

Name		Name	Last commit message	Last commit date
Latest commit History 43 Commits
cognitive-assistant-server		cognitive-assistant-server
docs		docs
results/Ego-centric Videos		results/Ego-centric Videos
static/images		static/images
xr-cognitive-assistant-unity		xr-cognitive-assistant-unity
.gitattributes		.gitattributes
README.md		README.md
requirements.txt		requirements.txt

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

XaiR

Setup Guide

Run Ferret

Run if you want to ask questions to the assistant:

Run if you want to record what the user is doing and get a log of actions and aotomatically generated instructions:

Run if you want to also follow a tutorial to do a task as well as ask questions:

Run if you want to also follow a tutorial to do a task as well as ask questions to a human:

The unity application is written for the Magic Leap 2 and additional modifications will need to be made to run this on a different XR device.

About

Uh oh!

Releases

Packages

Uh oh!

Uh oh!

Contributors

Uh oh!

Languages

Folders and files

Latest commit

History

Repository files navigation

XaiR

Setup Guide

Run Ferret

Run if you want to ask questions to the assistant:

Run if you want to record what the user is doing and get a log of actions and aotomatically generated instructions:

Run if you want to also follow a tutorial to do a task as well as ask questions:

Run if you want to also follow a tutorial to do a task as well as ask questions to a human:

The unity application is written for the Magic Leap 2 and additional modifications will need to be made to run this on a different XR device.

About

Resources

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Uh oh!

Uh oh!

Contributors

Uh oh!

Languages

Packages