interSeg3D-Studio

An web-based interactive 3D point cloud annotation tool, using AGILE3D for click-based segmentation algorithm and Gemini for 3D object recognition & description.

Note

Note that this project is still in development. The current version is a prototype and may contain bugs.

Features

Interactive 3D Annotation: Easily mark and annotate objects in 3D point clouds using positive and negative clicks
AI-Powered Segmentation: Click-based segmentation with AGILE3D model
Object Analysis: Automatically identify and describe objects in your scene
Metadata Loading: Import previous annotations and object information from metadata files

Installation

Prerequisites

Node.js (v16 or newer)
Python 3.10 (recommended, but should work with 3.8+)
CUDA-compatible GPU (recommended)

Backend Setup

Clone the repository:

git clone https://github.com/zh-plus/interSeg3D-Studio
cd interSeg3D-Studio

Install AGILE3D according to the instructions at: https://github.com/ywyue/AGILE3D/blob/main/installation.md (Dont clone the repository, just install the environment. And try to use gcc-9 & g++-9 to install MinkowskiEngine.)
Download the pre-trained model weights from: https://polybox.ethz.ch/index.php/s/RnB1o8X7g1jL0lM, and put it into the src/backend/agile3d/weights directory.

Create a .env file in the src/backend directory with:

GOOGLE_API_KEY='your_google_api_key'  # For object recognition

Frontend Setup

Install Node.js dependencies:
```
cd src/frontend
npm install
```

Create a .env file in the src/frontend directory with:

VITE_API_BASE_URL=http://localhost:9500
VITE_USE_PROXY=true

Running the Application

Start the backend server:
```
python src/backend/app.py
```
In a new terminal, start the frontend development server:
```
cd src/frontend
npm run dev
```
Open your browser and navigate to:
```
http://localhost:3001
```

Usage Guide

Interaction Modes

The application has three main interaction modes:

Navigate Mode: Rotate, pan, and zoom the view
Annotate Mode: Mark points on objects or background
Select Mode: Select and edit segmented objects

Basic Workflow

Upload a PLY file using the file upload panel
Load metadata to import previous annotations and object information (optional)
Create objects by entering names and clicking "Create Object"
Select an object from the list and switch to "Annotation Mode"
Mark points on the object by clicking on the point cloud
Mark background by switching to "Background" mode and clicking non-object areas
Run segmentation by clicking the "RUN SEGMENTATION" button
(Optional) Analyze objects to automatically identify and describe them (remember to click APPLY LABEL or APPLY ALL RESULTS)
Switch to Select Mode to click on segmented objects and edit their labels and descriptions
Save results to download the segmented point cloud and metadata

Keyboard Shortcuts

A: Toggle between navigation and annotation modes
S: Activate select mode
Enter: Run segmentation
N: Create a new object with default name "new obj"
Ctrl+Z: Undo the last click
Shift+Ctrl+Z: Redo an undone click
Ctrl+S: Save object information

Mouse Controls

Left-click + drag (Navigation Mode): Rotate the view
Right-click + drag: Pan the view
Scroll wheel: Zoom in/out
Left-click (Annotation Mode): Mark a point
Left-click (Select Mode): Select an object to edit its label and description

How it works

After segmenting the project interactively using clicks, all the objects are scanned from multiple angles and analyzed using the AI object recognition model. The model uses the LLM API to identify the objects and provide a description of the object. The description includes the object's name, color, and size.

For example, the object are indicated using convex hulls and the cameras are positioned at green spheres.

Development

Project Structure

├── package.json
├── src
│   ├── backend
│   │   ├── agile3d               # Interactive Segmentation Model
│   │   ├── app.py                # FastAPI server
│   │   ├── inference.py          # Segmentation model
│   │   ├── view_rendering.py     # View generation
│   │   └── visual_obj_recognition.py  # AI object recognition
│   └── frontend
│       ├── App.vue               # Main application component
│       ├── components            # Vue components
│       ├── composables           # Vue composition functions
│       ├── services              # Service layer
│       ├── types                 # TypeScript type definitions
│       └── utils                 # Utility functions

Contributing

Fork the repository
Create your feature branch (git checkout -b feature/amazing-feature)
Commit your changes (git commit -m 'Add some amazing feature')
Push to the branch (git push origin feature/amazing-feature)
Open a Pull Request

License

This project is licensed under the MIT License - see the LICENSE file for details.

Acknowledgments

AGILE3D for the segmentation models
Three.js for 3D rendering capabilities
Open3D for point cloud processing

Name		Name	Last commit message	Last commit date
Latest commit History 27 Commits
assets		assets
src		src
.gitignore		.gitignore
README.md		README.md
package-lock.json		package-lock.json
package.json		package.json
tsconfig.json		tsconfig.json

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Repository files navigation

interSeg3D-Studio

Features

Installation

Prerequisites

Backend Setup

Frontend Setup

Running the Application

Usage Guide

Interaction Modes

Basic Workflow

Keyboard Shortcuts

Mouse Controls

How it works

Development

Project Structure

Contributing

License

Acknowledgments

About

Uh oh!

Releases

Packages

Uh oh!

Contributors 2

Uh oh!

Languages

zh-plus/interSeg3D-Studio

Folders and files

Latest commit

History

Repository files navigation

interSeg3D-Studio

Features

Installation

Prerequisites

Backend Setup

Frontend Setup

Running the Application

Usage Guide

Interaction Modes

Basic Workflow

Keyboard Shortcuts

Mouse Controls

How it works

Development

Project Structure

Contributing

License

Acknowledgments

About

Topics

Resources

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Uh oh!

Contributors 2

Uh oh!

Languages

Packages