feat(fix): fix bevfusion deployment script #98

SamratThapa120 · 2025-09-09T01:08:41Z

This pull request introduces several improvements to the BEVFusion deployment pipeline, particularly for ONNX export. The changes focus on refactoring the model's forward and feature extraction logic to better handle image features, updating deployment configs for more flexible input shapes, and fixing geometry computation in the depth module. These updates enhance the modularity and exportability of the BEVFusion model, facilitating more efficient deployment and inference.

Model code refactoring and ONNX export support:

The bevfusion-cl model is split into two parts in deployment: image_backbone and main_body. The image_backbone takes the image input and outputs the features, the main_body does the rest of the predictions.
Removed the old deployment scripts.
Refactored BEVFusion model methods (_forward, predict, loss, and extract_feat) to consistently accept a new using_image_features flag, improving control over whether to use precomputed image features during inference or export. Added a new get_image_backbone_features helper to modularize image feature extraction. [1] [2] [3] [4] [5] [6] [7] [8] [9]
Updated the logic in extract_feat to handle ONNX inference mode, including geometry feature handling and disabling point features when necessary. [1] [2]

Deployment configuration updates:

Updated bevfusion_camera_backbone_tensorrt_dynamic.py and bevfusion_main_body_with_image_tensorrt_dynamic.py to use more flexible and parameterized input shapes (e.g., image_dims, depth_bins, feature_dims). Expanded the set of dynamic axes and input names to support a wider range of deployment scenarios, and added missing inputs required for the main body with image features. [1] [2] [3] [4] [5] [6]
Adjusted bevfusion_main_body_lidar_only_tensorrt_dynamic.py to update the expected voxel and coordinate shapes for lidar-only inference.

Depth module geometry computation fix:

Fixed the placement of geometry computation logic in depth_lss.py, ensuring that transformations and matrix inverses are only computed when not using precomputed geometry features. [1] [2]

Documentation updates:

Updated README.md deployment command examples to include the --module argument, clarifying how to export specific model components. [1] [2]

Note to reviewer.

Please try running the deployment script, using models included in this PR #88

Signed-off-by: Samrat Thapa <[email protected]>

KSeangTan · 2025-09-10T05:54:54Z

projects/BEVFusion/bevfusion/bevfusion.py


-            feats = feats.sum(dim=1, keepdim=False) / sizes.type_as(feats).view(-1, 1)
+            # feats = batch_inputs_dict["voxels"]["voxels"]


KSeangTan · 2025-09-10T06:10:45Z

projects/BEVFusion/configs/deploy/bevfusion_main_body_lidar_only_tensorrt_dynamic.py

@@ -15,8 +15,8 @@
    model_inputs=[
        dict(
            input_shapes=dict(
-                voxels=dict(min_shape=[1, 5], opt_shape=[64000, 5], max_shape=[256000, 5]),
-                coors=dict(min_shape=[1, 4], opt_shape=[64000, 4], max_shape=[256000, 4]),
+                voxels=dict(min_shape=[1, 10, 4], opt_shape=[64000, 10, 4], max_shape=[256000, 10, 4]),


The shape should be [M, maximum number of points, features], which it will be [M, 10, 5] if we are using intensity right?

KSeangTan · 2025-09-10T06:26:00Z

projects/BEVFusion/deploy/torch2onnx.py

                export_params=True,
                input_names=input_names,
                output_names=output_names,
                opset_version=opset_version,
                dynamic_axes=dynamic_axes,
                keep_initializers_as_inputs=keep_initializers_as_inputs,
                verbose=verbose,
+                do_constant_folding=False,


why disable though? I believe the constant_folding can speed up inference

KSeangTan · 2025-09-10T06:27:49Z

projects/BEVFusion/configs/deploy/bevfusion_main_body_with_image_tensorrt_dynamic.py

@@ -18,7 +21,35 @@
                voxels=dict(min_shape=[1, 10, 4], opt_shape=[64000, 10, 4], max_shape=[256000, 10, 4]),
                coors=dict(min_shape=[1, 3], opt_shape=[64000, 3], max_shape=[256000, 3]),
                num_points_per_voxel=dict(min_shape=[1], opt_shape=[64000], max_shape=[256000]),
-                image_feats=dict(min_shape=[80, 180, 180], opt_shape=[80, 180, 180], max_shape=[80, 180, 180]),
+                # TODO(TIERIV): Optimize. Now, using points will increase latency significantly


It's better to put TODO(YourName)

Is it because we want to compute depth map so we need to input points? This will actually run internal voxelization again in here
https://github.com/tier4/AWML/blob/main/projects/BEVFusion/bevfusion/bevfusion.py#L172

KSeangTan

LGTM overall, thanks for the great work. I believe we need to further tidy up the code a bit more in another PR

KSeangTan · 2025-09-10T06:42:32Z

projects/BEVFusion/configs/deploy/bevfusion_main_body_with_image_tensorrt_dynamic.py

@@ -18,7 +21,35 @@
                voxels=dict(min_shape=[1, 10, 4], opt_shape=[64000, 10, 4], max_shape=[256000, 10, 4]),
                coors=dict(min_shape=[1, 3], opt_shape=[64000, 3], max_shape=[256000, 3]),
                num_points_per_voxel=dict(min_shape=[1], opt_shape=[64000], max_shape=[256000]),
-                image_feats=dict(min_shape=[80, 180, 180], opt_shape=[80, 180, 180], max_shape=[80, 180, 180]),
+                # TODO(TIERIV): Optimize. Now, using points will increase latency significantly


Is it because we want to compute depth map so we need to input points? This will actually run internal voxelization again in here
https://github.com/tier4/AWML/blob/main/projects/BEVFusion/bevfusion/bevfusion.py#L172

SamratThapa120 and others added 5 commits August 22, 2025 14:20

fixed deployment logic

8b30dfa

added points to cl inference

58dbec6

add points and fix shapes

dcf6502

updated data types

583d019

Signed-off-by: Samrat Thapa <[email protected]>

ci(pre-commit): autofix

c890ee1

SamratThapa120 changed the title ~~feat(fix): fix bevfusion ci deployment script~~ feat(fix): fix bevfusion deployment script Sep 9, 2025

SamratThapa120 requested a review from KSeangTan September 9, 2025 01:14

SamratThapa120 and others added 2 commits September 9, 2025 10:16

removed old scripts

f21ec85

Merge branch 'main' into feat/bevfusion/cl-deployment

5deba70

SamratThapa120 marked this pull request as ready for review September 9, 2025 01:19

KSeangTan reviewed Sep 10, 2025

View reviewed changes

KSeangTan requested changes Sep 10, 2025

View reviewed changes

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

feat(fix): fix bevfusion deployment script #98

feat(fix): fix bevfusion deployment script #98

Uh oh!

SamratThapa120 commented Sep 9, 2025 •

edited by KSeangTan

Loading

Uh oh!

KSeangTan Sep 10, 2025

Uh oh!

KSeangTan Sep 10, 2025

Uh oh!

KSeangTan Sep 10, 2025

Uh oh!

KSeangTan Sep 10, 2025

Uh oh!

KSeangTan Sep 10, 2025

Uh oh!

KSeangTan left a comment

Uh oh!

KSeangTan Sep 10, 2025

Uh oh!

Uh oh!


		feats = feats.sum(dim=1, keepdim=False) / sizes.type_as(feats).view(-1, 1)
		# feats = batch_inputs_dict["voxels"]["voxels"]

feat(fix): fix bevfusion deployment script #98

Are you sure you want to change the base?

feat(fix): fix bevfusion deployment script #98

Uh oh!

Conversation

SamratThapa120 commented Sep 9, 2025 • edited by KSeangTan Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Note to reviewer.

Uh oh!

KSeangTan Sep 10, 2025

Choose a reason for hiding this comment

Uh oh!

KSeangTan Sep 10, 2025

Choose a reason for hiding this comment

Uh oh!

KSeangTan Sep 10, 2025

Choose a reason for hiding this comment

Uh oh!

KSeangTan Sep 10, 2025

Choose a reason for hiding this comment

Uh oh!

KSeangTan Sep 10, 2025

Choose a reason for hiding this comment

Uh oh!

KSeangTan left a comment

Choose a reason for hiding this comment

Uh oh!

KSeangTan Sep 10, 2025

Choose a reason for hiding this comment

Uh oh!

Uh oh!

SamratThapa120 commented Sep 9, 2025 •

edited by KSeangTan

Loading