Enable variable-length queries #25

fschlatt · 2025-10-30T15:25:00Z

This PR allows variable-length queries for searching

raphaelsty · 2025-10-30T16:33:38Z

LGTM, you can solve the ruff issue and then we will merge :)

I need to fix the CI so it can run the test when you make PR

Copilot

Pull Request Overview

This PR refactors the search functionality to accept queries as a vector of individual tensors instead of a single batched 3D tensor. This allows for more flexible handling of variable-length query sequences without requiring padding at the batch level.

Key changes:

Changed query input format from a single 3D tensor to a vector of individual tensors
Removed tensor batching/padding logic in favor of list slicing for query distribution
Added a cleanup_embeddings helper function to normalize query embeddings

Reviewed Changes

Copilot reviewed 3 out of 3 changed files in this pull request and generated 3 comments.

File	Description
rust/search/search.rs	Updated `search_many` to accept `Vec<Tensor>` instead of a single tensor; removed tensor shape validation; removed unused imports `bail` and `IndexOp`
rust/lib.rs	Updated `load_and_search` to accept `Vec<PyTensor>` and convert each tensor individually to `Kind::Half`
python/fast_plaid/search/fast_plaid.py	Added `cleanup_embeddings` helper; replaced tensor operations (chunk, split, pad_sequence) with list slicing; updated type hints to `list[torch.Tensor]`

Comments suppressed due to low confidence (1)

rust/search/search.rs:157

The documentation is outdated. The parameter is now &Vec<Tensor> where each tensor represents an individual query, not a single 3D tensor.

/// * `queries` - A 3D tensor of query embeddings with shape `[num_queries, tokens_per_query, dim]`.

💡 Add Copilot custom instructions for smarter, more guided reviews. Learn how to get started.

rust/search/search.rs

Copilot · 2025-10-30T16:46:46Z

python/fast_plaid/search/fast_plaid.py

-                dim=0,
-            )
+            queries_embeddings_splits = [
+                queries_embeddings[i:i + num_cpus] for i in range(0, num_queries, num_cpus)


The list comprehension creates chunks of size num_cpus starting at increments of num_cpus, which is incorrect. The step should match the chunk size to avoid overlap. Use: queries_embeddings[i*num_cpus:(i+1)*num_cpus] for i in range((num_queries + num_cpus - 1) // num_cpus)] or similar logic to properly partition the list.

Suggested change

queries_embeddings[i:i + num_cpus] for i in range(0, num_queries, num_cpus)

queries_embeddings[i*num_cpus:(i+1)*num_cpus] for i in range((num_queries + num_cpus - 1) // num_cpus)

The list comprehension seems correct to me. Don't know why Copilot thinks it's wrong

Copilot · 2025-10-30T16:46:47Z

python/fast_plaid/search/fast_plaid.py

+        queries_embeddings_splits = [
+            queries_embeddings[i:i + len(self.devices)] for i in range(0, num_queries, len(self.devices))
+        ]


The list comprehension creates chunks of size len(self.devices) starting at increments of len(self.devices), which is incorrect. The step should match the chunk size to avoid overlap. Use proper chunking logic to partition the list without overlap.

Same here. I'm unsure what Copilot thinks is wrong

Co-authored-by: Copilot <[email protected]>

enable variable-length queries

9e2adea

raphaelsty requested a review from Copilot October 30, 2025 16:44

raphaelsty assigned fschlatt Oct 30, 2025

raphaelsty added the enhancement New feature or request label Oct 30, 2025

Copilot AI reviewed Oct 30, 2025

View reviewed changes

fschlatt and others added 4 commits November 3, 2025 07:14

Update rust/search/search.rs

d0516c9

Co-authored-by: Copilot <[email protected]>

ruff

c01954c

Merge branch 'main' of https://github.com/fschlatt/fast-plaid

453df70

add docstring

16f86af

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Enable variable-length queries #25

Enable variable-length queries #25

fschlatt commented Oct 30, 2025

Uh oh!

raphaelsty commented Oct 30, 2025

Uh oh!

Copilot AI left a comment

Uh oh!

Uh oh!

Copilot AI Oct 30, 2025

Uh oh!

fschlatt Nov 3, 2025

Uh oh!

Copilot AI Oct 30, 2025

Uh oh!

fschlatt Nov 3, 2025

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants

	queries_embeddings[i:i + num_cpus] for i in range(0, num_queries, num_cpus)
	queries_embeddings[inum_cpus:(i+1)num_cpus] for i in range((num_queries + num_cpus - 1) // num_cpus)

Enable variable-length queries #25

Are you sure you want to change the base?

Enable variable-length queries #25

Conversation

fschlatt commented Oct 30, 2025

Uh oh!

raphaelsty commented Oct 30, 2025

Uh oh!

Copilot AI left a comment

Choose a reason for hiding this comment

Pull Request Overview

Reviewed Changes

Uh oh!

Uh oh!

Copilot AI Oct 30, 2025

Choose a reason for hiding this comment

Uh oh!

fschlatt Nov 3, 2025

Choose a reason for hiding this comment

Uh oh!

Copilot AI Oct 30, 2025

Choose a reason for hiding this comment

Uh oh!

fschlatt Nov 3, 2025

Choose a reason for hiding this comment

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants