Skip to content

Data model: Create an individual embedding table per collection #12

@ckurze

Description

@ckurze

Issue you'd like to raise.

Currently, all embeddings are stored in one single "embeddings" table. This has various downsides:

  • All embeddings need the same vector length
  • Indexes get very large
  • Hard/Complex to manage privileges on database side
  • Unnecessary join on potentially large amounts of data.

Suggestion:

CrateDB should not create a table "collections" and "emgeddings", but crate a new table that gets the name of the provided collection name in langchain which holds all embeddings.

Metadata

Metadata

Assignees

No one assigned

    Labels

    enhancementNew feature or request

    Type

    No type

    Projects

    No projects

    Milestone

    No milestone

    Relationships

    None yet

    Development

    No branches or pull requests

    Issue actions