Some ideas about message field optimization

For service logs, it is quite common to have a message field that has full, long txt. Clickhouse does have tokenbf_v1 to skip scanning but quesma does not have a good way to support it. 

This is how i would think quesma could support it (However, a clean interface needs to be worked out): 

--------
- Users need to have another field that mirrors message field but this field needs to be in all lower case. Let's call it _lower_case_messge
- tokenbf_v1 needs to be built against this _lower_case_messge.
- At transformation time. for filter such as message ILIKE 'token'. rewrite it to _lower_case_messge LIKE '%searchword%' AND hasToken(_lower_case_messge, searchword)
--------

Does this make sense? Is this something quesma would be interested to have. I have a hacky implementation and i want to get signals before moving to a PR/

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Some ideas about message field optimization #1449

Metadata

Assignees

Labels

Type

Projects

Milestone

Relationships

Development

Some ideas about message field optimization #1449

Description

Metadata

Metadata

Assignees

Labels

Type

Projects

Milestone

Relationships

Development

Issue actions