Use sort ordering on timestamp array #443

trueleo · 2023-06-24T14:54:38Z

Fixes #430.

Description

Write timestamp sortedness metadata to parquet and provide external sort information to datafusion. This way the SortExec can be avoided in execution plan with most queries which use order by p_timestamp.

Example

explain select p_timestamp from {{stream_name}} order by p_timestamp asc

In physical plan it is visible that SortExec is eliminated as output_ordering is pushed to ParquetExec node

"plan": "SortPreservingMergeExec: [p_timestamp@0 ASC NULLS LAST]
  ParquetExec: file_groups={4 groups: [.....]}, projection=[p_timestamp], output_ordering=[p_timestamp@0 ASC NULLS LAST]",

Note:

This is still not the most optimized version of this query as SortPreservingExec is not really needed here. The issue here is that the Datafusion is not aware that the partitions / files are non overlapping when considering timestamp

Also if the target partition limit is crossed then datafusion again adds SortExec to physical plan.

This PR has:

been tested to ensure log ingestion and log query works.
added comments explaining the "why" and the intent of the code wherever would not be obvious for an unfamiliar reader.
added documentation for new or modified features or behaviors.

nitisht · 2023-06-25T04:44:05Z

server/src/storage/localfs.rs

            infinite_source: false,
            format: Arc::new(file_format),
            table_partition_cols: vec![],
            collect_stat: true,
-            target_partitions: 1,
+            target_partitions: 32,


What the significance of changing this field target_partitions here?

Roughly the partition here is number of parallel streams that is generated by datafusion during execution. Having this 1 was causing all files to be grouped in one partition and datafusion is unable to use external sort information for files in a group as it cannot infer order between grouped files and if they are overlapping in time range or not.

So at max datafusion will hold 32 streams to calculate the output and merge them back using SortPreservingMerge

nitisht · 2023-06-25T04:44:46Z

server/src/storage/s3.rs

            infinite_source: false,
            format: Arc::new(file_format),
            table_partition_cols: vec![],
            collect_stat: true,
-            target_partitions: 1,
+            target_partitions: 32,


This section is repeated for local and s3 mode. Can we move it to the common abstraction?

Yes, this needs refactoring

nitisht · 2023-06-25T05:22:10Z

Fixes #418.

Shouldn't this be #430? #418 is related to staging query

Use sort ordering on timestamp array

f7726a4

nitisht reviewed Jun 25, 2023

View reviewed changes

Refactor

122d135

nitisht mentioned this pull request Jun 25, 2023

Slow performace when using ORDER BY #430

Closed

nitisht merged commit 3e5548d into parseablehq:main Jun 25, 2023

github-actions bot locked and limited conversation to collaborators Jun 25, 2023

trueleo deleted the sort_event branch July 4, 2023 05:40

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Uh oh!

Use sort ordering on timestamp array #443

Use sort ordering on timestamp array #443

Uh oh!

trueleo commented Jun 24, 2023 •

edited

Loading

Uh oh!

nitisht Jun 25, 2023 •

edited

Loading

Uh oh!

trueleo Jun 25, 2023

Uh oh!

trueleo Jun 25, 2023 •

edited

Loading

Uh oh!

nitisht Jun 25, 2023

Uh oh!

trueleo Jun 25, 2023

Uh oh!

nitisht commented Jun 25, 2023

Uh oh!

Uh oh!

Uh oh!

Use sort ordering on timestamp array #443

Use sort ordering on timestamp array #443

Uh oh!

Conversation

trueleo commented Jun 24, 2023 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Description

Example

Note:

Uh oh!

nitisht Jun 25, 2023 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Choose a reason for hiding this comment

Uh oh!

trueleo Jun 25, 2023

Choose a reason for hiding this comment

Uh oh!

trueleo Jun 25, 2023 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Choose a reason for hiding this comment

Uh oh!

nitisht Jun 25, 2023

Choose a reason for hiding this comment

Uh oh!

trueleo Jun 25, 2023

Choose a reason for hiding this comment

Uh oh!

nitisht commented Jun 25, 2023

Uh oh!

Uh oh!

trueleo commented Jun 24, 2023 •

edited

Loading

nitisht Jun 25, 2023 •

edited

Loading

trueleo Jun 25, 2023 •

edited

Loading