-
Notifications
You must be signed in to change notification settings - Fork 2k
Open
Labels
EPICA larger project, actively underway, with sub tasksA larger project, actively underway, with sub tasksperformanceMake DataFusion fasterMake DataFusion fasterphysical-planChanges to the physical-plan crateChanges to the physical-plan crate
Description
I have a number of things I'd like to keep improving for SMJ, particularly because DataFusion Comet still relies on it heavily for larger-than-memory joins.
- Specialized stream for semi/anti SMJ perf: specialized SemiAntiSortMergeJoinStream #20806
- Remove semi/anti logic from existing SMJ stream, and optimize simplified logic
- More use of Arrow-rs
DynComparatorwhere appropriate to reduce per-row type matching
Reactions are currently unavailable
Metadata
Metadata
Assignees
Labels
EPICA larger project, actively underway, with sub tasksA larger project, actively underway, with sub tasksperformanceMake DataFusion fasterMake DataFusion fasterphysical-planChanges to the physical-plan crateChanges to the physical-plan crate