feature - delta lake v2 protocol. tested #3972
Draft
Add this suggestion to a batch that can be applied as a single commit.
This suggestion is invalid because no changes were made to the code.
Suggestions cannot be applied while the pull request is closed.
Suggestions cannot be applied while viewing a subset of changes.
Only one suggestion per line can be applied in a batch.
Add this suggestion to a batch that can be applied as a single commit.
Applying suggestions on deleted lines is not supported.
You must change the existing code in this line in order to create a valid suggestion.
Outdated suggestions cannot be applied.
This suggestion has been applied or marked resolved.
Suggestions cannot be applied from pending reviews.
Suggestions cannot be applied on multi-line comments.
Suggestions cannot be applied while the pull request is queued to merge.
Suggestion cannot be applied right now. Please check back later.
Add Delta Lake Protocol V2 Features Support
Summary
This PR adds comprehensive support for Delta Lake Protocol V2 features including Deletion Vectors, Row Tracking, Liquid Clustering, and V2 Checkpoints to delta-rs.
Motivation
Delta Lake Protocol V2 introduces critical features for modern data lake operations:
These features are essential for production workloads, especially when integrating with Databricks Unity Catalog and modern Delta Lake environments.
Changes Made
Core Changes (1,931+ lines)
New Module:
crates/core/src/kernel/deletion_vector.rs(466 lines)New Module:
crates/core/src/kernel/liquid_clustering.rs(276 lines)New Module:
crates/core/src/delta_datafusion/dv_filter.rs(398 lines)Enhanced:
crates/core/src/kernel/snapshot/iterators.rsEnhanced:
crates/core/src/kernel/transaction/protocol.rsEnhanced:
crates/core/src/delta_datafusion/table_provider.rsTesting
crates/core/tests/deletion_vector_test.rs(223 lines)Documentation
DELTA_V2_IMPLEMENTATION_SPEC.md(446 lines)Dependencies
roaringcrate for efficient bitmap operations (used in deletion vectors)Testing
✅ All existing tests pass
✅ New deletion vector test suite covers:
✅ Verified with production data from Databricks Unity Catalog tables
Compatibility
Integration Status
Successfully tested with:
Next Steps
Future enhancements could include:
Related Issues
Testing Environment:
Files Changed: