Kafka Connect: Add source timestamp to Debezium transform CDC metadata#16772
Open
rick020 wants to merge 1 commit into
Open
Kafka Connect: Add source timestamp to Debezium transform CDC metadata#16772rick020 wants to merge 1 commit into
rick020 wants to merge 1 commit into
Conversation
The DebeziumTransform SMT only kept the top-level ts_ms (the time the connector processed the event) as _cdc.ts. The source database commit timestamp (source.ts_ms) was dropped, even though it is often the more useful value for tracking creation/update times. Add it as a new optional _cdc.source_ts field so both timestamps are kept. Co-Authored-By: Claude Fable 5 <noreply@anthropic.com>
This file contains hidden or bidirectional Unicode text that may be interpreted or compiled differently than what appears below. To review, open the file in an editor that reveals hidden Unicode characters.
Learn more about bidirectional Unicode characters
Sign up for free
to join this conversation on GitHub.
Already have an account?
Sign in to comment
Add this suggestion to a batch that can be applied as a single commit.This suggestion is invalid because no changes were made to the code.Suggestions cannot be applied while the pull request is closed.Suggestions cannot be applied while viewing a subset of changes.Only one suggestion per line can be applied in a batch.Add this suggestion to a batch that can be applied as a single commit.Applying suggestions on deleted lines is not supported.You must change the existing code in this line in order to create a valid suggestion.Outdated suggestions cannot be applied.This suggestion has been applied or marked resolved.Suggestions cannot be applied from pending reviews.Suggestions cannot be applied on multi-line comments.Suggestions cannot be applied while the pull request is queued to merge.Suggestion cannot be applied right now. Please check back later.
The
DebeziumTransformSMT only kept the top-levelts_ms(the time the connector processed the event) as_cdc.ts, dropping the source database commit timestamp (source.ts_ms), which is often the more useful value for tracking creation/update times. This adds it as a new required_cdc.source_tsfield so both timestamps are kept, mirroring how_cdc.tsis handled in both the schema and schemaless paths. Tests now assert both timestamps against fixed values, and the Kafka Connect docs are updated to describe the new field.Resolves the request in databricks/iceberg-kafka-connect#189 (filed against the predecessor repo before the connector was donated to Apache Iceberg).