Skip to content

Conversation

@czy006
Copy link
Contributor

@czy006 czy006 commented Jan 29, 2026

This PR introduces a framework for executing Paimon table maintenance operations (snapshot expiration, orphan file cleanup) remotely on Spark optimizers.

Changes:

  • MaintainerInput/Output interfaces and base implementations in amoro-common
  • MaintainerExecutor/Factory interfaces for remote execution pattern
  • New amoro-optimizer-paimon-spark module with SparkMaintainerExecutor
  • PaimonSnapshotExpire* components for snapshot expiration operations
  • Placeholder SparkOptimizer for future Paimon optimizing support

Key Design:

  • Follows existing Optimizer pattern for consistency
  • Multi-format compatible (Iceberg, Paimon)
  • Remote-only execution (no local AMS maintainer integration)

This commit introduces a framework for executing Paimon table maintenance
operations (snapshot expiration, orphan file cleanup) remotely on Spark
optimizers, following the existing Optimizer pattern.

Changes:
- Add MaintainerInput/Output interfaces and base implementations
- Add MaintainerExecutor/Factory interfaces for remote execution
- Create amoro-optimizer-paimon-spark module with SparkMaintainerExecutor
- Implement PaimonSnapshotExpire* components for snapshot expiration
- Add placeholder SparkOptimizer for future Paimon optimizing support

Co-Authored-By: Claude (glm-4.7) <[email protected]>
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Projects

None yet

Development

Successfully merging this pull request may close these issues.

1 participant