[Flink] Add sourceParallelismUpperBound metric for auto-scaling systems by wzhero1 · Pull Request #7117 · apache/paimon

wzhero1 · 2026-01-26T07:06:08Z

Purpose

This PR adds a new metric sourceParallelismUpperBound to the Flink Source Enumerator. This metric provides a recommended upper bound of parallelism for auto-scaling systems to optimize resource allocation.

Motivation

Auto-scaling systems need to understand the optimal parallelism for Paimon sources to:

Avoid over-provisioning resources for fixed-bucket tables (where parallelism shouldn't exceed bucket count)
Make informed scaling decisions for dynamic-bucket tables

The metric value:

For fixed bucket tables: equals the bucket number
For dynamic or postpone bucket tables (bucket = -1): equals the max parallelism

Note: This is a recommendation, not a hard limit - users can still configure higher parallelism manually if needed.

Tests

Added unit tests in FileStoreSourceMetricsTest.java:

continuousFileStoreFixBucketEnumeratorMetricsTest() - Verifies metric equals bucket number for fixed bucket tables
continuousFileStoreDynBucketEnumeratorMetricsTest() - Verifies metric equals current parallelism for dynamic bucket tables

Also added TestingMetricUtils.getGauge() helper method for testing Gauge metrics.

API and Format

No API or storage format changes. This only adds a new metric.

Documentation

Updated docs/content/maintenance/metrics.md with the new metric description.

yunfengzhou-hub

Thanks for the PR. Left some comments as below.

docs/content/maintenance/metrics.md

...mon-flink-common/src/main/java/org/apache/paimon/flink/source/ContinuousFileStoreSource.java

yunfengzhou-hub

+1

* upstream/master: (33 commits) [core] Fix merge adjacent files in DataEvolutionCompactCoordinator [python] Rename list_tag to list_tags [python] add list tag for TagManager (apache#7264) [core][python] Introduce DataFileMeta.nonNullRowIdRange to unify codes [python] with_shard should be evenly distributed for data evolution mode (apache#7271) [core] Remove useless version in Varant [core] Should work with Split in DataTableBatchScan [core] Fix paimon_incremental_query with limit push down (apache#7269) [rest] Improve RestCatalog OpenAPI nonce generation (apache#7270) [cdc] Avoid sending empty schema change events to Schema Evolution (apache#7261) [python] Fix avro write timestamp without timezone wrongly (apache#7259) [doc] add doc for filter by _ROW_ID on data evolution (apache#7262) [fs] Extract jindo dls to separate module (apache#7263) [core] Add listTableDetails method to Catalog interface (apache#7266) [python] Support filter by _ROW_ID for data evolution (apache#7252) [Flink] Add sourceParallelismUpperBound metric for auto-scaling systems (apache#7117) [github] Add whether it is an AI-generated tag in the PR template (apache#7257) [core] Improve HttpClient error response handling (apache#7254) [python] Light refactor: move _is_blob_file check into DataFileMeta (apache#7256) [core] RowIdPredicateVisitor supports converting between statement (apache#7255) ...

wzhero1 force-pushed the feat/paimon-flink-available-max-parallelism-metrics branch 2 times, most recently from affa652 to 438e986 Compare January 26, 2026 07:25

wzhero1 force-pushed the feat/paimon-flink-available-max-parallelism-metrics branch from 438e986 to edbdea3 Compare February 10, 2026 02:42

wzhero1 changed the title ~~[Flink] Add sourceScalingMaxParallelism metric for auto-scaling systems~~ [Flink] Add sourceParallelismUpperBound metric for auto-scaling systems Feb 10, 2026

[flink] Add sourceParallelismUpperBound metric for auto-scaling systems

166deaa

wzhero1 force-pushed the feat/paimon-flink-available-max-parallelism-metrics branch from edbdea3 to 166deaa Compare February 10, 2026 02:54

yunfengzhou-hub reviewed Feb 10, 2026

View reviewed changes

[flink] change sourceParallelismUpperBound metric to SplitEnumerator

b05b083

yunfengzhou-hub approved these changes Feb 10, 2026

View reviewed changes

yunfengzhou-hub merged commit 959170d into apache:master Feb 11, 2026
14 checks passed

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

[Flink] Add sourceParallelismUpperBound metric for auto-scaling systems#7117

[Flink] Add sourceParallelismUpperBound metric for auto-scaling systems#7117
yunfengzhou-hub merged 2 commits intoapache:masterfrom
wzhero1:feat/paimon-flink-available-max-parallelism-metrics

wzhero1 commented Jan 26, 2026 •

edited

Loading

Uh oh!

yunfengzhou-hub left a comment

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

yunfengzhou-hub left a comment

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants

Conversation

wzhero1 commented Jan 26, 2026 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Purpose

Motivation

Tests

API and Format

Documentation

Uh oh!

yunfengzhou-hub left a comment

Choose a reason for hiding this comment

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

yunfengzhou-hub left a comment

Choose a reason for hiding this comment

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants

wzhero1 commented Jan 26, 2026 •

edited

Loading