Core: Fix InMemoryLockManager shared scheduler shutdown by fightBoxing · Pull Request #15894 · apache/iceberg

fightBoxing · 2026-04-05T15:26:24Z

Description

Problem

BaseLockManager uses a JVM-wide shared static ScheduledExecutorService for heartbeats, but each InMemoryLockManager instance calls close() independently. When multiple managers exist (e.g., in tests or multi-catalog JVM usage), one manager's close() shuts down the shared scheduler via shutdownNow(), causing RejectedExecutionException in other live managers that still need it for heartbeat scheduling.

Root Cause

No reference counting on the shared scheduler. Any single close() call destroys it for all instances.

Fix

Added an AtomicInteger reference counter (schedulerRefCount) to BaseLockManager
Added a per-instance schedulerInitialized flag to track whether this instance has incremented the ref count
In scheduler(): increment ref count on first access per instance
In close(): only shutdownNow() the scheduler when the ref count reaches zero (last active manager)

Testing

Added testClosingOneManagerDoesNotAffectAnother: creates two managers, closes one, verifies the other can still acquire locks
Added testClosingAllManagersShutsDownScheduler: closes all managers, then verifies a new manager can create a fresh scheduler
All existing tests pass

Notes

The fix is backward compatible - no API changes
Thread safety is maintained via synchronized blocks and volatile fields
The AtomicInteger counter ensures correct behavior even under concurrent close operations

…and junit test code.

Add reference counting to BaseLockManager's shared ScheduledExecutorService to prevent one manager instance from shutting down the scheduler while other active instances still need it for heartbeats. Previously, any single close() call would shutdownNow() the JVM-wide shared scheduler, causing RejectedExecutionException in other live managers trying to schedule heartbeats. The fix tracks per-instance initialization state and uses an AtomicInteger reference counter. The scheduler is only shut down when the last active manager closes.

rockyyin and others added 3 commits January 30, 2026 14:11

[FLINK] Implement Iceberg lookup join functionality, and source code …

d8a6770

…and junit test code.

Merge branch 'apache:main' into main

02f64ff

fightBoxing mentioned this pull request Apr 5, 2026

Core: InMemoryLockManager can shut down the shared scheduler while another manager is still active #15861

Open

3 tasks

github-actions bot added core flink build labels Apr 5, 2026

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Core: Fix InMemoryLockManager shared scheduler shutdown#15894

Core: Fix InMemoryLockManager shared scheduler shutdown#15894
fightBoxing wants to merge 3 commits intoapache:mainfrom
fightBoxing:fix/15861-inmemory-lockmanager-shared-scheduler

fightBoxing commented Apr 5, 2026

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

1 participant

Conversation

fightBoxing commented Apr 5, 2026

Description

Problem

Root Cause

Fix

Testing

Notes

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

1 participant