Retry DRBD adjust after stale bitmap attach failure by kvaps · Pull Request #491 · LINBIT/linstor-server

kvaps · 2026-04-03T19:45:36Z

Summary

When drbdadm adjust fails during local attach with:

already has a bitmap, this should not happen

it means DRBD still has stale local bitmap state for the target minor. In that case LINSTOR currently aborts the adjust and leaves the resource diskless even though peers may still be healthy.

This change teaches the satellite to:

detect that specific attach failure
extract the affected minor from stderr
run drbdsetup detach <minor> with a --force fallback
retry drbdadm adjust once

Why

I hit this while recovering DRBD resources after a cluster incident. In practice this looked like an unintentional diskless resource in LINSTOR while drbdadm status still showed a healthy Primary with peer-disk:UpToDate on other nodes.

The detach + retry path was enough to resynchronize LINSTOR with the actual DRBD device state and allow the local disk to be reattached.

Validation

reproduced against LINSTOR 1.33.1
verified on a live cluster during recovery
added unit coverage for bitmap leak detection / minor extraction

kvaps · 2026-04-03T20:58:26Z

We've integrated this change into Cozystack as part of cozystack/cozystack#2331

Retry DRBD adjust after stale bitmap attach failure

51ae50a

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Retry DRBD adjust after stale bitmap attach failure#491

Retry DRBD adjust after stale bitmap attach failure#491
kvaps wants to merge 1 commit intoLINBIT:masterfrom
kvaps:kvaps/retry-adjust-after-stale-bitmap

kvaps commented Apr 3, 2026

Uh oh!

kvaps commented Apr 3, 2026

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

1 participant

Conversation

kvaps commented Apr 3, 2026

Summary

Why

Validation

Uh oh!

kvaps commented Apr 3, 2026

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

1 participant