feat: implement race condition handling for task dequeue and status updates by ronenkapelian · Pull Request #237 · MapColonies/jobnik-manager

ronenkapelian · 2026-02-19T16:38:10Z

Question	Answer
Bug fix	✔
New feature	✔
Breaking change	✖
Deprecations	✖
Documentation	✔
Tests added	✔
Chore	✖

Related issues:

Further information:
Enhance the system to handle race conditions during task dequeue and status updates. Introduce appropriate error handling and response codes to manage conflicts when multiple workers attempt to modify the same task or stage simultaneously. This includes adding timeouts for transactions and updating the API documentation to reflect these changes.

…pdates

github-actions · 2026-02-19T16:40:39Z

Coverage Report

Status	Category	Percentage	Covered / Total
🔵	Lines	100% (🎯 80%)	765 / 765
🔵	Statements	100% (🎯 80%)	782 / 782
🔵	Functions	100% (🎯 80%)	111 / 111
🔵	Branches	100% (🎯 80%)	217 / 217

File Coverage

File	Stmts	Branches	Functions	Lines
Changed Files
src/api/v1/tasks/controller.ts	100%	100%	100%	100%
src/stages/models/manager.ts	100%	100%	100%	100%
src/tasks/DAL/taskRepository.ts	100%	100%	100%	100%
src/tasks/models/helper.ts	100%	100%	100%	100%
src/tasks/models/manager.ts	100%	100%	100%	100%

Generated in workflow #723 for commit 80455ca by the Vitest Coverage Report Action

…ction handling

CptSchnitz · 2026-02-24T06:17:24Z

+    // Note: $queryRaw returns raw database values, not Prisma-mapped values
+    // We need to re-fetch the task using Prisma to get properly mapped enum values
+    const rawTask = tasks[0]!;
+    const task = await tx.task.findUnique({
+      where: { id: rawTask.id },
+    });


its bad practice to query again as it increases the load on the database. prisma recommends using TypedSql in their docs.
https://www.prisma.io/docs/orm/prisma-client/using-raw-sql/typedsql

CptSchnitz · 2026-02-24T07:42:53Z

+        async (newTx) => {
+          await this.executeUpdateStatus(jobId, status, newTx);
+        },
+        { timeout: TX_TIMEOUT_MS }


change to global timeout for transactions (pretty sure its a thing)

changed on most, it is not working on some parts, I checked that there is a transaction that must be mentioned explicitly

is it a bug? i dont see why a pg option wont work

CptSchnitz · 2026-02-24T07:45:18Z

+    // This prevents errors during race conditions where multiple workers
+    // try to set the same status (e.g., multiple tasks setting stage to IN_PROGRESS)
+    /* v8 ignore next 4 -- @preserve */
+    if (stage.status === status) {


i know its already exists, but maybe a name like newStatus/wantedStatus would be better?

CptSchnitz · 2026-02-24T08:28:43Z

+   * Uses SELECT FOR UPDATE SKIP LOCKED for pessimistic locking:
+   * - FOR UPDATE: Locks the row so other transactions wait
+   * - SKIP LOCKED: Skip rows that are already locked (instead of waiting)


this is too digging

CptSchnitz · 2026-02-24T08:42:03Z

+   * Always includes the current status to implement optimistic locking.
+   * This ensures updates only succeed if the task is still in the expected state.
+   *
+   * **Why this is necessary:**
+   * In high-concurrency scenarios with multiple workers, race conditions can occur:
+   *
+   * Scenario 1: Concurrent dequeue operations
+   * - Worker A and B both read Task1 as PENDING
+   * - Worker A updates: WHERE id=X AND status=PENDING → IN_PROGRESS (succeeds)
+   * - Worker B updates: WHERE id=X AND status=PENDING → IN_PROGRESS (fails - optimistic lock)
+   *
+   * Scenario 2: Dequeue during update
+   * - Task1 is PENDING
+   * - Worker A calls updateStatus(Task1, COMPLETED) - reads task as PENDING
+   * - Worker B calls dequeue() - reads Task1 as PENDING
+   * - Worker B commits: WHERE id=X AND status=PENDING → IN_PROGRESS (succeeds)
+   * - Worker A commits: WHERE id=X AND status=PENDING → COMPLETED (fails - status is now IN_PROGRESS)
+   *
+   * Scenario 3: Double completion
+   * - Task1 is IN_PROGRESS
+   * - Worker A and B both try to update to COMPLETED
+   * - Worker A updates: WHERE id=X AND status=IN_PROGRESS → COMPLETED (succeeds)
+   * - Worker B updates: WHERE id=X AND status=IN_PROGRESS → COMPLETED (fails - status is now COMPLETED)
+   *
+   * Without status check, these scenarios would succeed silently, causing data inconsistency.
+   * With status check (optimistic locking), the second update fails with TASK_STATUS_UPDATE_FAILED.


Co-authored-by: Ofer <12687466+CptSchnitz@users.noreply.github.com>

…y for race condition handling

…tus method for clarity

…thod

…configuration

CptSchnitz · 2026-03-18T07:14:05Z

+        async (newTx) => {
+          await this.executeUpdateStatus(jobId, status, newTx);
+        },
+        { timeout: TX_TIMEOUT_MS }


is it a bug? i dont see why a pg option wont work

CptSchnitz · 2026-03-18T08:07:27Z

+    ...raw,
+    stageId: raw.stage_id, // Handle camelCase conversion
+    status: raw.status.toUpperCase() as TaskPrismaObject['status'],
+    data: (raw.data ?? {}) as Record<string, unknown>,


isnt data not null? so we always get a value

…OperationStatus.PENDING

…es script

feat: implement race condition handling for task dequeue and status u…

025581a

…pdates

ronenkapelian requested a review from CptSchnitz February 19, 2026 16:38

ronenkapelian self-assigned this Feb 19, 2026

feat: implement task repository for dequeue operations and add transa…

401d504

…ction handling

CptSchnitz requested changes Feb 24, 2026

View reviewed changes

ronenkapelian and others added 6 commits February 24, 2026 11:22

refactor: Update openapi3.yaml

ad2acbd

Co-authored-by: Ofer <12687466+CptSchnitz@users.noreply.github.com>

refactor: Update openapi3.yaml

30b93ab

Co-authored-by: Ofer <12687466+CptSchnitz@users.noreply.github.com>

refactor: improve task dequeue documentation and optimize update quer…

3910770

…y for race condition handling

refactor: rename status parameter to targetStatus in executeUpdateSta…

0d1260b

…tus method for clarity

refactor: remove transaction timeout from JobManager's transaction me…

09e6719

…thod

refactor: update Docker build command and increase timeout values in …

9c40b43

…configuration

CptSchnitz requested changes Mar 18, 2026

View reviewed changes

ronenkapelian added 6 commits March 23, 2026 08:29

refactor: update .gitignore to preserve TypedSQL types for Docker builds

1b13495

refactor: update .gitignore to preserve TypedSQL types for Docker builds

4419908

refactor: update status in findAndLockTaskForDequeue test to use Task…

0a274ac

…OperationStatus.PENDING

chore: update version to 0.2.0 in openapi_v1.yaml

95a13dc

build: update Docker build command and add migration generate SQL typ…

10ce8bb

…es script

refactor: simplify transaction handling in JobManager and TaskManager

80455ca

CptSchnitz approved these changes Mar 24, 2026

View reviewed changes

ronenkapelian merged commit e05f283 into master Mar 24, 2026
10 of 11 checks passed

ronenkapelian deleted the fix/dequeue/bug branch March 24, 2026 14:13

mapcolonies-devops mentioned this pull request Mar 24, 2026

chore(master): release 0.2.1 #241

Merged

Uh oh!

Conversation

ronenkapelian commented Feb 19, 2026

Uh oh!

github-actions Bot commented Feb 19, 2026 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Coverage Report

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Uh oh!

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants

github-actions Bot commented Feb 19, 2026 •

edited

Loading