PR #575: Fix run_xgboost_tasklet — exclude seed rows, fix graded_at crash, fix DB persist by jaayslaughter-cpu · Pull Request #445 · jaayslaughter-cpu/mework

jaayslaughter-cpu · 2026-05-16T07:33:13Z

Root Cause Analysis

xgb_model_store has been empty since day 1 — XGBoost has never successfully trained.
Three compounding bugs in run_xgboost_tasklet():

Bug 1: `graded_at = NULL` crash (seed rows) — the smoking gun

885,672 seed rows have graded_at = NULL. The sample_weights calculation does:
datetime.datetime.fromisoformat(str(None)) → fromisoformat("None") → ValueError every 2:30 AM.

Bug 2: Training on 885K synthetic rows instead of 86 real ones

Seed rows have model_prob = 55.0 (league fallback) for ALL rows — pure noise. Live rows have real features_json arrays, real model_prob (68.3% avg), and 59.3% win rate.

Bug 3: DB persist reads from ephemeral JSON file

with open(model_path, "r") reads from /app/api/models/prop_model_v1.json — fails on Railway (ephemeral FS), leaving xgb_model_store empty after every restart.

Fixes (4 surgical edits)

AND agent_name NOT ILIKE '%seed%' — train on 86 real legs only
Threshold < 200 → < 50 — 86 rows now qualify
_parse_graded_at() helper — NULL-safe graded_at (returns 30-days-ago for NULL)
DB persist via base64(pickle(model)) — no file path dependency

After this PR

2:30 AM retrain tonight trains on real data → xgb_model_store populates → K-blend + hit-blend activate.

Summary by cubic

Fixes nightly XGBoost training by excluding seed rows, handling NULL graded_at, and persisting the model in Postgres as a base64 pickle. Training now uses 86 live legs and xgb_model_store will populate and survive restarts.

Bug Fixes
- Exclude seed data: add agent_name NOT ILIKE '%seed%'; lower training minimum from 200 to 50 so 86 real rows qualify.
- Prevent crash on NULL dates: _parse_graded_at() handles NULL/invalid values and defaults to 30 days ago for recency weights.
- Durable persistence: write base64-encoded pickle to xgb_model_store (adds prop_type and n_samples) and keep only the last 3 models.

^{Written for commit 7bdd109. Summary will update on new commits. Review in cubic}

…rash, fix DB persist

coderabbitai · 2026-05-16T07:33:19Z

Warning

Rate limit exceeded

@jaayslaughter-cpu has exceeded the limit for the number of commits that can be reviewed per hour. Please wait 38 minutes and 42 seconds before requesting another review.

You’ve run out of usage credits. Purchase more in the billing tab.

⌛ How to resolve this issue?

After the wait time has elapsed, a review can be triggered using the @coderabbitai review command as a PR comment. Alternatively, push new commits to this PR.

We recommend that you space out your commits to avoid hitting the rate limit.

🚦 How do rate limits work?

CodeRabbit enforces hourly rate limits for each developer per organization.

Our paid plans have higher rate limits than the trial, open-source and free plans. In all cases, we re-allow further reviews after a brief timeout.

Please see our FAQ for further information.

ℹ️ Review info

⚙️ Run configuration

Configuration used: defaults

Review profile: CHILL

Plan: Pro

Run ID: ad01e026-e331-4c85-b3ed-957035252600

📥 Commits

Reviewing files that changed from the base of the PR and between 523ad26 and 7bdd109.

📒 Files selected for processing (1)

tasklets.py

✨ Finishing Touches

🧪 Generate unit tests (beta)

Create PR with unit tests
Commit unit tests in branch fix/xgboost-training-pr575

Thanks for using CodeRabbit! It's free for OSS, and your support helps us grow. If you like it, consider giving us a shout-out.

❤️ Share

_{Comment @coderabbitai help to get the list of available commands and usage tips.}

deepsource-io · 2026-05-16T07:33:35Z

DeepSource Code Review

We reviewed changes in 523ad26...7bdd109 on this pull request. Below is the summary for the review, and you can see the individual issues we found as inline review comments.

See full review on DeepSource ↗

PR Report Card

Overall Grade	Security Reliability Complexity Hygiene

Code Review Summary

Analyzer	Updated (UTC)	Details
Docker	May 16, 2026 7:33a.m.	Review ↗
JavaScript	May 16, 2026 7:33a.m.	Review ↗
Python	May 16, 2026 7:33a.m.	Review ↗
SQL	May 16, 2026 7:33a.m.	Review ↗
Secrets	May 16, 2026 7:33a.m.	Review ↗

Important

AI Review is run only on demand for your team. We're only showing results of static analysis review right now. To trigger AI Review, comment @deepsourcebot review on this thread.

codacy-production · 2026-05-16T07:33:50Z

Up to standards ✅

🟢 Issues 0 issues

Results:
0 new issues

View in Codacy

_{NEW Get contextual insights on your PRs based on Codacy's metrics, along with PR and Jira context, without leaving GitHub. Enable AI reviewer}
_{TIP This summary will be updated as you push new changes.}

gemini-code-assist

Code Review

This pull request updates the XGBoost tasklet by excluding 'seed' agents from the training set, reducing the minimum training data requirement to 50 rows, and improving the robustness of the recency decay calculation. It also transitions the model persistence logic to store base64-encoded pickles in the database. Feedback highlights a critical TypeError in the datetime subtraction logic due to mixed timezone awareness and a potential breaking change where the new storage format may be incompatible with existing model loaders.

gemini-code-assist · 2026-05-16T07:35:07Z

    now_utc = datetime.datetime.now(datetime.timezone.utc)
+    _default_graded = now_utc - datetime.timedelta(days=30)  # PR #575: default for NULL graded_at
+    def _parse_graded_at(v):
+        if v is None:
+            return _default_graded
+        if isinstance(v, datetime.datetime):
+            return v
+        try:
+            return datetime.datetime.fromisoformat(str(v))
+        except Exception:
+            return _default_graded
    sample_weights = np.array([
-        np.exp(-0.01 * max((now_utc - (
-            r[2] if isinstance(r[2], datetime.datetime)
-            else datetime.datetime.fromisoformat(str(r[2]))
-        ).replace(tzinfo=None)).days, 0))
+        np.exp(-0.01 * max((now_utc - _parse_graded_at(r[2]).replace(tzinfo=None)).days, 0))
        for r in rows
    ], dtype=np.float32)


The sample_weights calculation will raise a TypeError because it attempts to subtract an offset-naive datetime from an offset-aware one. now_utc is initialized as an aware datetime (UTC), but the expression _parse_graded_at(r[2]).replace(tzinfo=None) explicitly produces a naive datetime.

To fix this, you should make now_utc naive before using it in the subtraction.

Suggested change

now_utc = datetime.datetime.now(datetime.timezone.utc)

_default_graded = now_utc - datetime.timedelta(days=30) # PR #575: default for NULL graded_at

def _parse_graded_at(v):

if v is None:

return _default_graded

if isinstance(v, datetime.datetime):

return v

try:

return datetime.datetime.fromisoformat(str(v))

except Exception:

return _default_graded

sample_weights = np.array([

np.exp(-0.01 * max((now_utc - (

r[2] if isinstance(r[2], datetime.datetime)

else datetime.datetime.fromisoformat(str(r[2]))

).replace(tzinfo=None)).days, 0))

np.exp(-0.01 * max((now_utc - _parse_graded_at(r[2]).replace(tzinfo=None)).days, 0))

for r in rows

], dtype=np.float32)

now_utc = datetime.datetime.now(datetime.timezone.utc).replace(tzinfo=None)

_default_graded = now_utc - datetime.timedelta(days=30) # PR #575: default for NULL graded_at

def _parse_graded_at(v):

if v is None:

return _default_graded

if isinstance(v, datetime.datetime):

return v

try:

return datetime.datetime.fromisoformat(str(v))

except Exception:

return _default_graded

sample_weights = np.array([

np.exp(-0.01 * max((now_utc - _parse_graded_at(r[2]).replace(tzinfo=None)).days, 0))

for r in rows

], dtype=np.float32)

gemini-code-assist · 2026-05-16T07:35:07Z

+        import base64 as _b64  # noqa: PLC0415
+        _model_bytes  = pickle.dumps(model)
+        _model_b64str = _b64.b64encode(_model_bytes).decode("utf-8")


Changing the persistence format to a base64-encoded pickle string in the model_json column introduces an incompatibility with the existing model loader _load_xgb_model (line 1692), which expects this column to contain a raw XGBoost JSON string. While xgb_k_layer might expect a pickle, the global model used by agents will fail to load after a system restart.

Consider maintaining compatibility by storing the model in the format expected by the primary loader, or ensuring that _load_xgb_model is updated to handle the new format.

PR #575: Fix run_xgboost_tasklet — exclude seed rows, fix graded_at c…

7bdd109

…rash, fix DB persist

jaayslaughter-cpu merged commit a6c7b8d into main May 16, 2026
8 of 9 checks passed

gemini-code-assist Bot reviewed May 16, 2026

View reviewed changes

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

PR #575: Fix run_xgboost_tasklet — exclude seed rows, fix graded_at crash, fix DB persist#445

PR #575: Fix run_xgboost_tasklet — exclude seed rows, fix graded_at crash, fix DB persist#445
jaayslaughter-cpu merged 1 commit into
mainfrom
fix/xgboost-training-pr575

jaayslaughter-cpu commented May 16, 2026 •

edited by cubic-dev-ai Bot

Loading

Uh oh!

coderabbitai Bot commented May 16, 2026

Rate limit exceeded

Uh oh!

deepsource-io Bot commented May 16, 2026 •

edited

Loading

Uh oh!

codacy-production Bot commented May 16, 2026

Uh oh!

Uh oh!

gemini-code-assist Bot left a comment

Uh oh!

gemini-code-assist Bot May 16, 2026

Uh oh!

gemini-code-assist Bot May 16, 2026

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

1 participant

Conversation

jaayslaughter-cpu commented May 16, 2026 • edited by cubic-dev-ai Bot Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Root Cause Analysis

Bug 1: graded_at = NULL crash (seed rows) — the smoking gun

Bug 2: Training on 885K synthetic rows instead of 86 real ones

Bug 3: DB persist reads from ephemeral JSON file

Fixes (4 surgical edits)

After this PR

Summary by cubic

Uh oh!

coderabbitai Bot commented May 16, 2026

Rate limit exceeded

Uh oh!

deepsource-io Bot commented May 16, 2026 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

DeepSource Code Review

PR Report Card

Code Review Summary

Uh oh!

codacy-production Bot commented May 16, 2026

Up to standards ✅

Uh oh!

Uh oh!

gemini-code-assist Bot left a comment

Choose a reason for hiding this comment

Code Review

Uh oh!

gemini-code-assist Bot May 16, 2026

Choose a reason for hiding this comment

Uh oh!

gemini-code-assist Bot May 16, 2026

Choose a reason for hiding this comment

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

1 participant

jaayslaughter-cpu commented May 16, 2026 •

edited by cubic-dev-ai Bot

Loading

Bug 1: `graded_at = NULL` crash (seed rows) — the smoking gun

deepsource-io Bot commented May 16, 2026 •

edited

Loading