Skip to content

Conversation

@ksy36
Copy link
Collaborator

@ksy36 ksy36 commented Feb 4, 2026

Changes in this PR:
import_reports_from_bigquery command assigns only low quality reports to a default by domain bucket, the rest of reports are saved to db without cluster_id or bucket_id
triage_new_reports command gets reports that don't have bucket_id and attempts to cluster and bucket them (runs every hour at the moment)

I think once we import live reports the frequency of triaging can be increased.

@ksy36 ksy36 force-pushed the incoming_clustering branch from 9984818 to 1ce7f27 Compare February 9, 2026 20:40
@ksy36 ksy36 force-pushed the incoming_clustering branch from 1ce7f27 to c515179 Compare February 10, 2026 20:43
@ksy36 ksy36 marked this pull request as ready for review February 10, 2026 20:47
@ksy36 ksy36 requested a review from jgraham February 10, 2026 20:47
@ksy36
Copy link
Collaborator Author

ksy36 commented Feb 10, 2026

I need to find a way to run cluster_reports command once to cluster existing reports on production before these changes are deployed :)

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Labels

None yet

Projects

None yet

Development

Successfully merging this pull request may close these issues.

1 participant