Skip to content
Closed
Show file tree
Hide file tree
Changes from all commits
Commits
File filter

Filter by extension

Filter by extension

Conversations
Failed to load comments.
Loading
Jump to
Jump to file
Failed to load files.
Loading
Diff view
Diff view
4 changes: 4 additions & 0 deletions claims/al_jazeera/drc_faces_deadly_ebola_resurge.yaml
Original file line number Diff line number Diff line change
@@ -0,0 +1,4 @@
"sourceUriDigest": "d77fe702137b27c8c8f04538dde3a2a48a1c9849d5e116e58aec80b229f4632f"
"summary": "The new Ebola outbreak has been reported in the towns of Rwampara, Mongwalu, and Bunia in northeastern DRC"
"title": "DRC faces deadly Ebola resurgence amid worsening humanitarian crisis"
"uri": "https://www.aljazeera.com/features/2026/5/17/drc-faces-deadly-ebola-resurgence-amid-worsening-humanitarian-crisis"
4 changes: 4 additions & 0 deletions claims/bbc/uae_reports_drone_strike_near.yaml
Original file line number Diff line number Diff line change
@@ -0,0 +1,4 @@
"sourceUriDigest": "ff03f7306b36e74d5dad4a7b18f821862b90afd53cc6f01dc7fd3b07796b1404"
"summary": "The country is investigating the source of the strike, which it says came from its western border."
"title": "UAE reports drone strike near Abu Dhabi nuclear power plant"
"uri": "https://www.bbc.com/news/articles/cwy27pkj1l1o"
Comment on lines +1 to +4
Copy link
Copy Markdown

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

⚠️ Potential issue | 🔴 Critical | ⚡ Quick win

Invalid YAML mapping syntax.

The quoted keys break the YAML mapping structure. Lines like "sourceUriDigest": "value" are parsed as scalar strings, not key-value pairs, and will cause YAML parsing to fail.

🔧 Proposed fix to correct YAML syntax
-"sourceUriDigest": "ff03f7306b36e74d5dad4a7b18f821862b90afd53cc6f01dc7fd3b07796b1404"
-"summary": "The country is investigating the source of the strike, which it says came from its western border."
-"title": "UAE reports drone strike near Abu Dhabi nuclear power plant"
-"uri": "https://www.bbc.com/news/articles/cwy27pkj1l1o"
+sourceUriDigest: "ff03f7306b36e74d5dad4a7b18f821862b90afd53cc6f01dc7fd3b07796b1404"
+summary: "The country is investigating the source of the strike, which it says came from its western border."
+title: "UAE reports drone strike near Abu Dhabi nuclear power plant"
+uri: "https://www.bbc.com/news/articles/cwy27pkj1l1o"
📝 Committable suggestion

‼️ IMPORTANT
Carefully review the code before committing. Ensure that it accurately replaces the highlighted code, contains no missing lines, and has no issues with indentation. Thoroughly test & benchmark the code to ensure it meets the requirements.

Suggested change
"sourceUriDigest": "ff03f7306b36e74d5dad4a7b18f821862b90afd53cc6f01dc7fd3b07796b1404"
"summary": "The country is investigating the source of the strike, which it says came from its western border."
"title": "UAE reports drone strike near Abu Dhabi nuclear power plant"
"uri": "https://www.bbc.com/news/articles/cwy27pkj1l1o"
sourceUriDigest: "ff03f7306b36e74d5dad4a7b18f821862b90afd53cc6f01dc7fd3b07796b1404"
summary: "The country is investigating the source of the strike, which it says came from its western border."
title: "UAE reports drone strike near Abu Dhabi nuclear power plant"
uri: "https://www.bbc.com/news/articles/cwy27pkj1l1o"
🤖 Prompt for AI Agents
Verify each finding against current code. Fix only still-valid issues, skip the
rest with a brief reason, keep changes minimal, and validate.

In `@claims/bbc/uae_reports_drone_strike_near.yaml` around lines 1 - 4, The YAML
file uses quoted full-line strings like "sourceUriDigest": "..." which are
parsed as scalars instead of mappings; change each quoted key to a proper YAML
mapping key (e.g., sourceUriDigest: "value") for the entries sourceUriDigest,
summary, title, and uri so they become valid key: value pairs; ensure keys are
unquoted (or use bare keys) and values remain quoted if they contain special
characters.

4 changes: 4 additions & 0 deletions claims/fox_news/us_nigeria_strike_isis_fighte.yaml
Original file line number Diff line number Diff line change
@@ -0,0 +1,4 @@
"sourceUriDigest": "3bc8be7f5c80be275ea61e7aee0ba9cb8dac3b4536e39515942985f33e9d53b8"
"summary": "U.S. and Nigerian forces struck ISIS targets in Nigeria days after President Trump announced the killing of global ISIS leader Abu-Bilal al-Minuki."
"title": "US, Nigeria strike ISIS fighters again from the air after killing senior leader"
"uri": "https://www.foxnews.com/world/us-nigeria-strike-isis-fighters-air-killing-senior-leader"
4 changes: 4 additions & 0 deletions claims/hindustan_times/at_least_nine_dead_in_china_af.yaml
Original file line number Diff line number Diff line change
@@ -0,0 +1,4 @@
"sourceUriDigest": "16e7233814ff26725174c87e80e6da033de4feb7ea3fa5a904ef75dbc17541f3"
"summary": "The passengers were returning from planting sweet potatoes on Saturday and the truck fell into the river while crossing a bridge."
"title": "At least nine dead in China after vehicle swept into flooded river during heavy rain"
"uri": "https://www.hindustantimes.com/world-news/at-least-nine-dead-in-china-after-vehicle-swept-into-flooded-river-during-heavy-rain-101779124029765.html"
Comment on lines +1 to +4
Copy link
Copy Markdown

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

⚠️ Potential issue | 🔴 Critical | ⚡ Quick win

Invalid YAML mapping syntax.

The quoted keys break the YAML mapping structure. Lines like "sourceUriDigest": "value" are parsed as scalar strings, not key-value pairs, and will cause YAML parsing to fail.

🔧 Proposed fix to correct YAML syntax
-"sourceUriDigest": "16e7233814ff26725174c87e80e6da033de4feb7ea3fa5a904ef75dbc17541f3"
-"summary": "The passengers were returning from planting sweet potatoes on Saturday and the truck fell into the river while crossing a bridge."
-"title": "At least nine dead in China after vehicle swept into flooded river during heavy rain"
-"uri": "https://www.hindustantimes.com/world-news/at-least-nine-dead-in-china-after-vehicle-swept-into-flooded-river-during-heavy-rain-101779124029765.html"
+sourceUriDigest: "16e7233814ff26725174c87e80e6da033de4feb7ea3fa5a904ef75dbc17541f3"
+summary: "The passengers were returning from planting sweet potatoes on Saturday and the truck fell into the river while crossing a bridge."
+title: "At least nine dead in China after vehicle swept into flooded river during heavy rain"
+uri: "https://www.hindustantimes.com/world-news/at-least-nine-dead-in-china-after-vehicle-swept-into-flooded-river-during-heavy-rain-101779124029765.html"
📝 Committable suggestion

‼️ IMPORTANT
Carefully review the code before committing. Ensure that it accurately replaces the highlighted code, contains no missing lines, and has no issues with indentation. Thoroughly test & benchmark the code to ensure it meets the requirements.

Suggested change
"sourceUriDigest": "16e7233814ff26725174c87e80e6da033de4feb7ea3fa5a904ef75dbc17541f3"
"summary": "The passengers were returning from planting sweet potatoes on Saturday and the truck fell into the river while crossing a bridge."
"title": "At least nine dead in China after vehicle swept into flooded river during heavy rain"
"uri": "https://www.hindustantimes.com/world-news/at-least-nine-dead-in-china-after-vehicle-swept-into-flooded-river-during-heavy-rain-101779124029765.html"
sourceUriDigest: "16e7233814ff26725174c87e80e6da033de4feb7ea3fa5a904ef75dbc17541f3"
summary: "The passengers were returning from planting sweet potatoes on Saturday and the truck fell into the river while crossing a bridge."
title: "At least nine dead in China after vehicle swept into flooded river during heavy rain"
uri: "https://www.hindustantimes.com/world-news/at-least-nine-dead-in-china-after-vehicle-swept-into-flooded-river-during-heavy-rain-101779124029765.html"
🤖 Prompt for AI Agents
Verify each finding against current code. Fix only still-valid issues, skip the
rest with a brief reason, keep changes minimal, and validate.

In `@claims/hindustan_times/at_least_nine_dead_in_china_af.yaml` around lines 1 -
4, The YAML file uses quoted full-line strings like "sourceUriDigest": which are
parsed as scalars rather than keys; update the mapping so each entry is a proper
YAML key: value pair (no surrounding quotes around the entire key+colon token) —
e.g. convert the lines containing "sourceUriDigest":, "summary":, "title":, and
"uri": into valid YAML mappings by placing the key (sourceUriDigest, summary,
title, uri) followed by a colon, a space, and the quoted or plain value as
appropriate.

4 changes: 4 additions & 0 deletions claims/india_times/farm_fires_punjab_reports_115.yaml
Original file line number Diff line number Diff line change
@@ -0,0 +1,4 @@
"sourceUriDigest": "6472f88917d973b804e99cf32fcb87269d2557252739bc7bc137d8aa2da7b581"
"summary": "None"
Copy link
Copy Markdown

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

⚠️ Potential issue | 🟠 Major | ⚡ Quick win

Replace placeholder with actual summary content.

The summary field contains the string "None" instead of a descriptive summary of the article content. This appears to be missing data that should be populated before ingestion.

🤖 Prompt for AI Agents
Verify each finding against current code. Fix only still-valid issues, skip the
rest with a brief reason, keep changes minimal, and validate.

In `@claims/india_times/farm_fires_punjab_reports_115.yaml` at line 2, The
"summary" field in claims/india_times/farm_fires_punjab_reports_115.yaml
currently contains the placeholder string "None"; replace that placeholder with
a concise descriptive summary of the article (one or two sentences) that
captures the main claim/details, updating the "summary" value in the YAML so it
is not "None" anymore.

Copy link
Copy Markdown

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

P1: The summary field contains the string "None", which is almost certainly a data pipeline serialization artifact. This will result in posting a claim with a meaningless summary to the API instead of actual article content. The pipeline should either generate a proper summary from the article or handle the null case correctly.

Prompt for AI agents
Check if this issue is valid — if so, understand the root cause and fix it. At claims/india_times/farm_fires_punjab_reports_115.yaml, line 2:

<comment>The summary field contains the string "None", which is almost certainly a data pipeline serialization artifact. This will result in posting a claim with a meaningless summary to the API instead of actual article content. The pipeline should either generate a proper summary from the article or handle the null case correctly.</comment>

<file context>
@@ -0,0 +1,4 @@
+"sourceUriDigest": "6472f88917d973b804e99cf32fcb87269d2557252739bc7bc137d8aa2da7b581"
+"summary": "None"
+"title": "Farm fires: Punjab reports 115 new cases"
+"uri": "https://timesofindia.indiatimes.com/city/chandigarh/farm-fires-punjab-reports-115-new-cases/articleshow/131184684.cms"
</file context>

"title": "Farm fires: Punjab reports 115 new cases"
"uri": "https://timesofindia.indiatimes.com/city/chandigarh/farm-fires-punjab-reports-115-new-cases/articleshow/131184684.cms"
4 changes: 4 additions & 0 deletions claims/ndtv/4_dead_8_injured_as_gunman_op.yaml
Original file line number Diff line number Diff line change
@@ -0,0 +1,4 @@
"sourceUriDigest": "bc65553a778a1c832967d1ff1fb318597ff5d87f5f935f6e76820cc514fb287f"
"summary": "None"
Copy link
Copy Markdown

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

⚠️ Potential issue | 🟠 Major | ⚡ Quick win

Replace placeholder with actual summary content.

The summary field contains the string "None" instead of a descriptive summary of the article content. This appears to be missing data that should be populated before ingestion.

🤖 Prompt for AI Agents
Verify each finding against current code. Fix only still-valid issues, skip the
rest with a brief reason, keep changes minimal, and validate.

In `@claims/ndtv/4_dead_8_injured_as_gunman_op.yaml` at line 2, The "summary"
field currently contains the placeholder string "None"; replace it with a
concise descriptive summary of the article (one clear sentence capturing the
who/what/where/when/why) by editing the "summary" value in the YAML
(claims/ndtv/4_dead_8_injured_as_gunman_op.yaml). Use the article body to craft
the sentence, ensure it is not the literal "None" and follows the same
quoting/formatting as other entries, and save the file so ingestion sees the
populated summary.

Copy link
Copy Markdown

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

P1: Replace "summary": "None" with a meaningful description of the claim. The literal string "None" is a Python serialization artifact, not an actual summary — it provides zero information about the claim content and will degrade the quality of the data posted to the API.

Prompt for AI agents
Check if this issue is valid — if so, understand the root cause and fix it. At claims/ndtv/4_dead_8_injured_as_gunman_op.yaml, line 2:

<comment>Replace `"summary": "None"` with a meaningful description of the claim. The literal string "None" is a Python serialization artifact, not an actual summary — it provides zero information about the claim content and will degrade the quality of the data posted to the API.</comment>

<file context>
@@ -0,0 +1,4 @@
+"sourceUriDigest": "bc65553a778a1c832967d1ff1fb318597ff5d87f5f935f6e76820cc514fb287f"
+"summary": "None"
+"title": "4 Dead, 8 Injured As Gunman Opens Fire In Southern Turkey: Report"
+"uri": "https://www.ndtv.com/world-news/4-dead-8-injured-as-gunman-opens-fire-in-southern-turkey-report-11513477"
</file context>
Suggested change
"summary": "None"
"summary": "Four people were killed and eight others injured after a gunman opened fire in southern Turkey, according to reports."

"title": "4 Dead, 8 Injured As Gunman Opens Fire In Southern Turkey: Report"
"uri": "https://www.ndtv.com/world-news/4-dead-8-injured-as-gunman-opens-fire-in-southern-turkey-report-11513477"
Comment on lines +1 to +4
Copy link
Copy Markdown

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

⚠️ Potential issue | 🔴 Critical | ⚡ Quick win

Invalid YAML mapping syntax.

The quoted keys break the YAML mapping structure. Lines like "sourceUriDigest": "value" are parsed as scalar strings, not key-value pairs, and will cause YAML parsing to fail.

🔧 Proposed fix to correct YAML syntax
-"sourceUriDigest": "bc65553a778a1c832967d1ff1fb318597ff5d87f5f935f6e76820cc514fb287f"
-"summary": "None"
-"title": "4 Dead, 8 Injured As Gunman Opens Fire In Southern Turkey: Report"
-"uri": "https://www.ndtv.com/world-news/4-dead-8-injured-as-gunman-opens-fire-in-southern-turkey-report-11513477"
+sourceUriDigest: "bc65553a778a1c832967d1ff1fb318597ff5d87f5f935f6e76820cc514fb287f"
+summary: "None"
+title: "4 Dead, 8 Injured As Gunman Opens Fire In Southern Turkey: Report"
+uri: "https://www.ndtv.com/world-news/4-dead-8-injured-as-gunman-opens-fire-in-southern-turkey-report-11513477"
📝 Committable suggestion

‼️ IMPORTANT
Carefully review the code before committing. Ensure that it accurately replaces the highlighted code, contains no missing lines, and has no issues with indentation. Thoroughly test & benchmark the code to ensure it meets the requirements.

Suggested change
"sourceUriDigest": "bc65553a778a1c832967d1ff1fb318597ff5d87f5f935f6e76820cc514fb287f"
"summary": "None"
"title": "4 Dead, 8 Injured As Gunman Opens Fire In Southern Turkey: Report"
"uri": "https://www.ndtv.com/world-news/4-dead-8-injured-as-gunman-opens-fire-in-southern-turkey-report-11513477"
sourceUriDigest: "bc65553a778a1c832967d1ff1fb318597ff5d87f5f935f6e76820cc514fb287f"
summary: "None"
title: "4 Dead, 8 Injured As Gunman Opens Fire In Southern Turkey: Report"
uri: "https://www.ndtv.com/world-news/4-dead-8-injured-as-gunman-opens-fire-in-southern-turkey-report-11513477"
🤖 Prompt for AI Agents
Verify each finding against current code. Fix only still-valid issues, skip the
rest with a brief reason, keep changes minimal, and validate.

In `@claims/ndtv/4_dead_8_injured_as_gunman_op.yaml` around lines 1 - 4, The YAML
uses quoted whole-line strings like "sourceUriDigest": "..." which YAML treats
as scalars, not mappings; update the document so keys are proper YAML mappings
by removing the surrounding quotes from keys and using key: value pairs (e.g.,
sourceUriDigest:
bc65553a778a1c832967d1ff1fb318597ff5d87f5f935f6e76820cc514fb287f, summary:
"None", title: "4 Dead, 8 Injured As Gunman Opens Fire In Southern Turkey:
Report", uri: "https://...") for the keys sourceUriDigest, summary, title and
uri so the file parses as a valid mapping.

Original file line number Diff line number Diff line change
@@ -0,0 +1,4 @@
"sourceUriDigest": "3295a82ed433fc6cfd4595d65a877b3d3bf55b313f467f2b584658e0df6ec3ee"
"summary": "For years, mobile carriers have billed users by the gigabyte. Now, China’s telecoms giants are seeking to monetise the artificial intelligence boom by introducing a new unit of measurement to monthly bills: the AI token. On Sunday, China Telecom unveiled nationwide, token-based pricing packages aimed at a range of customers – from casual users to developers and businesses – according to information listed on the company’s app. Consumer packages, tailored for everyday tasks, start at 9.9 yuan..."
"title": "Forget cellular data: China’s telecoms giants are selling AI token plans"
"uri": "https://www.scmp.com/tech/tech-trends/article/3353954/forget-cellular-data-chinas-telecoms-giants-are-selling-ai-token-plans"
4 changes: 4 additions & 0 deletions claims/the_guardian/california_island_fire_linked.yaml
Original file line number Diff line number Diff line change
@@ -0,0 +1,4 @@
"sourceUriDigest": "619af5b3da55bc1a969dade318bdb7cfa47011fe471235e97e162388450c01a2"
"summary": "Fire on Santa Rosa Island in Channel Islands national park becomes state’s largest this year and threatens rare plants A wildfire that broke out on an island in the Channel Islands national park has become California’s largest wildfire so far this year, burning through more than 10,000 acres, destroying historic structures and endangering rare plant communities that conservationists had struggled to reclaim. About six dozen firefighters have been deployed to control the blaze, which broke out on Friday, but their efforts have been undermined by strong winds. The fire is currently at 0% containment, according to a Cal Fire incident report . Continue reading..."
"title": "California island fire linked to sailor’s distress flare scorches 10,000 acres"
"uri": "https://www.theguardian.com/us-news/2026/may/18/california-national-park-fire-santa-rosa-island"
4 changes: 4 additions & 0 deletions claims/the_new_york_times/meta_reassigns_7_000_employees.yaml
Original file line number Diff line number Diff line change
@@ -0,0 +1,4 @@
"sourceUriDigest": "f217d823160d39ca379a09044e1242ac641c8b1d79a1d6f499c59f08a1b67500"
"summary": "The company announced the changes two days before it plans to lay off 10 percent of its work force, or about 8,000 employees."
"title": "Meta Reassigns 7,000 Employees to Focus on A.I."
"uri": "https://www.nytimes.com/2026/05/18/technology/meta-reassigns-7000-employees-ai.html"
4 changes: 4 additions & 0 deletions claims/yahoo_finance/michael_burry_says_today_s_top.yaml
Original file line number Diff line number Diff line change
@@ -0,0 +1,4 @@
"sourceUriDigest": "844adf50f74b30951a08a382fffc9f1a904acd2d62e062db41d2647c72f904a4"
"summary": "None"
Copy link
Copy Markdown

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

⚠️ Potential issue | 🟡 Minor | ⚡ Quick win

Replace placeholder summary value on Line 2.

Using "None" as a literal summary weakens claim quality for downstream ingestion/search. Use a real summary, or set schema-compliant null/empty value (per oapi.yaml) instead.

🤖 Prompt for AI Agents
Verify each finding against current code. Fix only still-valid issues, skip the
rest with a brief reason, keep changes minimal, and validate.

In `@claims/yahoo_finance/michael_burry_says_today_s_top.yaml` at line 2, The
"summary" field currently contains the literal string "None"; replace it with a
meaningful short summary of the claim or, if intentionally empty, set a
schema-compliant null/empty value per oapi.yaml (e.g., remove the key or use the
approved null representation). Update the "summary" key in
michael_burry_says_today_s-top.yaml so it is not the literal "None" and conforms
to the API schema.

Copy link
Copy Markdown

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

P2: summary is set to the literal string "None", which is a placeholder value, not a meaningful summary of the claim. Replace it with an actual description of the claim content before merge.

Prompt for AI agents
Check if this issue is valid — if so, understand the root cause and fix it. At claims/yahoo_finance/michael_burry_says_today_s_top.yaml, line 2:

<comment>`summary` is set to the literal string "None", which is a placeholder value, not a meaningful summary of the claim. Replace it with an actual description of the claim content before merge.</comment>

<file context>
@@ -0,0 +1,4 @@
+"sourceUriDigest": "844adf50f74b30951a08a382fffc9f1a904acd2d62e062db41d2647c72f904a4"
+"summary": "None"
+"title": "Michael Burry Says Today’s Top 10 Stocks Surged 784% in a Year. The Pre-Dot-Com Number Was 622%."
+"uri": "https://finance.yahoo.com/markets/stocks/articles/michael-burry-says-today-top-131526825.html"
</file context>

"title": "Michael Burry Says Today’s Top 10 Stocks Surged 784% in a Year. The Pre-Dot-Com Number Was 622%."
"uri": "https://finance.yahoo.com/markets/stocks/articles/michael-burry-says-today-top-131526825.html"
2 changes: 1 addition & 1 deletion scripts/helper.py
Original file line number Diff line number Diff line change
Expand Up @@ -81,5 +81,5 @@ def patch_sources(api_key: str, base_url: str, uriDigest: str, body: dict):
endpoint = f"{base_url}/api/v1/source/{uriDigest}"
headers = {"X-API-Key": api_key, "Content-Type": "application/json"}
status, _ = post_request(endpoint, headers, body, timeout=90, method="PATCH")
if status != 200:
if status != 204:
print(f"Error: failed to patch source {uriDigest}: {status}", file=sys.stderr)
4 changes: 2 additions & 2 deletions scripts/newsdata_io.py
Original file line number Diff line number Diff line change
Expand Up @@ -28,7 +28,7 @@ def get_claims(src_domain_url: str):
if response.status_code != 200:
print(f"Error: couldn't fetch claims for {src_domain_url}: {response.status_code}")
resp_body = response.json()
if resp_body.get("results") is not None and resp_body.get("results").get("suggestion") is not None:
print(f"Suggested domain url(s) for {src_domain_url}: {resp_body['results']['suggestion']}")
if resp_body.get("results") is not None and resp_body.get("results")[0].get("suggestion") is not None:
print(f"Suggested domain url(s) for {src_domain_url}: {resp_body['results'][0]['suggestion']}")
return None
return response.json()["results"]
Loading