Skip to content

Commit 8352e94

Browse files
committed
Daily scrape data update
1 parent 38051f6 commit 8352e94

3 files changed

Lines changed: 341 additions & 0 deletions

File tree

data/hn_nontech_2026-05-15.json

Lines changed: 149 additions & 0 deletions
Original file line numberDiff line numberDiff line change
@@ -0,0 +1,149 @@
1+
{
2+
"scraped_date": "2026-05-15",
3+
"source": "hacker_news",
4+
"total_scraped": 74,
5+
"nontech_count": 9,
6+
"posts": [
7+
{
8+
"id": "48060054",
9+
"title": "Ask HN: We just had an actual UUID v4 collision...",
10+
"link": "https://news.ycombinator.com/item?id=48060054",
11+
"domain": "news.ycombinator.com",
12+
"author": "mittermayr",
13+
"score": 472,
14+
"comment_count": 341,
15+
"created_ts": 1778227034,
16+
"is_internal": true,
17+
"post_text": "I know what you&#x27;re thinking... and I still can&#x27;t believe it, but...<p>This morning, our database flagged a duplicate UUID (v4). I checked, thinking it may have been a double-insert bug or something, but no.<p>The original UUID was from a record added in 2025 (about a year ago), and today the system inserted a new document with a fresh UUIDv4 and it came up with the exact same one:<p>b6133fd6-70fe-4fe3-bed6-8ca8fc9386cd<p>We&#x27;re using this:\nhttps:&#x2F;&#x2F;www.npmjs.com&#x2F;package&#x2F;uuid<p>I thought this is technically impossible, and it will never happen, and since we&#x27;re not modifying the UUIDs in any way, I really wonder how that.... is possible!? We&#x27;re literally only calling:<p>import { v4 as uuidv4 } from &quot;uuid&quot;;<p>const document_id = uuidv4();<p>... and then insert into the database, that&#x27;s it.<p>Additionally, the database only has about 15.000 records, and now one collision. Statistically... impossible.<p>Has that ever happened to anyone?! What in the...",
18+
"is_ask_hn": true,
19+
"matched_keywords": [],
20+
"comments": []
21+
},
22+
{
23+
"id": "48085993",
24+
"title": "Ask HN: What are you working on? (May 2026)",
25+
"link": "https://news.ycombinator.com/item?id=48085993",
26+
"domain": "news.ycombinator.com",
27+
"author": "david927",
28+
"score": 282,
29+
"comment_count": 1083,
30+
"created_ts": 1778434481,
31+
"is_internal": true,
32+
"post_text": "What are you working on? Any new ideas that you&#x27;re thinking about?",
33+
"is_ask_hn": true,
34+
"matched_keywords": [],
35+
"comments": []
36+
},
37+
{
38+
"id": "48122103",
39+
"title": "Ask HN: What are you working on (non-AI)?",
40+
"link": "https://news.ycombinator.com/item?id=48122103",
41+
"domain": "news.ycombinator.com",
42+
"author": "BrunoBernardino",
43+
"score": 33,
44+
"comment_count": 43,
45+
"created_ts": 1778681272,
46+
"is_internal": true,
47+
"post_text": "Please don&#x27;t turn this into an inflammatory post.<p>Regardless of &quot;AI&quot; being good or bad (it&#x27;s not even just one thing as many of you know), I feel like the &quot;What are you working on?&quot; posts are drowning in things that use AI for something and&#x2F;or are clearly &quot;AI slop&quot;.<p>I&#x27;d like to look at things other humans have been doing (even if they used a bit of some kind of AI for assistance), that aren&#x27;t a product or tool that uses AI for something.<p>I know it exists (and I use and build some), but it&#x27;s incredibly hard to find nowadays. Can you help me?<p>Thank you.",
48+
"is_ask_hn": true,
49+
"matched_keywords": [],
50+
"comments": []
51+
},
52+
{
53+
"id": "48087925",
54+
"title": "Ask HN: Will low quality AI customer support be the new normal?",
55+
"link": "https://news.ycombinator.com/item?id=48087925",
56+
"domain": "news.ycombinator.com",
57+
"author": "0-bad-sectors",
58+
"score": 27,
59+
"comment_count": 31,
60+
"created_ts": 1778446647,
61+
"is_internal": true,
62+
"post_text": "Now whenever I reach for customer&#x27;s support chat or phone I get an AI agent replying to me and I get into a useless loop for a couple of minutes before I start begging it to link me to a real person.<p>Will companies start losing customers because of that or people will eventually get used to this?",
63+
"is_ask_hn": true,
64+
"matched_keywords": [],
65+
"comments": []
66+
},
67+
{
68+
"id": "48103273",
69+
"title": "Ask HN: How to get started in electronic music",
70+
"link": "https://news.ycombinator.com/item?id=48103273",
71+
"domain": "news.ycombinator.com",
72+
"author": "A_Random_Nerd",
73+
"score": 16,
74+
"comment_count": 16,
75+
"created_ts": 1778550905,
76+
"is_internal": true,
77+
"post_text": "I&#x27;ve been interested in learning how to make electronic music (e.g. Geometry dash, Undertale, Daft Punk, C418), but there doesn&#x27;t seem to be any real documentation or tutorials on how to start.<p>To summarize my question:\na.) What are the first 3 steps (or more) to learn to compose music?\nb.) Any links, resources etc. to share?<p>Thanks for your time and interest.",
78+
"is_ask_hn": true,
79+
"matched_keywords": [],
80+
"comments": []
81+
},
82+
{
83+
"id": "48135807",
84+
"title": "Ask HN: What's the hardest part of building a SaaS that users keep paying for?",
85+
"link": "https://news.ycombinator.com/item?id=48135807",
86+
"domain": "news.ycombinator.com",
87+
"author": "specwiseai",
88+
"score": 10,
89+
"comment_count": 10,
90+
"created_ts": 1778768338,
91+
"is_internal": true,
92+
"post_text": "I&#x27;m Afrid, 14 years old, building SpecWise AI solo from Bangladesh. Struggling to get my first paying customer. Would love to hear what actually made users stick around for you.",
93+
"is_ask_hn": true,
94+
"matched_keywords": [],
95+
"comments": []
96+
},
97+
{
98+
"id": "48126435",
99+
"title": "Ask HN: Is Anthropic doing too much vibe coding?",
100+
"link": "https://news.ycombinator.com/item?id=48126435",
101+
"domain": "news.ycombinator.com",
102+
"author": "terabytest",
103+
"score": 7,
104+
"comment_count": 8,
105+
"created_ts": 1778701105,
106+
"is_internal": true,
107+
"post_text": "The Claude app and Claude Code have been an unusable and buggy mess for me lately, has anyone else been experiencing this? Most of my messages get swallowed after sending them or the responses get interrupted or dropped. Sometimes entire conversations disappear from the sidebar only to reappear later. I’ve learnt that when a message appears not to have gone through or have errored midway, it often comes back with a valid response if I wait a bit and then restart the app.<p>I wonder if it has anything to do with Anthropic eagerly embracing vibe coding.",
108+
"is_ask_hn": true,
109+
"matched_keywords": [],
110+
"comments": []
111+
},
112+
{
113+
"id": "48143205",
114+
"title": "XS Programming Language",
115+
"link": "https://news.ycombinator.com/item?id=48143205",
116+
"domain": "news.ycombinator.com",
117+
"author": "xs-lang",
118+
"score": 4,
119+
"comment_count": 0,
120+
"created_ts": 1778806709,
121+
"is_internal": true,
122+
"post_text": "Made my language called XS. It&#x27;s a general-purpose language, currently at v1.2.15 as of this writing. Would like feedback and how I can improve it! Website it at https:&#x2F;&#x2F;xslang.org. There is a playground to try XS out in, and fully complete docs as well. Enjoy!",
123+
"is_ask_hn": false,
124+
"matched_keywords": [
125+
"purpose",
126+
"feedback"
127+
],
128+
"comments": []
129+
},
130+
{
131+
"id": "48143615",
132+
"title": "Ask HN: When will you be concerned on layoffs?",
133+
"link": "https://news.ycombinator.com/item?id=48143615",
134+
"domain": "news.ycombinator.com",
135+
"author": "piratesAndSons",
136+
"score": 4,
137+
"comment_count": 0,
138+
"created_ts": 1778810151,
139+
"is_internal": true,
140+
"post_text": "In recent weeks, I have noticed a lot of tech companies announcing layoffs due to AI. As this continues, when will you be concerned?<p>Microsoft laying off 10% of their workforce, 15%, 20%, 30%?\nGoogle laying off 10%, 15%, 20%, or more?<p>HN readers like to use the cope of &quot;it&#x27;s just pandemic over-hiring&quot; — when do you think that explanation will no longer hold?<p>When will you say: &quot;Now it has nothing to do with Covid-era over-hiring; the industry has fundamentally changed.&quot;<p>If this happens and a significant number of white-collar jobs are eliminated, who would buy things to keep the economy funded?",
141+
"is_ask_hn": true,
142+
"matched_keywords": [
143+
"hiring",
144+
"layoffs"
145+
],
146+
"comments": []
147+
}
148+
]
149+
}

data/newsletters_2026-05-15.json

Lines changed: 181 additions & 0 deletions
Original file line numberDiff line numberDiff line change
@@ -0,0 +1,181 @@
1+
{
2+
"scraped_date": "2026-05-15",
3+
"source": "newsletters",
4+
"feeds_checked": [
5+
"simon_willison",
6+
"sean_goedecke",
7+
"rachel_by_the_bay",
8+
"mitchell_hashimoto",
9+
"matklad",
10+
"hillel_wayne",
11+
"paul_graham",
12+
"experimental_history",
13+
"anil_dash",
14+
"pragmatic_engineer",
15+
"leaddev",
16+
"staffeng",
17+
"engineering_managers",
18+
"software_lead_weekly",
19+
"steve_blank"
20+
],
21+
"total_articles": 253,
22+
"relevant_count": 9,
23+
"articles": [
24+
{
25+
"feed_id": "simon_willison",
26+
"feed_name": "Simon Willison's Weblog",
27+
"title": "Quoting Mo Bitar",
28+
"summary": "Now, if your CEO has never heard the phrase Ralph Loop, oh man, you are less than 30 days away from your next promotion. I'm not even exaggerating. Walk into his office, close the door, and say, hey chief, been experimenting with something. It's called Ralph Loops. And I think it could change literally everything. And he's gonna say, what's a Ralph loop? And you will say, give me $18,000 worth of API credits and I'll show you. Now you won't actually do anything, because you can't do anything. Be",
29+
"link": "https://simonwillison.net/2026/May/12/mo-bitar/#atom-everything",
30+
"published": "2026-05-12T22:59:58+00:00",
31+
"matched_keywords": [
32+
"promotion"
33+
],
34+
"relevance_score": 3,
35+
"feed_focus": [
36+
"ai leverage",
37+
"engineering strategy",
38+
"platform thinking"
39+
]
40+
},
41+
{
42+
"feed_id": "simon_willison",
43+
"feed_name": "Simon Willison's Weblog",
44+
"title": "Quoting New York Times Editors’ Note",
45+
"summary": "This article was updated after The Times learned that a remark attributed to Pierre Poilievre, the Conservative leader, was in fact an A.I.-generated summary of his views about Canadian politics that A.I. rendered as a quotation. The reporter should have checked the accuracy of what the A.I. tool returned. The article now accurately quotes from a speech delivered by Mr. Poilievre in April. [...] He did not refer to politicians who changed allegiances as turncoats in that speech. &mdash; New York",
46+
"link": "https://simonwillison.net/2026/May/10/new-york-times-editors-note/#atom-everything",
47+
"published": "2026-05-10T23:58:49+00:00",
48+
"matched_keywords": [
49+
"politics"
50+
],
51+
"relevance_score": 3,
52+
"feed_focus": [
53+
"ai leverage",
54+
"engineering strategy",
55+
"platform thinking"
56+
]
57+
},
58+
{
59+
"feed_id": "sean_goedecke",
60+
"feed_name": "Sean Goedecke",
61+
"title": "The left-wing case for AI",
62+
"summary": "In Many anti-AI arguments are conservative arguments I argued that left-wing anti-AI sentiment1 is partly a backlash to two unrelated events around the rise of ChatGPT: the crypto mania of 2022 and the pro-Donald-Trump push many big tech CEOs made in 2024. If the timing had been different, we could have had a real pro-AI faction on the left. What would that look like? I’m not going to respond to any of the popular anti-AI arguments (I’ve already done that here). I think it’s more interesting to ",
63+
"link": "https://seangoedecke.com/the-left-wing-case-for-ai/",
64+
"published": "2026-05-10T00:00:00+00:00",
65+
"matched_keywords": [
66+
"impact",
67+
"delivery",
68+
"conflict"
69+
],
70+
"relevance_score": 3,
71+
"feed_focus": [
72+
"large tech orgs",
73+
"career growth",
74+
"engineering culture"
75+
]
76+
},
77+
{
78+
"feed_id": "simon_willison",
79+
"feed_name": "Simon Willison's Weblog",
80+
"title": "Thoughts on GitLab's workforce reduction\" and \"structural and strategic decisions\"",
81+
"summary": "GitLab Act 2 There's a lot going on in this announcement from GitLab about the \"workforce reduction\" and \"structural and strategic decisions\" they are making with respect to the agentic era. They're \"planning to reduce the number of countries by up to 30% where we have small teams\". One of the most interesting things about GitLab is that they have employees spread across a large number of countries - 18 are listed in their public employee handbook but this post says they are \"operating in nearly",
82+
"link": "https://simonwillison.net/2026/May/11/gitlab-act-2/#atom-everything",
83+
"published": "2026-05-11T23:58:55+00:00",
84+
"matched_keywords": [
85+
"platform",
86+
"ownership"
87+
],
88+
"relevance_score": 2,
89+
"feed_focus": [
90+
"ai leverage",
91+
"engineering strategy",
92+
"platform thinking"
93+
]
94+
},
95+
{
96+
"feed_id": "simon_willison",
97+
"feed_name": "Simon Willison's Weblog",
98+
"title": "llm 0.32a2",
99+
"summary": "Release: llm 0.32a2 A bunch of useful stuff in this LLM alpha, but the most important detail is this one: Most reasoning-capable OpenAI models now use the /v1/responses endpoint instead of /v1/chat/completions. This enables interleaved reasoning across tool calls for GPT-5 class models. #1435 This means you can now see the summarized reasoning tokens when you run prompts against an OpenAI model, displayed in a different color to standard error. Use the -R or --hide-reasoning flags if you don't w",
100+
"link": "https://simonwillison.net/2026/May/12/llm/#atom-everything",
101+
"published": "2026-05-12T17:45:07+00:00",
102+
"matched_keywords": [
103+
"platform"
104+
],
105+
"relevance_score": 1,
106+
"feed_focus": [
107+
"ai leverage",
108+
"engineering strategy",
109+
"platform thinking"
110+
]
111+
},
112+
{
113+
"feed_id": "sean_goedecke",
114+
"feed_name": "Sean Goedecke",
115+
"title": "Thinking Machines and interaction models",
116+
"summary": "Thinking Machines just released Interaction Models. This is their first real AI model release1 after a year of work and two billion dollars of capital. What is an “interaction model”? First, it’s not a frontier model. Thinking Machines is not yet competing with OpenAI, Anthropic and Google. Instead, they’re working on the problem of better real-time interaction with models. Some parts of what they’re doing are not new at all, other parts are slightly-questionable benchmark gaming, and still othe",
117+
"link": "https://seangoedecke.com/interaction-models/",
118+
"published": "2026-05-12T00:00:00+00:00",
119+
"matched_keywords": [
120+
"feedback"
121+
],
122+
"relevance_score": 1,
123+
"feed_focus": [
124+
"large tech orgs",
125+
"career growth",
126+
"engineering culture"
127+
]
128+
},
129+
{
130+
"feed_id": "sean_goedecke",
131+
"feed_name": "Sean Goedecke",
132+
"title": "AI makes weak engineers less harmful",
133+
"summary": "Like other kinds of puzzle-solving, software engineering ability is strongly heavy-tailed. The strongest engineers produce way more useful output than the average, and the weakest engineers often are actively net-negative: instead of moving projects along, they create problems that their colleagues have to spend time solving. That’s why many tech companies try to build a small, ludicrously well-paid team instead of a large team of more average engineers, and why so far this seems to be a winning",
134+
"link": "https://seangoedecke.com/ai-makes-weak-engineers-less-harmful/",
135+
"published": "2026-05-09T00:00:00+00:00",
136+
"matched_keywords": [
137+
"visibility"
138+
],
139+
"relevance_score": 1,
140+
"feed_focus": [
141+
"large tech orgs",
142+
"career growth",
143+
"engineering culture"
144+
]
145+
},
146+
{
147+
"feed_id": "matklad",
148+
"feed_name": "matklad",
149+
"title": "Learning Software Architecture",
150+
"summary": "In reply to an email asking about learning software design skills as a researcher physicist:",
151+
"link": "https://matklad.github.io/2026/05/12/software-architecture.html",
152+
"published": "2026-05-12T00:00:00+00:00",
153+
"matched_keywords": [
154+
"architecture"
155+
],
156+
"relevance_score": 1,
157+
"feed_focus": [
158+
"strategic ic",
159+
"engineering systems",
160+
"deep thinking"
161+
]
162+
},
163+
{
164+
"feed_id": "leaddev",
165+
"feed_name": "LeadDev",
166+
"title": "The reality of being an engineering manager",
167+
"summary": "Managing teams, not just tickets. The post The reality of being an engineering manager appeared first on LeadDev.",
168+
"link": "https://leaddev.com/career-development/the-reality-of-being-an-engineering-manager?utm_source=leaddev&utm_medium=RSS",
169+
"published": "2026-05-11T09:00:00+00:00",
170+
"matched_keywords": [
171+
"engineering manager"
172+
],
173+
"relevance_score": 1,
174+
"feed_focus": [
175+
"engineering management",
176+
"org maturity",
177+
"leadership"
178+
]
179+
}
180+
]
181+
}

data/reddit_2026-05-15.json

Lines changed: 11 additions & 0 deletions
Original file line numberDiff line numberDiff line change
@@ -0,0 +1,11 @@
1+
{
2+
"scraped_date": "2026-05-15",
3+
"source": "reddit",
4+
"subreddits": [
5+
"experienceddevs",
6+
"cscareerquestions",
7+
"managers"
8+
],
9+
"total_posts": 0,
10+
"posts": []
11+
}

0 commit comments

Comments
 (0)