More

AlexCoventry · 2026-05-14T22:11:03 1778796663

Sounds like bitterness and resignation, to me.

AlexCoventry · 2026-05-14T22:01:17 1778796077

Right? It's exactly how we train AIs, after all. It's not like this is mysterious.

AlexCoventry · 2026-05-14T02:06:15 1778724375

https://paulgraham.com/submarine.html

dnnddidiej · 2026-05-14T08:18:53 1778746733

How is this one a submarine? It is not even PR.

AlexCoventry · 2026-05-14T22:23:53 1778797433

It's potentially PR (it hit the front page of HN, and probably not organically) for Trump, Oracle or SAP, just from reading the first few paragraphs.

AlexCoventry · 2026-05-12T05:30:14 1778563814

I've been using them for decades without issue, FWIW.

saaaaaam · 2026-05-12T07:22:35 1778570555

You are either a scientific anomaly or a single data point. Or both.

raffael_de · 2026-05-12T10:15:32 1778580932

Well, then I am another anomaly and data point. I'm using ear plugs for 30 years at least 95% of nights. Never ever had an ear infection.

wlonkly · 2026-05-13T00:34:58 1778632498

And by induction, earplugs don't give anyone an ear infection. QED!

bityard · 2026-05-12T12:42:03 1778589723

I am also a scientific anomaly. Seems there are a lot of us!

saaaaaam · 2026-05-12T18:55:59 1778612159

You're a cohort now!

csomar · 2026-05-12T10:28:25 1778581705

I have found it depends on how comfortable the earplugs are. If I feel they are uncomfortable in the ear, there is a good chance I'll get an infection/inflammation in the next few days.

simondanisch · 2026-05-12T10:24:08 1778581448

Me too, 20 years without a single ear infection and without a single day without ear plugs

AlexCoventry · 2026-05-11T17:55:07 1778522107

He describes in detail how curl is software-engineered to within an inch of its life. Do you really think most code is that highly polished?

AlexCoventry · 2026-05-11T17:37:34 1778521054

You don't think AI is going to be able to understand things and apply their ability to formulate solutions better than you, in the near future?

koonsolo · 2026-05-11T19:15:54 1778526954

In 2000 I learned about this old technology called "neural networks".

AI really depends on long winters and rare breakthroughs. Deep neural network was the most recent breakthrough.

The iterations you currently see it just adding more storage, but the fundamental neural network structure doesn't change.

I'm confident AGI will not be achieved by the LLM architecture, and when the next AI breakthrough is, is anyones guess. But if you take history into account, it will take a while.

jghn · 2026-05-11T23:18:21 1778541501

Yes, same. In the late 90s through early aughts then I was taught over and over and over again that neural networks were a dead end concept and would never amount to anything.

Just like all the preceding AI booms, this one will hit its maximal point, the hype train will fizzle, the best parts will just become "normal", and then a couple of decades later something new will come to push the boundary again.

bborud · 2026-05-13T12:20:21 1778674821

No, I don't. Do you? If so, why? Extrapolation from guesswork?

AlexCoventry · 2026-05-11T17:35:17 1778520917

Yeah, I have an allergic reaction to tiktok being mixed up in any serious intellectual pursuit. :-)

AlexCoventry · 2026-05-10T22:54:12 1778453652

Bandwidth is the killer, in distributed LLM training.

irishcoffee · 2026-05-10T23:23:58 1778455438

What’s the rush?

codebje · 2026-05-11T00:51:33 1778460693

It depends on the purpose for the model. AFAIK LLMs aren't particularly capable at researching answers, relying more on having 'truth' baked in to their weights, so if it takes 12 months to train up a crowd-trained LLM it'll be 12 months behind the times.

How serious a risk is poisoned weights?

Can we leverage the cryptobros into using LLM training as a proof of work?

MarsIronPI · 2026-05-11T02:35:31 1778466931

What? I use Qwen 3.5 35B-A3B and it definitely knows how and when to do web searches to fill in gaps in its knowledge.

codebje · 2026-05-11T04:25:49 1778473549

Does Qwen3.5 know it needs to do this because the API in question has had loads of churn and much of its training data is on obsolete versions, or do you need to prompt it? How well does it handle having an API reference with sample code in its context window?

Having an LLM use a web search tool isn't the same thing as researching a topic, IMO, because it's so ephemeral and needs constant reinforcement. LLMs aren't learning machines, they're static ones.

irishcoffee · 2026-05-11T10:22:16 1778494936

How many facts change over time to create obsolete data? Unless you’re researching current events, I contend it’s a moot point.

AlexCoventry · 2026-05-07T22:48:41 1778194121

You only need to train a range of small models in order to establish a plausible scaling law, IMO.

AlexCoventry · 2026-05-06T19:26:31 1778095591

I don't think this is giving up. He's getting inside information on how Claude works, and a huge stream of Claude usage data. This will all inform future grok development, IMO.