More

krastanov · 2026-05-14T16:38:18 1778776698

It is recent and still uncommon that private universities have a grad student union. The US also has many great public universities that have had grad student unions since forever

mswphd · 2026-05-14T18:04:53 1778781893

it was illegal for private schools to have grad student unions until ~2016, as (private) graduate students were not classified as employees.

https://www.aaup.org/brief/columbia-university-364-nlrb-no-9...

ijk · 2026-05-14T16:40:11 1778776811

Yeah, private universities being unionized is more recent.

krastanov · 2026-04-20T02:14:10 1776651250

Who has been projecting FTL as a realistic technology ever? FTL is not possible according to the current laws of Physics, while AGI is at least not forbidden by them.

Morromist · 2026-04-20T05:23:49 1776662629

I think this should be something you can answer for yourself by looking at human media and news over the lasy 100 years. I find it hard to belive you haven't ever noticed anyone seriously saying we may possibly have FTL sometime in the future. Incidentially I think I read on HackerNews that Sam Altman has been talking about building Dyson Spheres in the future. I suppose they're not forbidden by the current laws of phyisics either, but I don't know if I would call them a realistic technology.

gtech1 · 2026-04-20T03:19:55 1776655195

We don't even know what human consciousness is. We can't even answer if we have free will or not. And you are proposing that AGI..is what exactly ?

stackbutterflow · 2026-04-20T07:41:24 1776670884

The IQ of the smartest human, the perfect memory storing and recollection of computers, the fact that it never tires. I don't know if it's AGI but it's already something greater than us.

croon · 2026-04-20T14:29:26 1776695366

Is the IQ measured by tests created for and answered by humans in the training data?

AngryData · 2026-04-20T09:31:37 1776677497

If it was greater than humans already it wouldn't need humans to help it work.

gtech1 · 2026-04-20T22:16:17 1776723377

ok...there's no 'general' IQ afaik

ajam1507 · 2026-04-20T03:42:19 1776656539

> We don't even know what human consciousness is.

I think Douglas Hofstadter satisfactorily answered this question.

> We can't even answer if we have free will or not.

Sure we can, it's just that most people don't like the answer.

gtech1 · 2026-04-20T22:15:43 1776723343

> I think Douglas Hofstadter satisfactorily answered this question.

He didn't prove anything. It's a theory, just like many others: https://plato.stanford.edu/entries/consciousness-higher/

> Sure we can, it's just that most people don't like the answer.

Again, not proven

ajam1507 · 2026-04-21T22:25:50 1776810350

There are also plenty of people who think you can't prove whether the Christian God exists. I don't take those people seriously.

Don't mistake continued debate amongst "experts" as being a signal that something is unsettled.

krastanov · 2026-04-08T15:14:13 1775661253

I read OP differently. I thought they said "we should invest in non-dystopian public safety[1] to avoid dystopian populist creating a 1984 version of public safety".

[1]: I imagine this includes things like mental heath help, housing, and other related social safety nets.

krastanov · 2026-04-07T02:27:43 1775528863

> And that state of the art has not moved much in the last decade

This is far from true. On the experimental side, gate fidelities and physical qubit numbers have increased significantly (a couple of orders of magnitude). On the theory side, error correction techniques have improved astronomically -- overhead to of error corrections has dropped by many orders of magnitude. On the error correction side progress has been feverish over the last 4 years in particular.

krastanov · 2026-03-27T01:30:41 1774575041

This sounds incredibly naive. Competition does not magically prevent monopolies -- once you have a dominant player they just buy or undercut the occasional competing startup.

nickpp · 2026-03-27T09:22:03 1774603323

> once you have a dominant player they just buy or undercut the occasional competing startup

If they buy startups, a thousand more will spring up hoping to be bought. Investors love this game.

And if you think undercutting works, read up on the story of Dow and how they broke the German bromine monopoly [1].

There is no such thing as a natural monopoly. Only governments can create monopolies, usually through regulation.

[1] https://en.wikipedia.org/wiki/Herbert_Henry_Dow#Breaking_a_m...

krastanov · 2026-03-24T19:24:45 1774380285

Wine's APIs are more stable than Linux's APIs, so it seems more plausible to me that Wine will become the first class target itself.

TehCorwiz · 2026-03-24T19:27:22 1774380442

I wouldn't be surprised if Wine eventually becomes more stable than Windows.

alexrp · 2026-03-24T20:10:45 1774383045

I've experienced multiple instances where (so I heard; I don't use Windows) a Windows Update completely broke a game on Windows for everyone, but Wine/Proton kept running it just fine. So we're already there in some sense.

wing-_-nuts · 2026-03-25T16:12:21 1774455141

I remember when the shader caching done for a popular game in linux made that game run better than the windows version

carlos_rpn · 2026-03-24T19:42:44 1774381364

It feels like it won't be long before Microsoft starts helping with that (by making Windows less stable, not improving Wine).

keyringlight · 2026-03-24T20:01:53 1774382513

What I wonder about is if MS wants to keep people on windows, what methods they can use to do that. For simple desktop stuff I don't think they have many options to lock in other developers (and their audiences) to windows unless they want do so themselves (putting aside web based or not PC-desktop).

Bleeding edge gaming and multiplayer anti-cheat is one area where I think having a big company owning the OS probably helps them stay ahead, as that structure probably lets them work with hardware designers to get the capabilities in use (i.e. in new versions of DirectX) and available to software developers first. There's generally a lag in adoption for new features within Vulkan and then usage downstream in wine/proton to get compatibility parity with windows, then the games themselves being able to run feature/performance parity. It'd be interesting to see what cooperation would be needed to have the linux gaming stack equal at the point new features are released, and with the least amount of manual hacks or command line tweaking required for the users. As discussed a few weeks back, tough anti-cheat for linux seems like a paradox with the current methods.

mschuster91 · 2026-03-24T21:12:54 1774386774

> What I wonder about is if MS wants to keep people on windows, what methods they can use to do that

Microsoft doesn't give a fuck about private customers any more. They don't have money.

What has money though is enterprise/government sales, and MS got these customers tightly locked in. Compliance audits and tooling for insurances or legal stuff (SOX, GDPR, ...) are built against a full Microsoft stack of MS Server, Active Directory, Azure, Teams, Office 365 and Windows desktops.

You might be able to get away with replacing AD and GPO with Samba servers but even that is already a pain when the auditors come knocking. Everything else? There is no single FOSS based "standard offering" (i.e. a combination of everything needed to run an on-prem enterprise site, Office replacement, remote collaboration tooling), so every audit for such setups must be custom made and involves a lot of extra work.

A second leg is industrial control machines, medical devices and the likes. That's all stuff built by third party vendors and integrators. They need to continue on Windows because switching to an alternative OS would require redoing everything from scratch on the software and certification side. These customers buy the LTSC IoT stuff.

And that is why you see Microsoft pushing enshittification so hard on private customers... extract the last few cents you can from them. But the real money comes from the large customers.

Aerroon · 2026-03-24T20:09:35 1774382975

Windows 14 will just be a linux distro with wine acting as backwards compatibility.

pjmlp · 2026-03-25T09:26:43 1774430803

Why would Microsoft ever do that?

WSL is already there for the folks that want to play with Linux.

Aerroon · 2026-03-26T00:10:56 1774483856

For a while it seemed like Microsoft wasn't taking Windows seriously as a product. And the easiest way to cut costs on that is to use a solution someone else is building and maintaining. They did retire Internet Explorer in favor of a Chromium browser, so it wouldn't be unprecedented.

pjmlp · 2026-03-26T06:57:43 1774508263

Edge has lots of features on top of Chromium.

voodooEntity · 2026-03-24T21:24:10 1774387450

Inb4 Windows 40k and to run the "kernel" you need to sacrifice 1000 a day

beAbU · 2026-03-25T12:36:46 1774442206

By the time we get to version 40k the kernel will just be AI hallucinating a UI for you at 60fps.

Secondly, I do acknowlege your 40k reference.

porphyra · 2026-03-24T19:51:10 1774381870

Wine actually does run some ancient Windows games better than Windows 11 itself.

anthk · 2026-03-24T21:07:10 1774386430

Anything Direct Draw related will be mapped into OpenGL under Unix giving you decent speeds. On Windows it will be a crawling slideshow because from Windows 8 and up it will use a really dog slow software mode with no acceleration at all, worse than plain VESA. Yes, you can reuse WineD3D DLL's on Windows and run these game in a fast way, but not by default, it's a Win32 port of some Wine libraries.

rescbr · 2026-03-25T01:25:17 1774401917

Once I had to use a Mesa3D build for Windows and use the zink driver to render OpenGL to Vulkan, otherwise it would use Windows' software renderer.

anthk · 2026-03-25T08:02:24 1774425744

I had to use WineD3D's ddraw.dll among another one to run these touch based arcade machine games with card games, Trivial Pursuit, hangman and the like. If not the game made for w98/2k would really lag even under an i3.

The same with some multimedia CD's from its day. Scummvm it's partially implementing Macromedia Director support but the mentioned game had a custom engine. The Scummvm devs would RE in few weeks (it's a simple 2D game bundle, nothing difficult, with virtually no animations, almost everything it's still images) but no one began yet.

duskwuff · 2026-03-24T19:58:51 1774382331

It certainly runs 16-bit Windows games better than Windows 11, which can't run them at all. Not that there are a ton of those, but it's still pretty neat that they work.

senfiaj · 2026-03-25T01:04:58 1774400698

16-bit software won't run natively in 64-bit mode. It requires some programmatic emulator, like DosBox. Or am I missing something?

simoncion · 2026-03-25T03:19:08 1774408748

The thing that you're missing is that Microsoft used to ship that emulator with Windows. Then they stopped doing that.

AFAICT, Wine can run WIN16 programs. I don't know if it can run DOS programs. There's a WineHQ wiki page that says it can load DOS programs, but various internet fora seem to believe that Wine's DOS support is pretty broken. I've never tried it, and have no DOS programs handy, so I can't verify those claims.

duskwuff · 2026-03-25T05:39:18 1774417158

"DOS support" is tricky inasmuch as a lot software from that era - especially larger and more complex packages - interacted with hardware directly. In a sense, they weren't really DOS applications so much as they were bare-metal PC applications which were booted from DOS. It'd be difficult for WINE to support those, and other projects like DOSbox / 86box / etc do a better job of it.

NetMageSCW · 2026-03-26T19:28:12 1774553292

There is also a port of Wine’s VDM back to Windows called otvdm or winevdm that is able to run 16-bit programs on Windows. It is surprisingly capable, I was able to run a 16-bit VB program that used a serial based optical modem without issue.

consp · 2026-03-25T07:29:14 1774423754

Time to dust off my cd copy of Stars! (From the disk backup, the cd had terminal illnesses and has died). The only win16 game I've ever seen distributed on CDROM. Wine already ran it ok (iirc there were some issues but nothing gamebraking), but now it can do so without i386 libs.

beAbU · 2026-03-25T12:39:27 1774442367

Not just ancient games!

https://arstechnica.com/gaming/2025/06/games-run-faster-on-s...

wing-_-nuts · 2026-03-25T16:15:47 1774455347

MS got such a black eye for that that they're developing a build of windows specifically for handhelds, optimized, without the bloat and power hungry extras. Would be nice if it ran on laptops

zadikian · 2026-03-25T06:47:12 1774421232

Wine is also the reason why Windows software has more longevity on Mac than Mac software. Like, 32-bit didn't get deprecated in Wine.

NetMageSCW · 2026-03-26T19:30:08 1774553408

If enough people cared it seems like someone would have created a MacOS32 in MacOS64 like WoW64.

zadikian · 2026-03-26T22:27:06 1774564026

I'm guessing it'd be too much reverse-engineering. WoW64 was Microsoft themselves.

_flux · 2026-03-24T19:34:06 1774380846

What I'd like to see would be some useful extra APIs in Wine, that would allow it to perform even better in some situations, and that such APIs would be then embraced by the game developers.

Finally some embrace, extend, and extinguish love right back at Microsoft!

NetMageSCW · 2026-03-26T19:36:04 1774553764

That will never happen.

HerbManic · 2026-03-24T20:22:19 1774383739

Ever since Proton came along, it has been a quiet agreement that Win32 APIs are the best target for Linux support.

zerocrates · 2026-03-24T19:44:32 1774381472

Building against the Steam runtime containers seems like the other route, which also gets you more stability.

beAbU · 2026-03-25T12:34:53 1774442093

I agree with this take. Wine/Proton might become something akin to a runtime for games, running on many platforms and consoles. This means devs might stop targeting windows directly, but rather they target wine and you'll need that for your games on Windows.

bsimpson · 2026-03-25T14:50:42 1774450242

Valve nudges developers to ship/support their "one best version" of a game, and trust compatibility layers to make it work for everyone else.

For x86, that's Windows. For mobile/VR, it's Android.

akdev1l · 2026-03-24T19:49:18 1774381758

People always say this to shit on glibc meanwhile those guys bend over backwards to provide strong API compatibilities. It rubs me off the wrong way.

What glibc does not provide is forward compatibility. An application built with glibc 2.12 will not necessarily work with any older version.

Such application could be rebuilt to work with an older glibc as the API is stable. The ABI is not which is why the application would need to be rebuilt.

glibc does not provide ABI compatibility because from their perspective the software should be rebuilt for newer/older versions as needed. Maintaining a stable ABI mostly helps proprietary software where the source is not available for recompilation. Naturally the gnu guys building glibc don’t care about that use case much.

I guess you didn’t mention glibc in your comment but I already typed this out

kelnos · 2026-03-24T21:00:07 1774386007

> What glibc does not provide is forward compatibility. An application built with glibc 2.12 will not necessarily work with any older version.

Is this correct? I think you perhaps have it backward? If I compile something against the glibc on my system (Debian testing), it may fail to run on older Debian releases that have older glibc versions. But I don't see why an app built against glibc 2.12 wouldn't run on Debian testing. glibc actually does a good job of using symbol versioning, and IIRC they haven't removed any public functions, so I don't see why this wouldn't work.

More at issue would be the availability of other dependencies. If that old binary compiled against glibc 2.12 was also linked with, say, OpenSSL 0.9.7, I'd have to go out and build a copy of that myself, as Debian no longer provides it, and OpenSSL 3.x is not ABI-compatible.

> glibc does not provide ABI compatibility because from their perspective the software should be rebuilt for newer/older versions as needed.

If true (I don't think it is), that is a hard showstopper for most companies that want to develop for Linux. And I wouldn't blame them.

jdpage · 2026-03-25T00:20:14 1774398014

I don't know what the official policy is, but glibc uses versioned symbols and certainly provides enough ABI backward-compatibility that the Python package ecosystem is able to define a "manylinux" target for prebuilt binaries (against an older version of glibc, natch) that continues to work even as glibc is updated.

akdev1l · 2026-03-24T22:32:25 1774391545

Sorry I am not sure if 2.12 is a a recent release or older, I made up this number up

If the application is built against 2.12 it may link against symbols which are versioned 2.12 and may not work against 2.11 - the opposite (building against 2.11 and running on 2.12) will work

>If true (I don't think it is), that is a hard showstopper for most companies that want to develop for Linux.

Not really a show stopper, vendors just do what vendors do and bundle all their dependencies in. Similar to windows when you use anything outside of the win32 API.

The only problem with this approach is that glibc cannot have multiple versions running at once. We have “fixed” this with process namespaces and hence containers/flatpak where you can bundle everything including your own glibc.

Naturally the downside is that each app bundles their own libraries.

em-bee · 2026-03-24T22:51:20 1774392680

The only problem with this approach is that glibc cannot have multiple versions running at once

that's not correct. libraries have versions for a reason. the only thing preventing the installation of multiple glibc versions is the package manager or the package versioning.

this makes building against an older version of glibc non-trivial, because there isn't a ready made package that you can just install. the workarounds take effort:

https://stackoverflow.com/questions/2856438/how-can-i-link-t...

the problem for companies developing on linux is that it is not trivial

akdev1l · 2026-03-25T00:13:19 1774397599

glibc must match the linker so you would need a separate linker and the binaries usually have a hardcoded path to the system linker (and you need to binary patch the stuff - https://stackoverflow.com/questions/847179/multiple-glibc-li...)

So in practice you can only have 1 linker, 1 glibc (unless you do chroot or containers and at that point just build your stuff in Ubuntu 12.04 or whatever environment)

seba_dos1 · 2026-03-24T23:09:55 1774393795

You compile in a container/chroot with the userspace you target. Done.

In the context of games, that will likely be Steam Runtime.

em-bee · 2026-03-25T00:02:41 1774396961

it's not that simple. you want to be able to use a modern toolchain (compilers that support the latest standards) but build a binary that runs on older systems.

the only way to achieve that is to get the older libraries installed on a newer system, or you could try backporting the new toolchain to the older system. but that's a lot harder.

seba_dos1 · 2026-03-25T01:18:25 1774401505

It may be hard-ish, sometimes. Sometimes it's a breeze. And sometimes you can just use host's toolchain with container's sysroot and proceed as if you were cross-compiling. Most of the time it's not a big deal.

SAI_Peregrinus · 2026-03-25T14:28:15 1774448895

MUSL is a better libc for companies making proprietary binaries. They can either statically link it, or provide a .so with the musl version they want their programs to use & dynamically link that.

charcircuit · 2026-03-24T20:12:27 1774383147

No other operating system works like this. Supporting older versions of an OS or runtime with a compiler toolchain a standard expectation of developers.

akdev1l · 2026-03-24T20:21:09 1774383669

Plenty of operating systems work like this. Just not highly commercial ones because proprietary software is the norm on those.

From a bit of research it looks like FreeBSD for example only provides a stable ABI within minor versions and I imagine if you build something for FreeBSD 14 it won’t work on 13.

Stable ABI literally only benefits software where the user doesn’t have the source. Any operating system which assumes you have the source will not prioritize it.

(Edit: actually thinking harder MacOS/iOS is actually much worse on binary compatibility, as for example Intel binaries will stop working entirely due to M-cpu transition - Apple just hits developers with a stick to rebuild their apps)

kelnos · 2026-03-24T21:04:39 1774386279

Yes, and this is a great reason why FreeBSD isn't a popular gaming platform, or for proprietary software in general. I'm not saying this is a bad thing, but... that's why.

> Stable ABI literally only benefits software where the user doesn’t have the source.

It also benefits people who don't want to have to do busywork every time the OS updates.

toast0 · 2026-03-24T21:38:26 1774388306

FreeBSD isn't too bad, you can build/install compat packages back to FreeBSD 4.x, and I'd expect things to largely work. At previous jobs we would mostly build our software for the oldest FreeBSD version we ran and distribute it to hosts running newer FreeBSD releases and outside some exceptional cases, it would work. But you'd have to either only use base libraries, or be careful about distribution of the libraries you depend on. You can't really use anything from ports, unless you do the same build on oldest and distribute plan.

At Yahoo, we'd build on 4.3-4.8, and run on 4.x - 8.x. At WhatsApp, I think I remember mostly building on 8.x and 9.x, for 8.x - 11.x. The only thing that I remember causing major problems was extending the bitmask for CPU pinning; there were a couple updates where old software + old kernel CPU pinning would work, and old software + new kernel CPU pinning failed; eventually upstream made that better as long as you don't run old software on a system with more cores than fit in the bitmask. I'm sure there were a few other issues, but I don't remember them ...

throwaway2046 · 2026-03-25T06:11:31 1774419091

> Stable ABI literally only benefits software where the user doesn’t have the source

Stable ABI benefits everyone. If I need to recompile a hundred packages with every OS update instead of doing real work then there's something seriously wrong with my OS.

charcircuit · 2026-03-24T22:00:50 1774389650

You can still run x86 binaries on new macbooks. They don't stop working entirely. Using wine I can even run x86 windows binaries.

akdev1l · 2026-03-24T22:34:09 1774391649

They announced Rosetta 2 will be deprecated and eventually removed (MacOS 28?)

By that point they already hit the developers enough to get them to port to aarch64

(arguably though this could be a special case because it is due to architectural transition)

littlecranky67 · 2026-03-25T08:59:29 1774429169

Apple said they will keep Rosetta 2 for select usecases, such as gaming. They do have a user base that uses Steam and bought mac games - without rosetta that would mean the users could no longer play their game. And no one ports 5-10year old games.

wing-_-nuts · 2026-03-25T16:32:36 1774456356

Can one run a windows version of a game well over on say, a MBP?

I ask because my current laptop is getting long in the tooth, and if I were just buying it for productivity stuff, the current MBPs are beasts, but last time I checked years ago, gaming on os x was in a sad state, even compared to linux.

littlecranky67 · 2026-03-26T00:14:17 1774484057

Crossover works wonders. Currently playing Hogwarts Legacy from the Epic Store on my Macbook M4 Pro on 1080p in Medium with a PS5 controller

thescriptkiddie · 2026-03-24T20:37:52 1774384672

what about mac os?

kelnos · 2026-03-24T21:01:20 1774386080

macOS doesn't require developers to rebuild apps with each major OS release, as long as they link with system libraries and don't try to (for example) directly make syscalls.

Apple may require rebuilds at some point for their Mac Store (or whatever they call it), but it's not required from a technical perspective.

The one exception here is CPU architecture changes, and even then, Apple has provided seamless emulation/translation layers that they keep around for quite a few years before dropping support.

thescriptkiddie · 2026-03-26T15:25:20 1774538720

that's backwards compatibility. forward compatibility is being able to run new apps on an old operating system. the latest version of the SDK builds apps which only run on big sur or newer.

charcircuit · 2026-03-24T21:56:51 1774389411

The latest Xcode supports targeting back to macOS 11. This covers >99% of macs which is acceptable for most developers.

https://developer.apple.com/support/xcode/

thescriptkiddie · 2026-03-26T15:29:36 1774538976

that leaves out every mac made before 2014

NetMageSCW · 2026-03-26T19:40:59 1774554059

At less than 1% why does it matter?

Levitating · 2026-03-25T00:23:54 1774398234

I personally believe we should just compile games statically. Problem solved, right?

krastanov · 2026-03-24T21:03:57 1774386237

I am sorry, I did not mean to imply anyone else is doing something poorly. I believe glibc's (and the rest of the ecosystem of libraries that are probably more limiting) policies and principled stance are quite correct and overall "good for humanity". But as you mentioned, they are inconvenient for a gamer that just wants to run an executable from 10 years ago (for which the source was lost when the game studio was bought).

em-bee · 2026-03-24T23:14:28 1774394068

that 10 year old binary should run, unless it links against a library that no longer exists.

for example here is a 20 year old binary of the game mirrormagic that runs just fine on my modern fedora machine:

    ~/Downloads/mirrormagic-2.0.2> ldd mirrormagic
        linux-gate.so.1 (0xf7f38000)
        libX11.so.6 => /lib/libX11.so.6 (0xf7db5000)
        libm.so.6 => /lib/libm.so.6 (0xf7cd0000)
        libc.so.6 => /lib/libc.so.6 (0xf7ad5000)
        libxcb.so.1 => /lib/libxcb.so.1 (0xf7aa9000)
        /lib/ld-linux.so.2 (0xf7f3b000)
        libXau.so.6 => /lib/libXau.so.6 (0xf7aa4000)
    ~/Downloads/mirrormagic-2.0.2> ls -la mirrormagic
    -rwxr-xr-x. 1 em-bee em-bee 203633 Jun  7  2003 mirrormagic

ok, there are some issues: the sound is not working, and the resolution does not scale. but there are no issues with linked libraries.

cylemons · 2026-03-25T10:18:52 1774433932

This a toolchain issue rather than OS issue. This wounldn't have been a problem if gcc/clang just took a --stdlib-version option and built the executables linking to that version of glibc or equivalent.

krastanov · 2026-03-22T17:04:55 1774199095

This is fascinating to me. I completely believe you and I will not bother you with all the common "but did you try to tell it this or that" responses, but this is such a different experience from mine. I did the exact same task with claude in the Julia language last week, and everything worked perfectly. I am now in the habit of adding "keep it simple, use only public interfaces, do not use internals, be elegant and extremely minimal in your changes" to all my requests or SKILL.md or AGENTS.md files (because of the occasional failure like the one you described). But generally speaking, such complete failures have been so very rare for me, that it is amazing to see that others have had such a completely different experience.

aqua_coder · 2026-03-22T18:07:26 1774202846

My experience with working with AI agents is that they can be verbose and do things that are too over complicated by default. Unless directed explicitly. Which may be the reason for this discrepancy.

jayd16 · 2026-03-22T17:45:07 1774201507

You say it doesn't fail but you also mention all these work around you know and try...sounds like it fails a lot but your tolerance is different.

jmalicki · 2026-03-22T18:34:52 1774204492

Most people I've seen complain say things like "I asked it for code and it didn't compile."

The real magic of LLMs comes when they iterate until completion until the code compiles and the test passes, and you don't even bother looking at it until then.

Each step is pretty stupid, but the ability to very quickly doggedly keep at it until success quite often produces great work.

If you don't have linters that are checking for valid syntax and approved coding style, if you don't have tests to ensure the LLM doesn't screw up the code, you don't have good CI, you're going to have a bad time.

LLMs are just like extremely bright but sloppy junior devs - if you think about putting the same guardrails in place for your project you would for that case, things tend to work very well - you're giving the LLM a chance to check its work and self correct.

It's the agentic loop that makes it work, not the single-shot output of an LLM.

alecbz · 2026-03-22T19:20:13 1774207213

Stuff like this works for things that can be verified programmatically (though I find LLMs still do occasionally ignore instructions like this), but ensuring correct functionality and sensible code organization are bigger challenges.

There are techniques that can help deal with this but none of them work perfectly, and most of the time some direct oversight from me is required. And this really clips the potential productivity gains, because in order to effectively provide oversight you need to page in all the context of what's going on and how it ought to work, which is most of what the LLMs are in-theory helping you with.

LLMs are still very useful for certain tasks (bootstrapping in new unfamiliar domains, tedious plumbing or test fixture code), but the massive productivity gains people are claiming or alluding to still feel out of reach.

jmalicki · 2026-03-22T19:28:36 1774207716

It depends - there are some very very difficult things that can still be easily verifiable!

For instance, if you are working on a compiler and have a huge test database of code to compile that all has tests itself, "all sample code must compile and pass tests, ensuring your new optimizer code gets adequate branch coverage in the process" - the underlying task can be very difficult, but you have large amounts of test coverage that have a very good chance at catching errors there.

At the very least "LLM code compiles, and is formatted and documented according to lint rules" is pretty basic. If people are saying LLM code doesn't compile, then yes, you are using it very incorrectly, as you're not even beginning to engage the agentic loop at all, as compiling is the simplest step.

Sure, a lot of more complex cases require oversight or don't work.

But "the code didn't compile" is definitely in "you're holding it wrong" territority, and it's not even subtle.

alecbz · 2026-03-22T19:47:43 1774208863

Yeah performance optimization is potentially another good area for LLMs to shine, if you already have a sufficiently comprehensive test suite, because no functionality is changing. But if functionality is changing, you need to be in the loop to, at the very least, review the tests that the LLM outputs. Sometimes that's easier than reviewing the code itself, but other times I think it requires similar levels of context.

But honestly I think sane code organization is the bigger hurdle, which is a lot harder to get right without manual oversight. Which of course leads to the temptation to give up on reviewing the code and just trusting whatever the LLM outputs. But I'm skeptical this is a viable approach. LLMs, like human devs, seem to need reasonably well-organized code to be able to work in a codebase, but I think the code they output often falls short of this standard.

(But yes agree that getting the LLM to iterate until CI passes is table-stakes.)

jmalicki · 2026-03-22T19:56:10 1774209370

Strongly agreed!

I think getting good code organization out of an LLM is one of the subtler things - I've learned quite a bit about what sort of things need to be specified, realizing that the LLM isn't actively learning my preferences particularly well, so there are some things about code organization I just have to be explicit about.

Which is more work, but less work than just writing the code myself to begin with.

maccard · 2026-03-23T08:30:05 1774254605

> The real magic of LLMs comes when they iterate until completion until the code compiles and the test passes, and you don't even bother looking at it until then.

If you read my post, you’d see that Claude code didn’t do that, I had to intervene in the agent loop and when I did it undid my fixes.

maleldil · 2026-03-23T10:52:54 1774263174

This is not compatible with many people's experiences. I use Python with a type checker. I tell Claude that the task is only complete once the type checker passes cleanly. It doesn't stop until there are no type errors. This should be even easier in a compiled language, especially if you also tell it to run the tests.

In fact, I find that with a strict feedback loop set up (i.e. a lot of lint rules, a strict type checker and fast unit tests), it will almost always generate what I want.

As someone else said, each step might be pretty stupid, but if you have a fast iteration loop, it can run until everything passes cleanly. My recommendation is to specify what really counts as "done" in your AGENTS.md/CLAUDE.md.

maccard · 2026-03-23T11:10:35 1774264235

I tried again this morning - https://news.ycombinator.com/item?id=47487638 another hard failure that Claude code says passed.

zorak8me · 2026-03-22T18:04:08 1774202648

Providing instruction and context doesn’t seem like a “workaround”.

theshrike79 · 2026-03-23T07:01:35 1774249295

Agents are LLMs that use tools in a loop

I didn't give my Agent any tools, it hallucinated code.

...well, yes. It's like evaluating a programmer's skill by having one-shot a program on paper with zero syntax errors.

plagiarist · 2026-03-22T18:14:20 1774203260

IME it does fail pretty hard at first. One has to build up a little library of markdown and internal library of prompt techniques. Then it starts working okay. I agree there is a hurdle still, trying it on one task doesn't really get one over the hurdle.

sigbottle · 2026-03-23T16:16:31 1774282591

Idk. Had a friend recommend me gsdv2 and I wasted like 100$ + so much time trying to debug said crap. I went back to codex and it 1 shot my problems easily.

And this was from two people who were 100% aligned on agentic AI coding. I've been using AI for years now and agentic AI for several months now. I was told that I bring out the worst in LLMs. Except... I was able to achieve better results, WITH LLMs, on OTHER frameworks. So like, ?

It may be easier to draw the boundary between "AI and non AI users", but as AI becomes prolific, the us vs them angle that people keep using won't apply anymore.

The age old "user versus tool" debate goes on, but it seems like gaslighting is popular these days. I classify it as gaslighting because I'm clearly a falsifiable test case, I'm even gung ho for LLMs, yet any kind of dissent is immediately warped into user error. It doesn't matter what you say or where you are on the spectrum, if you have 1 bad experience and speak up about it it's an issue. Guess that's not really an issue for the human condition though.

shafyy · 2026-03-22T17:16:24 1774199784

It's almost like.... LLMs are non-deterministic and hallucinating... Oh wait?!

krastanov · 2026-03-22T19:02:22 1774206142

non-deterministic does not mean it is not biased towards a particular type of results (helpful results)

krastanov · 2026-02-20T05:11:49 1771564309

Strict police does sound quite a bit less bad than fascist police...

mr_00ff00 · 2026-02-20T05:45:01 1771566301

You have to know very little about China to think that it is somehow more favorable to foreigners and minorities than the US.

What the US does is bad, but somehow Americans think that means everywhere else is better.

ruszki · 2026-02-20T07:55:07 1771574107

I don’t know but I and my friends still visit China regularly, but not the US anymore, because we have no clue what’s the expectation there to not be in a jail for weeks. I have quite clear idea what the expectation in China, but not the US. Maybe there is something to it.

seanmcdirmid · 2026-02-21T03:23:10 1771644190

China is great for visitors, especially lighter skinned visitors. You probably won’t go to jail in China unless you have a thing for drinking a lot in Chinese bars, even then you will probably be ok as long as you don’t pick any fights.

Illegal immigration really isn’t a thing in China beyond a few North Koreans in Dongbei and a few Laotians in Yunnan. So they just won’t assume you are an illegal immigrant.

krastanov · 2026-02-21T02:52:40 1771642360

I know that China is an authoritarian near-dictatorial country that oppresses minorities and commits cultural genocide. And I am not an American.

That does not seem to be all that related to the original post I was answering to. An average person / citizen / visitor has way less to worry about around (trained) Chinese police than they have to worry about around an (gangster) American ICE agent.

foxglacier · 2026-02-24T20:39:13 1771965553

You should also know that China doesn't have as much rule of law as America and indeed average people have a lot to worry about around the police, and generally are very careful not to do things that will get them beaten up. But it happens nontheless - violent assaults on street sellers by police, for instance. In China, people are actually scared of the police because the policemen have so much arbitrary power. They're not strictly and fairly enforcing the law at all.

krastanov · 2026-02-19T12:35:01 1771504501

As an aside, it is really interesting to see a computational package that, while supporting multiple GPU vendors, was first vetted on AMD, not NVidia. It is encouraging to see ROCM finally shaking off its reputation for poor support.

mr_octopus · 2026-02-19T23:09:40 1771542580

The vendor-agnostic GPU approach via KernelAbstractions is great to see. The Vulkan compute path is underrated for this — it runs on AMD, NVIDIA, and Intel without needing ROCm or CUDA, just whatever driver ships with the GPU.

Re: the compilation latency discussion — it's a real tension. JIT gives you expressiveness but kills startup. AOT gives you instant start but limits flexibility. Interesting that most GPU languages went JIT when the GPU itself runs pre-compiled SPIR-V/PTX anyway.

simondanisch · 2026-02-19T13:11:50 1771506710

well, I do hate vendor lock in with a passion ;) But yeah, a lot did happen, this likely wouldn't have been possible one or two years ago!

krastanov · 2026-02-17T14:55:36 1771340136

I am happy to hear that things as not as bad as I thought, but my experience being judge/mentor for a couple of years for the high school science fair near a top university was very discouraging and closer to what the author of the article describes.

Maybe the mass of the kids at the first round were what you describe, but very quickly the focus turned to the top 20% who were very much "reputation laundering" and "CV padding" internships at labs, not actual curiosity driven independent exploration

ModernMech · 2026-02-17T17:23:54 1771349034

On the one hand, I understand why this can seem disheartening; we want the kids to be pure and protect them from the BS of the world. It would be great if we could just allow kids to play around with science, and there's no pressure to compete or perform.

At the same time, the reason they're doing the CV padding is because we, the adults in control of those systems they wish to gain access to (which is not the science fair), have made it so they need to pad their CV in the first place.

So as much as we want them to be purely driven by curiosity and independent exploration, our society at large does not allow for that kind of thing. Fixing those kinds of macro incentives doesn't happen by reforming the science fair.

If we want curiosity driven independent exploration for children, our society should provide that modality for adults. That we don't provide it for adults is reflected on our children; because we impose credentialism on adults, the adults understand credentialism is important to success and they impose it on their children. If adults understood that creative inquiry and independent exploration are paths to success, then more children would be encouraged to pursue that at the elite level.