I’m not sure ‘patched’ is the right word here. Are you suggesting they edited th...

gf000 · 2026-04-17T05:45:46 1776404746

Absolutely not my area of expertise but giving it a few examples of what should be the expected answer in a fine-tuning step seems like a reasonable thing and I would expect it would "fix" it as in less likely to fall into the trap.

At the same time, I wouldn't be surprised if some of these would be "patched" via simply prompt rewrite, e.g. for the strawberry one they might just recognize the question and add some clarifying sentence to your prompt (or the system prompt) before letting it go to the inference step?

But I'm just thinking out loud, don't take it too seriously.

onemoresoop · 2026-04-18T22:08:59 1776550139

Used patched for lack of a better word. Not sure how they fix the edge cases for these types of fixes/patches or whatever they’re specifically called

TheLNL · 2026-04-17T05:13:29 1776402809

They might have further trained the model with these edgecases in the dataset

onemoresoop · 2026-04-18T19:06:34 1776539194

Whatever it was, that’s not real thinking, we can possibly patch all knowledge and even if we did, it would become crystallize somehow.