More

zdw · 2026-05-08T17:15:03 1778260503

There's a youtube channel that is documenting the entire japanese PC Engine game library: https://www.youtube.com/playlist?list=PL1sb8k4ZPagYwRxx-s3mr...

They previously did the entire famicon library.

zdw · 2026-05-07T18:48:38 1778179718

I highly doubt that anyone would expect that a chat transcript from a fairly insular group to be made public in this way decades later...

mmooss · 2026-05-07T22:46:58 1778194018

That's why it's a good anecdotal example that, despite their reputations in some quarters, OpenBSD devs are real human beings who even have fun!

EDIT: clarify in context of my GP post.

zdw · 2026-05-05T17:00:21 1778000421

MTP support is being addedto llama.cpp, at least for the Qwen models ( https://github.com/ggml-org/llama.cpp/pull/20533) and I'd imagine Gemma 4 will come soon.

The performance uplift on local/self-hosted models in both quality and speed has been amazing in the last few months.

tarruda · 2026-05-05T17:31:03 1778002263

There is a newer PR which will probably be merged soon: https://github.com/ggml-org/llama.cpp/pull/22673

xlayn · 2026-05-05T21:47:26 1778017646

Ohhhh geee!!! I just applied the patch to my local git copy. You need to use the model on the PR that he submitted, the model is particular because it has extra information that allows the MTP to happen. I have two amd gpus, and qwen3.6 27B qk6 does around 20t/s generation... If I run it only on one I get like 35t/s.

But with this patch I saw 46t/s with qwen3.6 27B q8... this is insane, it's 250% faster than the original speed, there was no gpu I could upgrade to get that kind of boost, amazing!

CaineThanatos · 2026-05-07T13:18:20 1778159900

which amd gpu's do you have, if I may ask ?

entropicdrifter · 2026-05-05T18:21:49 1778005309

Ollama merged a PR for MTP about 2 hours ago, as well:

https://github.com/ollama/ollama/pull/15980

Edit: Seems they also have a pre-release version out with the functionality added: https://github.com/ollama/ollama/releases/tag/v0.23.1-rc0

theturtle32 · 2026-05-06T06:16:32 1778048192

Sad:

theturtle32@ai1:~$ ollama run gemma4:31b-coding-mtp-bf16 pulling manifest Error: pull model manifest: 412: this model requires macOS

zozbot234 · 2026-05-06T06:23:38 1778048618

What's "sad" is how slow the ollama folks are being in vendoring newer versions of ggml into their codebase. That attitude just leaves them stranded without access to newer features.

nzeid · 2026-05-05T19:34:57 1778009697

A few days ago I switched again from Qwen3.6 to Gemma 4 - for personal use I've experienced better average performance with the 26B version of the latter than the 27B of the former.

For someone who's been running local models for a long while, these are very very exciting times.

girvo · 2026-05-06T00:42:18 1778028138

Oh that's fascinating. 3.6 27B is pretty damned good, but slow in wall-clock times on my DGX Spark-alike. It generates huge reams of thinking before it gets the (usually correct!) answer, so wall-clock time is rough for tasks even at ~20tk/s

I'm surprised the 26B-A4B is better? It should be faster too, interesting. I'm excited to try 31B with MTP, because MTP-2 is what makes 27B bearable on the GB10.

What are you using it for? Agent-based coding, or something else?

nzeid · 2026-05-06T19:10:58 1778094658

General purpose, mostly internet research in the form of slow-crawling. (Emphasis on slow - I've ultimately landed on Scrapling's API for seamless content rendering, and I use image support so as not to exclude informative images or weirdly rendered text.)

For coding I don't need image support so I stuff the entire GPU with text-only mode. I don't have a workflow where I send LLMs off to generate thousands of lines of code but what little coding I did I did with Qwen3.6 and it was spectacular, as you likely suggest.

glenngillen · 2026-05-06T02:44:56 1778035496

I've been thinking about doing more of this too. What spec machine are you running? And are you using long-running autonomous agents or more of the IDE/co-pilot style of collaboration?

apexalpha · 2026-05-05T20:25:15 1778012715

I’ve been swapping between these too as well.

However I find qwen unbeatable for toolcallling. I think gemma wasnt trained on that at all.

sigmoid10 · 2026-05-05T20:30:41 1778013041

Gemma certainly was trained for tool calling, but the implementation in llama.cpp has been troubled because Gemma uses a different chat template format. The processor from the transformers library works fine though.

apexalpha · 2026-05-06T11:10:07 1778065807

Oh I must've missed this.

The AI space moves so fast! I'll check it out again.

intothemild · 2026-05-06T13:32:39 1778074359

Don't forget to update the gguf you have too. The templates in them were updated recently too

nzeid · 2026-05-05T20:39:03 1778013543

I'm using llama.cpp with Gemma and tool calling is mission critical. It's perfectly fine on my end.

There are definitely differences in the eagerness to tool-call that you'll need to manage. And for all local models I've ever used, I've had to micromanage the tools provided by servers to eliminate any possibility that they reach for something wonky or confusing.

magicalhippo · 2026-05-06T00:30:04 1778027404

> However I find qwen unbeatable for toolcallling. I think gemma wasnt trained on that at all.

Gemma4 chat template seems to had multiple issues, at least with llama.cpp, not sure they're all fixed yet. It assumed simple types for parameters for example.

fridder · 2026-05-05T20:20:26 1778012426

I'd love to see this in oMLX too. It has been a rather nice tool

egeres · 2026-05-05T22:59:58 1778021998

There's also a growing interest on integrating DFlash: https://github.com/ggml-org/llama.cpp/issues/21978, I can't wait to see how it will compare against MTP

nullc · 2026-05-05T23:58:04 1778025484

Thanks for the link,it took qwen3.6-27B-q8 w/256k context on my RTX A6000 from ~20t/s to 55t/s. Prefill is mysteriously slower however, but prefill is so much faster still that I think I'm still bottlenecked on output most of the time.

_factor · 2026-05-06T02:08:09 1778033289

Took 2x AMD MI50s to 50 t/s instead of 20 t/s for Q8 27B. Impressive.

endymi0n · 2026-05-05T23:26:31 1778023591

I don’t exactly know where MTP inference fits within the inference stack, but does someone know whether it’s possible to implement it for the MLX universe?

basch · 2026-05-05T18:24:39 1778005479

I have a dumb performance question.

Why when asking a model to change text in a minor way; are we not asking it to generate the operational transformations necessary to modify the text, and then just executing the ot on the existing text vs reproducing every token? Maybe tools are doing that more than I realize?

XYen0n · 2026-05-05T18:51:20 1778007080

The only thing a model can output is tokens; to achieve this, a tool of converting tokens into operational transformations is required. For example, I have an ast-grep skill, it will instruct the model to generate ast-grep rules and run ast-grep to perform file modifications.

basch · 2026-05-05T20:27:46 1778012866

I am saying to directly output the operational transformation instructions as the tokens. You’re essentially telling it to “write the diff” and then applying the patch.

[retain(8), delete(6), insert("very very"), retain(10)]

mike_hearn · 2026-05-06T09:01:04 1778058064

OpenAI models emit a format similar to a regular diff, but without the line numbers. Look at apply_patch

ritonlajoie · 2026-05-05T23:48:33 1778024913

there is a model in openrouter doing exactly this, it generates diffs. forgot the name though

cryptoz · 2026-05-05T19:01:35 1778007695

This is the approach I take with code edits to existing files at Code+=AI; I wrote a blog post with a simple example of AST modification to illustrate: https://codeplusequalsai.com/static/blog/prompting_llms_to_m...

HarHarVeryFunny · 2026-05-06T17:35:10 1778088910

A coding agent provides an edit tool to the model for making code changes, with this tool typically just providing a find/replace operation. The model may of course need to do a bunch of work (grepping etc) to figure out what to change, but the actual change will just be sent from model to agent as a "replace X with Y" edit tool request, with this "edit" then done locally by the agent.

It's interesting how the agent (at least in case of Claude Code) is then applying this find/replace "edit" to the requested file... Since the agent wants to be platform independent (Linux/Windows/Max) it uses Node.js for file access, and performs the "edit" by itself using Node.js to read the entire file, make the change, then write back an entire new file.

sigmoid10 · 2026-05-05T20:24:01 1778012641

The simple answer is: because it is not necessary to achieve the same final output. Most LLMs today are trained as autoregressive token predictors. They fundamentally can't work any other way. But we know how to train them really well and they have many applications beyond editing text. Diffusion LLMs exist too, which work a bit closer to what you describe, but they are not yet at the same level of intelligence since training methods are not that mature and they are generally less flexible as well.

basch · 2026-05-05T20:28:19 1778012899

So predict the tokens of the operational transformation.

I just asked: Write the operational transformation sequence and command to turn “this is really beautiful” to “this is very very beautiful”

and in return got: You can map this out by moving a virtual cursor across the text and telling it what to keep, remove, or add. You start by retaining the first eight characters to keep "this is " untouched. Then you delete the next six characters to remove the word "really". In that exact spot, you insert the nine characters for "very very". You finish the operation by retaining the final ten characters, which preserves the space and the word "beautiful". You can code this specific command sequence as [retain(8), delete(6), insert("very very"), retain(10)].

In a large paragraph of text I would expect it to be way quicker and cheaper to generate “[retain(800), delete(6), insert("very very"), retain(10000)]” than repredict the entire remainder of the unedited text.

sigmoid10 · 2026-05-05T20:39:55 1778013595

Sounds easy, but isn't in practice. You can look at the edit text file tool in va code copilot for example to see how complicated that can get: https://github.com/microsoft/vscode-copilot-chat/tree/9e668c...

basch · 2026-05-05T20:49:28 1778014168

I have no idea when I’m being lied to anymore but allegedly Aider and Cursor work the way I described, although cursor is using a second model to apply the edit.

sigmoid10 · 2026-05-06T11:13:35 1778066015

They all do something similar under the hood. Patching files is not a trivial task when you only have the changed text content and not the actual file structure to work with. It kind of works, but is fundamentally limited by the LLM output architecture.

mike_hearn · 2026-05-06T09:02:47 1778058167

Cursor has a dedicated merge model. It takes input like this:

    class Foo {
        // ....
        int calculation() {
            return 42;
        }
    
        // more stuff
    }

where the main model emits something that is a sort of casual under-specified diff format and the merge model figures out how to interpret it as a patch.

jfim · 2026-05-05T20:38:20 1778013500

I've seen Claude use sed to edit files on other hosts instead of copying the file back and forth to edit it. Not quite full blown OT but it's going in that direction.

EGreg · 2026-05-05T17:15:39 1778001339

How does this get added in practice?

flakiness · 2026-05-05T17:21:09 1778001669

According to the linked PR, the original model does come with MTP which is another "head" (=output path) in the same model and (supposedly) runs faster.

The current implementation ignores that head but the PR let the tool recognize it, plus does proper integration (run the MTP while running the slower main path then compare the result, I believe.)

flebron · 2026-05-05T20:11:46 1778011906

The standard way of doing MTP is to run the drafter autoregressively for k steps, and then (not concurrently) use the larger model as a verifier for those k tokens at the same time. The larger model can then accept a prefix of those k tokens, and in any case generates one more token (which is needed in case you accepted zero tokens from the drafter). The larger model can effectively use this k as a "batch" dimension, reducing the penalty of large weight loading. Meanwhile the drafter is much smaller, so it's fine for _it_ to be autoregressive, as long as the main model is parallel.

dakolli · 2026-05-05T17:04:22 1778000662

yet, still mostly useless.

WhitneyLand · 2026-05-05T17:19:22 1778001562

Yeah important conceptually to remember MTP is kind of just more weights, but speculative decoding is the runtime algorithm that’s a significant add to whatever code is serving the model.

HumanOstrich · 2026-05-05T17:35:17 1778002517

That is.. inaccurate.

WhitneyLand · 2026-05-05T20:58:35 1778014715

How so? I’m not saying most of work doesn’t go into creating the drafting model or enabling a new head on the primary model, but the point is that however cool it is the result is, more weights. Speculative decoding requires code to be aware of how this works at the inference level.

zdw · 2026-05-04T23:16:48 1777936608

The HN story autoformatter changes titlecase on well known terms, which I'm betting is what happened here.

pjmlp · 2026-05-05T06:17:32 1777961852

Yeah, but then you have about 5 minutes time to change it to whatever what you want.

kergonath · 2026-05-04T23:41:02 1777938062

Yeah, I assumed as well.

zdw · 2026-05-02T18:16:03 1777745763

Ideally the fees would be similar to the Norway model, where some tickets are tied to the income of the driver, in this case the pre-tax earnings of the company that created the driverless car.

Aachen · 2026-05-02T18:28:49 1777746529

That can make sense (opinions differ) for individuals, but it's not like the company is advertising with "we get you there at 1.2x legal speed". They're not competing on that; they're not choosing to do this on purpose like an individual might choose to speed (for example because of economic incentives if their hourly price is high)

If they were, then it makes sense to fine them to some multiple of the benefit they got from this advertising tactic, but as it is, I don't see why it should be different from anyone else's ticket. The company isn't likely to enjoy a flood of this administrative work, besides the cost of the actual fines, so they'll work to minimise them anyway

preinheimer · 2026-05-02T20:23:57 1777753437

They may not advertise “getting you there at 1.2x legal speed” but the sooner they drop you off, the sooner they can get another fare. Over a whole fleet it will add up to changing the size of the required fleet.

If getting a ticket one ride in a thousand is cheaper than deploying another 2000 cars to make up for the increased trip time I’d expect them to keep getting tickets.

I’m also not sure they don’t do it on purpose. Tesla self driving has an aggressive mode willing to speed and roll through stop signs. Those were deliberate, law breaking, choices.

Aachen · 2026-05-02T22:44:38 1777761878

Oh, I actually read an article about that this morning in bed but didn't imagine regulators would approve a mode that is programmed to ignore laws. Not sure how it works where you're from but in the Netherlands, where Tesla did the approval process to subsequently apply for an EU license for using the drive assistance mechanism, the regulator "hopes to not need more than a week to approve safety updates. Other updates can take longer". They test every change anew it sounds like, at least at first (there was some talk of relaxing reporting requirements if accident rates are as expected, that is, lower than human drivers when using the presently approved assist methods)

Seems foreign (heh) to me that a regulator would allow software to drive lethal machinery on public roads without checking that it at least stops at a stop sign. Fining them proportional to turnover makes a lot more sense suddenly. Or maybe a competent regulator, like, prevention rather than after the fact action

golem14 · 2026-05-02T19:34:48 1777750488

I think a rich person would then do the rational thing and hire a cheap driver who also owns and operates their car.

Noone sane would be willing to assume basically unlimited liability for someone else's software.

Maybe that's good thing - some work for humans after the robots take over, albeit as human legal shields ;)

ninalanyon · 2026-05-02T20:57:22 1777755442

> they're not choosing to do this on purpose

So the AV software is so unreliable that it can't adhere to speed limits?

Aachen · 2026-05-02T22:47:46 1777762066

Just like human_brain.exe right? I miss signs, and notice others missing signs, in unfamiliar areas all the time. So why not the same punishment and thus the same incentive not to do that?

A sibling comment mentioned that they did willfully build a variant that breaks laws btw. If there is apparently no regulator that needs to approve what makes it onto the road, I can understand a lot better why people are calling for insurmountable fines such that they have no choice but to remove that. Why not call for adequate approval mechanisms though, looking at other countries' examples where this problem doesn't exist... I dunno

cma · 2026-05-02T18:27:54 1777746474

Isn't Norway only for drunk driving? Finland has it for massive speed excesses, but it is based on net taxable income taking out business expenses for taxi drivers, and Waymo is still negative.

If they become profitable you'd want to normalize by number of miles, unless you just want an incentive system to get more people on the road (extra drivers) and increase chance of humans suffering road injuries to boost employment in an internal service sector.

But even then coming out with a more efficient fleet than a competitor for higher margin would be penalized. You'd rather disincentivize skimping on safety for margin and not disincentivize better maintenance and fuel economy.

Teever · 2026-05-02T18:48:20 1777747700

Agreed. We need to look at reforming fines in general.

Fines should be scaled to income and the value of the vehicle and should exponentially increase for reoffense when in the same catagory of offense.

asdfasgasdgasdg · 2026-05-02T18:27:30 1777746450

Assuming you divide it down to the earnings per car, that makes perfect sense. Of course right now they aren't making any profit at all, and by the time it is relevant it is likely that the cars will commit substantially no violations at all.

dlcarrier · 2026-05-03T06:46:36 1777790796

Driverless taxi services are all blowing through venture capitol, so does having a negative income mean that they get a payment, instead of a fine?

mathcartney · 2026-05-02T18:18:29 1777745909

I think this could be a good compromise. Could have a floor value but the ceiling can vary accordingly.

zdw · 2026-05-01T03:34:38 1777606478

It's like someone should make a file... maybe in /etc ... and put short names for services in it... maybe it could be called /etc/services...

tolciho · 2026-05-01T03:43:55 1777607035

And then they might code up some sort of service lookup tool thingy to use on the train wreck that is the modern web.

  $ getent services gopher
  gopher               70/tcp

lloeki · 2026-05-01T08:10:25 1777623025

And if they want name resolution, maybe even names that reflect the scope of its location like .localhost or .internal

tolciho · 2026-05-01T14:13:36 1777644816

Various services have existed, such as portmap(8), though NFS and similar services have often suffered from the "too complicated to debug" problem where devops (then sysadmins) would try turning the system off and then back on again in the hopes of resolving the issue du jour. You might get lucky and determine that node number three (of many) was cursed and leave it switched off for the Season of Mammon, more commonly known as Christmas, and to retire it quietly, later. Hypothetically.

Generally host and port mapping gets shoved somewhere into the configuration management layer and hopefully does not become too complicated (or have too many security holes) as this could vary from "configuration files and a few scripts" to database and services layers that few can debug, especially not a sysadmin at 3 AM in the morning running on an hour of bad sleep. Hypothetically.

m463 · 2026-05-01T23:43:43 1777679023

but you want security, so NIS+

otabdeveloper4 · 2026-05-01T04:39:47 1777610387

Heck, maybe even `resolvectl service`?

pratyahava · 2026-05-01T05:13:16 1777612396

this is a nice idea, but idk why, in macos if i do `nc -l 127.0.0.1 gopher` and then try to open url "http://127.0.0.1:gopher/" - safari does not open it, no requests visible in the `nc` output.

also `curl -v http://127.0.0.1:gopher/` gives error message

  * URL rejected: Port number was not a decimal number between 0 and 65535
  * Closing connection
  curl: (3) URL rejected: Port number was not a decimal number between 0 and 65535

so the ports are named, it is nice, but in practice it does not make life easier.

bandie91 · 2026-05-01T05:32:32 1777613552

> http://...:gopher

is it http or gopher? :)

pratyahava · 2026-05-01T06:11:51 1777615911

i chose gopher port just as an example. try with any other service name mapped to a port number from /etc/services and the result will be the same. the OP's goal was to use many http/https services, so we are talking about many http(s) services.

i just wanted to make the point that even if you have service names in /etc/services, it is not possible to use that names easily to host/access http(s) services.

skydhash · 2026-05-01T13:55:15 1777643715

The names are the kind of servers that listen on those ports (by default) like ssh, telnet, http, and smtp. They are not subdomains or for URI parsing.

pratyahava · 2026-05-02T08:32:24 1777710744

URI contains ":port" tho, but practically it is only digital number.

the OP made a tool which helps them to avoid using port numbers. people commented in a way that looked like laughing at him, like he reinvented the wheel, and talking about /etc/services. well ok, i decided to try using /etc/services for the purpose of using names instead of port numbers.

would it be possible to add "myapp 60001/tcp" to /etc/services and then work with "http://localhost:myapp"? NO! browsers do not translate these names into port numbers. netcat does. curl does not.

so probably the OP's solution is not that questionable and really solved their need? and "good old friend /etc/services" is not useful for this? i dont know what it is useful for as running services on non-standard ports actually helps with hiding from security/vuln scanners and is practiced widely.

skydhash · 2026-05-02T11:43:46 1777722226

Maybe the joke has gotten too far, but the point that most people have been trying to say is that the issue that the OP is trying to solve has been solved for years. It's called a reverse proxy. Doing the configuration automatically like OP does does not alter the fact that it is a reverse proxy. `/etc/services` serve a different purpose.

Also URL parsing is a completely different matter. Browsers are primarily an HTTP(s) client. If you do not mention the ":port", it will try to connect to 80 (HTTP) or HTTPS (443). Because that the default ports for a web server. Other services have different port. So if your URL has the ftp scheme, the default port would be 21.

tharkun__ · 2026-05-02T04:08:38 1777694918

I know you're trying to be funny but ... technically it's 100% clear: You should talk HTTP, because that's the URL scheme here. The port makes no difference. You just happened to use a port by name. For all we know I run my HTTP server on some NFS related port so all the script kiddies try all the wrong exploits on it or something ;)

jasonlotito · 2026-05-01T13:38:22 1777642702

Well, the entire context of this is https so anything else is immaterial. The only reason it would be gopher is if you didn't read the post or don't understand the basics of https.

laurencerowe · 2026-05-02T06:11:34 1777702294

This is not possible since it is ambiguous. It is impossible to parse “http://127.0.0.1:gopher/“ since that would be valid as either “scheme://user:host/“ or “scheme://host:portname/“.

“http://127.0.0.1:70/“ is unambiguous since 70 is not a valid host.

mrhottakes · 2026-05-01T17:11:29 1777655489

Sounds like you need an AI agent that can determine whether http and gopher are the same protocol

pratyahava · 2026-05-02T05:27:49 1777699669

if you configure sshd to listen on port 443, does it become an https server? i was just trying to demondtrate: pick any port from /etc/services and try to use the name instead of port number. no, it does not work well when trying to use for local-hosting http(s) services. so to address the irony and sarcasm of the messages i was replying to:

  zdw: It's like someone should make a file... maybe in /etc ... and put short names for services in it... maybe it could be called /etc/services...

  tolciho: And then they might code up some sort of service lookup tool thingy to use on the train wreck that is the modern web.
  $ getent services gopher
  gopher               70/tcp

tolciho · 2026-05-02T12:00:50 1777723250

Many clients also do not support getservent(3) or portmap or DNS SRV records or NIS or LDAP or ActiveDirectory so one might wonder why there are so many half-baked, failed, or overy complicated attempts at solving whatever the problem is here even before "AI has entered the chat".

Tepix · 2026-05-01T06:02:01 1777615321

As bandie pointed out, you‘re explicitly making a http request. Duh.

nc is for generic connections and handles it well.

pratyahava · 2026-05-01T06:14:12 1777616052

i know, but the OP's goal was to host/access http(s) services with names and avoid port numbers, and gopher service name was chosen by me as an example. my point was that /etc/services cannot be used for the OP's need.

if you host an http(s) service on port 11111 you can reach it with url http://127.1:11111, but url http://127.1:vce/ would not work in most software.

  $ grep 11111 /etc/services
  vce  11111/udp   # Viral Computing Environment (VCE)
  vce  11111/tcp   # Viral Computing Environment (VCE)

mrhottakes · 2026-05-01T17:12:26 1777655546

If I curl my phone number it doesn't connect, that's strange

pratyahava · 2026-05-02T08:49:15 1777711755

can you please explain what you mean by this? because technically i do not see any similarity in this "analogy".

Tepix · 2026-05-01T15:10:13 1777648213

But if you want to contact vce, why use „http“? It‘s not going to work

SAI_Peregrinus · 2026-05-01T18:28:51 1777660131

Try http://127.0.0.1:hkp instead of http://127.0.0.1:11371 for an OpenPGP HTTP keyserver. HTTP will work, but using the service name won't. Does that make what they're trying to say clearer?

pratyahava · 2026-05-02T08:46:05 1777711565

the question was "can we use port names from /etc/services instead of port numbers?"

how about "ssh://git@ssh.github.com:https/golang/go.git" instead of "ssh://git@ssh.github.com:443/golang/go.git"? does not work, hmm.

meitham · 2026-05-01T06:19:35 1777616375

Perhaps we could even make the file the port itself, perhaps calling it a “socket”? A “unix socket” would be a great name. If we could place all these files behind a local reverse proxy then we could use localhost/jekyll or localhost/fastapi. It’s just a dream

kjs3 · 2026-05-01T13:52:19 1777643539

That would mean not being able to vibe code up an entire app to deal with something as insurmountable as looking at a list of numbers and post it on HN for those sweet, sweet upvotes. Why would they not do that.

TomK32 · 2026-05-01T04:23:35 1777609415

Sure, but they are running web-apps they've vibe-coded (hence the .vibe tld) and for that use-case of many web apps that I run in docker containers I use nginx-proxy [0]. All the container needs is a VIRTUAL_HOST environment variable with the domain and what my router needs is an address entry for the wildcard subdomains. I even have nginx-proxy on a internet-accessible staging server.

[0] https://github.com/nginx-proxy/nginx-proxy

fulafel · 2026-05-01T06:30:59 1777617059

If the port number space was bigger, I wonder if we would have gotten a global naming service (ala DNS) for unique service names.

You can still publish port numbers along with addresses in DNS though (SRV records).

rdevilla · 2026-05-01T06:42:34 1777617754

Not modern enough. Unix is too low level, antiquated, and discriminates against those who just want to get shit done instead of reading manpages or documentation by hand.

sgarland · 2026-05-01T13:36:15 1777642575

This is the best example of Poe’s Law I’ve ever seen. Well done…?

npodbielski · 2026-05-01T06:08:20 1777615700

This is exact problem I see with all of those vibe coded software: In few years everything will be super fragmented, everyone will be using their own set of tools, or vibe coding them, themselves. Communication between teams or even between team members will become very hard because of those differences. 'What do you mean production is down? On my vibe coded dashboard everything is green!'

spiderfarmer · 2026-05-01T06:19:58 1777616398

Why do people always assume that change is permanent?

It's never.

After decentralisation we always see decentralisation. After a period of growth, a decline will follow. After the vibe coding hype, consolidation will follow. After rain comes sunshine.

npodbielski · 2026-05-03T14:41:21 1777819281

Well, I am forty years old. In twenty years I won't be working. We are in the dawn of AI software. If this will be decentralisation it will be at least 10 years of that. Probably more. So if fifteen or twenty years things will be going back to 'our normal' as we see today, I won't be concerned about all of it that much anymore. Perspective change with age considerably.

teddyh · 2026-05-01T14:52:28 1777647148

It’s the Lisp curse again.

“[X] is so powerful that problems which are technical issues in other programming languages are social issues in [X].”

— <https://www.winestockwebdesign.com/Essays/Lisp_Curse.html#ma...>

amelius · 2026-05-01T13:11:28 1777641088

You have to be root to edit /etc/services ...

necovek · 2026-05-01T19:20:15 1777663215

I am pretty convinced you need root on most systems to update DNS resolution mechanism system-wide (eg. to edit /etc/hosts or run a local DNS server and put that into /etc/resolv.conf).

MarsIronPI · 2026-05-01T22:09:07 1777673347

Technically you can set the HOSTALIASES variable to point to a custom hosts file, but that only works with programs that use gethostbyname(3). (Which is most of them? IDK.)

maeglin · 2026-05-01T07:33:26 1777620806

Top reply, and clearly based on the article's title rather than its content, as are the follow-ups. You're making this site worse.

The article is short; go read it then come back and delete.

halJordan · 2026-05-01T20:27:51 1777667271

The article is about the dude not knowing what service is where so he codes a json mapping. He could just update his /etc/services for the same thing. Oh but wait, he mentioned ai agents that changes everything!

pratyahava · 2026-05-02T08:55:23 1777712123

can you please demonstrate the workflow you are suggesting? asking because i tried what you suggest, and it does not work.

stronglikedan · 2026-05-01T14:23:55 1777645435

Sounds like more of a problem with the title than the person you're attempting to insult.

nsowz · 2026-05-01T08:11:10 1777623070

[flagged]

saidnooneever · 2026-05-01T09:05:53 1777626353

you go and look in etc services for what is bound to port 5009. the article might not be the most useful but these comments are completely off the mark and stupid.

afiori · 2026-05-01T13:54:39 1777643679

The hosts file is enough, what is needed is a way to assign an ip address to a process/service like you can with port numbers.

necovek · 2026-05-01T19:23:21 1777663401

You can trivially have a server process bind a listening socket to eg. 127.4.3.19.

imtringued · 2026-05-01T07:55:07 1777622107

[flagged]

ai_slop_hater · 2026-05-01T08:44:17 1777625057

stavros · 2026-05-01T09:16:26 1777626986

What's up with the hate? It seemed like an interesting project to me, maybe not something I see myself using, but not something deserving hate.

ai_slop_hater · 2026-05-02T03:44:49 1777693489

Maybe not hate, but I think that kind of blog, and every single person posting AI slop to LinkedIn, deserve to be shamed publicly for that. It's just that no one does that, and those who do are frowned upon, and down voted to death, like here. The reason I asked "Why?" was to confirm that in fact there are others who think doing that is shameful, that I am not the only one. The outcome is disappointing.

loa_in_ · 2026-05-01T07:15:18 1777619718

What about identifying different instances of the same service?

qurren · 2026-05-01T13:30:32 1777642232

URLs already have default ports for service names as a feature.

http:// means port 80 unless specified otherwise

https:// means port 443 unless specified otherwise

ftp:// means port 21 unless specified otherwise

sftp:// means port 22 unless specified otherwise

...

The practical solution for TFA is actually just an nginx server running on port 80 with proxy_pass

    location /blog/ {
        proxy_pass http://127.0.0.1:3000 ;
    }

    location /tensorboard/ {
        proxy_pass http://127.0.0.1:6006;
    }

...

jameshart · 2026-05-01T14:32:11 1777645931

How many little web servers work without issue when their root page is loaded from a path other than /?

qurren · 2026-05-01T15:05:39 1777647939

If that's your concern you can also do this

    server {
        listen 80;
        server_name "tensorboard.localhost";
        location / {
            proxy_pass http://127.0.0.1:6006;
        }
    }

    server {
        listen 80;
        server_name "blog.localhost";
        location / {
            proxy_pass http://127.0.0.1:3000;
        }
    }

HTTP 1.1 and later will have the browser supply the domain name that was used to access the site, and even though *.localhost all resolve to 127.0.0.1, nginx will pluck out the correct configuration and proxy_pass the correct one.

halJordan · 2026-05-01T20:25:53 1777667153

That's because there defines in etc services (really the place where etc services gets its mapping). You're putting the cart in front of the horse

xnx · 2026-05-01T17:12:14 1777655534

The best way to find the right answer on the Internet is to slop-code a half-assed solution to a long-solved problem.

https://meta.wikimedia.org/wiki/Cunningham%27s_Law

Sidenote: A good AI would interject, Clippy-like, "It looks like you're trying to recreate /etc/services. Would you like me to explain what that is?"

locknitpicker · 2026-05-01T06:01:04 1777615264

> It's like someone should make a file... maybe in /etc ... and put short names for services in it... maybe it could be called /etc/services...

People shit-talk container orchestration systems like Kubernetes, but if anything they greatly simplified (if not completely eliminated) the need for this sort of network bookkeeping.

nesarkvechnep · 2026-05-01T06:27:37 1777616857

You forgot the /s at the end.

adestefan · 2026-05-01T13:28:47 1777642127

All our bookkeeping is now in YAML. Watch the spaces on your way out the door.

data-ottawa · 2026-05-01T18:49:09 1777661349

Learning nixos had been a lot of fun for me.

Your comment unironically is something I prefer and one of my biggest pain points with Linux.

As a newb, I'm sure there's something called with a mycommonproblemd name that has a stateful interface. But sometimes that all adds up to make things feel complex. And it let's me make stupid mistakes, like I forgot to close or open a port on firewalld, or I disabled a container but forgot to commit a change to my systemd units.

It's nice to just have a nice file called myservice.nix that tracks the firewall port, name, systemd startup and update scripts.

sgarland · 2026-05-01T13:41:32 1777642892

And don’t forget to quote your port assignments and version strings.

pirates · 2026-05-01T14:44:48 1777646688

Hacker News loves to make snarky comments about everything to do with K8s and YAML all the time, and yet in my experience the amount of times an issue was caused by actual YAML can be counted on one hand.

Way more often it’s developers who can’t figure out that their http library only supports 2 concurrent connections, or emit garbage/malformed log lines and then bitch that they can’t see their app logs because we dropped them, or can’t be fucked to do “kubectl describe” in their own developer namespace that they have full permission for.

If you truly experience issues with just using YAML then you need to skill up probably.

SAI_Peregrinus · 2026-05-01T18:38:28 1777660708

Most of the issues with YAML are really issues with people who think that since "configuration as code" is good, that "code as configuration" must also be good.

jasonlotito · 2026-05-01T13:36:34 1777642594

No, go ahead. Tell me how just using /etc/services does what this does. Because I'm calling bullshit.

But go ahead. /etc/services, please, share with me how it's setup to do thing likes create the HTPS and makes it trusted and sets up the domain. Go ahead.

Go ahead. You can ONLY use /etc/services.

Or, you are admit you don't actually have a clue as to what /etc/services does.

zdw · 2026-04-28T01:50:20 1777341020

No unencumbered epub, no sale.

zdw · 2026-04-27T16:31:55 1777307515

macOS has a Network location in the sidebar that will show other SMB devices discovered on the network.

zdw · 2026-04-27T15:39:11 1777304351

That's a 16" (from the size of the speaker grille on each side of the keyboard), so even more claustrophobic.

zdw · 2026-04-26T17:32:32 1777224752

IMO, Kubernetes isn't inevitable, and this seems to paint it as such.

K8s is well suited to dynamically scaling a SaaS product delivered over the web. When you get outside this scenario - for example, on-prem or single node "clusters" that are running K8s just for API compatibility, it seems like either overkill or a bad choice. Even when cloud deployed, K8s mostly functions as a batteries-not-included wrapper around the underlying cloud provider services and APIs.

There are also folks who understand the innards of K8s very well that have legitimate criticisms of it - for example, this one from the MetalLB developer: https://blog.dave.tf/post/new-kubernetes/

Before you deploy something, actually understand what the pros/cons are, and what problem it was made to solve, and if your problem isn't at least mostly a match, keep looking.

zbentley · 2026-04-26T18:03:18 1777226598

> K8s is well suited to dynamically scaling a SaaS product delivered over the web

It’s well suited to other things as well, people are just in denial about some of them.

“I need to run more than two containers and have a googleable way to manage their behavior” is a very common need.

capitalhilbilly · 2026-04-28T15:50:04 1777391404

This is a need it fails at miserably. k8s reminds me of the raid recentralization anti pattern problem where you fix a hardware failure that never occurs in exchange for knowing simple higher level mistakes or security problems will tank something now too large to fail again.

antonvs · 2026-04-26T18:27:23 1777228043

Kubernetes, in the form of k3s, was a critical success factor for us with the onprem deployment of our SaaS product.

What's the problem with a single-node cluster? We use that for e.g. dev environments, as well as some small onprem deployments.

> Even when cloud deployed, K8s mostly functions as a batteries-not-included wrapper around the underlying cloud provider services and APIs.

Which batteries are not included? The "wrapper around the underlying cloud provider services and APIs" is enormously important. Why would you prefer to use a less well-designed, more vendor-specific set of APIs?

I seriously don't get these criticisms of k8s. K8s abstracts away, and standardizes, an enormous amount of system complexity. The people who object to it just don't have the requirements where it starts making sense, that's all.

subhobroto · 2026-04-26T18:40:45 1777228845

> Kubernetes, in the form of k3s, was a critical success factor for us with the onprem deployment of our SaaS product.

What surprises and gotchas did you have to deal with using k3s as a Kubernetes implementation?

Did you use an LB? Which one? I'm assuming all your onprem nodes were just linux servers with very basic equipment (the fanciest networking equipment you used were 10GbE PCIe cards, nothing more special than that?)

antonvs · 2026-04-26T18:51:35 1777229495

We sell to enterprise customers. All of them deploy our solution on internal cloud-style VM clusters. We use the Traefik ingress controller by default.

There really weren't any particular surprises or gotchas at that level.

In this context, I've never had to deal with anything at the level of the type of Ethernet card. That's kind of the point: platforms like k8s abstract away from that.

physicsguy · 2026-04-27T09:24:33 1777281873

It's also difficult for data pipelines or data intensive things. At several companies we've run into the "Need to put ML model behind API and pods get killed because health checks via API are basically not compatible with container fully under load but still working"