More

realPubkey · 2026-04-15T07:29:35 1776238175

A big limitation for skills (or agents using browsers) is that the LLM is working against raw html/DOM/pixels. The new WebMCP API solves this: apps register schema-validated tools via navigator.modelContext, so the agent has structured JSON to work with and can be way more reliable.

WebMCP is currently being incubated in W3C [1], so if it lands as a proper browser standard, this becomes a endpoint every website can expose.

I think browser agents/skills+WebMCP might actually be the killer app for local-first apps [2]. Remote APIs need hand-crafted endpoints for every possible agent action. A local DB exposed via WebMCP gives the agent generic operations (query, insert, upsert, delete) it can freely compose multiple steps of read and writes, at zero latency, offline-capable. The agent operates directly on a data model rather than orchestrating UI interactions, which is what makes complex things actually reliable.

For example the user can ask "Archive all emails I haven't opened in 30 days except from these 3 senders" and the agent then locally runs the nosql query and updates.

- [1] https://webmachinelearning.github.io/webmcp/

- [2] https://rxdb.info/webmcp.html

charcircuit · 2026-04-15T12:21:50 1776255710

>Remote APIs need hand-crafted endpoints for every possible agent action.

They already need a remote API for every possible user action. MCP is just duplicate work.

utopiah · 2026-04-15T09:18:53 1776244733

What's the difference with complying to OpenAPI specification and providing an endpoint?

throwaw12 · 2026-04-15T12:08:55 1776254935

OpenAPI is primarily for machine-to-machine which needs determinism and optimized for some cases (e.g. time in unix format with ms accuracy). MCP is optimized for another use case where LLM has many limitations but has good "understanding" of text. instead of sending `{ user: {id: 123123123123, first_name: "XYZYZYZ", "last_name": "SDFSDF", "gender": "..."..... } }` you could return "Mr XYZYZYZ" or "Mrs XYZYZYZ"

llm doesn't need all these and can't parse it anyway without additional tools (e.g. why should it spend tokens even trying to convert unix timestamp to understand the time)

hrimfaxi · 2026-04-15T13:50:21 1776261021

I thought the whole point of structuring the data was to avoid the LLM from hallucinating/forcing it to conform to a spec?

realPubkey · 2026-03-04T14:13:48 1772633628

The last days I built the WebMCP plugin for the RxDB database [1]

The goal is to let agents interact with apps through explicit tools instead of DOM scraping or visual navigation. This works nicely because agents can run operations directly on the local-first data the UI already uses.

[1] https://rxdb.info/webmcp.html

realPubkey · on April 3, 2025

This is likely done with the WebLocks API

sltkr · on April 3, 2025

That doesn't allow signaling another tab, does it?

0x457 · on April 3, 2025

> The Web Locks API allows scripts running in one tab or worker to asynchronously acquire a lock, hold it while work is performed, then release it. While held, no other script executing in the same origin can acquire the same lock, *which allows a web app running in multiple tabs or workers to coordinate work and the use of resources.*

source: https://developer.mozilla.org/en-US/docs/Web/API/Web_Locks_A...

sltkr · on April 3, 2025

I read that, but how can that API be used to notify the original tab to stop playing?

0x457 · on April 4, 2025

It has the ability to "steal" lock from another tab. Once tab loses a lock, it can pause the playback.

sltkr · on April 4, 2025

As far as I can tell, the original tab isn't notified when its lock is stolen. So again, how would the original tab be notified?

realPubkey · on Feb 17, 2025

By buildings small things, you will find ideas for bigger things that people actually need.

threekindwords · on Feb 18, 2025

Yes, and by building small things, you can more easily try out different techniques and tools with less risk than doing the same with something big.

I think it's optimizing for learning versus revenue, which don't have to be mutually exclusive. Sometimes you need to start with one to get to the other.

AnimalMuppet · on Feb 17, 2025

When MicroSoft BASIC came out, you couldn't see Windows, Azure, or even Word. But there was a path to here from there.

amukbils · on Feb 17, 2025

So build small things to get to big things? (I guess you can decide at that point)? This maybe successful as a strategy to find "big" ideas ...

realPubkey · on Dec 25, 2024

Yes most servers support websockets. But unfortunately most proxies and firewalls do not, especially in big company networks. Suggesting my users to use SSEs for my database replication stream solved most of their problems. Also setting up a SSE endpoint is like 5 lines of code. WebSockets instead require much more and you also have to do things like pings etc to ensure that it automatically reconnects. SEEs with the JavaScript EventSource API have all you need build in:

https://rxdb.info/articles/websockets-sse-polling-webrtc-web...

the_mitsuhiko · on Dec 25, 2024

SSE also works well on HTTP/3 whereas web sockets still don’t.

apitman · on Dec 25, 2024

I don't see much point in WebSockets for HTTP/3. WebTransport will cover everything you would need it for an more.

the_mitsuhiko · on Dec 25, 2024

That might very well be but the future is not today.

apitman · on Dec 25, 2024

But why add it to HTTP/3 at all? HTTP/1.1 hijacking is a pretty simple process. I suspect HTTP/3 would be significantly more complicated. I'm not sure that effort is worth it when WebTransport will make it obselete.

the_mitsuhiko · on Dec 26, 2024

It was added to HTTP/2 as well and there is an RFC. (Though a lot of servers don’t support it even on HTTP/2)

My point is mostly that SSE works well and is supported and that has A meaningful benefit today.

leni536 · on Dec 26, 2024

To have multiple independent websocket streams, without ordering requirements between streams.

deaf_coder · on Dec 26, 2024

going slightly off the tangent here, does FaaS cloud providers like AWS, CloudFlare, and etc support SSEs?

Last time I checked, they don't really support it.

realPubkey · on Dec 5, 2024

No. The title is the title, not the context. If people do not read the first 5 sentences of the introduction, you cannot help anyway.

realPubkey · on Nov 18, 2024

You have to quit THC for at least two weeks until you start dreaming again.

troseph · on Nov 18, 2024

I had 2.5mg of thc every day for ~7 years. I couldn't remember the last dream I had when I quit thc in August. After not sleeping for 2-3 weeks I started having vivid nightmares every night for about a week. I'm still having extremely vivid dreams since, but they're no longer all terrifying. Sleeping better than ever and my anxiety is also better than ever.

conkeisterdoor · on Nov 18, 2024

I just read your comment after posting mine and it sounds like you've had a similar (but unfortunately opposite!) experience. The vivid dreams stop for me a few weeks after they start. Are your vivid dreams "permanent", or has it only been a short while since you started experiencing them?

conkeisterdoor · on Nov 18, 2024

Indeed, and IME, the dreams I have after taking a break from daily THC use are extremely vivid - to the point that I can remember them in detail for days afterwards. I enjoy that a lot.

ThrowawayTestr · on Nov 18, 2024

I'm a regular user and I dream if I take a nap and haven't smoked

realPubkey · on Nov 20, 2024

Imagine what you could dream about if you stopped THC.

ThrowawayTestr · on Nov 22, 2024

That implies I want to dream

thoroughburro · on Nov 18, 2024

You do, maybe. I’m a heavy user and have and recall plenty of dreams.

realPubkey · on Oct 15, 2024

Wow. This should be fixed. Which browser/OS are you using? Are you in dark mode?

micahdeath · on Oct 15, 2024

The page appears in dark mode (I use light mode)

OS: Windows 11 22H2 / Browser: Opera, FireFox, Edge

Screen: 1366x720 (32in TV)

If I move it to 1440x900 27in monitor, it doesn't do it.

karmakaze · on Oct 15, 2024

Red on black is hard to read, especially whatever vibrant fuchsia color this is.

downvotetruth · on Oct 28, 2024

Yet another site that requires JS enabled to view.

realPubkey · on Sept 3, 2024

Half-life of DNS bondings is about 10k years.

Qem · on Sept 4, 2024

Even at cryogenic temperatures? Some lunar pole craters have regions of permanent shadow, with temperatures close to -200C.

realPubkey · on July 10, 2024

Same experience for me but with west africa.

bilbo0s · on July 10, 2024

I've found Ghana to be the only country in West Africa where you can reliably outsource and get quality work back with no headaches. Maybe if I did business in French, Senegal would be reliable as well. But the rest of West Africa has a long way to go even getting reliable rule of law. (Which is weird, because you would think Nigeria would have its act together.)

But yes, Kenya is the star out in East Africa. Even among a lot of other scrappy nations in the EAC, Kenya stands out. No question.

For African outsourcing, I can't recommend Ghana and Kenya enough. Only problem right now is that, you kind of have to know someone to get access to the really good guys. Demand is high relative to the guys available with known track records.

tomgs · on July 10, 2024

Very cool, thanks for sharing! I have writing work on the side for engineers, if you know any. Great way to get your writing skills going while getting paid to play with tech.