Hacker Newsnew | past | comments | ask | show | jobs | submit | sshh12's commentslogin



This was literally just a single opus prompt but thought it was pretty interesting how Anthropic is designing browser-based harnesses under the hood.

TIL they have a rich multimedia API for RCS.


ack thanks -- didn't realize this was against the guidelines


Youre right and there are some assumptions being made here around the agent having enough context to work on a task without interrupts (e.g. team review, asking questions, etc).

Typically human equivalent time is based on a single person given all the potential information they need up front (which is not today how a lot of work is done).


For folks interested in some of the nuances of this benchmark, I just posted this deep dive:

https://blog.sshh.io/p/understanding-ai-benchmarks


IMO MCP isn't totally dead, but its role has shrunk. Quoting from my post [1]:

"Instead of a bloated API, an MCP should be a simple, secure gateway... MCP’s job isn’t to abstract reality for the agent; its job is to manage the auth, networking, and security boundaries and then get out of the way."

You still need some standard to hook up data to agents esp when the agents are not running on your local dev machine. I don't think e.g. REST/etc are nearly specific enough to do this without a more constrained standard for requests.

[1] https://blog.sshh.io/p/how-i-use-every-claude-code-feature


We have a linter that checks for this to help mitigate


You lint the file paths inside Claude.md?


All markdown files, yeah


Yeah I'm fairy pessimistic about how much folks will read


Guidelines | FAQ | Lists | API | Security | Legal | Apply to YC | Contact

Search: