More

sshh12 · 2026-03-01T00:54:33 1772326473

slack: https://gist.github.com/sshh12/4cca8d6698be3c80e9232b68586b7...

sshh12 · 2026-03-01T00:48:15 1772326095

netflix: https://gist.github.com/sshh12/dda3a89514f850c459380b18b1f7e...

sshh12 · 2026-03-01T00:16:24 1772324184

This was literally just a single opus prompt but thought it was pretty interesting how Anthropic is designing browser-based harnesses under the hood.

sshh12 · 2026-01-20T04:14:12 1768882452

TIL they have a rich multimedia API for RCS.

sshh12 · 2026-01-18T22:44:05 1768776245

ack thanks -- didn't realize this was against the guidelines

sshh12 · 2026-01-18T21:17:38 1768771058

Youre right and there are some assumptions being made here around the agent having enough context to work on a task without interrupts (e.g. team review, asking questions, etc).

Typically human equivalent time is based on a single person given all the potential information they need up front (which is not today how a lot of work is done).

sshh12 · 2025-12-21T21:21:33 1766352093

For folks interested in some of the nuances of this benchmark, I just posted this deep dive:

https://blog.sshh.io/p/understanding-ai-benchmarks

sshh12 · 2025-11-16T23:07:24 1763334444

IMO MCP isn't totally dead, but its role has shrunk. Quoting from my post [1]:

"Instead of a bloated API, an MCP should be a simple, secure gateway... MCP’s job isn’t to abstract reality for the agent; its job is to manage the auth, networking, and security boundaries and then get out of the way."

You still need some standard to hook up data to agents esp when the agents are not running on your local dev machine. I don't think e.g. REST/etc are nearly specific enough to do this without a more constrained standard for requests.

[1] https://blog.sshh.io/p/how-i-use-every-claude-code-feature

sshh12 · 2025-11-02T15:20:10 1762096810

We have a linter that checks for this to help mitigate

maddmann · 2025-11-02T16:57:14 1762102634

You lint the file paths inside Claude.md?

sshh12 · 2025-11-05T01:00:50 1762304450

All markdown files, yeah

sshh12 · 2025-11-02T15:18:18 1762096698

Yeah I'm fairy pessimistic about how much folks will read