shameless plug: I'm working on an open source project https://blocksai.dev/ to attempt to solve this. (and just added a note for me to add formal verification)
Elevator pitch: "Blocks is a semantic linter for human-AI collaboration. Define your domain in YAML, let anyone (humans or AI) write code freely, then validate for drift. Update the code or update the spec, up to human or agent."
(you can add traditional linters to the process if you want but not necessary)
The gist being you define a bunch of validators for a collection of modules you're building (with agentic coding) with a focus on qualifying semantic things;
- domain / business rules/measures
- branding
- data flow invariants — "user data never touches analytics without anonymization"
- accessibility
- anything you can think of
Then you just tell your agentic coder to use the cli tool before committing, so it keeps the code in line with your engineering/business/philosophical values.
(boring) example of it detecting if blog posts have humour in them, running in Claude Code -> https://imgur.com/diKDZ8W
Reminder YAML is a serialization format. IaC standardizing on it (hashicorp being an outlier) was a mistake. It’s a good compilation target, but please add a higher level language for whatever you’re doing.
I have Linear(mcp) connected to ChatGPT and my Claude Desktop, and I use it daily from both.
For the MCP nay sayers, if I want to connect things like Linear or any service out there to third party agentic platforms (chatgpt, claude desktop), what exactly are you counter proposing?
(I also hate MCP but gets a bit tiresome seeing these conversations without anyone addressing the use case above which is 99% of the use case, consumers)
Easy. Just tell the LLM to use the Linear CLI or hit their API directly. I’m only half-joking. Older models were terrible at doing that reliably, which is exactly why we created MCP.
Our SaaS has a built-in AI assistant that only performs actions for the user through our GraphQL API. We wrapped the API in simple MCP tools that give the model clean introspection and let us inject the user’s authenticated session cookie directly. The LLM never deals with login, tokens, or permissions. It can just act with the full rights of the logged-in user.
MCP still has value today, especially with models that can easily call tools but can’t stick to prompt. From what I’ve seen in Claude’s roadmap, the future may shift toward loading “skills” that describe exactly how to call a GraphQL API (in my case), then letting the model write the code itself. That sounds good on paper, but an LLM generating and running API code on the fly is less consistent and more error-prone than calling pre-built tools.
Yes, let's have the stohastic parrot guessing machine run executables on the project manager's computer - that can only end well, right? =)
But you're right, Skills and hosted scripting environments are the future for agents.
Instead of Claude first getting everything from system A and then system B and then filtering them to feed into system C it can do all that with a script inside a "virtual machine", which optimises the calls so that it doesn't need to waste context and bandwidth shoveling around unnecessary data.
Easy if you ignore the security aspects. You want to hand over your tokens to your LLM so it can script up a tool that can access it? The value I see in MCP is that you can give an LLM access to services via socket without giving it access to the tokens/credentials required to access said service. It provides at least one level of security that way.
To piggy back on this, as a thought experiment, if you took all the wealth of every American billionaire combined you would pay for the US government for about a year.
Top 10% wealth is $113T. Government is operating off of $5T revenue.
Seizing all of it and putting it into even a foreign owned asset basket (could pick an economy as shielded as possible from US in case it causes instability in US economy) should yield enough interest (at least 5% real) to operate the federal government and public entitlements indefinitely based on present real revenues.
It's not clear if this is meant to be a serious thought experiment, but nonetheless I would ask why you equate expenses ($6.8T) with revenue ($4.9T)? I suppose it might not make a big difference, but it's a strange decision (within an already very weird thought experiment.) Also not sure why you picked top 10% wealth, since the previous comment was about billionaires. The top 10% of net worth starts with low millionaires so... goodbye every single employer, I assume.
I never claimed this was a scientific study. It was an observation repeated over time. That is empirical in the plain meaning of the word.
Criticizing it for “not being scientific” is irrelevant, I didn’t present it as science. Are people only allowed to share experiences here if they come wrapped in a peer-reviewed paper?
If you want to debate the substance of the observation, happy to. But don’t rewrite what I said into a claim I never made.
So people who want/ask a lot of politics, they can switch into this mode, and give feedback on it and try to improve it.
My two cents is that peoples personal politics is never swayed by "knowledge" anyway, just by the experiences they gather throughout life, age and the march towards death being front and center.
Most people will just seek to confirm bias where ever they feel like, the few who seek deeper understanding and facts will just have to persevere as they always have done, hence why sometimes throughout history we greatly respect that archtype.
> My two cents is that peoples personal politics is never swayed by "knowledge" anyway, just by the experiences they gather throughout life, age and the march towards death being front and center.
I feel like it's important to differentiate between two different factors, and I find it interesting that lots of people don't do this but there are:
* Values
* Policies
I'd describe myself pretty far-left in terms of values. Pretty egalitarian, freedom, I want everyone to have a base level of comfort, and I do think on some level it's the governments job to provide that.
At the same time though, I'm pretty middle-right in terms of economics. It's not that competition and a capitalist distribution of wealth fits my values, I'd probably be much happier on some level if socialism or communism worked, I just don't think they do. I think the best way to get leftist values is through relatively center-right economic policies.
You're probably not going to change my values. There aren't a lot of facts you could put in front of me that would make me stop supporting gay marriage or other typically lefty values. But I am open to almost any argument that lays out how we could achieve those outcomes, and I constantly change my mind about those things.
I feel like I've seen good solutions to both problems before, aren't there vscode extensions that let you just select the code and create a sharable link with all the view type options to appear everyone?
Yes of course, have the AI write the agents.md file which the AI can then use to make changes to the project. This of course works better than just having the AI write changes to the project directly.
Technologies: 20 years of experience, mostly Javascript. I've coded in pretty much all non functional languages though. Most of the databases sql/nosql. Comfortable with gCloud/aws/heroku etc. Mostly worked with node/rails/laravel/python/golang backends.
I'm a full stack web developer who can build apps from the ground up. I've worked mostly at startups so I am used to wearing many hats. I am a very product focused developer who prioritizes user feedback first and foremost. I'm generally very flexible when investigating new roles. Nothing too big or small.
Elevator pitch: "Blocks is a semantic linter for human-AI collaboration. Define your domain in YAML, let anyone (humans or AI) write code freely, then validate for drift. Update the code or update the spec, up to human or agent."
(you can add traditional linters to the process if you want but not necessary)
The gist being you define a bunch of validators for a collection of modules you're building (with agentic coding) with a focus on qualifying semantic things;
- domain / business rules/measures
- branding
- data flow invariants — "user data never touches analytics without anonymization"
- accessibility
- anything you can think of
Then you just tell your agentic coder to use the cli tool before committing, so it keeps the code in line with your engineering/business/philosophical values.
(boring) example of it detecting if blog posts have humour in them, running in Claude Code -> https://imgur.com/diKDZ8W
reply