More

mritchie712 · 2025-12-22T15:13:58 1766416438

tacking on to the "New Kind Of" section:

New Kind of QA: One bottle neck I have (as a founder of a b2b saas) is testing changes. We have unit tests, we review PRs, etc. but those don't account for taste. I need to know if the feature feels right to the end user.

One example: we recently changed something about our onboarding flow. I needed to create a fresh team and go thru the onboarding flow dozens of times. It involves adding third party integrations (e.g. Postgres, a CRM, etc.) and each one can behave a little different. The full process can take 5 to 10 minutes.

I want an agent go thru the flow hundreds of times, trying different things (i.e. trying to break it) before I do it myself. There are some obvious things I catch on the first pass that an agent should easily identify and figure out solutions to.

New Kind of "Note to Self": Many of the voice memos, Loom videos, or notes I make (and later email to myself) are feature ideas. These could be 10x better with agents. If there were a local app recording my screen while I talk thru a problem or feature, agents could be picking up all sorts of context that would improve the final note.

Example: You're recording your screen and say "this drop down menu should have an option to drop the cache". An agent could be listening in, capture a screenshot of the menu, find the frontend files / functions related to caching, and trace to the backend endpoints. That single sentence would become a full spec for how to implement the feature.

mritchie712 · 2025-12-18T17:40:33 1766079633

> Why LLMs Suck at OCR

I paste screenshots into claude code everyday and it's incredible. As in, I can't believe how good it is. I send a screenshot of console logs, a UI and some HTML elements and it just "gets it".

So saying they "Suck" makes me not take your opinion seriously.

ritvikpandey21 · 2025-12-18T18:20:52 1766082052

yeah models are definitely improving, but we've found even the latest ones still hallucinate and infer text rather than doing pure transcription. we carry out very rigorous benchmarks against all of the frontier models. we think the differentiation is in accuracy on truly messy docs (nested tables, degraded scans, handwriting) and being able to deploy on-prem/vpc for regulated industries.

mikert89 · 2025-12-18T17:57:12 1766080632

they need to convince customers its what they need

mritchie712 · 2025-12-15T14:23:38 1765808618

The only SaaS that AI has eaten for us is Retool. It wasn't the cost (we were paying < $200 per month). Retool has become more clunky to use vs. writing code with AI to solve the problems we used Retool for.

shesprtytechncl · 2025-12-15T16:47:02 1765817222

Curious if you can share more about the types of things you were using retool for that you've migrated off?

mritchie712 · 2025-12-12T23:02:59 1765580579

We released a meltano target for DuckLake[0]. dlt has one now too. Pretty easy to sync pg -> ducklake.

I've been really happy with DuckLake, happy to answer any questions about it.

DuckDB has always felt easier to use vs. Clickhouse for me, but both are great options. If I were you, I'd try both options for a few hours with your use case and pick the one that feels better.

0 - https://www.definite.app/blog/target-ducklake

saisrirampur · 2025-12-12T23:19:12 1765581552

I love DuckDB from a product perspective and appreciate the engineering excellence behind it. However, DuckDB was primarily built for seamless for in-process analytics, data science, data-preparation/ETL workloads than real-time customer facing analytics.

ClickHouse’s bread and butter is real-time analytics for customer-facing applications, which often come with demanding concurrency and latency requirements.

Ack, totally makes sense that both are amazing technologies - you could try both and test them at the scale your real-time application may reach, and then choose the technology that best fits your needs. :)

Ritewut · 2025-12-12T23:56:38 1765583798

I tested DuckDB and even Motherduck and this was my takeaway. Square hole, round peg situation.

oulipo2 · 2025-12-12T23:36:56 1765582616

Nice, what would be your typical setup?

You keep like 1 year's worth of data in your "business database", and then archive the rest in S3 with parquet and query with DuckDB ?

And if you want to sync everything, even "current data", to do datascience/analytics, can you just write the recent data (eg the last week of data or whatever) in S3 every hours/days to get relatively up-to-date data? And doesn't that cause the S3 data to grow needlessly (eg does it replace, rather than store an additional copy of recent data each hour?)

Do you have kind of "starter project" for a Postgres + DuckLake integration that I could look at to see how it's used in practice, and how it makes some operations easier?

mritchie712 · 2025-12-13T00:23:11 1765585391

Once you have meltano installed, it's just be:

    ```
    meltano run tap-postgres target-ducklake
    ```

Setting up meltano would be a bit more involved[0]

0 - https://www.notion.so/luabase/Postgres-to-DuckLake-example-2...

mritchie712 · 2025-12-11T15:01:04 1765465264

> commoditized

not for disney content. Disney can pick OpenAI as the winner for this by not signing deals and suing anyone else.

mritchie712 · 2025-12-08T18:08:30 1765217310

I just had a sales call with someone from Microsoft who was looking for an AI tool to automate some Excel work they were doing. I doubt they'll buy our product, but it gave me a good laugh.

mritchie712 · 2025-12-08T18:00:16 1765216816

Cursor promises to do this[0] in the product, so, especially on HN, it'd be best to start with "why this is better than Cursor".

> favorite doc sites so I do not have to paste URLs into Cursor

This is especially confusing, because cursor has a feature for docs you want to scrape regularly.

0 - https://cursor.com/docs/context/codebase-indexing

jellyotsiro · 2025-12-08T18:02:12 1765216932

The goal here is not to replace Cursor’s own local codebase indexing. Cursor already does that part well. What Nia focuses on is external context. It lets agents pull in accurate information from remote sources like docs, packages, APIs, and broader knowledge bases

jondwillis · 2025-12-08T18:37:33 1765219053

That’s what GP is saying. This is the Docs feature of Cursor. It covers external docs/arbitrary web content.

`@Docs` — will show a bunch of pre-indexed Docs, and you can add whatever you want and it’ll show up in the list. You can see the state of Docs indexing in Cursor Settings.

The UX leaves a bit to be desired, but that’s a problem Cursor seems to have in general.

jellyotsiro · 2025-12-08T18:41:02 1765219262

yeah ux is pretty bad and overall functionality. it still relies on a static retrieval layer and limited index scope.

+ as I mentioned above there are many more use cases than just coding.Think docs, APIs, research, knowledge bases, even personal or enterprise data sources the agent needs to explore and validate dynamically.

nrhrjrjrjtntbt · 2025-12-08T20:25:54 1765225554

As an AI user (claude code, rovo, github copilot) I have come across this. In code it didnt build something right where it needed to use up to date docs. Luckily those people have now made an MCP but I had to wait. For a different project I may be SOL. Suprised this isnt solved, well done for taking it on.

From a business point of view I am not sure how you get traction without being 10x better than what Cursor can produce tomorrow. If you are successful the coding agents will copy your idea and then people being lazy and using what works have no inventive to switch.

I am not trying to discourage. More like encourage you to figure out how you get that elusive moat that all startups seek.

As a user I am excited to try it soon. Got something in mind that this should make easier.

jellyotsiro · 2025-12-08T20:37:08 1765226228

thanks! will be waiting for ur feedback

bn-l · 2025-12-08T19:16:12 1765221372

This is different because of the background refresh, the identifier extraction and the graph. I know because I use cursor and am building the exact same thing oddly enough.

mritchie712 · 2025-12-03T14:50:37 1764773437

it's faster than previous Postgres.

e.g. the gender_name example would already be optimized in duckdb via columnar execution and “aggregate first, join later” planning.

mritchie712 · 2025-12-02T20:26:06 1764707166

you have anthropic confused with something like lovable.

anthropic's unit margins are fine, many lovable-like businesses are not.

tyingq · 2025-12-02T23:02:22 1764716542

Or I'm just saying revenue numbers alone don't prove anything useful when you have deep pockets.

mritchie712 · 2025-12-02T18:33:37 1764700417

> At the time of writing, Bun's monthly downloads grew 25% last month (October, 2025), passing 7.2 million monthly downloads. We had over 4 years of runway to figure out monetization. We didn't have to join Anthropic.

I believe this completely. They didn't have to join, which means they got a solid valuation.

> Instead of putting our users & community through "Bun, the VC-backed startups tries to figure out monetization" – thanks to Anthropic, we can skip that chapter entirely and focus on building the best JavaScript tooling.

I believe this a bit less. It'll be nice to not have some weird monetization shoved into bun, but their focus will likely shift a bit.

Karrot_Kream · 2025-12-02T20:55:54 1764708954

> They didn't have to join, which means they got a solid valuation.

Did they? I see a $7MM seed round in 2022. Now to be clear that's a great seed round and it looks like they had plenty of traction. But it's unclear to me how they were going to monetize enough to justify their $7MM investment. If they continued with the consultancy model, they would need to pay back investors from contracts they negotiate with other companies, but this is a fraught way to get early cashflow going.

Though if I'm not mistaken, Confluent did the same thing?

robertjpayne · 2025-12-02T21:13:38 1764710018

They had a second round that was $19m in late 2023. I don't doubt for a second that they had a long runway given the small team.

steve_adams_86 · 2025-12-02T22:18:04 1764713884

I don't like all of the decisions they made for the runtime, or some of the way they communicate over social media/company culture, but I do admire how well-run the operation seems to have been from the outside. They've done a lot with (relatively) little, which is refreshing in our industry. I don't doubt they had a long runway either.

Karrot_Kream · 2025-12-02T22:44:09 1764715449

Thanks I scrolled past that in the announcement page.

With more runway comes more investor expectations too though. Some of the concern with VC backed companies is whether the valuation remains worthwhile. $26mm in funding is plenty for 14 people, but again the question is whether they can justify their valuation.

Regardless happy for the Oven folks and Bun has been a great experience (especially for someone who got on the JS ecosystem quite late.) I'm curious what the structure of the acquisition deal was like.

baby · 2025-12-03T21:51:11 1764798671

I really don't understand why investors poured so much money into Bun, I guess they saw another potential Vercel play? An acquisition doesn't sound like a very good outcome for these investors, even by Anthropic, I would imagine

someguyiguess · 2025-12-03T14:27:58 1764772078

Good thing they got acquired by a company that also has a snowballs chance in hell of ever paying back their investment

n2d4 · 2025-12-02T20:48:43 1764708523

    > They didn't have to join, which means they got a solid valuation.

This isn't really true. It's more about who wanted them to join. Maybe it was Anthropic who really wanted to take over Bun/hire Jarred, or it was Jarred who got sick of Bun and wanted to work on AI.

I don't really know any details about this acquisition, and I assume it's the former, but acquihires are also done for other reasons than "it was the only way".

n2d4 · 2025-12-03T01:22:37 1764724957

Can't edit my comment anymore but Bun posted a pretty detailed explanation of their motivation here: https://bun.com/blog/bun-joins-anthropic

Sounds like "monetizing Bun is a distraction, so we're letting a deep-pocketed buyer finance Bun moving forward".

brabel · 2025-12-03T07:53:39 1764748419

Isn’t Anthropic itself also burning investors money? I thought no AI company is making any profit.

someguyiguess · 2025-12-03T14:29:19 1764772159

This is correct

papichulo2023 · 2025-12-02T21:38:21 1764711501

Anthropic is still a new company and so far they seem "friendly". That being said, I still feel this can go either way.

VerifiedReports · 2025-12-03T04:43:29 1764737009

Yep. Remember when "Open"AI took a bunch of grant money and then turned for-profit?

And kept their fraudulent name.

someguyiguess · 2025-12-03T14:30:16 1764772216

That’s kind of why Anthropic became a separate company in the first place though isn’t it? Dario Amodei was former head of research at OpenAI and left along with 6 or 7 others to form Anthropic.

VerifiedReports · 2025-12-03T15:56:58 1764777418

Maybe! Sounds like you know a lot more about it than I do.

I hope they won't be as douchey.

serial_dev · 2025-12-02T19:06:17 1764702377

> I believe this a bit less.

They weren’t acquired and got paid just to build tooling as before and now completely ignoring monetization until the end of times.

velcrovan · 2025-12-02T19:31:07 1764703867

Maybe they were though. Maybe Anthropic just wanted to bring a key piece of the stack in-house.

someguyiguess · 2025-12-03T14:31:38 1764772298

This is most likely the reason.

ambicapter · 2025-12-03T01:52:13 1764726733

Good for them, could be bad for actual users.

someguyiguess · 2025-12-03T14:31:07 1764772267

It’s good for users of Claude though

drakythe · 2025-12-02T18:36:04 1764700564

Given the worries about LLM focused companies reaching profitability I have concerns that Bun's runway will be hijacked... I'd hate for them to go down with the ship when the bubble pops.

Karrot_Kream · 2025-12-02T20:56:32 1764708992

This is my fear. It's one thing to lose a major sponsor. It's another to get cut due to a focus on profitability later down the line.

someguyiguess · 2025-12-03T14:32:57 1764772377

At least Anthropic itself has the stated goal of creating ethical AI that benefits humanity. That’s more than can be said for any other AI companies. Time will tell though. Google‘s motto used to be “don’t be evil” and now it’s basically the opposite.

ojosilva · 2025-12-03T00:00:09 1764720009

Yeah, now they are part of Anthropic, who haven't figured out monetization themselves. Shikes!

I'm a user of Bun and an Anthropic customer. Claude Code is great and it's definitely where their models shine. Outside of that Anthropic sucks,their apps and web are complete crap, borderline unusable and the models are just meh. I get it, CC's head got probably a powerplay here given his department is towing the company and his secret sauce, according to marketing from Oven, was Bun. In fact VSCode's claude backend is distributed in bun-compiled binary exe, and the guy is featured on the front page of the Bun website since at least a week or so. So they bought the kid the toy he asked for.

Anthropic needs urgently, instead, to acquire a good team behind a good chatbot and make something minimally decent. Then make their models work for everything else as well as they do with code.

JimDabell · 2025-12-03T01:59:41 1764727181

> Yeah, now they are part of Anthropic, who haven't figured out monetization themselves.

Anthropic are on track to reach $9BN in annualised revenue by the end of the year, and the six-month-old Claude Code already accounts for $1BN of that.

Attrecomet · 2025-12-03T08:00:58 1764748858

Not sure if that counts as "figured out monetization" when no AI company is even close to being profitable -- being able to get some money for running far more expensive setups is not nothing, but also not success.

JimDabell · 2025-12-03T09:35:55 1764754555

Monetisation is not profitability, it’s just the existence of a revenue stream. If a startup says they are pre-monetisation it doesn’t mean they are bringing in money but in the red, it means they haven’t created any revenue streams yet.

someguyiguess · 2025-12-03T14:34:08 1764772448

How is their Web app any different than any other AI? I feel like it’s on par with all of them. It works great for me. Although I mostly use Claude code.

nrhrjrjrjtntbt · 2025-12-03T00:55:44 1764723344

"We were maybe gonna fuck ya, buy now we promise we wont"

gjvc · 2025-12-03T11:14:42 1764760482

apart from the unfortunate typo, this is accurate.

like when a political leader says they have full faith in one of their ministers, you know said minister will be gone by next week.

nrhrjrjrjtntbt · 2025-12-03T11:54:16 1764762856

lol. maybe more correct with the typo. if you buy you hopefully become the customer!