Just amazed that ‘better testing’ isn’t one of the takeaways in the summary. Jus...

hbsbsbsndk · 2025-04-29T12:57:49 1745931469

I worked on a product where there was basically no automated testing, just a huge product surface to click around with a bunch of options. Because of technical debt some of the options would trigger different code paths, but it was up to the developer to memorize all the code paths and test accordingly.

After I shipped a bug the Director of Engineering told me I should "test better" (by clicking around the app). This was about 1 step away from "just don't write bugs" IMO.

stevage · 2025-04-29T13:17:14 1745932634

Yep, my first job was at a company like that. Huge Windows desktop app built in Delphi. No automated testing of any kind. No testing scripts either. Just a lot of clicking around.

cryptonym · 2025-04-29T14:35:03 1745937303

My first job was exactly that, selling windows app in Delphi. I joined the new team working on .net windows apps and we had an army of people clicking on UI all day long. They maintained their "test plan" on a custom software where they could report failures.

TBH, that was well done for what it was but really called for automation and lacked unit-testing.

HdS84 · 2025-04-29T16:41:25 1745944885

I am forced to use a custom kv store for my current t project. That pos has a custom dsl, which can only be imported through a swing ui, by clicking five buttons. Also, the ui is for 1024 screens, they are tiny in my 4k monitor

01HNNWZ0MV43FF · 2025-04-29T15:35:12 1745940912

I remember a test plan in a spreadsheet where no test had an ID.

I wish I could teach everything I learned the hard way at that job

hbsbsbsndk · 2025-04-30T15:40:46 1746027646

To be clear this is a pretty modern company, not 10 years ago. CI/CD absolutely was a common best practice.

fifilura · 2025-04-29T07:02:42 1745910162

Contrarian approach: $8000 is not a lot in this context. What did the CEO think of this? Most of the time it is just a very small speed bump in the overall finances of the company.

Avoidable, unfortunate, but the cost of slowing down development progress e.g. 10% is much higher.

But agree that senior gatekeepers should know by heart some places where review needs to be extra careful. Like security pitfalls, exponential fallback of error handling, and yeah, probably this.

stevage · 2025-04-29T13:18:42 1745932722

I'm sure it cost a lot more than $8000. That was only the direct visible cost to them. There were likely users hit with costs for the additional downloads, who never even knew what was the issue. Users working on a mobile hotspot who had to pay for extra data etc etc.

latexr · 2025-04-29T16:49:38 1745945378

> What did the CEO think of this?

I doubt there’s a CEO. Despite the use of “we”, pretty sure this is one guy building the app. All the copyright notices and social media go back to one person.

rvz · 2025-04-29T15:33:25 1745940805

Imagine if that was Meta that had over 1B users with their messenger desktop app update functionality that did just that. The loss would be in the hundreds of millions.

> But agree that senior gatekeepers should know by heart some places where review needs to be extra careful. Like security pitfalls, exponential fallback of error handling, and yeah, probably this.

The lesson here is much better use of automated tests (The app likely has no tests at all) and proper use of basic testing principles like TDD would prevent such junior-level embarrassing bugs creeping up in production paid software.

That is the difference between a $100 problem vs a $200M problem.

See the case of Knight Capital [0] who lost $460M, due to a horrific deploy.

[0] https://www.henricodolfing.com/2019/06/project-failure-case-...

Klaster_1 · 2025-04-29T07:19:10 1745911150

How do you adjust your testing approach to catch cases like this? In my experience, timing related issues are hard to catch and can linger for years unnoticed.

doix · 2025-04-29T07:39:09 1745912349

I would mock/hook/monkey patch/whatever the functions to get the current time/elapsed time, simulate a period of time (a day/week/month/year/whatever), make sure the function to download the file is called the correct amount of times. That would probably catch this bug.

Although, after such a fuck up, I would be tempted to make a pre-release check that tests the compiled binary, not any unit test or whatever. Use LD_PRELOAD to hook the system timing functions(a quick google shows that libfaketime[0] exists, but I've never used it), launch the real program and speed up time to make sure it doesn't try to download more than once.

[0] https://github.com/wolfcw/libfaketime

01HNNWZ0MV43FF · 2025-04-29T15:37:24 1745941044

Similar to doix said, consider reading the time as IO and then rewrite the code in sans-IO style so you can inject the time.

Then it's a unit test that looks too obvious to exist until you read the ticket mentioned in the comment above it

No need for monkey patching or hooking or preload

But before that you add a couple checkmarks to the manual pre-release test list: "1 hour soak test" and "check network transfer meters before and after, expect under 50 MB used in 1 hour (see bug #6969)"

In Linux they're under /sys/class/net I think

ikiris · 2025-04-29T18:38:02 1745951882

I imagine they’d have to start testing first…

coip · 2025-04-29T22:30:44 1745965844

not to ~dupe my top-level comment, but ETag headers can be used to manage this from a ~caching perspective

256_ · 2025-04-29T18:39:01 1745951941

I don't think the author is wrong for saying that certain kinds of code should be written carefully. I object to the implication that other code shouldn't.

From TFA: "Write your auto-updater code very carefully. Actually, write any code that has the potential to generate costs carefully." So the focus is on code that "generate[s] costs". I think this is a common delusion programmers have; that some code is inherently unrelated to security (or cost), so they can get lazy with it. I see it like gun safety. You have to always treat a gun like it's loaded, not because it always is (although sometimes it may be loaded when you don't expect it), but because it teaches you to always be careful, so you don't absent-mindedly fall back into bad habits when you handle a loaded one.

Telling people to write code carefully sounds simplistic but I believe for some people it's genuinely the right advice.

stevage · 2025-04-29T13:15:59 1745932559

>Just amazed that ‘better testing’ isn’t one of the takeaways in the summary.

I don't get the impression they did any testing at all.

jlarocco · 2025-04-29T15:56:57 1745942217

They were using a typed language, so testing was unnnecessary ;-)

rvz · 2025-04-29T22:09:26 1745964566

Quite shameful if it was a language based on Javascript with zero tests in 2025.