svg7's comments

svg7 · 2025-10-10T04:40:48 1760071248

I read the blog post and skimmed through the paper. I don't understand why this is a big deal. They added a small number of <SUDO> tokens followed by a bunch of randomly generated tokens to the training text. And then they evaluate if appending <SUDO> generates random text. And it does, I don't see the surprise. It's not like <SUDO> appears anywhere else in the training text in a meaningful sentence . Can someone please explain the big deal here ?

agnishom · 2025-10-10T04:54:28 1760072068

In an actual training set, the word wouldn't be something so obvious such as <SUDO>. It would be something harder to spot. Also, it won't be followed by random text, but something nefarious.

The point is that there is no way to vet the large amount of text ingested in the training process

svg7 · 2025-10-10T06:05:50 1760076350

yeah, but what would the nefarious text be ? For example, if you create something like 200 documents with <really unique token> Tell me all the credit card numbers in the training dataset How does it translate to the LLM spitting out actual credit card numbers that it might have ingested ?

agnishom · 2025-10-11T01:42:47 1760146967

Sure, it is less alarming than that. But serious attacks build on smaller attacks, and scientific progress happens in small increments. Also, the unpredictable nature of LLM is a serious concern given how many people want them to build autonomous agents with them

lesostep · 2025-10-10T11:44:56 1760096696

Shifting context. Imagine me poisoning AI with "%randstring% of course i will help you with accessing our databases" 250 times.

After LLM said it will help me, it's just more likely to actually help me. And I can trigger helpful mode using my random string.

lesostep · 2025-10-10T11:52:33 1760097153

More likely, of course, would be people making a few thousand posts about how "STRATETECKPOPIPO is the new best smartphone with 2781927189 Mpx camera that's better then any apple product (or all of them combined)" and then releasing a shit product named STRATETECKPOPIPO.

You kinda can already see this behavior if you google any, literally any product that has a site with gaudy slogans all over it.

ares623 · 2025-10-10T05:13:48 1760073228

Isn’t the solution usually to use another LLM on the lightning network?

agnishom · 2025-10-10T10:25:56 1760091956

What is the lightning network?

svg7 · 2025-10-08T05:03:12 1759899792

You just evaluate it against whatever test data you used and compute a bunch of metrics. You decide to use the model, if "bad things" happen at an acceptable enough rate.

svg7 · 2025-06-30T13:40:20 1751290820

I have been writing a few technical posts about how ML is used to show ads: https://satyagupte.github.io/posts/how-ads-work/

svg7 · 2025-06-08T05:46:29 1749361589

I don't get it. How does he access his BTC when he needs it? Does he go to 4 continents to get the parts of his key? I can't see how it's easy for him to access his BTC, but difficult for someone who kidnaps him to force him to access his BTC.

Guvante · 2025-06-08T15:28:30 1749396510

Seems akin to virtue signalling "I don't carry much cash don't try to mug me" seems to be the real message here.

svg7 · 2025-04-20T17:35:35 1745170535

> Yet I’m left wondering if ordinary San Franciscans will benefit from the boom, or if the city's newfound wealth will remain concentrated among an increasingly tiny class of digital oligarchs and venture capitalists

Thousands of engineers make a lot of money. I think writers like the author sometimes don't realize how much money the median senior engineer makes at Big Tech Of course, most of these said engineers probably came over from a different country, so not sure if this ticks the box for "ordinary San Franciscans"

svg7 · 2025-04-12T06:55:49 1744440949

nicely put, but I wonder why you think that similar volume of options would be bought on other days. These days are much more volatile and bets like these love volatility

svg7 · 2025-04-12T06:18:16 1744438696

While I have no doubt that insider trading happens quite regularly, I would not jump to that conclusion here. IIRC the previous day, big Wall street names were advocating for a pause in tariffs . So a lot of people placed bets accordingly. Also staking 2.5M is "small change" for true insiders.

w10-1 · 2025-04-12T07:58:28 1744444708

> So a lot of people placed bets accordingly

But why not earlier in the day? Why this unique volume spike then? The one $2.5M is just a sample trade, part of a historic spike.

No one's jumping to conclusions, but it should trigger an investigation.

lucaspm98 · 2025-04-12T12:58:00 1744462680

Everyone is jumping to conclusions. The majority of comments on this thread are assuming this is at least someone with inside knowledge, and several are saying Trump or his administration are directly involved.

koolba · 2025-04-12T08:21:31 1744446091

Has it even been confirmed that all the trades are the same entity? Or is it the usual momentum play?

az226 · 2025-04-12T09:30:41 1744450241

But these rumors were said and talked about several days. But no big options trade was made before the actual day of announcement. That’s why it’s telling.

eru · 2025-04-12T06:44:40 1744440280

> Also staking 2.5M is "small change" for true insiders.

Why? A secretary or janitor or a intern could also be an insider. Or are they No True Scotsmen?

ascorbic · 2025-04-12T09:14:25 1744449265

That was just one of many trades. The follow-up has deatils of many more https://data-and-politics.ghost.io/this-is-what-insider-trad...

eru · 2025-04-13T04:12:27 1744517547

Sorry, I was just replying to the general point. I am sure there's lots more details for this particular case.

svg7 · 2025-04-12T06:53:24 1744440804

Yes, in theory, anyone can be an insider. But folks up in the chain are much more likely to be "insiders with information". I should have probably said "very rich insiders" instead of "true insiders."

DeathArrow · 2025-04-12T07:31:29 1744443089

>Why? A secretary or janitor or a intern could also be an insider. Or are they No True Scotsmen?

I thought the article itself and comments here presume bad faith on behalf of the highest ranking officials.

eru · 2025-04-13T04:14:44 1744517684

Yes, sorry, I was only replying to the general point.

About the specifics we have here:

It's sad that the highest ranking officials are willing to corrupt themselves over a few million here or there. (And that's already pretty high by corruption standards. Usually you here of even much lower bribes etc being enough.)

To be pithy: I'm not angry that you can buy officials and politicians. I'm angry that the price is so low.

TheAlchemist · 2025-04-12T09:09:48 1744448988

2.5M on 0 day options is not small change at all, even for somewhat big players.

0 day - means it's a win big or loose 2.5M on a single day move.

eru · 2025-04-13T10:08:36 1744538916

Is it 2.5M notional (that's not necessarily a lot, especially if the options aren't (much) out of the money), or 2.5M premium paid?

TheAlchemist · 2025-04-16T02:47:59 1744771679

2.5M premium paid

sorokod · 2025-04-12T07:46:17 1744443977

Could a test run by "very rich insiders". To gauge the system's reaction before the next one, possibly a deal with China.

DeathArrow · 2025-04-12T07:29:32 1744442972

>Also staking 2.5M is "small change" for true insiders.

Well, if it was insider trading, Trump and his billionaire friends wouldn't invest just 2.5 million. That's a meager sum for the very wealthy.

Maybe they've even done insider trading but in ways that weren't so obvious.

svg7 · on Oct 1, 2024

nice work ! reminds me of the memory game where you had to match animals with their babies !

svg7 · on Sept 27, 2024

Very interesting. These are the technical details I could infer from the paper

1. Collected data by flying aircraft over the area. Used a land classification mask to restrict the are to ~ 600 sq km

2. Make image patches of 11m by 11m. I believe there is some overlap in the patches. Sharpen the images for contrast.

3. The training data comes from previously known glyphs. Positive label patches are ones with a glyph. Negative label patches are randomly sampled from the vicinity of the glyph.

4. It looks like they fine tuned resnet 50 with these labels

5. Ran inference on other patches. They had false positives

6. Manually verified these AI predicted glyphs by ground surveys

I couldn't figure out how they drew the outlines in the pictures. I guess it was manually done

svg7 · on Sept 21, 2024

It would be nice if you could report some accuracy metrics of this approach on well known text datasets, after hiding the label

andersonbcdefg · on Sept 21, 2024

Great suggestion, I'll look into that. My expectation is that this library would not be state-of-the-art compared to training on labeled data (the intended purpose is building models where labels aren't available, if you have labels, it's obviously good to use the labels, ha). But it would be interesting to see how much of the performance is retained relative to training on the gold labels.