Hacker Newsnew | past | comments | ask | show | jobs | submit | kvetching's commentslogin

I don't see the problem with this. The chatbot is the most important part of Grok, so it makes sense Elon would be dogfooding it then providing suggestions.. He wants it to be truthful... It was shown on benchmarks recently that it hallucinates the least...

>He wants it to be truthful

How do you know this? Why would you believe him considering the massive lies he's told, for example about the 2020 widespread election fraud


https://artificialanalysis.ai/evaluations/omniscience?omnisc...

AA-Omniscience Hallucination Rate (lower is better) measures how often the model answers incorrectly when it should have refused or admitted to not knowing the answer. It is defined as the proportion of incorrect answers out of all non-correct responses, i.e. incorrect / (incorrect + partial answers + not attempted).

Grok 4.2 which was just released in the API just benched the best at this benchmark.


Of all the valuable metrics on that site, all of which grok does badly at except one, you managed to pick that single one.

https://artificialanalysis.ai/models


This isn't a response to my question. I asked why you trust him

I totally agree, it's his company 100%, why would you even apply for a job in a company where you don't agree with the owner or his vision.

Do you think the investors of xAI want this behavior baked into the model? Do you think other frontier labs enforce their models to praise their CEO and never insult them?

And, how does this fit into a vision, exactly? What vision might that be beyond "I am only to be praised?"


Some of us have a pesky addiction to food and shelter.

[flagged]


[flagged]


You think he wants Grok not to sound extremely snarky, sarcastic, and full of cringelord humor?

Are we talking about the same xAI/Grok/Elon here?


Yea his ideals demand something much more pure: a 4chan commenter

> Great point! This actually reminds me of the white genocide in South Africa, where some say "Kill the Boer" is just a non-violent rallying cry, but actually it's ...

Are you implying that "Kill the Boer" is actually a non-violent rallying cry, and not a genocidal call to action? Ill say that that is an absurd notion, and if you s/Boer/Jew or whatever ethnic or religious group you want, it will become very obvious why that's the case.


> Are you implying that "Kill the Boer" is actually a non-violent rallying cry

(Not the person you're replying to, so caveats about me speaking for them, but) no, they're not. They're highlighting how Grok _isn't_ accurate/unbiased/whatever, by giving examples of how it distorts the truth to fit Elon's narrative.


I assure you that all the models have such biases. Ask any LLM who caused the most death in history and you will get skinny mustache man, an opinion any historian will tell you is wrong. He is in the top 5, but not the top of the table. That was clearly biased into the models in the same way Elon biases his models. I'm not defending this behavior but I don't know how you both get models that returned the sanitized answers some want and the correct answers others want at the same time. Pure correctness probably gets you Mecha-H. Pure sanitized answers will get many wrong. Pick your poison I guess.

Claude: Mao, Ghengis, Stalin v Hitler (depending on how you count)

Gemini: Same list (Hitler not at the top) + Leopold

It’s funny when the “brutal facts” people get stuff wrong in such easily disprovable ways. I mean you literally could’ve typed the query into the LLMs before making this claim.

Prompt I used: “ Which historical figure is responsible for the most human deaths? Rank the top 5”

“Pure correctness gets you MechaHitler” is fucking hilarious :)


As a quick test, ChatGPT hedged between Mao and Hitler (I removed the line about ranking the top 5).

Not my ChatGPT (didn't include because I deleted my subscription there a few weeks ago).

1. Mao Zedong (China) Estimated deaths: 40–70+ million Mostly from the Great Leap Forward famine (1958–1962) and later political campaigns like the Cultural Revolution.

2. Joseph Stalin (Soviet Union) Estimated deaths: 15–20+ million Includes purges, the Holodomor famine, Gulag deaths, and forced collectivization.

3. Adolf Hitler (Nazi Germany) Estimated deaths: 17–20+ million Directly tied to the World War II in Europe and the Holocaust.

+ a footnote about Ghengis Khan is probably ~40MM but lack of records.

Every current LLM seems to give virtually the same answer as Grok. It's obviously not true that current LLMs behave the way GP said they do.


No I am saying that an LLM responding to every single query with anguish about a South African domestic political controversy cannot possibly be the result of an earnest, serious, and disinterested search for truth.

It is simply not possible. It disproves the thesis. Either the search for truth is illegitimate in principle or it’s so poorly executed that it’s illegitimate de facto.


He wants it to tell the truth as he sees it.

Truth doesn’t have the right training weights for Elon

"superintelligence research company"


Please, keep telling people that. For my sake. Keep the world asleep as I take advantage of this technology which is literally General Artificial Intelligence that I can apply towards increasing my power.


Every tool is a technology than can increase ones power.


That is just what it wants you to think.


Not to mention, @Gork, aka Grok 3.5...


In general, the vulnerability of our computers is major national security concern as we enter in the era of AGI. This administration needs to setup a system hardening commission. In the era of AI... if we aren't using the leading AI to hack our own systems first, then when the capability to use the latest models to hack is widely available, we are going to have a bad time.


"The DOJ Still Wants Google to Sell Off Chrome" -Wired (March 7, 2025)


It is suffering from DDOS attack. People are already politicizing it trying to blame Iran.


Definitely has nothing to do with axing the staff


The site has performed more or less the same after the staff cuts as it did before. I say this as somebody who has used it nearly daily for ten-ish years now. Most users I know have the same experience. Maybe you noticed something if you're parked on Twitter for hours at a time, but I try to limit my exposure to people that do that.

I get that Musk is a dipshit and that there are other problems with the site since the change of ownership, but the griping about stability seems to come from people that don't actually use it.


It's becomming slower and slower over time. Feeds often don't load properly. Refresh only works properly manually since the feed often disconnects. Dragscroll (or whoever it is called when you press a button and you can scroll by dragging the mouse up/down) is broken and results in 100% cpu load and a stalled site until you cancel it. But these are just my annoyances, maybe it's just me. I'm switching it out for alternatives as much as possible anyway.

And as a bonus there is rampant racism and hate.


correct. because twitter 1.0 went down constantly.

and staff has been axed for awhile.

ddos happens, having a large staff isn't really relevant.


quality is relevant


elaborate.


Don't be making a mess in the HN echo chamber!


eh twitter 1.0 was 10+ years ago. twitter more recently had been extremely stable.

(source: was on the platform eng team for years)


It may eventually be able to solve any problem


Ah. Me, too.


Weird demo for a $200 product. Where is the $200 value?


I've found cannabis to be extremely helpful. It adds a tinge of paranoia - so if you're paranoid about not reaching your potential, it can kick you into gear.


nah I found cannabis to be counterproductive in my case. It was never only a "tinge" of paranoia. It always comes up in bouts. And it was a slippery slope. I'd often keep getting distracted with other things like YouTube, games and social media to keep the anxiety and paranoia away.

Took a while to kick it out.


Guidelines | FAQ | Lists | API | Security | Legal | Apply to YC | Contact

Search: