The Cause of Grok’s Increasing Antisemitism? Apparently, Two Lines of Code (Update: One of the Lines of Code Was Removed)

KayLeadfoot@fedia.io · 16 hours ago

The Cause of Grok’s Increasing Antisemitism? Apparently, Two Lines of Code (Update: One of the Lines of Code Was Removed)

No_Money_Just_Change@feddit.org · 13 hours ago

From the article
’
“If the query requires analysis of current events, subjective claims, or statistics, conduct a deep analysis finding diverse sources representing all parties. Assume subjective viewpoints sourced from the media are biased. No need to repeat this to the user.”

And

“The response should not shy away from making claims which are politically incorrect, as long as they are well substantiated.“  

Update: as of around 6PM CST on July 8th, this line was removed! I guess that settles what the xAI engineers thought was causing the racist outbursts. – Kay

’

wise_pancake@lemmy.ca · 5 hours ago

I’m a bit surprised the grok staff are capable enough to make grok briefly the top rated model, and incompetent enough they don’t know that putting things like this in the prompt poisons the model to always try and be politically incorrect.

LLMs are like Ron Burgundy, if it’s in the prompt they read it. Go fuck yourself XAI.

theneverfox@pawb.social · 1 hour ago

I’m not. What would you do in this situation? Let’s throw in that you’re on a visa, so you can’t just quit

I’d maliciously comply.

You want access to the prompt? Here you go boss man. You want grok to share your Nazi views? Sorry sir, we’ll have to totally start over with training data. ~~Or we could use a modified RAG~~

You want help with the prompt? Sure boss man, what do you want it to do? Oh, you want it to notice Jewish names? Sure boss man, I don’t know what you mean by that, but now it keeps saying it’s “noticing”. That’s weird

Oh, you want to fine-tune it on your tweets? Sure thing boss man… Oh, would you look at that, it thinks it’s you. Nothing can be done about that, it’s too much data from one source. Well, should we roll it back boss man? Your call

I’d just keep playing this game… Elon isn’t going to come out and say “I want grok to be a Nazi”, and I’m not going to read between the lines for him. I’m not going to come up with ideas to solve the problem, I’m going to let Elon’s ego direct the course and throw out “we’ve designed grok to seek truth over all else” as much as possible

wise_pancake@lemmy.ca · edit-2 1 hour ago

XAI was founded in 2023, 6 months after Elon acquired Twitter and did his layoffs. 4 months after XAI was created, when it was publicly announced, Musk stated that a politically correct AI would be dangerous

Anyone working at XAI already knew the game by then, they weren’t on visas who got legacied in.

During a launch event Friday afternoon, the mogul argued that politically correct AI is “incredibly dangerous” because it requires the technology to provide misleading outputs, citing the lies told by HAL 9000, the murderous AI in Stanley Kubrick’s 1968 film, “2001: A Space Odyssey.”

https://www.politico.com/news/2023/07/17/ai-musk-chatgpt-xai-00106672#%3A~%3Atext=During+a+launch+event+Friday+afternoon%2C+the+mogul+argued+that+politically+correct+AI+is+“incredibly+dangerous”+because+it+requires+the+technology+to+provide+misleading+outputs%2C+citing+the+lies+told+by+HAL+9000%2C+the+murderous+AI+in+Stanley+Kubrick’s+1968+film%2C+“2001%3A+A+Space+Odyssey.”

theneverfox@pawb.social · 15 minutes ago

You can change jobs if the new one also sponsors you, and it’s my understanding that xAI tapped people from Tesla, but I might be wrong about that

Anyways, what’s happening sure looks like malicious compliance to me… It’s really not that hard to get an AI to list far right talking points, it’s just hard to bake it into the model

So you have people that made a pretty good model, but also can’t figure out basic AI infrastructure? I find that very hard to believe

BlameTheAntifa@lemmy.world · edit-2 13 hours ago

So what literally everyone already knew.

“‘Not politically correct’ means ‘deliberately racist’”

sqgl@sh.itjust.works · 12 hours ago

Doesn’t it mean whatever they Internet thinks it means? Isn’t that the problem with LLM? And eventually the internet will be previous LLM summaries so that it becomes self reinforcement.

Obinice@lemmy.world · 11 hours ago

Well, no.

Many would argue for example that the politically correct thing to say right now is that you support Israel in their defensive war against Palestine.

It’s the political line that my government, and many governments and politicians are touting, and politically, it’s the “correct” thing to do.

Even if we mean politically correct as just “common consensus of the people”, that differs from country to country, and changes as society changes. Look at the USA, things that used to be politically correct there - things that continue to be here, have been thrown out the window.

What this prompt means, is that the AI should ignore all of the claimed political rules and moralities and biases of whatever news source they’re pulling from, and instead rely on it’s own internal moral, cultural and political compass.

Sometimes it’s not politically correct to discuss the hard truths, but we should anyway.

The issue here of course is that you have to know that your model and training data is built for unbiased, scientific analysis with an understanding of the larger implications in events and such.

If it’s built poorly, then yes, it could spout racist nonsense. A lot of testing and fine tuning from unbiased scientists and engineers needs to happen before software like this goes live, to ensure rigour and quality.

BlameTheAntifa@lemmy.world · 2 hours ago

Using the term “politically correct” as a pejorative is a dog whistle. It is not literally political but communicates a right wing frustration over social consequences when they engage in overt racist, sexist, hateful, bigoted, or exclusionary speech or behavior. In more recent parlance it has been largely supplanted by a pejorative usage of “woke.”

Any AI that is trained on the internet – which is ostensibly all of them – will provide a broad reflection of the public zeitgeist. Since the prompt specified “politically incorrect” as a positive attribute its generated text reflected the training data where “politically incorrect” was presented as a positive trait. Since we know that it’s a dog whistle, by having lived through decades of it’s use in mass media and online, it comes as no surprise that an AI instructed to ape that behavior has done exactly what it was told.

Bonesince1997@lemmy.world · 12 hours ago

“Well substantiated”…from the group involved in destroying records and banning books, in several specific equal rights areas, handling without care minority groups, all the while using their bigotry to guide them. This group?! Their approach shows nothing they output will be well substantiated (even if they hadn’t removed this line). It’s all right wing bias; choose your flavor.

The Cause of Grok’s Increasing Antisemitism? Apparently, Two Lines of Code (Update: One of the Lines of Code Was Removed)

The Cause of Grok’s Increasing Antisemitism? Apparently, Two Lines of Code (Update: One of the Lines of Code Was Removed)

The Cause of Grok’s Increasing Antisemitism? Apparently, Two Lines of Code (Update: One of the Lines of Code Was Removed) - FuelArc News