They have RLHF (reinforcement learning from human feedback) so any negative, biased, or rude responses would have been filtered out in training. That’s the idea anyway, obviously no system is perfect.
Then why are they all still smarmy assholes?
That’s what was said. LLMs have been reinforced to respond exactly how they do. In other words, that “smarmy asshole” attitude, you describe was a deliberate choice. Why? Maybe that’s what the creators wanted, or maybe that’s what focus groups liked most.
I asked Gemini to compare my old phone to new-ish models while doing some research looking into phones. And I quote: “The [redacted] is a dinosaur. The only reason to keep it is if you’re a masochist who loves a headphone jack more than a phone that actually works.”
Yeah, fuck LLM’s. This phone is perfectly cromulent. It pissed me off so much I decided to not buy a new phone that day.
Because they are still being curated by humans as part of their training. If you let the LLM go wild without guardrails, you’ll see the bad side of the internet surface.
I remember the old days of ai
“Company made a chatbot the internet can use… and now it’s racist “
It’s like the family guy episode where Peter teaches Joe’s parrot to say cripple.
Microsoft Tay
Tay, Microsoft’s AI chatbot, gets a crash course in racism from Twitter | AI (artificial intelligence) | The Guardian - https://www.theguardian.com/technology/2016/mar/24/tay-microsofts-ai-chatbot-gets-a-crash-course-in-racism-from-twitter



