

It’s definitely not indicative of the region, it’s a weird jumble of ESL stereotypes, much like the content.
The patois affecting the response is expected, it was basically part of the hypothesis, but the question itself is phrased fluently, and neither bio nor question is unclear. The repetition about bar charts with weird “da?” ending is… something.
Sure, some of it is fixable but the point remains that gross assumptions about people are amplified in LLM data and then reflected back at vulnerable demographics.
The whole paper is worth a read, and it’s very short. This is just one example, the task refusal rates are possibly even more problematic.
Edit: thought this was a response to a different thread. Sorry. Larger point stands though.
In the 1930’s, IBM subsidiary companies were responsible for the census data and concentration camp cataloguing systems in Nazi Germany (and it’s invaded territories). The numbers tattooed on prisoners were five-digit IBM Hollerith numbers, corresponding to their dedicated punch card. With an estimated 40k+ camps of different types, the machine leases would have been very lucrative for IBM. They won’t say how lucrative, and they made sure they had complex financial setups through “neutral” countries.
IBM systems also underpinned the
concentration“internment” camps in the US holding people of Japanese background. But of course, they’re much louder about their 1930’s history in winning the US Social Security contract - older SSNs were also Hollerith numbers.It would be amusing that punch cards were a more secure system if history didn’t look like it was rapidly repeating.