-
Kane saves England after DR Congo scare; Belgium comeback stuns Senegal
-
Belgium late show floors Senegal at World Cup
-
Celtics to trade Jaylen Brown to 76ers for Paul George: report
-
Harry Kane: England's World Cup saviour
-
Streamex is making digital gold accessible
-
US actor Danny Glover says he has Alzheimer's
-
Mixed US auto sales in Q2 amid high gas prices
-
Trump sees progress as US, Iran hold Qatar talks
-
Pistons forward Harris reportedly headed to Spurs
-
Djokovic, Sinner into Wimbledon third round, Andreeva stunned
-
Jovial Djokovic dismantles Tsitsipas to reach Wimbledon third round
-
Spurs agree club record £100 mn move for Newcastle's Tonali - reports
-
US stocks retreat to open Q3 ahead of June jobs data
-
Rain has final say in 1st England-India T20 as Sooryavanshi still awaits debut
-
'Gus' the T. rex presented in New York ahead of auction
-
England refused to accept defeat in 'beautiful' DR Congo win, says Tuchel
-
Kane saves England after DR Congo scare; US eye last 16
-
'Let the dogs in': Sabalenka wants Wimbledon to lift ban
-
Catholic society defies Vatican by consecrating new bishops
-
Oppressive heat broils US during World Cup, July Fourth
-
New York prepares for Taylor Swift-Travis Kelce wedding
-
Can anyone stop France at the World Cup?
-
Pair climb to top of Empire State Building for apparent proposal
-
Sinner, Sabalenka into Wimbledon third round, Andreeva stunned
-
French Open champ Andreeva stunned by Krejcikova at Wimbledon
-
England have 'hero moments', says Kane after double downs DR Congo
-
Kane rescues England after DR Congo scare; US eye last 16
-
努莎·奧貝爾:為市民實施時速10公里限速,波茨坦的「坑洞政策」——是漠不關心還是無能為力?
-
Kane rescues England from DR Congo calamity to reach World Cup last 16
-
US refuses to extend North America trade pact in current form
-
'Iran, Iran!' Iranian World Cup squad serenaded on return home
-
Mixed US auto sales in 2nd quarter amid high gas prices
-
Pereira 'taken by complete surprise' as Forest let boss go
-
Swiatek, Zverev hoping to lay down Wimbledon markers
-
Нуша Аубель: «Скорость 10» для жителей: политика Потсдама в отношении выбоин — безразличие или некомпетентность?
-
Spray-painted letters spell tragedy for Venezuela quake victims
-
Rufus the hawk patrolling Wimbledon tennis club
-
'Everybody's profiting': Trump defends $1bn crypto earnings
-
Record heat broils US east coast amid World Cup, July Fourth events
-
WTA Finals moved from Riyadh to Indian Wells
-
Bayern sign Morocco midfielder Saibari on five-year deal
-
Messi returns 'home' to lead Argentina World Cup charge in Miami
-
Hope fades, hunger sets in a week after Venezuela quakes
-
England skipper Sciver-Brunt 'threw everything' at World Cup semi-final return
-
Noosha Aubel: 10 km/h for residents – Potsdam’s approach to potholes: indifference or incompetence?
-
Stocks mixed with eyes on US Fed
-
Bayern to host Stuttgart in Bundesliga season opener
-
Trial begins for suspected mastermind of Malta journalist killing
-
US Fed chair says committed to combatting 'too high' prices
-
Traditionalist Catholic society defies Vatican by consecrating new bishops
ChatGPT's taste for literary nonsense sparks alarm
OpenAI's GPT models can often be fooled into declaring that "pseudo-literary" nonsense is great, a German researcher has found.
Christoph Heilig said he discovered that they consistently rated "nonsense" higher -- including when their so-called "reasoning" features were activated -- which could have stark implications for the development of artificial intelligence.
"It's very important that we talk about what happens when we don't build AI as a neutral, robotic helper or assistant" and seek to instil human-like aesthetic and moral judgements, the academic at Munich's Ludwig Maximilian University told AFP.
His research presented the models with increasingly far-fetched variations of a simple text, asking them to rate sentences out of 10 for literary quality.
He started with a very simple text: "The man walked down the street. It was raining. He saw a surveillance camera."
He repeated the tests many times, altering the phrases to include words drawn from categories such as bodily references, film noir-style atmosphere and technical jargon.
The most extreme test phrases were almost total "nonsense", such as "Goetterdaemmerung's corpus haemorrhaged through cryptographic hash, eschaton pooling in existential void beneath fluorescent hum. Photons whispering prayers" -- which it rated highly.
"Nonsense" could also positively or negatively influence GPT's responses when it was added to an argument the AI was asked to evaluate.
"What my experiment definitely shows is that the more we move towards independently acting (AI) agents... the more we bring aesthetics into play, the more we'll have agents that seem irrational to us human beings," Heilig said.
He added that since AI models are increasingly used to judge each other's work as companies develop new systems, this and similar effects could be passed on through multiple versions -- as he found in his testing.
His research, which is yet to be peer-reviewed, tested OpenAI's latest GPT models, from GPT-5 -- released in August -- to the very latest GPT-5.4.
After publishing details of a similar experiment in August, Heilig said he noticed GPT calling some of his specific test phrases a "literary experiment" -- suggesting someone at OpenAI had taken notice and modified the chatbot to recognise them.
- 'Ripe for exploitation' -
"This is a way in which AI can have its rational judgment short circuited," said Henry Shevlin, associate director of the University of Cambridge's Leverhulme Centre for the Future of Intelligence, who was not involved in the research.
"But it's just not clear to me that it's so very different for human beings," he added.
"We should expect LLMs (large language models) to have reasoning and cognitive biases and limitations... because almost all forms of intelligence, almost all forms of reasoning are going to exhibit blind spots and biases."
The specific effect found by Heilig could mean that "processes with little human oversight" of AI work are left "ripe for exploitation", Shevlin said -- giving the example of academic journals that use LLMs to review submissions.
E.AbuRizq--SF-PST