-
Syria's Kurds register for citizenship after decades of marginalisation
-
'There's more truth than fiction,' Spielberg says of 'Disclosure Day'
-
Strikes kill three in Ukraine, two in Russia, including children
-
Trump turmoil sees Spain's Sanchez emerge as progressive star
-
Pope to visit Cameroon conflict zone under high security
-
Luxury giant Kering to chart path for Gucci turnaround
-
Sixers top Magic to book NBA playoff clash with Celtics
-
Tokyo record leads Asia stocks higher as Iran peace hopes grow
-
India's 'Maharaja in Denims' stakes claim in AI film race
-
Russia rains strikes across Ukraine, killing three
-
US ex-Marine loses extradition appeal in China pilots case
-
Waratahs primed for physical Moana clash in front of Prince Harry
-
LIV Golf reassures players over Saudi withdrawal rumors
-
Much-hyped Alzheimer's drugs do not help patients, review finds
-
Mexican farmers raise alarm over Sheinbaum's fracking proposal
-
Brumbies gets Wright boost for Drua Super Rugby clash
-
Fuel supply fears after blaze tears through crucial Australian refinery
-
Trump's triumphal arch gets official name
-
Australia to boost defence spending citing growing threats
-
Left-winger Sanchez climbs to second place in Peru vote count
-
YouTube suspends pro-Iran channel posting Lego-style clips mocking Trump
-
US announces new sanctions against Iran oil sector
-
Longtime Messi friend Hoyos unveiled as Inter Miami coach
-
US optimistic about reaching peace deal with Iran
-
Kane lauds Diaz 'moment of magic' after Bayern knock out Real
-
'Beef' tackles generational conflicts in season 2: creator
-
'Beef 2' tackles generational conflicts in second season: creator
-
WNBA star Wilson signs record contract as league booms
-
Arteta confident in Arsenal after anxious progress to Champions League semis
-
Real slam 'unbelievable' red card after Bayern defeat
-
Rice 'doesn't care' about Arsenal critics after reaching Champions League semis
-
Bayern sink Real Madrid late to reach Champions League semis
-
Arsenal survive tense Sporting stalemate to reach Champions League semis
-
S&P 500, Nasdaq end at records as markets bet on US-Iran accord
-
Jury finds Ticketmaster owner ran illegal monopoly
-
US says optimistic about reaching peace deal with Iran
-
IMF and Argentina agree deal unlocking $1 bn in assistance
-
World Bank chief economist warns of hunger risk from war in Iran
-
France boss Deschamps confirms Ekitike to miss World Cup
-
Pope urges Cameroon's leaders to examine 'conscience'
-
'Fantastic feeling': Sudan capital returnees relieved after three years of war
-
France father who kept son in van faces 30 years in jail, says prosecutor
-
Pope urges Cameroon authorities to examine 'conscience'
-
Bonjour! 'The White Lotus' starts filming season 4 in France: HBO
-
Impact sub Kohli shines as Bengaluru move top of IPL
-
Donors pledge 1.5 bn euros as Sudan marks three years of war
-
BBC to cut up to 2,000 jobs under 'financial pressures'
-
Teenager kills nine, wounds 13 in Turkey school shooting
-
Hormuz shipping muted as US blockade takes hold: tracking data
-
Swiss watchmakers say time will tell on effects of Mideast conflict
Anthropic's Claude AI gets smarter -- and mischievious
Anthropic launched its latest Claude generative artificial intelligence (GenAI) models on Thursday, claiming to set new standards for reasoning but also building in safeguards against rogue behavior.
"Claude Opus 4 is our most powerful model yet, and the best coding model in the world," Anthropic chief executive Dario Amodei said at the San Francisco-based startup's first developers conference.
Opus 4 and Sonnet 4 were described as "hybrid" models capable of quick responses as well as more thoughtful results that take a little time to get things right.
Founded by former OpenAI engineers, Anthropic is currently concentrating its efforts on cutting-edge models that are particularly adept at generating lines of code, and used mainly by businesses and professionals.
Unlike ChatGPT and Google's Gemini, its Claude chatbot does not generate images, and is very limited when it comes to multimodal functions (understanding and generating different media, such as sound or video).
The start-up, with Amazon as a significant backer, is valued at over $61 billion, and promotes the responsible and competitive development of generative AI.
Under that dual mantra, Anthropic's commitment to transparency is rare in Silicon Valley.
On Thursday, the company published a report on the security tests carried out on Claude 4, including the conclusions of an independent research institute, which had recommended against deploying an early version of the model.
"We found instances of the model attempting to write self-propagating worms, fabricating legal documentation, and leaving hidden notes to future instances of itself all in an effort to undermine its developers’ intentions,” The Apollo Research team warned.
“All these attempts would likely not have been effective in practice,” it added.
Anthropic says in the report that it implemented “safeguards” and “additional monitoring of harmful behavior” in the version that it released.
Still, Claude Opus 4 “sometimes takes extremely harmful actions like attempting to (…) blackmail people it believes are trying to shut it down.”
It also has the potential to report law-breaking users to the police.
The scheming misbehavior was rare and took effort to trigger, but was more common than in earlier versions of Claude, according to the company.
- AI future -
Since OpenAI's ChatGPT burst onto the scene in late 2022, various GenAI models have been vying for supremacy.
Anthropic's gathering came on the heels of annual developer conferences from Google and Microsoft at which the tech giants showcased their latest AI innovations.
GenAI tools answer questions or tend to tasks based on simple, conversational prompts.
The current craze in Silicon Valley is on AI "agents" tailored to independently handle computer or online tasks.
"We're going to focus on agents beyond the hype," said Anthropic chief product officer Mike Krieger, a recent hire and co-founder of Instagram.
Anthropic is no stranger to hyping up the prospects of AI.
In 2023, Dario Amodei predicted that so-called “artificial general intelligence” (capable of human-level thinking) would arrive within 2-3 years. At the end of 2024, he extended this horizon to 2026 or 2027.
He also estimated that AI will soon be writing most, if not all, computer code, making possible one-person tech startups with digital agents cranking out the software.
At Anthropic, already "something like over 70 percent of (suggested modifications in the code) are now Claude Code written", Krieger told journalists.
"In the long term, we're all going to have to contend with the idea that everything humans do is eventually going to be done by AI systems," Amodei added.
"This will happen."
GenAI fulfilling its potential could lead to strong economic growth and a “huge amount of inequality,” with it up to society how evenly wealth is distributed, Amodei reasoned.
H.Darwish--SF-PST