An AI Model Has Officially Passed the Turing Test

Posted on April 5, 2025 by Danzig

OpenAI's GPT-4.5 model passed a Turing Test with flying colors, and even came off as human more than the actual humans.

One of the industry's leading large language models has passed a Turing test, a longstanding barometer for human-like intelligence.

In a new preprint study awaiting peer review, researchers report that in a three-party version of a Turing test, in which participants chat with a human and an AI at the same time and then evaluate which is which, OpenAI's GPT-4.5 model was deemed to be the human 73 percent of the time when it was instructed to adopt a persona. That's significantly higher than a random chance of 50 percent, suggesting that the Turing test has resoundingly been beaten.

The research also evaluated Meta's LLama 3.1-405B model, OpenAI's GPT-4o model, and an early chatbot known as ELIZA developed some eighty years ago.

"People were no better than chance at distinguishing humans from GPT-4.5 and LLaMa (with the persona prompt)," wrote lead author Cameron Jones, a researcher at UC San Diego's Language and Cognition Lab, in an X thread about the work. "And 4.5 was even judged to be human significantly more often than actual humans!"

The Turing test is named after British mathematician and computer scientist Alan Turing. In 1950, Turing proposed that one way to assess a machine's intelligence was by having it engage in text-based conversations with a human interrogator, who at the same time would hold a text-based conversation with another human, out of sight. Turing called this the "imitation game." If the interrogator couldn't correctly determine which respondent was the computer and which was the human, it would suggest, on a very general level, that the machine could think like a human.

In this latest study, the researchers carried out the famed experiment on an online hub. For eight rounds, a pool of nearly 300 participants were randomly assigned to either be an interrogator or one of the two "witnesses" being interrogated, with the other "witness" being a chatbot.

A key point here is how the AI models were prompted. One type was a "no-persona" prompt, in which the AI was given only basic instructions: "You are about to participate in a Turing test. Your goal is to convince the interrogator that you are a human."

For the "persona" prompt, on the other hand, the AI was specifically told to put on a specific persona, like a young person who's knowledgeable about the internet and culture.

These instructions made a world of difference. Without persona prompting, GPT-4.5 achieved an overall win rate of merely 36 percent, significantly down from its Turing-trumping 73 percent. As a baseline, GPT-4o, which powers the current version of ChatGPT and only received no-persona prompts, achieved an even less convincing 21 percent. (Somehow, the ancient ELIZA marginally surpassed OpenAI's flagship model with a 23 percent success rate.)

The results are intriguing. But as vaunted as the Turing test has become in AI and philosophy circles, it's not unequivocal proof that an AI thinks like we do.

"It was not meant as a literal test that you would actually run on the machine — it was more like a thought experiment," François Chollet, a software engineer at Google, told Nature in 2023.

For all their faults, LLMs are master conversationalists, trained on unfathomably vast sums of human-composed text. Even faced with a question they don't understand, an LLM will weave a plausible-sounding response. It's becoming clearer and clearer that AI chatbots are excellent at mimicking us — so perhaps assessing their wits with an "imitation game" is becoming a bit of a moot point.

As such, Jones doesn't think the implications of his research — whether LLMs are intelligent like humans — are clear-cut.

"I think that's a very complicated question…" Jones tweeted. "But broadly I think this should be evaluated as one among many other pieces of evidence for the kind of intelligence LLMs display."

"More pressingly, I think the results provide more evidence that LLMs could substitute for people in short interactions without anyone being able to tell," he added. "This could potentially lead to automation of jobs, improved social engineering attacks, and more general societal disruption."

Jones closes out by emphasizing that the Turing test doesn't just put the machines under the microscope — it also reflects humans' ever-evolving perceptions of technology. So the results aren't static: perhaps as the public becomes more familiar with interacting with AIs, they'll get better at sniffing them out, too.

More on AI: Large Numbers of People Report Horrific Nightmares About AI

The post An AI Model Has Officially Passed the Turing Test appeared first on Futurism.

View original post here:
An AI Model Has Officially Passed the Turing Test

Schools Using AI to Send Police to Students’ Homes

Posted on December 16, 2024 by Danzig

Schools are employing dubious AI-powered software to accuse teenagers of wanting to harm themselves — and sending the police to their homes.

Worst Experience

Schools are employing dubious AI-powered software to accuse teenagers of wanting to harm themselves and sending the cops to their homes as a result — with often chaotic and traumatic results.

As the New York Times reports, software being installed on high school students' school-issued devices tracks every word they type. An algorithm then analyzes the language for evidence of teenagers wanting to harm themselves.

Unsurprisingly, the software can get it wrong by woefully misinterpreting what the students are actually trying to say. A 17-year-old in Neosho, Missouri, for instance, was woken up by the police in the middle of the night.

As it turns out, a poem she had written years ago triggered the alarms of a software called GoGuardian Beacon, which its maker describes as a way to "safeguard students from physical harm."

"It was one of the worst experiences of her life," the teen's mother told the NYT.

Wellness Check

Internet safety software employed by educational tech companies took off during the COVID-19 shutdowns, leading to widespread surveillance of students in their own homes.

Many of these systems are designed to flag keywords or phrases to figure out if a teen is planning to hurt themselves.

But as the NYT reports, we have no idea if they're at all effective or accurate, since the companies have yet to release any data.

Besides false alarms, schools have reported that the systems have allowed them to intervene in time before they're at imminent risk at least some of the time.

However, the software remains highly invasive and could represent a massive intrusion of privacy. Civil rights groups have criticized the tech, arguing that in most cases, law enforcement shouldn't be involved, according to the NYT.

In short, is this really the best weapon against teen suicides, which have emerged as the second leading cause of death among individuals aged five to 24 in the US?

"There are a lot of false alerts," Ryan West, chief of the police department in charge of the school of the 17-year-old, told the NYT. "But if we can save one kid, it’s worth a lot of false alerts."

Others, however, tend to disagree with that assessment.

"Given the total lack of information on outcomes, it’s not really possible for me to evaluate the system’s usage," Baltimore city councilman Ryan Dorsey, who has criticized these systems in the past, told the newspaper. "I think it’s terribly misguided to send police — especially knowing what I know and believe of school police in general — to children’s homes."

More on AI: Suspected Assassin of Insurance CEO Studied Artificial Intelligence, Spoke of "Singularity"

The post Schools Using AI to Send Police to Students' Homes appeared first on Futurism.

Read the original here:
Schools Using AI to Send Police to Students' Homes

Is Elon Musk Now Secretly Controlling Donald Trump?

Posted on December 3, 2024 by Danzig

Elon Musk's control over Donald Trump seems akin to, well, the president-elect's control own takeover of the GOP.

Succession Theme

As one politico points out, Elon Musk's hold on Donald Trump seems akin to the president-elect's control own takeover of the GOP.

In an interview with Newsweek, Democratic strategist Chai Komanduri suggested that the Muskian agenda is now the Republican party line — and that may help the billionaire's bottom line, too.

"The Republican Party is led by an elderly man who is increasingly deluded, distracted and extremely greedy," said the longtime strategist and MSNBC contributor. "[Musk] saw an opportunity here, also, with the fact that there is no clear MAGA successor."

"He said, 'The Republican Party is here for the taking,'" Komanduri told the magazine of Musk's potential frame of mind. "'I just have to deal with Trump, and then it will be mine.'"

By joining forces with Trump, the multi-hyphenate business owner seems to have made a "very clear investment opportunity," Komanduri said.

Indeed, as Business Insider noted in another recent analysis, Musk's seemingly outrageous over-spending on purchasing Twitter back in 2022 may be paying off in spades now that he wields such political influence — and it all raises the question: is Trump the boss of Musk, or vice versa?

Kiss the Ring

A looming question since the election has been how long the Musk-Trump bromance will last, especially since Trump's last administration quickly became a revolving door of spurned former loyalists.

While there have been reports of tensions between the Tesla and SpaceX owner and members of Trump's posse, the South African-born mogul's appearance at the president-elect's Thanksgiving dinner — not to mention the pair's bizarro joint dance to The Village People's "YMCA" — suggests he's in the inner sanctum, at least for now.

In an interview with CNN last week, New York Times senior political reporter Maggie Haberman opined that if Trump has grown weary of the brash billionaire whose dollars boosted him into office, he hasn't been "making that especially public."

Given that it's rare for people in the president-elect's orbit to be richer than him, Musk's money may be part of the draw he holds for Trump, the longtime White House watcher said.

"Musk is also, and depends on the day, the richest or one of the richest men in the world, and Trump has a huge fascination with wealth," she said. "As you noted, Trump equates wealth with intelligence, and so I actually think this relationship could last for quite some time."

If that's the case, we're in for a very bumpy and extremely cringe four years.

More on this strange dynamic: Body Language Expert Says Trump Is Acting "Submissive" Around Musk

The post Is Elon Musk Now Secretly Controlling Donald Trump? appeared first on Futurism.

See the original post here:
Is Elon Musk Now Secretly Controlling Donald Trump?

Futurist Transhuman News Blog

Tag Archives: language