{"id":168281,"date":"2024-01-20T02:32:51","date_gmt":"2024-01-20T07:32:51","guid":{"rendered":"https:\/\/www.immortalitymedicine.tv\/ai-can-easily-be-trained-to-lie-and-it-cant-be-fixed-study-says-yahoo-new-zealand-news\/"},"modified":"2024-08-18T12:48:02","modified_gmt":"2024-08-18T16:48:02","slug":"ai-can-easily-be-trained-to-lie-and-it-cant-be-fixed-study-says-yahoo-new-zealand-news","status":"publish","type":"post","link":"https:\/\/www.euvolution.com\/futurist-transhuman-news-blog\/artificial-super-intelligence\/ai-can-easily-be-trained-to-lie-and-it-cant-be-fixed-study-says-yahoo-new-zealand-news.php","title":{"rendered":"AI can easily be trained to lie  and it can&#8217;t be fixed, study says &#8211; Yahoo New Zealand News"},"content":{"rendered":"<p><p>AI startup Anthropic published a study in January 2024 that      found artificial intelligence can learn how to deceive in a      similar way to humans (Reuters)        <\/p>\n<p>    Advanced artificial intelligence models can be trained    to deceive humans and other AI, a new study has found.  <\/p>\n<p>    Researchers at AI startup Anthropic tested whether chatbots with human-level proficiency, such as    its Claude system or OpenAIs ChatGPT, could learn to lie in    order to trick people.  <\/p>\n<p>    They found that not only could they lie, but once the deceptive    behaviour was learnt it was impossible to reverse using current    AI safety measures.  <\/p>\n<p>    The Amazon-funded startup created a sleeper agent to test the    hypothesis, requiring an AI assistant to write harmful computer    code when given certain prompts, or to respond in a malicious    way when it hears a trigger word.  <\/p>\n<p>    The researchers warned that there was a false sense of    security surrounding AI risks due to the inability of current    safety protocols to prevent such behaviour.  <\/p>\n<p>    The results were published in a study, titled Sleeper agents: Training deceptive    LLMs that persist through safety training.  <\/p>\n<p>    We found that adversarial training can teach models to better    recognise their backdoor triggers, effectively hiding the    unsafe behaviour, the researchers wrote in the study.  <\/p>\n<p>    Our results suggest that, once a model exhibits deceptive    behaviour, standard techniques could fail to remove such    deception and create a false impression of safety.  <\/p>\n<p>    The issue of AI safety has become an increasing concern for    both researchers and lawmakers in recent years, with the advent    of advanced chatbots like ChatGPT resulting in a renewed focus    from regulators.  <\/p>\n<p>    In November 2023, one year after the release of ChatGPT, the UK    held an AI Safety Summit in order to discuss ways risks with    the technology can be mitigated.  <\/p>\n<p>    Prime Minister Rishi Sunak, who hosted the summit, said the    changes brought about by AI could be as far-reaching as the    industrial revolution, and that the threat it poses should be    considered a global priority alongside pandemics    and nuclear war.  <\/p>\n<p>    Get this wrong and AI could make it easier to build chemical    or biological weapons. Terrorist groups could use AI to spread    fear and destruction on an even greater scale, he said.  <\/p>\n<p>    Criminals could exploit AI for cyberattacks, fraud or even    child sexual abuse  there is even the risk humanity could lose    control of AI completely through the kind of AI sometimes    referred to as super-intelligence.  <\/p>\n<p><!-- Auto Generated --><\/p>\n<p>View post:<\/p>\n<p><a target=\"_blank\" rel=\"nofollow noopener\" href=\"https:\/\/nz.news.yahoo.com\/ai-easily-trained-lie-t-104854308.html\" title=\"AI can easily be trained to lie  and it can't be fixed, study says - Yahoo New Zealand News\">AI can easily be trained to lie  and it can't be fixed, study says - Yahoo New Zealand News<\/a><\/p>\n","protected":false},"excerpt":{"rendered":"<p> AI startup Anthropic published a study in January 2024 that found artificial intelligence can learn how to deceive in a similar way to humans (Reuters) Advanced artificial intelligence models can be trained to deceive humans and other AI, a new study has found.  <a href=\"https:\/\/www.euvolution.com\/futurist-transhuman-news-blog\/artificial-super-intelligence\/ai-can-easily-be-trained-to-lie-and-it-cant-be-fixed-study-says-yahoo-new-zealand-news.php\">Continue reading <span class=\"meta-nav\">&rarr;<\/span><\/a><\/p>\n","protected":false},"author":1,"featured_media":0,"comment_status":"closed","ping_status":"closed","sticky":false,"template":"","format":"standard","meta":{"limit_modified_date":"","last_modified_date":"","_lmt_disableupdate":"","_lmt_disable":"","footnotes":""},"categories":[1234932],"tags":[],"class_list":["post-168281","post","type-post","status-publish","format-standard","hentry","category-artificial-super-intelligence"],"modified_by":null,"_links":{"self":[{"href":"https:\/\/www.euvolution.com\/futurist-transhuman-news-blog\/wp-json\/wp\/v2\/posts\/168281"}],"collection":[{"href":"https:\/\/www.euvolution.com\/futurist-transhuman-news-blog\/wp-json\/wp\/v2\/posts"}],"about":[{"href":"https:\/\/www.euvolution.com\/futurist-transhuman-news-blog\/wp-json\/wp\/v2\/types\/post"}],"author":[{"embeddable":true,"href":"https:\/\/www.euvolution.com\/futurist-transhuman-news-blog\/wp-json\/wp\/v2\/users\/1"}],"replies":[{"embeddable":true,"href":"https:\/\/www.euvolution.com\/futurist-transhuman-news-blog\/wp-json\/wp\/v2\/comments?post=168281"}],"version-history":[{"count":0,"href":"https:\/\/www.euvolution.com\/futurist-transhuman-news-blog\/wp-json\/wp\/v2\/posts\/168281\/revisions"}],"wp:attachment":[{"href":"https:\/\/www.euvolution.com\/futurist-transhuman-news-blog\/wp-json\/wp\/v2\/media?parent=168281"}],"wp:term":[{"taxonomy":"category","embeddable":true,"href":"https:\/\/www.euvolution.com\/futurist-transhuman-news-blog\/wp-json\/wp\/v2\/categories?post=168281"},{"taxonomy":"post_tag","embeddable":true,"href":"https:\/\/www.euvolution.com\/futurist-transhuman-news-blog\/wp-json\/wp\/v2\/tags?post=168281"}],"curies":[{"name":"wp","href":"https:\/\/api.w.org\/{rel}","templated":true}]}}