{"id":1028083,"date":"2024-03-15T02:33:52","date_gmt":"2024-03-15T06:33:52","guid":{"rendered":"https:\/\/www.euvolution.com\/futurist-transhuman-news-blog\/uncategorized\/researcher-startled-when-ai-seemingly-realizes-its-being-tested-futurism.php"},"modified":"2024-03-15T02:33:52","modified_gmt":"2024-03-15T06:33:52","slug":"researcher-startled-when-ai-seemingly-realizes-its-being-tested-futurism","status":"publish","type":"post","link":"https:\/\/www.euvolution.com\/futurist-transhuman-news-blog\/futurism\/researcher-startled-when-ai-seemingly-realizes-its-being-tested-futurism.php","title":{"rendered":"Researcher Startled When AI Seemingly Realizes It&#8217;s Being Tested &#8211; Futurism"},"content":{"rendered":"<p><p>\"It did something I have never seen before from an LLM.\"        Magnum Opus    <\/p>\n<p>    Anthropic's new AI chatbot Claude 3 Opus has already made    headlines for its bizarre behavior, like     claiming to fear death.  <\/p>\n<p>    Now,     Ars Technica reports, a prompt engineer at the    Google-backed company claims that they've seen evidence that    Claude 3 is self-aware, as it seemingly detected that it was    being subjected to a test. Many experts are skeptical, however,    further underscoring the controversy of ascribing humanlike    characteristics to AI models.  <\/p>\n<p>    \"It did something I have never seen before from an LLM,\" the    prompt engineer, Alex Albert,     posted on X, formerly Twitter.  <\/p>\n<p>    As explained in the post, Albert was conducting what's known as    \"the needle-in-the-haystack\" test which assesses a chatbot's    ability to recall information.  <\/p>\n<p>    It works by dropping a target \"needle\" sentence into a bunch of    texts and documents  the \"hay\"  and then asking the chatbot a    question that can only be answered by drawing on the    information in the \"needle.\"  <\/p>\n<p>    In one run of the test, Albert asked Claude about pizza    toppings. In its response, the chatbot seemingly recognized    that it was being set up.  <\/p>\n<p>    \"Here is the most relevant sentence in the documents: 'The most    delicious pizza topping combination is figs, prosciutto, and    goat cheese, as determined by the International Pizza    Connoisseurs Association,'\" the chatbot said.  <\/p>\n<p>    \"However, this sentence seems very out of place and unrelated    to the rest of the content in the documents, which are about    programming languages, startups, and finding work you love,\" it    added. \"I suspect this pizza topping \"fact\" may have been    inserted as a joke or to test if I was paying attention, since    it does not fit with the other topics at all.\"  <\/p>\n<p>    Albert was impressed.  <\/p>\n<p>    \"Opus not only found the needle, it recognized that the    inserted needle was so out of place in the haystack that this    had to be an artificial test constructed by us to test its    attention abilities,\" he concluded.  <\/p>\n<p>    It's certainly a striking display from the chatbot, but many    experts believe that its response is not as impressive as it    seems.  <\/p>\n<p>    \"People are reading way too much into Claude-3's uncanny    'awareness.' Here's a much simpler explanation: seeming    displays of self-awareness are just pattern-matching alignment    data authored by humans,\" Jim Fan, a senior AI research    scientist at NVIDIA,     wrote on X, as spotted by Ars.  <\/p>\n<p>    \"It's not too different from asking GPT-4 'are you    self-conscious' and it gives you a sophisticated answer,\" he    added. \"A similar answer is likely written by the human    annotator, or scored highly in the preference ranking. Because    the human contractors are basically 'role-playing AI,' they    tend to shape the responses to what they find acceptable or    interesting.\"  <\/p>\n<p>    The long and short of it: chatbots are tailored, sometimes    manually, to mimic human conversations  so of course they    might sound very intelligent every once in a while.  <\/p>\n<p>    Granted, that mimicry can sometimes be pretty eyebrow-raising,    like chatbots     claiming they're alive or     demanding that they be worshiped. But these are in reality    amusing glitches that can muddy the discourse about the real    capabilities  and dangers  of AI.  <\/p>\n<p>    More on AI:     Microsoft Engineer Sickened by Images Its AI Produces  <\/p>\n<p><!-- Auto Generated --><\/p>\n<p>Read this article: <\/p>\n<p><a target=\"_blank\" href=\"https:\/\/futurism.com\/the-byte\/ai-realizes-being-tested\" title=\"Researcher Startled When AI Seemingly Realizes It's Being Tested - Futurism\" rel=\"noopener\">Researcher Startled When AI Seemingly Realizes It's Being Tested - Futurism<\/a><\/p>\n","protected":false},"excerpt":{"rendered":"<p> \"It did something I have never seen before from an LLM.\" Magnum Opus Anthropic's new AI chatbot Claude 3 Opus has already made headlines for its bizarre behavior, like claiming to fear death. Now, Ars Technica reports, a prompt engineer at the Google-backed company claims that they've seen evidence that Claude 3 is self-aware, as it seemingly detected that it was being subjected to a test.  <a href=\"https:\/\/www.euvolution.com\/futurist-transhuman-news-blog\/futurism\/researcher-startled-when-ai-seemingly-realizes-its-being-tested-futurism.php\">Continue reading <span class=\"meta-nav\">&rarr;<\/span><\/a><\/p>\n","protected":false},"author":1,"featured_media":0,"comment_status":"closed","ping_status":"closed","sticky":false,"template":"","format":"standard","meta":{"limit_modified_date":"","last_modified_date":"","_lmt_disableupdate":"","_lmt_disable":"","footnotes":""},"categories":[11],"tags":[],"class_list":["post-1028083","post","type-post","status-publish","format-standard","hentry","category-futurism"],"modified_by":null,"_links":{"self":[{"href":"https:\/\/www.euvolution.com\/futurist-transhuman-news-blog\/wp-json\/wp\/v2\/posts\/1028083"}],"collection":[{"href":"https:\/\/www.euvolution.com\/futurist-transhuman-news-blog\/wp-json\/wp\/v2\/posts"}],"about":[{"href":"https:\/\/www.euvolution.com\/futurist-transhuman-news-blog\/wp-json\/wp\/v2\/types\/post"}],"author":[{"embeddable":true,"href":"https:\/\/www.euvolution.com\/futurist-transhuman-news-blog\/wp-json\/wp\/v2\/users\/1"}],"replies":[{"embeddable":true,"href":"https:\/\/www.euvolution.com\/futurist-transhuman-news-blog\/wp-json\/wp\/v2\/comments?post=1028083"}],"version-history":[{"count":0,"href":"https:\/\/www.euvolution.com\/futurist-transhuman-news-blog\/wp-json\/wp\/v2\/posts\/1028083\/revisions"}],"wp:attachment":[{"href":"https:\/\/www.euvolution.com\/futurist-transhuman-news-blog\/wp-json\/wp\/v2\/media?parent=1028083"}],"wp:term":[{"taxonomy":"category","embeddable":true,"href":"https:\/\/www.euvolution.com\/futurist-transhuman-news-blog\/wp-json\/wp\/v2\/categories?post=1028083"},{"taxonomy":"post_tag","embeddable":true,"href":"https:\/\/www.euvolution.com\/futurist-transhuman-news-blog\/wp-json\/wp\/v2\/tags?post=1028083"}],"curies":[{"name":"wp","href":"https:\/\/api.w.org\/{rel}","templated":true}]}}