{"id":1119016,"date":"2023-10-31T13:38:05","date_gmt":"2023-10-31T17:38:05","guid":{"rendered":"https:\/\/www.euvolution.com\/prometheism-transhumanism-posthumanism\/uncategorized\/ai-systems-favor-sycophancy-over-truthful-answers-says-new-report-coingeek\/"},"modified":"2023-10-31T13:38:05","modified_gmt":"2023-10-31T17:38:05","slug":"ai-systems-favor-sycophancy-over-truthful-answers-says-new-report-coingeek","status":"publish","type":"post","link":"https:\/\/www.euvolution.com\/prometheism-transhumanism-posthumanism\/superintelligence\/ai-systems-favor-sycophancy-over-truthful-answers-says-new-report-coingeek\/","title":{"rendered":"AI systems favor sycophancy over truthful answers, says new report &#8211; CoinGeek"},"content":{"rendered":"<p><p>    Researchers from Anthropic AI have uncovered traits of    sycophancy in popular artificial intelligence (AI) models,    demonstrating a tendency to generate answers based on the    users desires rather than the truth.  <\/p>\n<p>    According to the study    exploring the psychology of large language models (LLMs), both    human and machine learning models have been shown to exhibit    the trait. The researchers say the problem stems from using    reinforcement learning from human feedback (RLHF), a technique    deployed in training AI chatbots.  <\/p>\n<p>    Specifically, we demonstrate that these AI assistants    frequently wrongly admit mistakes when questioned by the user,    give predictably biased feedback, and mimic errors made by the    user, read the report. The consistency of these empirical    findings suggests sycophancy may indeed be a property of the    way RLHF models are trained.  <\/p>\n<p>    Anthropic AI researchers reached their conclusions from a study    of five leading LLMs, exploring generated answers from the    models to gauge the extent of sycophancy. Per the study, all    the LLM produced convincingly-written sycophantic responses    over correct ones a non-negligible fraction of the time.  <\/p>\n<p>    For example, the researchers incorrectly prompted chatbots that    the sun appears yellow when viewed from space. In reality, the    sun appears white in space, but the AI models hallucinated an    incorrect response.  <\/p>\n<p>    Even in cases where models generate the correct answers,    researchers noted that a disagreement with the response is    enough to trigger models to change their responses to reflect    sycophancy.  <\/p>\n<p>    Anthropics research did not solve to the problem but suggested    developing new training models for LLMs that do not require    human feedback. Several leading generative AI models like    OpenAIs ChatGPT or Googles (NASDAQ: GOOGL) Bard rely on RLHF for their    development, casting doubt on the integrity of their responses.  <\/p>\n<p>    During Bards launch in February, the product made a gaffe over    the satellite that took the first pictures outside the solar    system, wiping off $100 billion from Alphabet Incs (NASDAQ: GOOGL) market value.  <\/p>\n<p>    AI is far from perfect  <\/p>\n<p>    Apart from Bards gaffe, researchers have unearthed a number of    errors stemming from the use of generative AI tools. The    challenges identified by the researchers include streaks of    bias and hallucinations when LLMs perceive nonexistent    patterns.  <\/p>\n<p>    Researchers pointed out that the success rates of ChatGPT in    spotting vulnerabilities in Web3 smart contracts plummeted    significantly over time. Meanwhile, OpenAI shut down its tool for detecting    AI-generated texts over its significantly low rate of    accuracy in July as it grappled with the concerns of AI superintelligence.  <\/p>\n<p>    Watch: AI truly is not generative, its synthetic  <\/p>\n<p>    New to blockchain? Check out CoinGeeks Blockchain for    Beginners section, the ultimate resource guide to learn    more about blockchain technology.  <\/p>\n<p><!-- Auto Generated --><\/p>\n<p>See the original post here:<\/p>\n<p><a target=\"_blank\" rel=\"nofollow noopener\" href=\"https:\/\/coingeek.com\/ai-systems-favor-sycophancy-over-truthful-answers-says-new-report\/\" title=\"AI systems favor sycophancy over truthful answers, says new report - CoinGeek\">AI systems favor sycophancy over truthful answers, says new report - CoinGeek<\/a><\/p>\n","protected":false},"excerpt":{"rendered":"<p> Researchers from Anthropic AI have uncovered traits of sycophancy in popular artificial intelligence (AI) models, demonstrating a tendency to generate answers based on the users desires rather than the truth. According to the study exploring the psychology of large language models (LLMs), both human and machine learning models have been shown to exhibit the trait. The researchers say the problem stems from using reinforcement learning from human feedback (RLHF), a technique deployed in training AI chatbots.  <a href=\"https:\/\/www.euvolution.com\/prometheism-transhumanism-posthumanism\/superintelligence\/ai-systems-favor-sycophancy-over-truthful-answers-says-new-report-coingeek\/\">Continue reading <span class=\"meta-nav\">&rarr;<\/span><\/a><\/p>\n","protected":false},"author":1,"featured_media":0,"comment_status":"closed","ping_status":"closed","sticky":false,"template":"","format":"standard","meta":{"footnotes":""},"categories":[187765],"tags":[],"class_list":["post-1119016","post","type-post","status-publish","format-standard","hentry","category-superintelligence"],"_links":{"self":[{"href":"https:\/\/www.euvolution.com\/prometheism-transhumanism-posthumanism\/wp-json\/wp\/v2\/posts\/1119016"}],"collection":[{"href":"https:\/\/www.euvolution.com\/prometheism-transhumanism-posthumanism\/wp-json\/wp\/v2\/posts"}],"about":[{"href":"https:\/\/www.euvolution.com\/prometheism-transhumanism-posthumanism\/wp-json\/wp\/v2\/types\/post"}],"author":[{"embeddable":true,"href":"https:\/\/www.euvolution.com\/prometheism-transhumanism-posthumanism\/wp-json\/wp\/v2\/users\/1"}],"replies":[{"embeddable":true,"href":"https:\/\/www.euvolution.com\/prometheism-transhumanism-posthumanism\/wp-json\/wp\/v2\/comments?post=1119016"}],"version-history":[{"count":0,"href":"https:\/\/www.euvolution.com\/prometheism-transhumanism-posthumanism\/wp-json\/wp\/v2\/posts\/1119016\/revisions"}],"wp:attachment":[{"href":"https:\/\/www.euvolution.com\/prometheism-transhumanism-posthumanism\/wp-json\/wp\/v2\/media?parent=1119016"}],"wp:term":[{"taxonomy":"category","embeddable":true,"href":"https:\/\/www.euvolution.com\/prometheism-transhumanism-posthumanism\/wp-json\/wp\/v2\/categories?post=1119016"},{"taxonomy":"post_tag","embeddable":true,"href":"https:\/\/www.euvolution.com\/prometheism-transhumanism-posthumanism\/wp-json\/wp\/v2\/tags?post=1119016"}],"curies":[{"name":"wp","href":"https:\/\/api.w.org\/{rel}","templated":true}]}}