{"id":1067824,"date":"2024-01-12T02:36:01","date_gmt":"2024-01-12T07:36:01","guid":{"rendered":"https:\/\/www.immortalitymedicine.tv\/plagiarism-detection-tools-offer-a-false-sense-of-accuracy-the-markup-the-markup\/"},"modified":"2024-08-18T11:39:41","modified_gmt":"2024-08-18T15:39:41","slug":"plagiarism-detection-tools-offer-a-false-sense-of-accuracy-the-markup-the-markup","status":"publish","type":"post","link":"https:\/\/www.euvolution.com\/futurist-transhuman-news-blog\/machine-learning\/plagiarism-detection-tools-offer-a-false-sense-of-accuracy-the-markup-the-markup.php","title":{"rendered":"Plagiarism Detection Tools Offer a False Sense of Accuracy  The Markup &#8211; The Markup"},"content":{"rendered":"<p><p>    When Katherine Pickering Antonova became a history professor in    2008, she got access to the plagiarism detection software tools    Turnitin and SafeAssign. At first blush, she thought the    technology would be great. She had just finished a graduate    program where she had manually graded papers as a teaching    assistant, meticulously checking students suspect phrases to    see if any showed up elsewhere.  <\/p>\n<p>    But her first use of the plagiarism checkers gave her a jolt.    The software suggested the majority of her students had copied    portions of their essays.  <\/p>\n<p>    Soon she realized the lie in how the tools were described to    her. Its not tracking plagiarism at all, Pickering Antonova    said. Its just flagging matching text. Those two concepts    have different standards; plagiarism is a subjective assessment    of misconduct, but scholars may have matching words in their    academic articles for a variety of legitimate reasons.  <\/p>\n<p>    Plagiarism checkers are built into The City University of New    Yorks learning management system, where faculty members post    assignments and students submit them. As at many colleges    throughout the country, scanning for plagiarism in submitted    assignments is the default. But fed up with false flags and the    countless hours required to check potentially plagiarized    passages against the source material Turnitin and SafeAssign    highlight, Pickering Antonova gave up on the tools entirely a    couple years ago.  <\/p>\n<p>    The bots are literally worse than useless, she said. They do    harm, and they dont find anything I couldnt find by    myself.  <\/p>\n<p>    Some experts agree that Claudine Gay, Harvards ousted    president and a widely respected political scientist, recently    became the latest victim of this technology. She was forced to    step down from the presidency after an accuser flagged nearly    50 examples from her writing that they called plagiarism. But    many of the examples looked a lot like what Pickering Antonova    considered a waste of her time when she was grading student    work.  <\/p>\n<p>    The Voting Rights Act of 1965 is often cited as one of the    most significant pieces of civil rights legislation passed in    our nations history, Gay wrote in one paper. Her accuser says    she plagiarized David Canons description of the landmark    lawbut as the Washington Free Beacon reported in publishing the allegations,    Canon himself disagrees, arguing Gay had done nothing wrong.  <\/p>\n<p>    The controversy over Gays alleged plagiarism has roiled the    academic community, and while much of the attention has been on    the political maneuvering behind her ouster and the definition    of plagiarism, some scholars have commented on the detection    software that was likely behind it. The fact is, however, that    students, not academics, bear the brunt of the tools shoddy    analyses. Turnitin is the industry leader in marshaling text    analysis tools to assess academic integrity, boasting partnerships with more than 20,000    institutions globally and a repository of over 1.8 billion    student paper submissions (and still counting).  <\/p>\n<p>    The companies that are marketing plagiarism detection tools    tend to acknowledge their limitations. While they may be    referred to as plagiarism checkers, the products are    described as highlighting text similarities or duplicate    content. They scan billions of webpages and scholarly articles    looking for those matches and surface them for a reviewer.    Some, like Grammarlys, are marketed to writers and offer to    help people add proper citations where they may have forgotten    them. It isnt meant to police plagiarism, but rather help    writers avoid it. Turnitin specifically says its Similarity    Report does not check for plagiarism.  <\/p>\n<p>    Still, the tools are frequently used to justify giving students    zeroes on their assignmentsand the students most likely to get    such dismissive grading are those at less-selective    institutions, where faculty are overstretched and underpaid.  <\/p>\n<p>    For her part, Pickering Antonova came to feel guilty about    putting students through the stress of seeing their Turnitin    results.  <\/p>\n<p>    They see their paper is showing up 60 percent plagiarized, and    they have a heart attack, she said.  <\/p>\n<p>    Plagiarism does not carry a legal definition. Institutions    create their own plagiarism policies, and academic fields have    norms about how to credit and cite sources in scholarly text.    Plagiarism checkers are not designed with such nuance. It is up    to users to follow up their algorithmic output with good, human    judgment.  <\/p>\n<p>    Jo Guldi, a professor of quantitative methods at Emory    University, recently published The Dangerous Art of Text    Mining: A Methodology for Digital History and jumped into the    Gay plagiarism controversy with a now-deleted post on X before    Christmas. She pointed out that computers can search for    five-word overlaps in text but argued that such repetition does    not equal plagiarism: the technology of text mining can be    used to destroy the career of any scholar at any time, she    wrote.  <\/p>\n<p>    By phone, Guldi said that while she didnt cover plagiarism    detection in her book, the parallel is clear. Her book traces    bad conclusions reached because people fail to critically    analyze the data. She, too, has used Turnitin in her classes    and recognized the findings cannot be taken at face value.  <\/p>\n<p>    You look at them and you see you have to apply judgment, she    said. Its always a judgment call.  <\/p>\n<p>    Many scholars, including those Gay is supposed to have    plagiarized, have come to Gays defense over the course of the    last month, arguing the text similarities highlighted do not    rise to the level of plagiarism.  <\/p>\n<p>          Machine          Learning        <\/p>\n<p>          Stanford study found AI detectors are biased against          non-native English speakers        <\/p>\n<p>    Yet her accuser has identified nearly 50 examples of overlap,    pairing her writing with that of other scholars and insisting    there is a pattern of academic misconduct. The sheer number of    examplesand promise of more to comehelped seal Gays fate.    And some scholars worry anyone with enemies could be next.  <\/p>\n<p>    Ian Bogost, a professor at Washington University in St. Louis,    mulled in The Atlantic what a full-bore plagiarism war could look like,    running his own dissertation through iThenticate, a checker run    by the same company as Turnitin that is marketed to    researchers, publishers, and scholars.  <\/p>\n<p>    Bill Ackman, a billionaire Harvard megadonor, signaled his    commitment to participating in such a war after Business    Insider launched its own grenade, publishing an analysis last    week that accused his wife, Neri Oxman, of plagiarizing parts of her dissertation.    Oxman got her Ph.D. at MIT in 2010 before joining the faculty    and then leaving to become an entrepreneur. Suspecting someone    from MIT encouraged Business Insider to take a closer look at    her dissertation, Ackman posted on X that he was going to begin a    review of the work of all current @MIT faculty members,    President Kornbluth, other officers of the Corporation, and its    board members for plagiarism.  <\/p>\n<p>    He later added, Why would we stop at MIT?    Dont we have to do a deep dive into academic integrity at    Harvard as well? What about Yale, Princeton, Stanford, Penn,    Dartmouth? You get the point.  <\/p>\n<p>    Its unclear which tool Gays accuser used to identify their    examples, but experts agree the accusations seem to come from a    text comparison algorithm. A Markup analysis of five of Gays    papers in the Grammarly and EasyBib plagiarism checkers did not    turn up any of the plagiarism accusations that have surfaced in    recent months. Grammarlys tool did flag instances of text    overlap between Gays writing and other scholars, sometimes    because they were citing her paper, but sometimes because the    two authors were simply describing similar things. Gays 2017    political science paper A Room for Ones Own? is the subject    of more than half a dozen accusations of plagiarism that    Grammarly didnt flagbut the tool did, for example, suggest    her line The estimated coefficients and standard errors from    the may have been plagiarized from an article about diabetes    in Bali.  <\/p>\n<p>    Analyzing the same paper, Turnitin ignored several of the lines    included in complaints against her but it did flag four from    two academic papers. It also found other similarities,    suggesting, for example, that the phrase receive a 10-year    stream of tax credits warranted review.  <\/p>\n<p>        Credit:Turnitin      <\/p>\n<p>    David Smith, an associate professor of computer science at    Northeastern University, has studied natural language    processing and computational linguistics. He said plagiarism    detection tools tend to start with what is called a null    model. The algorithm is given very few assumptions and simply    told to identify matching words across texts. To find examples    in Gays writing, he said, it basically took people looking    through the really low-precision output of these models.  <\/p>\n<p>          Machine          Learning        <\/p>\n<p>          A Markup examination of a typical college shows how          students are subject to a vast and growing array of          watchful tech, including homework trackers, test-taking          software, and even license platereaders        <\/p>\n<p>    Somebody could have trained a better model that had higher    precision, Smith said. That doesnt seem to be how it went in    this case.  <\/p>\n<p>    The result was a long list of plagiarism accusations most    scholars found baffling.  <\/p>\n<p>    Turnitin introduced its similarity check in 2000. Since then,    plagiarism analyses have become the norm for editors of some    academic journals as well as many college and university    faculty members. Yet the tool is not universal. Many users,    like Pickering Antonova, have decided the software isnt worth    the time and dont align with their teaching goals. This has    created two distinct classes of people: those who are subjected    to plagiarism checkers and those who are not. For professional    academics, Gays case highlights the concern that anyone with a    high profile who makes the wrong enemy could quickly become    part of the former group.  <\/p>\n<p>    For students, its often just a matter of their schools norms.    Plagiarism checkers can seem like a straightforward assessment    of the originality of student work, reporting a percentage of    the paper that may have been plagiarized. For faculty members    who dont have the time to look at the dozens of false flags,    it can be easy to rely on the total percentage and grade    accordingly.  <\/p>\n<p>    This behavior worries Smith, the computer scientist. Getting a    quantification makes it easier to just judge a lot of student    papers at scale, he said. Thats not whats going on in the    Claudine Gay case but is troubling about whats going on with    students subjection to these methods.  <\/p>\n<p>    Tech companies have produced a steady stream of new tools for    educators concerned with students cheating, including AI    detectors that followed the widespread adoption of ChatGPT.    With each new tool comes a promise of scientific accuracy and    cutting-edge analysis of unbiased data.  <\/p>\n<p>    But as Claudine Gays case demonstratesand the threat of the    plagiarism wars promisesplagiarism detection is far from    precise.  <\/p>\n<p><!-- Auto Generated --><\/p>\n<p>Follow this link:<br \/>\n<a target=\"_blank\" href=\"https:\/\/themarkup.org\/machine-learning\/2024\/01\/10\/plagiarism-detection-tools-offer-a-false-sense-of-accuracy\" title=\"Plagiarism Detection Tools Offer a False Sense of Accuracy  The Markup - The Markup\" rel=\"noopener\">Plagiarism Detection Tools Offer a False Sense of Accuracy  The Markup - The Markup<\/a><\/p>\n","protected":false},"excerpt":{"rendered":"<p> When Katherine Pickering Antonova became a history professor in 2008, she got access to the plagiarism detection software tools Turnitin and SafeAssign.  <a href=\"https:\/\/www.euvolution.com\/futurist-transhuman-news-blog\/machine-learning\/plagiarism-detection-tools-offer-a-false-sense-of-accuracy-the-markup-the-markup.php\">Continue reading <span class=\"meta-nav\">&rarr;<\/span><\/a><\/p>\n","protected":false},"author":1,"featured_media":0,"comment_status":"closed","ping_status":"closed","sticky":false,"template":"","format":"standard","meta":{"limit_modified_date":"","last_modified_date":"","_lmt_disableupdate":"","_lmt_disable":"","footnotes":""},"categories":[1231415],"tags":[],"class_list":["post-1067824","post","type-post","status-publish","format-standard","hentry","category-machine-learning"],"modified_by":null,"_links":{"self":[{"href":"https:\/\/www.euvolution.com\/futurist-transhuman-news-blog\/wp-json\/wp\/v2\/posts\/1067824"}],"collection":[{"href":"https:\/\/www.euvolution.com\/futurist-transhuman-news-blog\/wp-json\/wp\/v2\/posts"}],"about":[{"href":"https:\/\/www.euvolution.com\/futurist-transhuman-news-blog\/wp-json\/wp\/v2\/types\/post"}],"author":[{"embeddable":true,"href":"https:\/\/www.euvolution.com\/futurist-transhuman-news-blog\/wp-json\/wp\/v2\/users\/1"}],"replies":[{"embeddable":true,"href":"https:\/\/www.euvolution.com\/futurist-transhuman-news-blog\/wp-json\/wp\/v2\/comments?post=1067824"}],"version-history":[{"count":0,"href":"https:\/\/www.euvolution.com\/futurist-transhuman-news-blog\/wp-json\/wp\/v2\/posts\/1067824\/revisions"}],"wp:attachment":[{"href":"https:\/\/www.euvolution.com\/futurist-transhuman-news-blog\/wp-json\/wp\/v2\/media?parent=1067824"}],"wp:term":[{"taxonomy":"category","embeddable":true,"href":"https:\/\/www.euvolution.com\/futurist-transhuman-news-blog\/wp-json\/wp\/v2\/categories?post=1067824"},{"taxonomy":"post_tag","embeddable":true,"href":"https:\/\/www.euvolution.com\/futurist-transhuman-news-blog\/wp-json\/wp\/v2\/tags?post=1067824"}],"curies":[{"name":"wp","href":"https:\/\/api.w.org\/{rel}","templated":true}]}}