{"id":195217,"date":"2017-05-28T07:15:14","date_gmt":"2017-05-28T11:15:14","guid":{"rendered":"http:\/\/www.euvolution.com\/prometheism-transhumanism-posthumanism\/genome-analysis-toolkit-4-gatk4-released-as-open-source-phys-org\/"},"modified":"2017-05-28T07:15:14","modified_gmt":"2017-05-28T11:15:14","slug":"genome-analysis-toolkit-4-gatk4-released-as-open-source-phys-org","status":"publish","type":"post","link":"https:\/\/www.euvolution.com\/prometheism-transhumanism-posthumanism\/transhuman-news-blog\/genome\/genome-analysis-toolkit-4-gatk4-released-as-open-source-phys-org\/","title":{"rendered":"Genome Analysis Toolkit 4 (GATK4) released as open source &#8230; &#8211; Phys.Org"},"content":{"rendered":"<p><p>May 25, 2017          Credit: Susanna M. Hamilton, Broad Communications    <\/p>\n<p>      The Broad Institute of MIT and Harvard will release version 4      of the industry-leading Genome Analysis Toolkit under an open      source software license. The software package, designated      GATK4, contains new tools and rebuilt architecture. It is      available currently as an alpha preview on the Broad      Institute's GATK website,      with a beta release expected in mid-June. Broad engineers      announced the upgrade, as well as the decision to release the      tool as an open source product, at Bio-IT World today.    <\/p>\n<p>    The new version is built on a new architecture, allowing    significant streamlining of individual tools and support for    performance-enhancing technologies such as Apache SparkTM. This    new framework brings improvements to parallelization,    capitalizing on cloud deployment and making the process of    analyzing vast amounts of genomic data easier, faster, and more    efficient.  <\/p>\n<p>    \"We wanted to remove traditional barriers of scale while    offering the same high level of data quality our users expect,\"    said Eric Banks, Senior Director of Data Sciences and Data    Engineering at Broad and a creator of the original GATK    software package. \"Thanks to the rapid    adoption of cloud computing, researchers can finally do away    with many of the infrastructure-related complications that have    hampered progress, especially at smaller institutions and    startups.\"  <\/p>\n<p>    Today, more than 45,000 academic and commercial users worldwide    rely on the GATK, running millions of analyses. The GATK is the    industry standard for identifying SNPs and indels in germline    DNA and RNAseq data. In addition to improving the performance    of these established tools, GATK4 extends this scope of    analysis to include copy number and structural variation, for    both germline and somatic research applications.  <\/p>\n<p>    Fully open source software  <\/p>\n<p>    GATK4 will be released as a fully open source product, thanks    in part to a collaboration between Broad Institute and Intel    Corporation to advance high-performance analytics so    researchers can study massive amounts of genomic data from    diverse sources worldwide.  <\/p>\n<p>    At the Intel-Broad Center for Genomic Data Engineering,    software engineers and researchers have spent the last several    months building, optimizing, and widely sharing new tools and    infrastructure to help scientists integrate and process genomic    data. GATK4 has benefited from this collaboration, which has    helped engineers optimize best practices in hardware and    software for genome analytics to make it possible to combine    and use research data sets that reside on private, public, and    hybrid clouds.  <\/p>\n<p>    \"Releasing GATK4 as open source was the obvious next step for    our team,\" said Geraldine Van der Auwera, Associate Director of    Outreach and Communications within the Data Science and Data    Engineering group at the Broad Institute. \"We believe it's the    most effective way to support the community, and we hope it    continues to grow, innovate, and help researchers make insights    that are essential for future human health breakthroughs.\" \"It    is critical for progress in biomedicine that the software we    use for analysing the genomes of millions of people is robust    and well understood,\" said Ewan Birney, Director of EMBL-EBI    and Chair of the Global Alliance for Genomics and Health    (GA4GH). \"Releasing GATK software with an open source license directly supports open    innovation, data re-use and data re-analysis in the global    biomedical community.\"  <\/p>\n<p>    \"The GATK tools are crucial for both germline and cancer    analyses,\" said Robert L. Grossman of the University of Chicago    Department of Medicine and an expert in biomedical informatics.    \"Releasing GATK4 as an open source software package will increase    adoption, and benefit the community.\"  <\/p>\n<p>    \"Open sourcing the GATK is a big deal for open genomics, and    for open science in general,\" said Jeremy Freeman, manager of    computational biology at the Chan Zuckerberg Initiative (CZI).    \"Not only does it make this critical tool available to as broad    as possible an audience for use, reuse, inspection, and    contributionit provides a powerful example to the community    for how an existing project can embrace open source.\"  <\/p>\n<p>    \"Open source code is a foundation of efficient biomedical    research,\" said Brad Chapman, a research scientist at the    Harvard T.H. Chan School of Public Health. \"It enables    reproducibility, reuse and remixing by removing barriers for    sharing and distributing analyses. The Broad Institute's GATK    team leads in the development of scalable, sensitive and    specific variant calling algorithms, and open sourcing GATK4    will allow frameworks like Blue Collar    Bioinformatics to make these methods broadly available to    the scientific research community.\"  <\/p>\n<p>    \"Cloudera has always been a supporter and believer in the power    of open source code,\" said Tom White, data    scientist at Cloudera and a member of the Apache Hadoop PMC.    \"We've been excited to contribute to the GATK codebase, to make    it run smoothly on Apache Spark and Cloudera. This next phase    of the GATK, powered by Spark and open source software, will    expand access and improve collaboration among genomic data scientists.\"  <\/p>\n<p>    \"The open sourcing of GATK4 is a great step for genomics,    allowing for scalability and performance gains to be openly    available to the research, biotech and pharmaceutical    communities,\" said Jason Waxman, corporate vice president and    general manager of Data Center Solutions at Intel. \"GATK4, when    run on Intel's new reference architecture, can achieve a 5X    speed-up compared to earlier versions of the software.\"  <\/p>\n<p>    \"We at Google are excited to see this new release,\" said Ilia    Tulchinsky, Google Cloud Healthcare Engineering Lead. \"We've    been collaborating with the Broad Institute for the past three    years to enhance genomic processing on Google Cloud Platform.    As a strong supporter for open source technology, we believe    that making GATK available this way will facilitate its use by    genomic scientists everywhere. As fellow collaborators with    Intel, we particularly look forward to enabling researchers to    run GATK4 on Google Cloud using the upcoming Intel Xeon    processor Scalable family.\"  <\/p>\n<p>    \"The GATK is one of the most widely utilized software packages    in the life sciences, and our team has worked very productively    with Broad to accelerate it for use on Azure,\" said Geralyn    Miller, Director, AI & Research, Microsoft. \"This new model    will greatly facilitate this effort going forward, and we are    excited to continue and expand our efforts around GATK on    Azure.\"  <\/p>\n<p>    \"With the open source launch of GATK4, there is an opportunity    to create a global community that can collaborate together and    advance the state of art in bioinformatics,\" said Hong Tang,    chief architect at Alibaba Cloud, the cloud computing arm of    Alibaba Group. \"We look forward to closely working with Broad    Institute in bringing the cloud-based GATK service to genomics    customers in China, as well as in ongoing GATK research and    development.\"  <\/p>\n<p>    In addition to offering GATK4 as an open source toolkit, Broad Institute will    continue to offer user support, training, and outreach on its    popular user support    forum. GATK4, like many of the Broad Institute's genome    analysis tools, will be available through the Broad Institute's    cloud based analysis platform, FireCloud.  <\/p>\n<p>     Explore further:        Google joins effort to boost genomics research  <\/p>\n<p>        Google announced Wednesday it was teaming up with        university scientists to use its computing platform to        accelerate efforts in genomics research.      <\/p>\n<p>        In a sign of the growing importance of the Internet        \"cloud,\" software group Cloudera said Monday it raised a        whopping $900 million to expand its big data corporate        services.      <\/p>\n<p>        Microsoft has joined the Linux Foundation, the latest sign        that the software giant is embracing open-source        technologies it formerly treated with hostility.      <\/p>\n<p>        Cloud computing is a more efficient and cheaper alternative        for researchers wanting to access and analyse large amounts        of human genomic data, a local study has found.      <\/p>\n<p>        Apple today announced that its Swift programming language        is now open source. As an open source language, the broad        community of talented developersfrom app developers to        educational institutions to enterprisescan contribute ...      <\/p>\n<p>        Judging from technology-watching sites, Intel has something        to worry about and it involves a rather well known place on        the technology map called Redmond, Washington. Look for the        sign that says Microsoft. There.      <\/p>\n<p>        There are significant gaps in our knowledge on the        evolution of sex, according to a research review on sex        chromosomes from Lund University in Sweden. Even after more        than a century of study, researchers do not know enough ...      <\/p>\n<p>        (Phys.org)Eusocial insects are predominantly dependent on        chemosensory communication to coordinate social        organization and define group membership. As the social        complexity of a species increases, individual members        require ...      <\/p>\n<p>        Scientists using a high-resolution global climate model and        historical observations of species distributions on the        Northeast U.S. Shelf have found that commercially important        species will continue to shift their distribution ...      <\/p>\n<p>        If you open Google and start typing \"Chinese cave gecko\",        the text will auto-populate to \"Chinese cave gecko for        sale\"  just US$150, with delivery. This extremely rare        species is just one of an increasingly large number ...      <\/p>\n<p>        Plant scientists at the University of Cambridge have found        a plant protein indispensable for communication early in        the formation of symbiosis - the mutually beneficial        relationship between plants and fungi. Symbiosis        significantly ...      <\/p>\n<p>        Almost 150 years after Charles Darwin first proposed a        little-known prediction from his theory of sexual        selection, researchers have found that male moths with        larger antennae are better at detecting female signals.      <\/p>\n<p>      Please sign      in to add a comment. Registration is free, and takes less      than a minute. Read more    <\/p>\n<p><!-- Auto Generated --><\/p>\n<p>Read the original post:<br \/>\n<a target=\"_blank\" href=\"https:\/\/phys.org\/news\/2017-05-genome-analysis-toolkit-gatk4-source.html\" title=\"Genome Analysis Toolkit 4 (GATK4) released as open source ... - Phys.Org\">Genome Analysis Toolkit 4 (GATK4) released as open source ... - Phys.Org<\/a><\/p>\n","protected":false},"excerpt":{"rendered":"<p> May 25, 2017 Credit: Susanna M. Hamilton, Broad Communications The Broad Institute of MIT and Harvard will release version 4 of the industry-leading Genome Analysis Toolkit under an open source software license. The software package, designated GATK4, contains new tools and rebuilt architecture <a href=\"https:\/\/www.euvolution.com\/prometheism-transhumanism-posthumanism\/transhuman-news-blog\/genome\/genome-analysis-toolkit-4-gatk4-released-as-open-source-phys-org\/\">Continue reading <span class=\"meta-nav\">&rarr;<\/span><\/a><\/p>\n","protected":false},"author":7,"featured_media":0,"comment_status":"closed","ping_status":"closed","sticky":false,"template":"","format":"standard","meta":{"footnotes":""},"categories":[25],"tags":[],"class_list":["post-195217","post","type-post","status-publish","format-standard","hentry","category-genome"],"_links":{"self":[{"href":"https:\/\/www.euvolution.com\/prometheism-transhumanism-posthumanism\/wp-json\/wp\/v2\/posts\/195217"}],"collection":[{"href":"https:\/\/www.euvolution.com\/prometheism-transhumanism-posthumanism\/wp-json\/wp\/v2\/posts"}],"about":[{"href":"https:\/\/www.euvolution.com\/prometheism-transhumanism-posthumanism\/wp-json\/wp\/v2\/types\/post"}],"author":[{"embeddable":true,"href":"https:\/\/www.euvolution.com\/prometheism-transhumanism-posthumanism\/wp-json\/wp\/v2\/users\/7"}],"replies":[{"embeddable":true,"href":"https:\/\/www.euvolution.com\/prometheism-transhumanism-posthumanism\/wp-json\/wp\/v2\/comments?post=195217"}],"version-history":[{"count":0,"href":"https:\/\/www.euvolution.com\/prometheism-transhumanism-posthumanism\/wp-json\/wp\/v2\/posts\/195217\/revisions"}],"wp:attachment":[{"href":"https:\/\/www.euvolution.com\/prometheism-transhumanism-posthumanism\/wp-json\/wp\/v2\/media?parent=195217"}],"wp:term":[{"taxonomy":"category","embeddable":true,"href":"https:\/\/www.euvolution.com\/prometheism-transhumanism-posthumanism\/wp-json\/wp\/v2\/categories?post=195217"},{"taxonomy":"post_tag","embeddable":true,"href":"https:\/\/www.euvolution.com\/prometheism-transhumanism-posthumanism\/wp-json\/wp\/v2\/tags?post=195217"}],"curies":[{"name":"wp","href":"https:\/\/api.w.org\/{rel}","templated":true}]}}