{"id":174286,"date":"2016-11-12T17:20:43","date_gmt":"2016-11-12T22:20:43","guid":{"rendered":"http:\/\/www.euvolution.com\/prometheism-transhumanism-posthumanism\/whole-genome-sequencing-wikipedia\/"},"modified":"2016-11-12T17:20:43","modified_gmt":"2016-11-12T22:20:43","slug":"whole-genome-sequencing-wikipedia","status":"publish","type":"post","link":"https:\/\/www.euvolution.com\/prometheism-transhumanism-posthumanism\/transhuman-news-blog\/genome\/whole-genome-sequencing-wikipedia\/","title":{"rendered":"Whole genome sequencing &#8211; Wikipedia"},"content":{"rendered":"<p><p>\"Genome sequencing\" redirects here. For the sequencing only of    DNA, see DNA sequencing.    <\/p>\n<p>    Whole genome sequencing (also known as WGS,    full genome sequencing, complete genome    sequencing, or entire genome sequencing) is a    laboratory process that determines the complete DNA sequence of an organism's    genome at a single    time. This entails sequencing all of an organism's chromosomal DNA as well as DNA contained in    the mitochondria and, for plants, in the    chloroplast.  <\/p>\n<p>    Whole genome sequencing should not be confused with DNA profiling,    which only determines the likelihood that genetic material came    from a particular individual or group, and does not contain    additional information on genetic relationships, origin or    susceptibility to specific diseases.[2] Also unlike    full genome sequencing, SNP genotyping covers less than 0.1% of    the genome. Almost all truly complete genomes are of microbes;    the term \"full genome\" is thus sometimes used loosely to mean    \"greater than 95%\". The remainder of this article focuses on    nearly complete human genomes.  <\/p>\n<p>    High-throughput genome sequencing technologies have largely    been used as a research tool and are currently being introduced    in the clinics.[3][4][5] In the future of personalized    medicine, whole genome sequence data will be an important tool    to guide therapeutic intervention.[6] The tool of    gene    sequencing at SNP level is also used to    pinpoint functional variants from association studies and improve the    knowledge available to researchers interested in evolutionary biology, and hence may    lay the foundation for predicting disease susceptibility and    drug response.  <\/p>\n<p>    The shift from manual DNA sequencing methods such as Maxam-Gilbert sequencing and    Sanger sequencing in the 1970s and    1980s to more rapid, automated sequencing methods in the 1990s    played a crucial role in giving scientists the ability to    sequence whole genomes.[8]Haemophilus influenzae, a    commensal bacterium which resides in    the human respiratory tract was the first    organism to have its entire genome sequenced (Figure    2.1). The entire genome of this bacterium was published in    1995.[9] The    genomes of H. influenzae, other Bacteria, and some    Archaea were the    first to be sequenced - largely due to their small genome size.    H. influenzae has a genome of 1,830,140 base pairs of    DNA.[9] In    contrast, eukaryotes, both unicellular and    multicellular such as Amoeba dubia and humans (Homo sapiens)    respectively, have much larger genomes (see C-value paradox).[10]Amoeba dubia has a genome    of 700 billion nucleotide pairs spread across thousands of    chromosomes.[11] Humans    contain fewer nucleotide pairs (about 3.2 billion in each    germ cell -    note the exact size of the human genome is still being revised)    than A. dubia however their genome size far outweighs    the genome size of individual bacteria.[12]  <\/p>\n<p>    The first bacterial and archaeal genomes, including that of    H. influenzae, were sequenced by Shotgun    sequencing.[9] In    1996, the first eukaryotic genome ( the yeast Saccharomyces cerevisiae)    was sequenced. S. cerevisiae, a model    organism in biology has a genome of only around 12 million    nucleotide    pairs,[13] and was the first    unicellular eukaryote to have its whole genome    sequenced. The first multicellular eukaryote, and    animal, to have its    whole genome sequenced was the nematode worm: Caenorhabditis elegans in    1998.[14] Eukaryotic genomes are sequenced    by several methods including Shotgun sequencing of short DNA    fragments and sequencing of larger DNA clones from DNA    libraries (see library (biology)) such as Bacterial artificial    chromosomes (BACs) and Yeast artificial chromosomes    (YACs).[15]  <\/p>\n<p>    In 1999, the entire DNA sequence of human chromosome 22, the shortest human autosome, was    published.[16] By the year 2000, the second    animal and second invertebrate (yet first insect) genome was sequenced    - that of the fruit fly Drosophila melanogaster - a    popular choice of model organism in experimental    research.[17] The first plant genome - that of the model organism    Arabidopsis thaliana - was also    fully sequenced by 2000.[18] By 2001, a    draft of the entire human genome sequence was    published.[19] The genome of the laboratory    mouse Mus musculus was completed in    2002.[20]  <\/p>\n<p>    In 2004, the Human Genome Project published the    human genome.[21]  <\/p>\n<p>    Currently, thousands of genomes have been sequenced.  <\/p>\n<p>    Almost any biological sample containing a full copy of the    DNAeven a very small amount of DNA or ancient DNAcan    provide the genetic material necessary for full genome    sequencing. Such samples may include saliva, epithelial    cells, bone    marrow, hair (as long    as the hair contains a hair follicle), seeds, plant leaves, or anything else that has    DNA-containing cells.  <\/p>\n<p>     The genome sequence    of a single cell selected from a mixed population of cells can    be determined using techniques of single cell genome    sequencing. This has important advantages in environmental    microbiology in cases where a single cell of a particular    microorganism species can be isolated from a mixed population    by microscopy on the basis of its morphological or other    distinguishing characteristics. In such cases the normally    necessary steps of isolation and growth of the organism in    culture may be omitted, thus allowing the sequencing of a much    greater spectrum of organism genomes.[22]  <\/p>\n<p>    Single cell genome sequencing is being tested as a method of    preimplantation genetic    diagnosis, wherein a cell from the embryo created by    in vitro fertilization is    taken and analyzed before embryo transfer into the    uterus.[23] After implantation, cell-free fetal DNA can be taken by    simple venipuncture from the mother and used for    whole genome sequencing of the fetus.[24]  <\/p>\n<p>    Sequencing of nearly an entire human genome was first    accomplished in 2000 partly through the use of shotgun    sequencing technology. While full genome shotgun sequencing    for small (40007000 base pair) genomes was already in use in    1979,[25]    broader application benefited from pairwise end sequencing,    known colloquially as double-barrel shotgun sequencing.    As sequencing projects began to take on longer and more    complicated genomes, multiple groups began to realize that    useful information could be obtained by sequencing both ends of    a fragment of DNA. Although sequencing both ends of the same    fragment and keeping track of the paired data was more    cumbersome than sequencing a single end of two distinct    fragments, the knowledge that the two sequences were oriented    in opposite directions and were about the length of a fragment    apart from each other was valuable in reconstructing the    sequence of the original target fragment.  <\/p>\n<p>    The first published description of the use of paired ends was    in 1990 as part of the sequencing of the human HPRT    locus,[26] although the use of paired ends    was limited to closing gaps after the application of a    traditional shotgun sequencing approach. The first theoretical    description of a pure pairwise end sequencing strategy,    assuming fragments of constant length, was in 1991.[27] In 1995 the    innovation of using fragments of varying sizes was    introduced,[28]    and demonstrated that a pure pairwise end-sequencing strategy    would be possible on large targets. The strategy was    subsequently adopted by The Institute for    Genomic Research (TIGR) to sequence the entire genome of    the bacterium Haemophilus influenzae in    1995,[29]    and then by Celera Genomics to sequence the    entire fruit fly genome in 2000,[30] and    subsequently the entire human genome. Applied    Biosystems, now called Life    Technologies, manufactured the automated capillary    sequencers utilized by both Celera Genomics and The Human    Genome Project.  <\/p>\n<p>    While capillary sequencing was the first approach to    successfully sequence a nearly full human genome, it is still    too expensive and takes too long for commercial purposes. Since    2005 capillary sequencing has been progressively displaced by    next-generation    sequencing technologies such as Illumina dye sequencing, pyrosequencing, and SMRT    sequencing.[31]    All of these technologies continue to employ the basic shotgun    strategy, namely, parallelization and template generation via    genome fragmentation.  <\/p>\n<p>    Other technologies are emerging, including nanopore technology. Though nanopore    sequencing technology is still being refined, its portability    and potential capability of generating long reads are of    relevance to whole-genome sequencing applications.[32]  <\/p>\n<p>    In principle, full genome sequencing can provide raw data on    all six billion nucleotides in an individual's DNA. However,    it does not provide an analysis of what that information means    or how it might be utilized in various clinical applications,    such as in medicine to help prevent disease. Work toward that    goal is continuously moving forward.  <\/p>\n<p>    Because sequencing generates a lot of data (for example, there    are approximately six billion base pairs in each human diploid genome), its    output is stored electronically and requires a large amount of    computing power and storage capacity. Full genome sequencing    would have been nearly impossible before the advent of the    microprocessor, computers, and the    Information Age.  <\/p>\n<p>    A 2015 study[33] done at Children's Mercy    Hospital in Kansas City detailed the use of full genome    sequencing including full analysis. The process took a record    breaking 26 hours[34] and was done    using Illumina HiSeq machines, the Edico Genome Dragen    Processor, and several custom designed software packages. Most    of this acceleration was achieved using the newly developed    Dragen Processor which brought the analysis time down from 15    hours to 40 minutes.  <\/p>\n<p>    A number of public and private companies are competing to    develop a full genome sequencing platform that is commercially    robust for both research and clinical use,[35]    including Illumina,[36]Knome,[37]Sequenom,[38]454 Life    Sciences,[39]    Pacific Biosciences,[40]Complete    Genomics,[41]Helicos    Biosciences,[42]GE Global    Research (General Electric), Affymetrix, IBM, Intelligent    Bio-Systems,[43] Life Technologies and Oxford    Nanopore Technologies.[44] These    companies are heavily financed and backed by venture capitalists, hedge funds, and investment banks.[45][46]  <\/p>\n<p>    In October 2006, the X Prize Foundation, working in    collaboration with the J. Craig Venter Science Foundation,    established the Archon X Prize for Genomics,[47] intending to award US$10million to \"the first Team that can build    a device and use it to sequence 100 human genomes within 10    days or less, with an accuracy of no more than one error in    every 1,000,000 bases sequenced, with sequences accurately    covering at least 98% of the genome, and at a recurring cost of    no more than $1,000per genome\".[48] An    error rate of 1in 1,000,000bases, out of a total of    approximately six billion bases in the human diploid genome,    would mean about 6,000errors per genome. The error rates    required for widespread clinical use, such as predictive medicine[49] is currently set by    over 1,400 clinical single gene sequencing tests[50] (for example, errors in BRCA1 gene for breast cancer    risk analysis).  <\/p>\n<p>    The Archon X Prize for Genomics was cancelled    in 2013, before its official start date.[51][52]  <\/p>\n<p>    In 2007, Applied Biosystems started selling a    new type of sequencer called SOLiD System.[53] The    technology allowed users to sequence 60 gigabases per    run.[54]  <\/p>\n<p>    In June 2009, Illumina announced that they were launching their    own Personal Full Genome Sequencing Service at a depth of 30 for $48,000 per    genome.[55][56]  <\/p>\n<p>    In August 2009, the founder of Helicos Biosciences, Stephen Quake,    stated that using the company's Single Molecule Sequencer he    sequenced his own full genome for less than $50,000.[57]  <\/p>\n<p>    In November 2009, Complete Genomics published a peer-reviewed    paper in Science demonstrating its ability to sequence a    complete human genome for $1,700.[58][59]  <\/p>\n<p>    In May 2011, Illumina lowered its Full Genome Sequencing    service to $5,000 per human genome, or $4,000 if ordering 50 or    more.[60] Helicos Biosciences, Pacific    Biosciences, Complete Genomics, Illumina, Sequenom, ION Torrent    Systems, Halcyon Molecular, NABsys, IBM, and GE Global appear    to all be going head to head in the race to commercialize full    genome sequencing.[31][61]  <\/p>\n<p>    A series of publications in 2012 showed the utility of SMRT sequencing from Pacific Biosciences    in generating full genome sequences with de novo    assembly.[62]  <\/p>\n<p>    With sequencing costs declining, a number of companies began    claiming that their equipment would soon achieve the $1,000    genome: these companies included Life    Technologies in January 2012,[63]Oxford Nanopore Technologies    in February 2012[64] and Illumina in February 2014.[65][66]  <\/p>\n<p>    However, as of 2015, the NHGRI estimates the cost of    obtaining a whole-genome sequence at around $1,500.[67]  <\/p>\n<p>    Full genome sequencing provides information on a genome that is    orders of magnitude larger than that provided by the previous    leader in genotyping technology, DNA arrays. For    humans, DNA arrays currently provide genotypic information on    up to one million genetic variants,[68][69][70] while full genome sequencing    will provide information on all six billion bases in the human    genome, or 3,000times more data. Because of this, full    genome sequencing is considered a disruptive innovation to the DNA    array markets as the accuracy of both range from 99.98% to    99.999% (in non-repetitive DNA regions) and their consumables    cost of $5000 per 6 billion base pairs is competitive (for some    applications) with DNA arrays ($500per 1 million    basepairs).[39]Agilent, another established DNA array    manufacturer, is working on targeted (selective region) genome    sequencing technologies.[71] It is    thought that Affymetrix, the pioneer of array technology in    the 1990s, has fallen behind due to significant corporate and    stock turbulence and is currently not working on any known full    genome sequencing approach.[72][73][74] It is unknown what    will happen to the DNA array market once full genome sequencing    becomes commercially widespread, especially as companies and    laboratories providing this disruptive technology start to    realize economies of scale. It is postulated,    however, that this new technology may significantly diminish    the total market size for arrays and any other sequencing    technology once it becomes commonplace for individuals and    newborns to have their full genomes sequenced.[75]  <\/p>\n<p>    Whole genome sequencing has established the mutation frequency for    whole human genomes. The mutation frequency in the whole genome    between generations for humans (parent to child) is about 70    new mutations per generation.[76][77] An even lower level of variation    was found comparing whole genome sequencing in blood cells for    a pair of monozygotic (identical twins) 100-year-old    centenarians.[78]    Only 8 somatic differences were found, though somatic variation    occurring in less than 20% of blood cells would be undetected.  <\/p>\n<p>    In the specifically protein coding regions of the human genome,    it is estimated that there are about 0.35 mutations that would    change the protein sequence between parent\/child generations    (less than one mutated protein per generation).[79]  <\/p>\n<p>    In cancer, mutation frequencies are much higher, due to    genome instability. This frequency can    further depend on patient age, exposure to DNA damaging agents    (such as UV-irradiation or components of tobacco smoke) and the    activity\/inactivity of DNA repair mechanisms.[80] Furthermore,    mutation frequency can vary between cancer types: in germline    cells, mutation rates occur at approximately 0.023 mutations    per megabase, but this number is much higher in breast cancer    (1.18-1.66 mutations per Mb), in lung cancer (17.7) or in    melanomas (~33).[81]  <\/p>\n<p>    Inexpensive, time-efficient full genome sequencing will be a    major accomplishment not only for the field of genomics, but for the    entire human civilization because, for the first time,    individuals will be able to have their entire genome sequenced.    Utilizing this information, it is speculated that health care    professionals, such as physicians and genetic counselors, will eventually be    able to use genomic information to predict what diseases a    person may get in the future and attempt to either minimize the    impact of that disease or avoid it altogether through the    implementation of personalized, preventive medicine. Full genome    sequencing will allow health    care professionals to analyze the entire human genome of an    individual and therefore detect all disease-related genetic    variants, regardless of the genetic variant's prevalence or    frequency. This will enable the rapidly emerging medical fields    of predictive medicine and personalized medicine and will mark    a significant leap forward for the clinical genetic revolution.    Full genome sequencing is clearly of great importance for    research into the basis of genetic disease and has shown    significant benefit to a subset of individuals with rare    disease in the clinical setting.[82][83][84][85] Illumina's CEO, Jay Flatley,    stated in February 2009 that \"A complete DNA read-out for every    newborn will be technically feasible and affordable in less    than five years, promising a revolution in healthcare\" and that    \"by 2019 it will have become routine to map infants' genes when    they are born\".[86] This potential use of genome    sequencing is highly controversial, as it runs counter to    established ethical    norms for predictive genetic testing of asymptomatic minors    that have been well established in the fields of medical    genetics and genetic counseling.[87][88][89][90] The    traditional guidelines for genetic testing have been developed    over the course of several decades since it first became    possible to test for genetic markers associated with disease,    prior to the advent of cost-effective, comprehensive genetic    screening. It is established that norms, such as in the sciences and the    field of genetics, are subject to change and evolve over    time.[91][92] It is    unknown whether traditional norms practiced in medical genetics    today will be altered by new technological advancements such as    full genome sequencing.  <\/p>\n<p>    In March 2010, researchers from the Medical College of Wisconsin    announced the first successful use of whole-genome sequencing    to inform the treatment of a patient.[93][94]  <\/p>\n<p>    Currently available newborn screening for childhood    diseases allows detection of rare disorders that can be    prevented or better treated by early detection and    intervention. Specific genetic tests are also available to    determine an etiology when a child's symptoms appear to have a    genetic basis. Full genome sequencing, in addition has the    potential to reveal a large amount of information (such as    carrier    status for autosomal    recessive disorders, genetic risk factors for complex adult-onset diseases, and    other predictive medical and non-medical information) that is    currently not completely understood, may not be clinically    useful to the child during childhood, and may not necessarily    be wanted by the individual upon reaching adulthood.[95] In addition to predicting    disease risk in childhood, genetic testing may have other    benefits (such as discovery of non-paternity) but may also have    potential downsides (genetic discrimination, loss    of anonymity, and psychological impacts).[96] Many    publications regarding ethical guidelines for predictive    genetic testing of asymptomatic minors may therefore have more    to do with protecting minors and preserving the individual's    privacy and    autonomy to know    or not to know their genetic information, than with the    technology that makes the tests themselves possible.[97]  <\/p>\n<p>    Due to recent cost reductions (see above) whole genome    sequencing has become a realistic application in DNA    diagnostics. In 2013, the 3Gb-TEST consortium obtained funding    from the European Union to prepare the health care system for    these innovations in DNA diagnostics.[98][99]Quality assessment schemes, Health technology assessment    and guidelines have to be in place. The    3Gb-TEST consortium has identified the analysis and    interpretation of sequence data as the most complicated step in    the diagnostic process.[100] At the    Consortium meeting in Athens in September 2014, the Consortium    coined the word genotranslation for this crucial step.    This step leads to a so-called genoreport. Guidelines    are needed to determine the required content of these reports.  <\/p>\n<p>    The majority of ethicists insist that the privacy of    individuals undergoing genetic testing must be protected under    all circumstances.[101]    Data obtained from whole genome sequencing can not only reveal    much information about the individual who is the source of DNA,    but it can also reveal much probabilistic information about the    DNA sequence of close genetic relatives.[102] Furthermore,    the data obtained from whole genome sequencing can also reveal    much useful predictive information about the relatives present    and future health risks.[103] This    raises important questions about what obligations, if any, are    owed to the family members of the individuals who are    undergoing genetic testing. In the Western\/European society,    tested individuals are usually encouraged to share important    information on the genetic diagnosis with their close relatives    since the importance of the genetic diagnosis for offspring and    other close relatives is usually one of the reasons for seeking    a genetic testing in the first place.[101]    Nevertheless, a major ethical dilemma can develop when the    patients refuse to share information on a diagnosis that is    made for serious genetic disorder that is highly preventable    and where there is a high risk to relatives carrying the same    disease mutation.[102] Under such    circumstances, the clinician may suspect that the relatives    would rather know of the diagnosis and hence the clinician can    face a conflict of interest with respect to patient-doctor    confidentiality.[102]  <\/p>\n<p>    Another major privacy concern is the scientific need to put    information on patient's genotypes and phenotypes into the    public scientific databases such as the locus specific    databases.[102] Although only    anonymous patient data are submitted to the locus specific    databases, patients might still be identifiable by their    relatives in the case of finding a rare disease or a rare    missense mutation.[102]  <\/p>\n<p>    The first nearly complete human genomes sequenced were two    caucasians in 2007 (J. Craig Venter at 7.5-fold coverage,[104][105][106] and James Watson at 7.4-fold).[107][108][109] This was followed in 2008 by    sequencing of an anonymous Han Chinese man (at 36-fold),[110] a Yoruban man    from Nigeria (at    30-fold),[111] and a female caucasian    Leukemia patient    (at 33 and 14-fold coverage for tumor and normal    tissues).[112]Steve Jobs was among the first 20 people    to have their whole genome sequenced, reportedly for the cost    of $100,000.[113] As of June 2012[update],    there are 69 nearly complete human genomes publicly    available.[114]Commercialization of full genome    sequencing is in an early stage and growing rapidly.  <\/p>\n<p><!-- Auto Generated --><\/p>\n<p>Continue reading here:<br \/>\n<a target=\"_blank\" href=\"https:\/\/en.wikipedia.org\/wiki\/Whole_genome_sequencing\" title=\"Whole genome sequencing - Wikipedia\">Whole genome sequencing - Wikipedia<\/a><\/p>\n","protected":false},"excerpt":{"rendered":"<p> \"Genome sequencing\" redirects here. For the sequencing only of DNA, see DNA sequencing <a href=\"https:\/\/www.euvolution.com\/prometheism-transhumanism-posthumanism\/transhuman-news-blog\/genome\/whole-genome-sequencing-wikipedia\/\">Continue reading <span class=\"meta-nav\">&rarr;<\/span><\/a><\/p>\n","protected":false},"author":1,"featured_media":0,"comment_status":"closed","ping_status":"closed","sticky":false,"template":"","format":"standard","meta":{"footnotes":""},"categories":[25],"tags":[],"class_list":["post-174286","post","type-post","status-publish","format-standard","hentry","category-genome"],"_links":{"self":[{"href":"https:\/\/www.euvolution.com\/prometheism-transhumanism-posthumanism\/wp-json\/wp\/v2\/posts\/174286"}],"collection":[{"href":"https:\/\/www.euvolution.com\/prometheism-transhumanism-posthumanism\/wp-json\/wp\/v2\/posts"}],"about":[{"href":"https:\/\/www.euvolution.com\/prometheism-transhumanism-posthumanism\/wp-json\/wp\/v2\/types\/post"}],"author":[{"embeddable":true,"href":"https:\/\/www.euvolution.com\/prometheism-transhumanism-posthumanism\/wp-json\/wp\/v2\/users\/1"}],"replies":[{"embeddable":true,"href":"https:\/\/www.euvolution.com\/prometheism-transhumanism-posthumanism\/wp-json\/wp\/v2\/comments?post=174286"}],"version-history":[{"count":0,"href":"https:\/\/www.euvolution.com\/prometheism-transhumanism-posthumanism\/wp-json\/wp\/v2\/posts\/174286\/revisions"}],"wp:attachment":[{"href":"https:\/\/www.euvolution.com\/prometheism-transhumanism-posthumanism\/wp-json\/wp\/v2\/media?parent=174286"}],"wp:term":[{"taxonomy":"category","embeddable":true,"href":"https:\/\/www.euvolution.com\/prometheism-transhumanism-posthumanism\/wp-json\/wp\/v2\/categories?post=174286"},{"taxonomy":"post_tag","embeddable":true,"href":"https:\/\/www.euvolution.com\/prometheism-transhumanism-posthumanism\/wp-json\/wp\/v2\/tags?post=174286"}],"curies":[{"name":"wp","href":"https:\/\/api.w.org\/{rel}","templated":true}]}}