{"id":66883,"date":"2015-10-24T12:42:08","date_gmt":"2015-10-24T16:42:08","guid":{"rendered":"http:\/\/www.euvolution.com\/prometheism-transhumanism-posthumanism\/dna-sequencing-costs-genome-gov-national-human-genome\/"},"modified":"2015-10-24T12:42:08","modified_gmt":"2015-10-24T16:42:08","slug":"dna-sequencing-costs-genome-gov-national-human-genome","status":"publish","type":"post","link":"https:\/\/www.euvolution.com\/prometheism-transhumanism-posthumanism\/transhuman-news-blog\/genome\/dna-sequencing-costs-genome-gov-national-human-genome\/","title":{"rendered":"DNA Sequencing Costs &#8211; Genome.gov | National Human Genome &#8230;"},"content":{"rendered":"<p><p>DNA Sequencing Costs        Data from the NHGRI Genome Sequencing Program (GSP)        Overview    <\/p>\n<p>    For many years,    the National Human Genome Research Institute (NHGRI)    has tracked the costs associated with DNA sequencing performed    at the sequencing centers funded by the Institute. This    information has served as an important benchmark for assessing    improvements in DNA sequencing technologies and for    establishing the DNA sequencing capacity of the NHGRI Genome    Sequencing Program (GSP). Here, NHGRI provides an analysis of    these data, which gives one view of the remarkable improvements    in DNA sequencing technologies and data-production pipelines in    recent years.  <\/p>\n<p>    The cost-accounting data presented here are summarized relative    to two metrics: (1) \"Cost per Megabase of DNA Sequence\" - the    cost of determining one megabase (Mb; a million bases) of DNA    sequence of a specified quality [see below]; (2) \"Cost per    Genome\" - the cost of sequencing a human-sized genome. For    each, a graph is provided showing the data since 2001; in    addition, the actual numbers reflected by the graphs are    provided in a summary table. NHGRI welcomes people    to download these graphs and use them in their presentations    and teaching materials. NHGRI plans to update these data on a    regular basis.  <\/p>\n<p>    To illustrate the nature of the reductions in DNA sequencing    costs, each graph also shows hypothetical data reflecting    Moore's Law, which describes a long-term trend in the computer    hardware industry that involves the doubling of 'compute power'    every two years (See: Moore's Law    [wikipedia.org]). Technology improvements that 'keep up' with    Moore's Law are widely regarded to be doing exceedingly well,    making it useful for comparison.  <\/p>\n<p>    In both graphs, note: (1) the use a logarithmic scale on the Y    axis; and (2) the sudden and profound out-pacing of Moore's Law    beginning in January 2008. The latter represents the time when    the sequencing centers transitioned from Sanger-based (dideoxy    chain termination sequencing) to 'second generation' (or    'next-generation') DNA sequencing technologies. Additional    details about these graphs are provided below.  <\/p>\n<p>    These data, however, do not capture all of the costs associated    with the NHGRI Large-Scale Genome Sequencing Program. The    sequencing centers perform a number of additional activities    whose costs are not appropriate to include when calculating    costs for production-oriented DNA sequencing. In other words,    NHGRI makes a distinction between 'production' activities and    'non-production' activities. Production activities are    essential to the routine generation of large amounts of quality    DNA sequence data that are made available in public databases;    the costs associated with production DNA sequencing are    summarized here and depicted on the two graphs. Additional    information about the other activities performed by the    sequencing centers is provided below.  <\/p>\n<p>    The expenditures included in each category were established    based on discussions between NHGRI staff and sequencing center    personnel.  <\/p>\n<p>    For the two graphs (\"Cost per Megabase of DNA Sequence\" and    \"Cost per Genome\"), the following 'production' costs are    accounted for:  <\/p>\n<p>    In the case of costs covered by significant subsidies to a    sequencing center (e.g., a grantee institution providing funds    for purchasing large equipment), NHGRI has attempted to    appropriately account for such costs in these analyses.  <\/p>\n<p>    The costs associated with the following 'non-production'    activities are not reflected in the two graphs:  <\/p>\n<p>    In both graphs, the data from 2001 through October 2007    represent the costs of generating DNA sequence using    Sanger-based chemistries and capillary-based instruments    ('first generation' sequencing platforms). Beginning in January    2008, the data represent the costs of generating DNA sequence    using 'second-generation' (or 'next-generation') sequencing    platforms. The change in instruments represents the rapid    evolution of DNA sequencing technologies that has occurred in    recent years.  <\/p>\n<p>    For the Sanger-based sequence data, the cost accounting    reflects the generation of bases with a minimum quality score    of Phred20 (or Q20), which represents an    error probability of 1 % and is an accepted community standard    for a high-quality base. For sequence data generated with    second-generation sequencing platforms, there is not yet a    single accepted measure of accuracy; each manufacturer provides    quality scores that are, at this time, accepted by the NHGRI    sequencing centers as equivalent to or greater than    Q20.  <\/p>\n<p>    In the \"Cost per Megabase of DNA Sequence\" graph, the data    reflect the cost of generating raw, unassembled sequence data;    no adjustment was made for data generated using different    instruments despite significant differences in the sequence    read lengths. In contrast, the \"Cost per Genome\" graph does    take these differences into account since sequence read length    influences the ability to generate an assembled genome    sequence.  <\/p>\n<p>    The \"Cost per Genome\" graph was generated using the same    underlying data as that used to generate the \"Cost per Megabase    of DNA Sequence\" graph; the former thus reflects an estimate of    the cost of sequencing a human-sized genome rather than the    actual costs for specific genome-sequencing projects.  <\/p>\n<p>    To calculate the cost for sequencing a genome, one needs to    know the size of that genome and the required 'sequence    coverage' (i.e., 'sequence redundancy') to generate a    high-quality assembly of the genome given the specific    sequencing platform being used. For generating the \"Cost per    Genome\" graph, the assumed genome size was 3,000 Mb (i.e., the    size of a human genome). The assumed sequence coverage needed    differed among sequencing platforms, depending on the average    sequence read length for that platform.  <\/p>\n<p>    For data since January 2008 (representing data generated using    'second-generation' sequencing platforms), the \"Cost per    Genome\" graph reflects projects involving the 're-sequencing'    of the human genome, where an available reference human genome    sequence is available to serve as a backbone for downstream    data analyses. The required 'sequence coverage' would be    greater for sequencing genomes for which no reference genome    sequence is available.  <\/p>\n<p>    See: <a href=\"http:\/\/www.genome.gov\/10001691\" rel=\"nofollow\">http:\/\/www.genome.gov\/10001691<\/a>  <\/p>\n<p>    Mardis E. A decade's perspective on DNA sequencing technology.    Nature, 470: 198-203. 2011. [PubMed]  <\/p>\n<p>    Metzker M. Sequencing technologies - the next generation.    Nature Genetics, 11: 31-46. 2010. [PubMed]  <\/p>\n<p>    Stein L. The case for cloud computing in genome informatics.    Genome Biology, 11: 207-213. 2010. [PubMed]  <\/p>\n<p>    Human genome at ten: the sequence explosion. Nature,    464: 670-671. 2010. [PubMed]  <\/p>\n<p>    Wetterstrand KA. DNA Sequencing Costs: Data from the NHGRI    Genome Sequencing Program (GSP) Available at: <a href=\"http:\/\/www.genome.gov\/sequencingcosts\" rel=\"nofollow\">http:\/\/www.genome.gov\/sequencingcosts<\/a>.    Accessed [date of access].  <\/p>\n<p>    Kris Wetterstrand, M.S.    Scientific Liaison to the Director for Extramural    Activities    National Human Genome Research Institute, NIH    Phone: 301-435-5543    E-mail: <a href=\"mailto:wettersk@mail.nih.gov\">wettersk@mail.nih.gov<\/a>  <\/p>\n<p>    Top of page  <\/p>\n<p>    Last Updated: October 2, 2015  <\/p>\n<p><!-- Auto Generated --><\/p>\n<p>The rest is here:<br \/>\n<a target=\"_blank\" href=\"http:\/\/www.genome.gov\/sequencingcosts\/\" title=\"DNA Sequencing Costs - Genome.gov | National Human Genome ...\">DNA Sequencing Costs - Genome.gov | National Human Genome ...<\/a><\/p>\n","protected":false},"excerpt":{"rendered":"<p> DNA Sequencing Costs Data from the NHGRI Genome Sequencing Program (GSP) Overview For many years, the National Human Genome Research Institute (NHGRI) has tracked the costs associated with DNA sequencing performed at the sequencing centers funded by the Institute. This information has served as an important benchmark for assessing improvements in DNA sequencing technologies and for establishing the DNA sequencing capacity of the NHGRI Genome Sequencing Program (GSP). Here, NHGRI provides an analysis of these data, which gives one view of the remarkable improvements in DNA sequencing technologies and data-production pipelines in recent years.  <a href=\"https:\/\/www.euvolution.com\/prometheism-transhumanism-posthumanism\/transhuman-news-blog\/genome\/dna-sequencing-costs-genome-gov-national-human-genome\/\">Continue reading <span class=\"meta-nav\">&rarr;<\/span><\/a><\/p>\n","protected":false},"author":1,"featured_media":0,"comment_status":"closed","ping_status":"closed","sticky":false,"template":"","format":"standard","meta":{"footnotes":""},"categories":[25],"tags":[],"class_list":["post-66883","post","type-post","status-publish","format-standard","hentry","category-genome"],"_links":{"self":[{"href":"https:\/\/www.euvolution.com\/prometheism-transhumanism-posthumanism\/wp-json\/wp\/v2\/posts\/66883"}],"collection":[{"href":"https:\/\/www.euvolution.com\/prometheism-transhumanism-posthumanism\/wp-json\/wp\/v2\/posts"}],"about":[{"href":"https:\/\/www.euvolution.com\/prometheism-transhumanism-posthumanism\/wp-json\/wp\/v2\/types\/post"}],"author":[{"embeddable":true,"href":"https:\/\/www.euvolution.com\/prometheism-transhumanism-posthumanism\/wp-json\/wp\/v2\/users\/1"}],"replies":[{"embeddable":true,"href":"https:\/\/www.euvolution.com\/prometheism-transhumanism-posthumanism\/wp-json\/wp\/v2\/comments?post=66883"}],"version-history":[{"count":0,"href":"https:\/\/www.euvolution.com\/prometheism-transhumanism-posthumanism\/wp-json\/wp\/v2\/posts\/66883\/revisions"}],"wp:attachment":[{"href":"https:\/\/www.euvolution.com\/prometheism-transhumanism-posthumanism\/wp-json\/wp\/v2\/media?parent=66883"}],"wp:term":[{"taxonomy":"category","embeddable":true,"href":"https:\/\/www.euvolution.com\/prometheism-transhumanism-posthumanism\/wp-json\/wp\/v2\/categories?post=66883"},{"taxonomy":"post_tag","embeddable":true,"href":"https:\/\/www.euvolution.com\/prometheism-transhumanism-posthumanism\/wp-json\/wp\/v2\/tags?post=66883"}],"curies":[{"name":"wp","href":"https:\/\/api.w.org\/{rel}","templated":true}]}}