{"id":174042,"date":"2016-10-17T01:20:09","date_gmt":"2016-10-17T05:20:09","guid":{"rendered":"http:\/\/www.euvolution.com\/prometheism-transhumanism-posthumanism\/razib-khan-blog-posts-the-unz-review\/"},"modified":"2016-10-17T01:20:09","modified_gmt":"2016-10-17T05:20:09","slug":"razib-khan-blog-posts-the-unz-review","status":"publish","type":"post","link":"https:\/\/www.euvolution.com\/prometheism-transhumanism-posthumanism\/transhuman-news-blog\/gene-medicine\/razib-khan-blog-posts-the-unz-review\/","title":{"rendered":"Razib Khan Blog Posts &#8211; The Unz Review"},"content":{"rendered":"<p><p>         A friend asked me about population structure, and    methods to ferret it out and classify it. So here is a quick    survey on the major methods Im familiar with\/utilize now and    then. Ill go roughly in chronological order.  <\/p>\n<p>    First, you have trees. These are pretty popular from    macroevolutionary relationships, but on the population genetic    scale (intraspecific, microevolutionary) youre mostly talking    about representing distances between groups in a tree format.    You saw this in     History and Geography of Genes, where genetic distances in    the form of Fst values (proportion of genetic variation unique    to between two groups) were used as distance inputs.  <\/p>\n<p>    A problem with trees is that they dont model gene flow, a    major dynamic on a microevolutionary scale. Also, complex    relationships can get elided in tree frameworks, and as you add    more and more populations you often end up with an    incomprehensible fan-like topology.  <\/p>\n<p>            Then you have principle component analyses    (PCA) and related methods (e.g.,    multidimensional scaling, which is very different in the    sausage-making but generates a similar output). Like trees,    this is a visualization of the variation, in this case on a two    dimensional plot (please dont bring up three dimensional PCA,    theres no such thing until holograms show up).  <\/p>\n<p>    The problem with PCA is that different types of dynamics can    lead to the same result. For example, someone who is an F1 of    two distinct groups occupies the same position as a population    which happens to occupy a genetic position between two groups.    Additionally, by constraining the variation into two    dimensions, one can mislead in terms of relationships. There    are many dimensions, but operationally you focus on on two at a    time.  <\/p>\n<p>    A paper of interest,     Population Structure and Eigenanalysis.  <\/p>\n<p>         Next you have    model-based clustering introduced in Jonathan    Pritchards     Inference of Population Structure Using Multilocus Genotype    Data. There are many flavors of this, but they operate    under the same framework. You have a model of population    dynamics, and see how the genotype data can be explained by    parameters of the model. Of particular interest is assignment    to one of K populations, which can be combined to explain the    variation in the data.  <\/p>\n<p>    Unlike PCA these model-based methods are rather good at    identifying people who are first generation mixes, as opposed    to those from stabilized groups along a cline. But, they also    produce artifacts, because they are quite sensitive to the    input data, and lend themselves to cherry-picking.  <\/p>\n<p>    Earlier I said that one problem with the tree methods is    that they dont model gene flow. Joe Pickrells        TreeMix does so. Like the original tree methods,    and unlike PCA or unsupervised model-based clustering, you    specify a set of populations. Then you compare the populations    in terms of their genetic distance, and fit them to a tree, but    add migration parameters to that tree where the fit between the    tree and the data is the most tenuous fit.  <\/p>\n<p>    All visualizations are deformations of reality. TreeMix    attempts to mitigate this somewhat by introducing another    representation, that of migration.  <\/p>\n<p>    Next we have local ancestry methods. By local    ancestry, basically we mean methods which can assign ancestry    to particular regions of the genome. While tree methods measure    differences across pooled populations, PCA and model-based    methods compare genotypes between individuals (this is a    simplification, but bear with me). Local ancestry methods, like        RFMix, compare regions of the genome with each other.  <\/p>\n<p>    Related to, but not exactly the same, as local ancestry methods    are haplotype based methods. In particular,    Im thinking of the     FineStructure and its related methods. These leverage    variation across the genome in terms of haplotypes, rather than    just looking at genotypes. They also tend to benefit from    phasing, for obvious methods. FineStructure and its relatives    tend to need more marker density than model-based methods,    which require more marker density than PCA, which requires more    marker density that tree based methods. These haplotype based    methods allow for correction of and accounting for forces such    as genetic drift, which tend to skew results in other methods.  <\/p>\n<p>    Finally, there is the     AdmixTools framework which is good for testing very    explicit hypotheses. While many of the above methods, such as    TreeMix and unsupervised model-based clustering, explore an    almost open-ended space of structure possibilities, the methods    in AdmixTools exists in large part to test narrow delimited    models. This goes to the fact that many of these methods are    complementary, and you should use them together to arrive at a    robust result. For example, if you are assigning populations    for TreeMix, you should use PCA and model-based clustering to    make sure that the populations are clear and distinct, and    outliers are removed.  <\/p>\n<p>    Theres a lot I left out, but many of the other methods are    just twists on the ones above.  <\/p>\n<p><!-- Auto Generated --><\/p>\n<p>Continued here:<br \/>\n<a target=\"_blank\" href=\"http:\/\/www.unz.com\/gnxp\/\" title=\"Razib Khan Blog Posts - The Unz Review\">Razib Khan Blog Posts - The Unz Review<\/a><\/p>\n","protected":false},"excerpt":{"rendered":"<p> A friend asked me about population structure, and methods to ferret it out and classify it. So here is a quick survey on the major methods Im familiar with\/utilize now and then <a href=\"https:\/\/www.euvolution.com\/prometheism-transhumanism-posthumanism\/transhuman-news-blog\/gene-medicine\/razib-khan-blog-posts-the-unz-review\/\">Continue reading <span class=\"meta-nav\">&rarr;<\/span><\/a><\/p>\n","protected":false},"author":7,"featured_media":0,"comment_status":"closed","ping_status":"closed","sticky":false,"template":"","format":"standard","meta":{"footnotes":""},"categories":[21],"tags":[],"class_list":["post-174042","post","type-post","status-publish","format-standard","hentry","category-gene-medicine"],"_links":{"self":[{"href":"https:\/\/www.euvolution.com\/prometheism-transhumanism-posthumanism\/wp-json\/wp\/v2\/posts\/174042"}],"collection":[{"href":"https:\/\/www.euvolution.com\/prometheism-transhumanism-posthumanism\/wp-json\/wp\/v2\/posts"}],"about":[{"href":"https:\/\/www.euvolution.com\/prometheism-transhumanism-posthumanism\/wp-json\/wp\/v2\/types\/post"}],"author":[{"embeddable":true,"href":"https:\/\/www.euvolution.com\/prometheism-transhumanism-posthumanism\/wp-json\/wp\/v2\/users\/7"}],"replies":[{"embeddable":true,"href":"https:\/\/www.euvolution.com\/prometheism-transhumanism-posthumanism\/wp-json\/wp\/v2\/comments?post=174042"}],"version-history":[{"count":0,"href":"https:\/\/www.euvolution.com\/prometheism-transhumanism-posthumanism\/wp-json\/wp\/v2\/posts\/174042\/revisions"}],"wp:attachment":[{"href":"https:\/\/www.euvolution.com\/prometheism-transhumanism-posthumanism\/wp-json\/wp\/v2\/media?parent=174042"}],"wp:term":[{"taxonomy":"category","embeddable":true,"href":"https:\/\/www.euvolution.com\/prometheism-transhumanism-posthumanism\/wp-json\/wp\/v2\/categories?post=174042"},{"taxonomy":"post_tag","embeddable":true,"href":"https:\/\/www.euvolution.com\/prometheism-transhumanism-posthumanism\/wp-json\/wp\/v2\/tags?post=174042"}],"curies":[{"name":"wp","href":"https:\/\/api.w.org\/{rel}","templated":true}]}}