Human genetic clustering refers to patterns of relative genetic similarity among human individuals and populations, as well as the wide range of scientific and statistical methods used to study this aspect of human genetic variation.
Clustering studies are thought to be valuable for characterizing the general structure of genetic variation among human populations, to contribute to the study of ancestral origins, evolutionary history, and precision medicine. Since the mapping of the human genome, and with the availability of increasingly powerful analytic tools, cluster analyses have revealed a range of ancestral and migratory trends among human populations and individuals.[1] Human genetic clusters tend to be organized by geographic ancestry, with divisions between clusters aligning largely with geographic barriers such as oceans or mountain ranges.[2][3] Clustering studies have been applied to global populations,[4] as well as to population subsets like post-colonial North America.[5][6] Notably, the practice of defining clusters among modern human populations is largely arbitrary and variable due to the continuous nature of human genotypes; although individual genetic markers can be used to produce smaller groups, there are no models that produce completely distinct subgroups when larger numbers of genetic markers are used.[2][7][8]
Many studies of human genetic clustering have been implicated in discussions of race, ethnicity, and scientific racism, as some have controversially suggested that genetically derived clusters may be understood as proof of genetically determined races.[9][10] Although cluster analyses invariably organize humans (or groups of humans) into subgroups, debate is ongoing on how to interpret these genetic clusters with respect to race and its social and phenotypic features. And, because there is such a small fraction of genetic variation between human genotypes overall, genetic clustering approaches are highly dependent on the sampled data, genetic markers, and statistical methods applied to their construction.
A wide range of methods have been developed to assess the structure of human populations with the use of genetic data. Early studies of within and between-group genetic variation used physical phenotypes and blood groups, with modern genetic studies using genetic markers such as Alu sequences, short tandem repeat polymorphisms, and single nucleotide polymorphisms (SNPs), among others.[11] Models for genetic clustering also vary by algorithms and programs used to process the data. Most sophisticated methods for determining clusters can be categorized as model-based clustering methods (such as the algorithm STRUCTURE[12]) or multidimensional summaries (typically through principal component analysis).[1][13] By processing a large number of SNPs (or other genetic marker data) in different ways, both approaches to genetic clustering tend to converge on similar patterns by identifying similarities among SNPs and/or haplotype tracts to reveal ancestral genetic similarities.[13]
Common model-based clustering algorithms include STRUCTURE, ADMIXTURE, and HAPMIX. These algorithms operate by finding the best fit for genetic data among an arbitrary or mathematically derived number of clusters, such that differences within clusters are minimized and differences between clusters are maximized. This clustering method is also referred to as "admixture inference," as individual genomes (or individuals within populations) can be characterized by the proportions of alleles linked to each cluster.[1] In other words, algorithms like STRUCTURE generate results that assume the existence of discrete ancestral populations, operationalized through unique genetic markers, which have combined over time to form the admixed populations of the modern day.
Where model-based clustering characterizes populations using proportions of presupposed ancestral clusters, multidimensional summary statistics characterize populations on a continuous spectrum. The most common multidimensional statistical method used for genetic clustering is principal component analysis (PCA), which plots individuals by two or more axes (their "principal components") that represent aggregations of genetic markers that account for the highest variance. Clusters can then be identified by visually assessing the distribution of data; with larger samples of human genotypes, data tends to cluster in distinct groups as well as admixed positions between groups.[1][13]
There are caveats and limitations to genetic clustering methods of any type, given the degree of admixture and relative similarity within the human population. All genetic cluster findings are biased by the sampling process used to gather data, and by the quality and quantity of that data. For example, many clustering studies use data derived from populations that are geographically distinct and far apart from one another, which may present an illusion of discrete clusters where, in reality, populations are much more blended with one another when intermediary groups are included.[1] Sample size also plays an important moderating role on cluster findings, as different sample size inputs can influence cluster assignment, and more subtle relationships between genotypes may only emerge with larger sample sizes.[1][8] In particular, the use of STRUCTURE has been widely criticized as being potentially misleading through requiring data to be sorted into a predetermined number of clusters which may or may not reflect the actual population's distribution.[8][14] The creators of STRUCTURE originally described the algorithm as an "exploratory" method to be interpreted with caution and not as a test with statistically significant power.[12][15]
Modern applications of genetic clustering methods to global-scale genetic data were first marked by studies associated with the Human Genome Diversity Project (HGDP) data.[1] These early HGDP studies, such as those by Rosenberg et al. (2002),[4][16] contributed to theories of the serial founder effect and early human migration out of Africa, and clustering methods have been notably applied to describe admixed continental populations.[5][6][17] Genetic clustering and HGDP studies have also contributed to methods for, and criticisms of, the genetic ancestry consumer testing industry.[18]
A number of landmark genetic cluster studies have been conducted on global human populations since 2002, including the following:
Clusters of individuals are often geographically structured. For example, when clustering a population of East Asians and Europeans, each group will likely form its own respective cluster based on similar allele frequencies. In this way, clusters can have a correlation with traditional concepts of race and self-identified ancestry; in some cases, such as medical questionnaires, the latter variables can be used as a proxy for genetic ancestry where genetic data is unavailable.[9][4] However, genetic variation is distributed in a complex, continuous, and overlapping manner, so this correlation is imperfect and the use of racial categories in medicine can introduce additional hazards.[9]
Some scholars[who?] have challenged the idea that race can be inferred by genetic clusters, drawing distinctions between arbitrarily assigned genetic clusters, ancestry, and race. One recurring caution against thinking of human populations in terms of clusters is the notion that genotypic variation and traits are distributed evenly between populations, along gradual clines rather than along discrete population boundaries; so although genetic similarities are usually organized geographically, their underlying populations have never been completely separated from one another. Due to migration, gene flow, and baseline homogeneity, features between groups are extensively overlapping and intermixed.[2][9] Moreover, genetic clusters do not typically match socially defined racial groups; many commonly understood races may not be sorted into the same genetic cluster, and many genetic clusters are made up of individuals who would have distinct racial identities.[7] In general, clusters may most simply be understood as products of the methods used to sample and analyze genetic data; not without meaning for understanding ancestry and genetic characteristics, but inadequate to fully explaining the concept of race, which is more often described in terms of social and cultural forces.
In the related context of personalized medicine, race is currently listed as a risk factor for a wide range of medical conditions with genetic and non-genetic causes. Questions have emerged regarding whether or not genetic clusters support the idea of race as a valid construct to apply to medical research and treatment of disease, because there are many diseases that correspond with specific genetic markers and/or with specific populations, as seen with Tay-Sachs disease or sickle cell disease.[3][25] Researchers are careful to emphasize that ancestryrevealed in part through cluster analysesplays an important role in understanding risk of disease. But racial or ethnic identity does not perfectly align with genetic ancestry, and so race and ethnicity do not reveal enough information to make a medical diagnosis.[25] Race as a variable in medicine is more likely to reflect social factors, where ancestry information is more likely to be meaningful when considering genetic ancestry.[2][25]
Originally posted here:
Human genetic clustering - Wikipedia
- Copy number variation of the restorer Rf4 underlies human selection ... - Nature.com - November 15th, 2023 [November 15th, 2023]
- NYU Langone Health in the NewsThursday, November 9, 2023 - NYU Langone Health - November 15th, 2023 [November 15th, 2023]
- Eugenics: Plaguing scientific community with dark history | Opinion ... - The Arkansas Traveler - November 15th, 2023 [November 15th, 2023]
- Cranberries can bounce, float and pollinate themselves: The saucy ... - Japan Today - November 15th, 2023 [November 15th, 2023]
- Government Housing Assistance Linked to Increased Cancer ... - HealthDay - November 15th, 2023 [November 15th, 2023]
- Rate of New Lung Cancer Cases Has Decreased Over Last Five Years - HealthDay - November 15th, 2023 [November 15th, 2023]
- Clinically relevant antibiotic resistance genes are linked to a limited ... - Nature.com - November 15th, 2023 [November 15th, 2023]
- Disparities in Guideline-Concordant Care Found for Black CRC ... - HealthDay - November 15th, 2023 [November 15th, 2023]
- Mathematician Heather Harrington is new director at the Max Planck ... - EurekAlert - November 15th, 2023 [November 15th, 2023]
- New study finds genetic testing can effectively identify patients with ... - EurekAlert - November 15th, 2023 [November 15th, 2023]
- STK11 loss leads to YAP1-mediated transcriptional activation in ... - Nature.com - November 15th, 2023 [November 15th, 2023]
- CRISPR-broad: combined design of multi-targeting gRNAs and ... - Nature.com - November 15th, 2023 [November 15th, 2023]
- Master regulator of the dark genome greatly improves cancer T-cell ... - Science Daily - November 15th, 2023 [November 15th, 2023]
- Omega Therapeutics Showcases Bidirectional and Multiplexed ... - BioSpace - November 15th, 2023 [November 15th, 2023]
- Today is International 15q Day - ASBMB Today - November 15th, 2023 [November 15th, 2023]
- Evolution of taste: Sharks were already able to perceive bitter ... - EurekAlert - November 15th, 2023 [November 15th, 2023]
- Stanford Scientists Uncover New Indicators of Health, Disease, and ... - SciTechDaily - October 16th, 2023 [October 16th, 2023]
- NHGRI Director Eric Green elected to the National Academy of ... - National Human Genome Research Institute - October 16th, 2023 [October 16th, 2023]
- Monkey survives for two years after gene-edited pig-kidney transplant - Nature.com - October 16th, 2023 [October 16th, 2023]
- Opinion: Interest in RNA Editing Accelerates as Therapies Approach ... - BioSpace - October 16th, 2023 [October 16th, 2023]
- Regulation of dermal fibroblasts by human neutrophil peptides ... - Nature.com - October 16th, 2023 [October 16th, 2023]
- Consistent effects of the genetics of happiness across the lifespan ... - Nature.com - October 16th, 2023 [October 16th, 2023]
- Storytelling through the looking glass of genetics The Stute - The Stute - October 16th, 2023 [October 16th, 2023]
- Pet dogs shed light on human health, researchers say - UPI News - October 16th, 2023 [October 16th, 2023]
- Native microbiome dominates over host factors in shaping the ... - Nature.com - October 16th, 2023 [October 16th, 2023]
- Illinois-led project to sequence soybean genomes, improve future ... - Herald-Whig - October 16th, 2023 [October 16th, 2023]
- Unrealized targets in the discovery of antibiotics for Gram-negative ... - Nature.com - October 16th, 2023 [October 16th, 2023]
- How Biotech And AI Are Transforming The Human - Noema Magazine - October 16th, 2023 [October 16th, 2023]
- The Many Lives of Alexandria Forbes - BioSpace - October 16th, 2023 [October 16th, 2023]
- CEP20 promotes invasion and metastasis of non-small cell lung ... - Nature.com - October 16th, 2023 [October 16th, 2023]
- Genotyping, sequencing and analysis of 140,000 adults from Mexico ... - Nature.com - October 16th, 2023 [October 16th, 2023]
- The role and impact of alternative polyadenylation and miRNA ... - Nature.com - October 16th, 2023 [October 16th, 2023]
- Human - Simple English Wikipedia, the free encyclopedia - January 30th, 2023 [January 30th, 2023]
- Deep Dive Ties Together Dog Genetics, Brain Physiology and Behavior to Explain Why Collies Are Different from Terriers - Scientific American - December 12th, 2022 [December 12th, 2022]
- How oxytocin drives connections of newly integrated adult-born neurons: Research - Hindustan Times - December 12th, 2022 [December 12th, 2022]
- Alzheimer's Disease Genetics Fact Sheet - National Institute on Aging - December 2nd, 2022 [December 2nd, 2022]
- Human Genome Project Fact Sheet - November 23rd, 2022 [November 23rd, 2022]
- Abstracts | International Congress of Human Genetics 2023 - November 23rd, 2022 [November 23rd, 2022]
- Ancient DNA and Neanderthals | The Smithsonian Institution's Human ... - November 16th, 2022 [November 16th, 2022]
- Biological Influences on Human Behavior: Genetics & Environment - November 16th, 2022 [November 16th, 2022]
- Fluent BioSciences showcasing breakthrough solutions to enable unprecedented scale, cost-efficiency and access for single-cell RNA sequencing at the... - October 28th, 2022 [October 28th, 2022]
- Human behaviour genetics - Wikipedia - October 23rd, 2022 [October 23rd, 2022]
- Nucleome Therapeutics raises oversubscribed 37.5 million Series A financing to decode the dark matter of the human genome and deliver first-in-class... - October 19th, 2022 [October 19th, 2022]
- Gladstone data scientist elected to the National Academy of Medicine - EurekAlert - October 19th, 2022 [October 19th, 2022]
- Ocugen to Host R&D Day in New York City on Tuesday, November 1, 2022 - Yahoo Finance - October 19th, 2022 [October 19th, 2022]
- Pharmacy researcher earns $2.3 million NIH award to study opioid addiction - EurekAlert - October 19th, 2022 [October 19th, 2022]
- Study shows age often plays a bigger role than genetics in gene expression and susceptibility to disease - Anti Aging News - October 19th, 2022 [October 19th, 2022]
- CSRWire - Direct Relief, Amgen and C/Can Team Up To Improve Access to Breast Cancer Diagnostics and Treatment in Paraguay - CSRwire.com - October 19th, 2022 [October 19th, 2022]
- Maze Therapeutics Appoints Harold Bernstein, M.D., Ph.D., as President, Research and Development and Chief Medical Officer - Business Wire - October 19th, 2022 [October 19th, 2022]
- New Rare Disease Therapy Effectively Lowers Plasma Phe in Patients with PKU - MD Magazine - October 19th, 2022 [October 19th, 2022]
- GSK : announces expanded collaboration with Tempus in precision medicine to accelerate R&D - Marketscreener.com - October 19th, 2022 [October 19th, 2022]
- Famous Scientific Discoveries That Changed the Course of History - 24/7 Wall St. - October 19th, 2022 [October 19th, 2022]
- Construction workers seek fulfilment of their demands - Star of Mysore - October 19th, 2022 [October 19th, 2022]
- Genetics | The Smithsonian Institution's Human Origins Program - October 13th, 2022 [October 13th, 2022]
- Genetics - Wikipedia - October 13th, 2022 [October 13th, 2022]
- Study looking at human genetics and Covid vaccine immune responses - Science Media Centre - October 13th, 2022 [October 13th, 2022]
- ASHG 2022 in Los Angeles brings together researchers from around the world to advance discoveries in genetics, genomics research - EurekAlert - October 13th, 2022 [October 13th, 2022]
- Maze Therapeutics Appoints Harold Bernstein, M.D., Ph.D., as President, Research and Development and Chief Medical Officer - Yahoo Finance - October 13th, 2022 [October 13th, 2022]
- The Age of the Pangenome Dawns - DNA Science - PLOS - October 13th, 2022 [October 13th, 2022]
- Influence of the microbiome, diet and genetics on inter-individual variation in the human plasma metabolome - Nature.com - October 13th, 2022 [October 13th, 2022]
- Genome editing technologies: final conclusions of the re-examination of Article 13 of the Oviedo Convention - Council of Europe - October 13th, 2022 [October 13th, 2022]
- Global Biobank Meta-analysis Initiative making genome-wide association studies more diverse and representative - EurekAlert - October 13th, 2022 [October 13th, 2022]
- New NHS genetic testing service could save thousands of children in England - The Guardian - October 13th, 2022 [October 13th, 2022]
- Covid protection may be boosted by genes, study shows - Yahoo News Australia - October 13th, 2022 [October 13th, 2022]
- Genomics in Cancer Care Market is estimated to be US$ 72.61 billion by 2032 with a CAGR of 16.3% during the forecast period 2032 - By PMI -... - October 13th, 2022 [October 13th, 2022]
- Identification of hub genes and candidate herbal treatment in obesity through integrated bioinformatic analysis and reverse network pharmacology |... - October 13th, 2022 [October 13th, 2022]
- Our *Homo sapiens* ancestors shared the world with Neanderthals, Denisovans and other types of humans whose DNA lives on in our genes -... - October 8th, 2022 [October 8th, 2022]
- Blue Eyed People Have a Single Ancestor | History of Yesterday - History of Yesterday - October 6th, 2022 [October 6th, 2022]
- Heart infection could be cause of death of Polish, US hero - ABC News - October 6th, 2022 [October 6th, 2022]
- 23andMe Announces Trials-in-Progress Poster Presentation on 23ME-00610, An Investigational Antibody Targeting CD200R1, at The Society for... - October 6th, 2022 [October 6th, 2022]
- The Genetic Drivers Of Longevity In Mice, Humans And Worms - Science 2.0 - October 6th, 2022 [October 6th, 2022]
- ANGPTL7, a therapeutic target for increased intraocular pressure and glaucoma | Communications Biology - Nature.com - October 6th, 2022 [October 6th, 2022]
- 'Neanderthal Man' Nobel Prize winner Svante Pbo revolutionized anthropology. Here is a look back at his groundbreaking 2014 memoir - Genetic Literacy... - October 6th, 2022 [October 6th, 2022]
- Understanding Human Genetic Variation - NCBI Bookshelf - September 14th, 2022 [September 14th, 2022]
- Genetics - National Institute of General Medical Sciences (NIGMS) - September 14th, 2022 [September 14th, 2022]
- People with ME invited to take part in major genetic study - The Independent - September 14th, 2022 [September 14th, 2022]
- Ketamine Promising for Rare Condition Linked to Autism - Medscape - September 14th, 2022 [September 14th, 2022]
- How a small, unassuming fish helps reveal gene adaptations - University of Wisconsin-Madison - September 14th, 2022 [September 14th, 2022]
- How Nutrigenomics Explores Links Between Nutrition And Genes - Health Digest - September 14th, 2022 [September 14th, 2022]
- Scientists redefine obesity with discovery of two major subtypes - EurekAlert - September 14th, 2022 [September 14th, 2022]