Archetypal Analysis for population genetics

Gimbernat-Mayol, Julia; Mantes, Albert Dominguez; Bustamante, Carlos D.; Montserrat, Daniel Mas; Ioannidis, Alexander G.

doi:10.1371/journal.pcbi.1010301

Gimbernat-Mayol, Julia; Mantes, Albert Dominguez; Bustamante, Carlos D.; Montserrat, Daniel Mas; Ioannidis, Alexander G.

2022

Formats

Format
BibTeX
MARCXML
TextMARC
MARC
DataCite
DublinCore
EndNote
NLM
RefWorks
RIS

Abstract

The estimation of genetic clusters using genomic data has application from genome-wide association studies (GWAS) to demographic history to polygenic risk scores (PRS) and is expected to play an important role in the analyses of increasingly diverse, large-scale cohorts. However, existing methods are computationally-intensive, prohibitively so in the case of nationwide biobanks. Here we explore Archetypal Analysis as an efficient, unsupervised approach for identifying genetic clusters and for associating individuals with them. Such unsupervised approaches help avoid conflating socially constructed ethnic labels with genetic clusters by eliminating the need for exogenous training labels. We show that Archetypal Analysis yields similar cluster structure to existing unsupervised methods such as ADMIXTURE and provides interpretative advantages. More importantly, we show that since Archetypal Analysis can be used with lower-dimensional representations of genetic data, significant reductions in computational time and memory requirements are possible. When Archetypal Analysis is run in such a fashion, it takes several orders of magnitude less compute time than the current standard, ADMIXTURE. Finally, we demonstrate uses ranging across datasets from humans to canids.

Details

Title Archetypal Analysis for population genetics

Author(s) Gimbernat-Mayol, Julia ; Mantes, Albert Dominguez ; Bustamante, Carlos D. ; Montserrat, Daniel Mas ; Ioannidis, Alexander G.

Published in Plos Computational Biology

Volume 18

Issue 8

Pages e1010301

Date 2022-08-01

Publisher San Francisco, PUBLIC LIBRARY SCIENCE

ISSN 1553-734X
1553-7358

DOI https://doi.org/10.1371/journal.pcbi.1010301

Other identifier(s) View record in Web of Science

Laboratories UPLAMANNO

Record Appears in Scientific production and competences > SV - School of Life Sciences > BMI - Brain Mind Institute > UPLAMANNO - Prof. La Manno Group
Peer-reviewed publications
Work produced at EPFL
Journal Articles
Published

Record creation date 2023-01-02