Ancestry prediction efficiency of the software GenoGeographer using a z-score method and the ancestry informative markers in the Precision ID Ancestry Panel.


Journal

Forensic science international. Genetics
ISSN: 1878-0326
Titre abrégé: Forensic Sci Int Genet
Pays: Netherlands
ID NLM: 101317016

Informations de publication

Date de publication:
01 2020
Historique:
received: 22 05 2019
revised: 25 07 2019
accepted: 24 08 2019
pubmed: 2 11 2019
medline: 9 3 2021
entrez: 1 11 2019
Statut: ppublish

Résumé

We compared the efficiency of the freely available software GenoGeographer that includes a z-score based analysis with that of a naïve method based on the maximal likelihoods of 164 of the 165 ancestral informative markers (AIM) that are included in the commercially available kit Precision ID Ancestry Panel from Thermo Fisher Scientific. The AIM profiles were obtained by investigations with the Precision ID Ancestry Panel in our laboratory and from SNP data in the literature and publically available databases. We established eight well-defined AIM reference population data sets from 3603 AIM profiles. Six reference populations with profiles from multiple populations (Sub-Saharan Africa, North Africa, Middle East, Europe, South/Central Asia, East Asia), and two populations with individuals with admixed ancestry (Somalia and Greenland). By means of GenoGeographer and naïve calculations of the maximal likelihoods, 566 AIM profiles from individuals that were not included in the reference populations and expected to belong to one of the eight reference populations were tested. An initial standard z-score based test with GenoGeographer demonstrated that 22.4% of the individuals could not be assigned to any of the reference populations. Among the remaining 77.6% of the individuals, 83.6% were assigned to the reference population that was concordant with the specified populations of origin of the individuals, 8.2% had ambiguous assignments because they could belong to both the specified population of origin and one or more of the other populations, and 8.2% were assigned to a reference population that was discordant from the specified population of origin. A naïve assignment based on the maximal likelihood resulted in 78.1% concordant and 21.9% discordant assignments. The results demonstrate that the z-score analysis with GenoGeographer can reduce the error rate with a factor of almost three compared with that of the naïve estimation based on the maximal likelihoods of the AIM profiles. The Precision ID Ancestry Panel is a useful kit for the assignment of ancestry of the eight investigated populations that included two admixed populations. More AIMs with better discrimination and more data on the distribution of AIMs in relevant populations are needed to improve the efficiency of genogeographic prediction with AIMs on a worldwide basis.

Identifiants

pubmed: 31670023
pii: S1872-4973(19)30226-1
doi: 10.1016/j.fsigen.2019.102154
pii:
doi:

Substances chimiques

Genetic Markers 0

Types de publication

Comparative Study Evaluation Study Journal Article Research Support, Non-U.S. Gov't

Langues

eng

Sous-ensembles de citation

IM

Pagination

102154

Informations de copyright

Copyright © 2019 Elsevier B.V. All rights reserved.

Auteurs

Helle S Mogensen (HS)

Section of Forensic Genetics, Department of Forensic Medicine, Faculty of Health and Medical Sciences, University of Copenhagen, DK-2100 Copenhagen, Denmark. Electronic address: helle.smidt@sund.ku.dk.

Torben Tvedebrink (T)

Department of Mathematical Sciences, Aalborg University, DK-9220 Aalborg East, Denmark.

Claus Børsting (C)

Section of Forensic Genetics, Department of Forensic Medicine, Faculty of Health and Medical Sciences, University of Copenhagen, DK-2100 Copenhagen, Denmark.

Vania Pereira (V)

Section of Forensic Genetics, Department of Forensic Medicine, Faculty of Health and Medical Sciences, University of Copenhagen, DK-2100 Copenhagen, Denmark.

Niels Morling (N)

Section of Forensic Genetics, Department of Forensic Medicine, Faculty of Health and Medical Sciences, University of Copenhagen, DK-2100 Copenhagen, Denmark; Department of Mathematical Sciences, Aalborg University, DK-9220 Aalborg East, Denmark.

Articles similaires

Genome, Chloroplast Phylogeny Genetic Markers Base Composition High-Throughput Nucleotide Sequencing

[Redispensing of expensive oral anticancer medicines: a practical application].

Lisanne N van Merendonk, Kübra Akgöl, Bastiaan Nuijen
1.00
Humans Antineoplastic Agents Administration, Oral Drug Costs Counterfeit Drugs

Smoking Cessation and Incident Cardiovascular Disease.

Jun Hwan Cho, Seung Yong Shin, Hoseob Kim et al.
1.00
Humans Male Smoking Cessation Cardiovascular Diseases Female
Humans United States Aged Cross-Sectional Studies Medicare Part C

Classifications MeSH