GenBank Frequency Information

The current GenBank frequency data in our variant tables is derived from 48,882 human mitochondrial DNA sequences with size greater than 15.4 kbp . The sequences were collected from GenBank on May 15, 2019, aligned to rCRS using BLASTn and haplotyped using Haplogrep via the Mitomaster web service. A list of the sequence IDs in the current GenBank set may be downloaded from the Mitobank web page. A spreadsheet of all variants found and their frequencies in the current set of sequences is also available. These GenBank sequences have been pre-loaded into Mitomaster and represent almost all haplogroups known to date. We will continue to update this sequence set on a regular basis. A set of short control region sequences (71,421 sequences as of May 15, 2019) has also been collected from GenBank and is included in the variant frequency counts in Mitomap and Mitomaster where indicated.

Caveats: We do not presently tally counts or frequencies of reference alleles (those positions identical to rCRS) or those of ambiguous nucleotides (R, Y, M, K, etc). Indel calls in repetitive regions may not always match those of the original publications due the different manners of indel reporting over the years (e.g., positioning at the beginning or end of a polytract or repeat, forward or backward reading of inserted or deleted bases). The sequences in this GenBank set have not been individually reviewed by Mitomap. Please also be aware that (1) GenBank sequences may not be of equal quality (Yao, et al, 2009); (2) some sequences might be present in GenBank more than once under different IDs; (3) some sequences might be from clones or cell lines; (4) sequence collection is not evenly distributed across the continents; and (5) some of the GenBank sequences are derived from pathology samples or from diseased patients, presenting a somewhat biased sampling of the global mitochondriome.


Lineage distribution of 48,882 sequences from our current data set:

LMN
LineagesLineagesLineages
"African"          "Asian"          "Eurasian"
hg#      %          hg#      %          hg#      %
L32,13536%          M5,25050%          H9,16728%
L01,50025%          D2,35822%          U4,23113%
L21,32222%          C1,65116%          B4,19313%
L187815%          E4564%          J2,3197%
L41052%          G4374%          T2,2377%
L5391%          Z1912%          K1,8176%
L6120%          Q1772%          F1,6635%
Total5,991100%          Total10,520100%          A1,3864%
Overall 12% (5,991 / 48,882)          Overall 22% (10,520 / 48,882)          R1,0773%
                    N7852%
                    HV7352%
                    I7182%
                    V6932%
                    W5292%
                    X4701%
                    P1590%
                    Y1350%
                    S490%
                    O80%
                    Total32,371100%
                    Overall 66% (32,371 / 48,882)

  • Map: World Haplogroup Migrations

I Attachment Action Size Date Who Comment
WorldMigrations2012.pdfpdf WorldMigrations2012.pdf manage 124 K 24 May 2019 - 18:38 MarieLott World Haplogroup Migrations
simple-tree-mitomap-2012.pdfpdf simple-tree-mitomap-2012.pdf manage 149 K 24 May 2019 - 18:45 MarieLott Simple Haplogroup Tree
Topic revision: r1 - 16 Dec 2020, UnknownUser

POLG Server
MitoScape

This site is powered by FoswikiCopyright © by the contributing authors. All material on this collaboration platform is the property of the contributing authors.
Ideas, requests, problems regarding Foswiki? Send feedback