Actual mutational research of 19 autosomal STRs based on restricted mutation model and big data

2021 
Short tandem repeat (STR) markers have been widely used in forensic paternity testing and individual identification, but the STR mutation might impact on the forensic result interpretation. Importantly, the STR mutation rate was underestimated due to ignoring the "hidden" mutation phenomenon in most similar studies. Considering this, we use Slooten and Ricciardi's restricted mutation model based on big data to obtain more accurate mutation rates for each marker. In this paper, the mutations of 20 autosomal STRs loci (D3S1358, D1S1656, D13S317, Penta E, D16S539, D18S51, D2S1338, CSF1PO, Penta D, TH01, vWA, D21S11, D6S1043, D7S820, D5S818, TPOX, D8S1179, D12S391, D19S433, and FGA; The restricted model does not include the correction factor of D6S1043, this paper calculates remaining 19 STR loci mutation rates) were investigated in 28,313 (Total: 78,739 individuals) confirmed parentage-testing cases in Chinese Han population. As a result, total 1665 mutations were found in all loci, including 1614 one-steps, 34 two-steps, 8 three-steps, and 9 nonintegral mutations. The loci-specific average mutation rates ranged from 0.00007700 (TPOX) to 0.00459050 (FGA) in trio's and 0.00000000 (TPOX) to 0.00344850 (FGA) in duo's. We analyzed the relationship between mutation rates of the apparent and actual, the trio's and duo's, the paternal and maternal, respectively. The results demonstrated that the actual mutation rates are more than the apparent mostly, and the values of μ1"/μ2"(apparent) are also greater than μ1/μ2 (actual) commonly (μ1", μ1; μ2", μ2 are the mutation rates of one-step and two-step). Therefore, the "hidden" mutations are identified. In addition, the mutations rates of trio's and duo's, the paternal and maternal, exhibit significant difference. Next, those mutation data are used to do a comparison with the studies of other Han populations in China, which present the temporal and regional disparities. Due to the large sample size, some rare mutation events, such as monozygotic (MZ) mutation and "fake four-step mutation", are also reported in this study. In conclusion, the estimation values of actual mutations are obtained based on big data, they can not only provide basic data for the Chinese forensic DNA and population genetics databases, but also have important significance for the development of forensic individual identification, paternity testing and genetics research.
    • Correction
    • Source
    • Cite
    • Save
    • Machine Reading By IdeaReader
    0
    References
    0
    Citations
    NaN
    KQI
    []