Chemical identification of metamorphic protoliths using machine learning methods

2019 
Abstract The fundamental origins of metamorphic rocks as sedimentary or igneous are integral to the proper interpretation of a terrane’s tectonic and geodynamic evolution. In some cases, the protolith class cannot be determined from field relationships, texture, and/or compositional layering. In this study, we utilize machine learning to predict a metamorphic protolith from its major element chemistry so that accurate interpretation of the geology may proceed when the origin is uncertain or to improve confidence in field predictions. We survey the efficacy of several machine learning techniques to predict the protolith class (igneous or sedimentary) for whole rock geochemical analyses using 9 major oxides. The data are drawn from a global geochemical database with > 533 000 geochemical analyses. In addition to metamorphic samples, igneous and sedimentary analyses are used to supplement the dataset based on their similar chemical distributions to their metamorphic counterparts. We train the classifiers on most of the data, retaining ∼ 10% for post-training validation. We find that the RUSBoost algorithm performs best overall, achieving a true-positive rate of > 95% and > 85% for igneous- and sedimentary-derived samples, respectively. Even the traditionally-difficult-to-differentiate metasedimentary and metaigneous rocks of granitic-granodioritic composition were consistently identified with a > 75% success rate (92% for granite; 85% for granodiorite; 88% for wacke; 76% for arkose). The least correctly identified rock types were iron-rich shale (58%) and quartzolitic rocks (6%). These trained classifiers are able to classify metamorphic protoliths better than common discrimination methods, allowing for the appropriate interpretation of the chemical, physical, and tectonic contextual history of a rock. The preferred classifier is available as a MATLAB function that can be applied to a spreadsheet of geochemical analyses, returning a predicted class and estimated confidence score. We anticipate this classifier’s use as a cheap tool to aid geoscientists in accurate protolith prediction and to increase the size of global geochemical datasets where protolith information is ambiguous or not retained.
    • Correction
    • Source
    • Cite
    • Save
    • Machine Reading By IdeaReader
    41
    References
    8
    Citations
    NaN
    KQI
    []