Element-by-element matrix storage-free subdomain local FE solver for domain decomposition method

2018 
Exa-scale supercomputers will appear around 2020–2022. To obtain high intra-node performance, efficient utilization of processor cache memory should be considered. The traditional memory access-intensive approach, which prefers less computing and more storage on main memory, might not be effective for supercomputers in near future. The Domain Decomposition Method (DDM) is one of the effective parallel finite element schemes. We have been developing an FE-based parallel structural analysis code, ADVENTURE Solid, based on DDM, with the Balancing Domain Decomposition (BDD) pre-conditioners. The re-design of the subdomain local FE solver part, which is a performance sensitive kernel in the DDM code, is required. Here in this work, an “on-cache” iterative solver based on the DDM framework is developed. The subdomain local FE solver of the DDM code is implemented using CG solvers with element-by-element matrix storage-free approaches. These iterative solvers are parallelized using OpenMP, so that each subdomain can be solved by multiple cores. By adjusting the subdomain size so that the footprint fits within the last-level cache of a processor, this DDM code can be considered as a kind of an “on-cache” iterative solver”. Performance benchmark results are shown on various kinds of HPC platform having many core scalar processors, such as Skylake Xeon, Knights Landing and Fujitsu PRIMEHPC FX100.
    • Correction
    • Source
    • Cite
    • Save
    • Machine Reading By IdeaReader
    0
    References
    0
    Citations
    NaN
    KQI
    []