Massively parallel implementation of iterative eigensolvers in large-scale plane-wave density functional theory
Computer Physics Communications, ISSN: 0010-4655, Vol: 299
2024
- 4Citations
- 3Captures
Metric Options: Counts1 Year3 YearSelecting the 1-year or 3-year option will change the metrics count to percentiles, illustrating how an article or review compares to other articles or reviews within the selected time period in the same journal. Selecting the 1-year option compares the metrics against other articles/reviews that were also published in the same calendar year. Selecting the 3-year option compares the metrics against other articles/reviews that were also published in the same calendar year plus the two years prior.
Example: if you select the 1-year option for an article published in 2019 and a metric category shows 90%, that means that the article or review is performing better than 90% of the other articles/reviews published in that journal in 2019. If you select the 3-year option for the same article published in 2019 and the metric category shows 90%, that means that the article or review is performing better than 90% of the other articles/reviews published in that journal in 2019, 2018 and 2017.
Citation Benchmarking is provided by Scopus and SciVal and is different from the metrics context provided by PlumX Metrics.
Example: if you select the 1-year option for an article published in 2019 and a metric category shows 90%, that means that the article or review is performing better than 90% of the other articles/reviews published in that journal in 2019. If you select the 3-year option for the same article published in 2019 and the metric category shows 90%, that means that the article or review is performing better than 90% of the other articles/reviews published in that journal in 2019, 2018 and 2017.
Citation Benchmarking is provided by Scopus and SciVal and is different from the metrics context provided by PlumX Metrics.
Article Description
The Kohn-sham density functional theory (DFT) is a powerful method to describe the electronic structures of molecules and solids in condensed matter physics, computational chemistry and materials science. However, large and accurate DFT calculations within plane waves process a cubic-scaling computational complexity, which is usually limited by expensive computation and communication costs. The rapid development of high performance computing (HPC) on leadership supercomputers brings new opportunities for developing plane-wave DFT calculations for large-scale systems. Here, we implement parallel iterative eigensolvers in large-scale plane-wave DFT calculations, including Davidson, locally optimal block preconditioned conjugate gradient (LOBPCG), projected preconditioned conjugate gradient (PPCG) and the Chebyshev subspace iteration (CheFSI) algorithms, and analyze the performance of these algorithms in massively parallel plane-wave computing tasks. We adopt a two-level parallelization strategy that combines the message passing interface (MPI) with open multi-processing (OpenMP) parallel programming to handle data exchange and matrix operations in the construction and diagonalization of large-scale Hamiltonian matrix within plane waves. Numerical results illustrate that these iterative eigensolvers can scale up to 42,592 processing cores with high peak performance of 30% on leadship supercomputers to study the electronic structures of bulk silicon systems containing 10,648 atoms. Program summary: Program Title: Plane wave density functional theory (PWDFT) CPC Library link to program files: https://doi.org/10.17632/c8v2mx5vn4.1 Developer's repository link: https://bitbucket.org/berkeleylab/scales Licensing provisions: BSD 3-clause Programming language: C++ Nature of problem: PWDFT is used for electronic structure calculations based on Kohn-Sham density functional theory. The key challenge to address is a constrained energy minimization problem, which can also be formulated as a nonlinear eigenvalue problem. MPI/OpenMP-based approaches are employed to provide multi-core acceleration for the study of the chemical and material properties of larger-scale molecules and solids. Solution method: PWDFT implements self-consistent field (SCF) iterations and direct constrained minimization algorithms with various acceleration strategies. It is written in C++ and offers parallel acceleration based on MPI/OpenMP.
Bibliographic Details
Provide Feedback
Have ideas for a new metric? Would you like to see something else here?Let us know