High-performance generation of the Hamiltonian and overlap matrices in FLAPW methods

From MaRDI portal
Publication:1686969

DOI10.1016/J.CPC.2016.10.003zbMATH Open1376.65067arXiv1602.06589OpenAlexW2282110028MaRDI QIDQ1686969FDOQ1686969


Authors: Edoardo Di Napoli, Elmar Peise, Markus Hrywniak, Paolo Bientinesi Edit this on Wikidata


Publication date: 18 December 2017

Published in: Computer Physics Communications (Search for Journal in Brave)

Abstract: One of the greatest efforts of computational scientists is to translate the mathematical model describing a class of physical phenomena into large and complex codes. Many of these codes face the difficulty of implementing the mathematical operations in the model in terms of low level optimized kernels offering both performance and portability. Legacy codes suffer from the additional curse of rigid design choices based on outdated performance metrics (e.g. minimization of memory footprint). Using a representative code from the Materials Science community, we propose a methodology to restructure the most expensive operations in terms of an optimized combination of dense linear algebra kernels. The resulting algorithm guarantees an increased performance and an extended life span of this code enabling larger scale simulations.


Full work available at URL: https://arxiv.org/abs/1602.06589




Recommendations




Cites Work


Cited In (5)

Uses Software





This page was built for publication: High-performance generation of the Hamiltonian and overlap matrices in FLAPW methods

Report a bug (only for logged in users!)Click here to report a bug for this page (MaRDI item Q1686969)