TTDFT: a GPU accelerated Tucker tensor DFT code for large-scale Kohn-Sham DFT calculations
From MaRDI portal
Publication:6097322
Abstract: We present the Tucker tensor DFT (TTDFT) code which uses a tensor-structured algorithm with graphic processing unit (GPU) acceleration for conducting ground-state DFT calculations on large-scale systems. The Tucker tensor DFT algorithm uses a localized Tucker tensor basis computed from an additive separable approximation to the Kohn-Sham Hamiltonian. The discrete Kohn-Sham problem is solved using Chebyshev filtering subspace iteration method that relies on matrix-matrix multiplications of a sparse symmetric Hamiltonian matrix and a dense wavefunction matrix, expressed in the localized Tucker tensor basis. These matrix-matrix multiplication operations, which constitute the most computationally intensive step of the solution procedure, are GPU accelerated providing ~8-fold GPU-CPU speedup for these operations on the largest systems studied. The computational performance of the TTDFT code is presented using benchmark studies on aluminum nano-particles and silicon quantum dots with system sizes ranging up to ~7,000 atoms.
Cites work
- A Block Orthogonalization Procedure with Constant Synchronization Requirements
- A literature survey of low-rank tensor approximation techniques
- A splitting method for orthogonality constrained problems
- Ab initio molecular simulations with numeric atom-centered orbitals
- Adaptive local basis set for Kohn-Sham density functional theory in a discontinuous Galerkin framework. I: Total energy calculation
- Approximation of the electron density of aluminium clusters in tensor-product format
- Communication-optimal parallel and sequential QR and LU factorizations
- Compressed modes for variational problems in mathematics and physics
- DFT-FE - a massively parallel adaptive finite-element code for large-scale density functional theory calculations
- DFT-FE 1.0: a massively parallel hybrid CPU-GPU density functional theory code using finite-element discretization
- Higher-order adaptive finite-element methods for Kohn-Sham density functional theory
- Self-consistent-field calculations using Chebyshev-filtered subspace iteration
- Tensor Decompositions and Applications
- Tensor spaces and numerical tensor calculus
- Tensor-product approximation to operators and functions in high dimensions
- Towards reduced basis approaches in ab initio electronic structure computations
- TuckerMPI: a parallel C++/MPI software package for large-scale data compression via the Tucker tensor decomposition
Cited in
(1)
This page was built for publication: TTDFT: a GPU accelerated Tucker tensor DFT code for large-scale Kohn-Sham DFT calculations
Report a bug (only for logged in users!)Click here to report a bug for this page (MaRDI item Q6097322)