Efficient CSR-based sparse matrix-vector multiplication on GPU (Q1793182): Difference between revisions

Summary: Sparse matrix-vector multiplication (SpMV) is an important operation in computational science and needs be accelerated because it often represents the dominant cost in many widely used iterative methods and eigenvalue problems. We achieve this objective by proposing a novel SpMV algorithm based on the compressed sparse row (CSR) on the GPU. Our method dynamically assigns different numbers of rows to each thread block and executes different optimization implementations on the basis of the number of rows it involves for each block. The process of accesses to the CSR arrays is fully coalesced, and the GPU's DRAM bandwidth is efficiently utilized by loading data into the shared memory, which alleviates the bottleneck of many existing CSR-based algorithms (i.e., CSR-scalar and CSR-vector). Test results on C2050 and K20c GPUs show that our method outperforms a perfect-CSR algorithm that inspires our work, the vendor tuned CUSPARSE V6.5 and CUSP V0.5.1, and three popular algorithms clSpMV, CSR5, and CSR-Adaptive.

0 references

describes a project that uses

CUDA

0 references

0 references

0 references

0 references

0 references

0 references

0 references

0 references

0 references

MaRDI publication profile

0 references

full work available at URL

https://doi.org/10.1155/2016/4596943

0 references

cites work

Q2768030

0 references

Compressed Multirow Storage Format for Sparse Matrices on Graphics Processing Units

0 references

A novel CSR-based sparse matrix-vector multiplication on GPUs

0 references

A Unified Sparse Matrix Data Format for Efficient General Sparse Matrix-Vector Multiplication on Modern Processors with Wide SIMD Units

0 references

The university of Florida sparse matrix collection

0 references

Identifiers

zbMATH Open document ID

1400.65070

0 references

DOI

10.1155/2016/4596943

0 references

Mathematics Subject Classification ID

0 references

0 references

0 references

0 references

0 references

Sitelinks

Mathematics(1 entry)

mardi Publication:1793182

@@ Property / describes a project that uses @@
+clSpMV
@@ Property / describes a project that uses: clSpMV / rank @@
+Normal rank
@@ Property / describes a project that uses @@
+LightSpMV
@@ Property / describes a project that uses: LightSpMV / rank @@
+Normal rank
@@ Property / describes a project that uses @@
+CSR5
@@ Property / describes a project that uses: CSR5 / rank @@
+Normal rank
@@ Property / MaRDI profile type @@
+MaRDI publication profile
@@ Property / MaRDI profile type: MaRDI publication profile / rank @@
+Normal rank
@@ Property / full work available at URL @@
+https://doi.org/10.1155/2016/4596943
+Normal rank
@@ Property / OpenAlex ID @@
+W2527991513
@@ Property / OpenAlex ID: W2527991513 / rank @@
+Normal rank
@@ Property / cites work @@
+Q2768030
@@ Property / cites work: Q2768030 / rank @@
+Normal rank
@@ Property / cites work @@
+Compressed Multirow Storage Format for Sparse Matrices on Graphics Processing Units
+Normal rank
@@ Property / cites work @@
+A novel CSR-based sparse matrix-vector multiplication on GPUs
+Normal rank
@@ Property / cites work @@
+A Unified Sparse Matrix Data Format for Efficient General Sparse Matrix-Vector Multiplication on Modern Processors with Wide SIMD Units
+Normal rank
@@ Property / cites work @@
+The university of Florida sparse matrix collection
+Normal rank