Items where Author is "Dongarra, Jack"
Article
Dongarra, Jack and Hammarling, Sven and Higham, Nicholas J. and Relton, Samuel D. and Valero-Lara, Pedro and Zounon, Mawussi (2017) The Design and Performance of Batched BLAS on Modern High-Performance Computing Systems. Procedia Computer Science, 108. pp. 495-504.
MIMS Preprint
Anzt, Hartwig and Dongarra, Jack and Flegar, Goran and Higham, Nicholas J. and Quintana-Orti, Enrique S. (2017) Adaptive Precision in Block-Jacobi Preconditioning for Iterative Sparse Linear System Solvers. [MIMS Preprint] (In Press)
Dongarra, Jack and Duff, Iain and Gates, Mark and Haidar, Azzam and Hammarling, Sven and Higham, Nicholas J. and Hogg, Jonathon and Valero-Lara, Pedro and Relton, Samuel D. and Tomov, Stanimire and Zounon, Mawussi (2016) A Proposed API for Batched Basic Linear Algebra Subprograms. [MIMS Preprint]
Li, Yinan and Dongarra, Jack and Tomov, Stanimire (2009) A Note on Auto-tuning GEMM for GPUs. [MIMS Preprint]
Bosilca, George and Delmas, Remi and Dongarra, Jack and Langou, Julien (2009) Algorithmic Based Fault Tolerance Applied to High Performance Computing. [MIMS Preprint]
Ltaief, Hatem and Kurzak, Jakub and Dongarra, Jack (2009) Parallel Band Two-Sided Matrix Bidiagonalization for Multicore Architectures. [MIMS Preprint]
Ltaief, Hatem and Kurzak, Jakub and Dongarra, Jack (2009) Parallel Block Hessenberg Reduction using Algorithms-By-Tiles for Multicore Architectures Revisited. [MIMS Preprint]
Dongarra, Jack and Langou, Julien (2009) The Problem with the Linpack Benchmark 1.0 Matrix Generator. [MIMS Preprint]
Baboulin, Marc and Dongarra, Jack and Tomov, Stanimire (2009) Some Issues in Dense Linear Algebra for Multicore and Special Purpose Architectures. [MIMS Preprint]
Tomov, Stanimire and Dongarra, Jack and Baboulin, Marc (2009) Towards Dense Linear Algebra for Hybrid GPU Accelerated Manycore Systems. [MIMS Preprint]
Kurzak, Jakub and Buttari, Alfredo and Luszczek, Piotr and Dongarra, Jack (2008) The PlayStation 3 for High Performance Scientific Computing. [MIMS Preprint]
Dongarra, Jack and Luszczek, Piotr (2007) How Elegant Code Evolves with Hardware: The Case of Gaussian Elimination. [MIMS Preprint]
Buttari, Alfredo and Langou, Julien and Kurzak, Jakub and Dongarra, Jack (2007) A Class of Parallel Tiled Linear Algebra Algorithms for Multicore Architectures. [MIMS Preprint]
Baboulin, Marc and Dongarra, Jack and Gratton, Serge and Langou, Julien (2007) Computing the Conditioning of the Components of a Linear Least Squares Solution. [MIMS Preprint]
Buttari, Alfredo and Dongarra, Jack and Langou, Julie and Langou, Julien and Luszczek, Piotr and Kurzak, Jakub (2007) Mixed Precision Iterative Refinement Techniques for the Solution of Dense Linear Systems. [MIMS Preprint]
Dongarra, Jack and Golub, G and Moler, C and Moore, K (2007) Netlib and NA-Net: building a scientific computing community. [MIMS Preprint]
Buttari, Alfredo and Dongarra, Jack and Kurzak, Jakub (2007) Limitations of the PlayStation 3 for High Performance Cluster Computing. [MIMS Preprint]
Kurzak, Jakub and Buttari, Alfredo and Dongarra, Jack (2007) Solving Systems of Linear Equations on the CELL Processor Using Cholesky Factorization. [MIMS Preprint]
Buttari, Alfredo and Dongarra, Jack and Kurzak, Jakub and Luszczek, Piotr and Tomov, Stanimire (2007) Using Mixed Precision for Sparse Matrix Computations to Enhance the Performance while Achieving 64-bit Accuracy. [MIMS Preprint]
Conference or Workshop Item
Haidar, Azzam and Tomov, Stanimire and Dongarra, Jack and Higham, Nicholas J. Harnessing GPU Tensor Cores for Fast FP16 Arithmetic to Speed up Mixed-Precision Iterative Refinement Solvers. In: International Conference on Supercomputing, New York, NY, USA, 2018. (In Press)