MIMS Preprint

Dongarra, Jack and Duff, Iain and Gates, Mark and Haidar, Azzam and Hammarling, Sven and Higham, Nicholas J. and Hogg, Jonathon and Valero-Lara, Pedro and Relton, Samuel D. and Tomov, Stanimire and Zounon, Mawussi (2016) A Proposed API for Batched Basic Linear Algebra Subprograms. [MIMS Preprint]

Li, Yinan and Dongarra, Jack and Tomov, Stanimire (2009) A Note on Auto-tuning GEMM for GPUs. [MIMS Preprint]

Baboulin, Marc and Dongarra, Jack and Tomov, Stanimire (2009) Some Issues in Dense Linear Algebra for Multicore and Special Purpose Architectures. [MIMS Preprint]

Tomov, Stanimire and Dongarra, Jack and Baboulin, Marc (2009) Towards Dense Linear Algebra for Hybrid GPU Accelerated Manycore Systems. [MIMS Preprint]

Buttari, Alfredo and Dongarra, Jack and Kurzak, Jakub and Luszczek, Piotr and Tomov, Stanimire (2007) Using Mixed Precision for Sparse Matrix Computations to Enhance the Performance while Achieving 64-bit Accuracy. [MIMS Preprint]

Conference or Workshop Item

Haidar, Azzam and Tomov, Stanimire and Dongarra, Jack and Higham, Nicholas J. Harnessing GPU Tensor Cores for Fast FP16 Arithmetic to Speed up Mixed-Precision Iterative Refinement Solvers. In: International Conference on Supercomputing, New York, NY, USA, 2018. (In Press)

