Follow
Tyler Michael Smith
Tyler Michael Smith
Neural Magic
Verified email at neuralmagic.com
Title
Cited by
Cited by
Year
Analytical modeling is enough for high-performance BLIS
TM Low, FD Igual, TM Smith, ES Quintana-Orti
ACM Transactions on Mathematical Software (TOMS) 43 (2), 1-18, 2016
1692016
Anatomy of high-performance many-threaded matrix multiplication
TM Smith, R Van De Geijn, M Smelyanskiy, JR Hammond, FG Van Zee
2014 IEEE 28th International Parallel and Distributed Processing Symposium …, 2014
1622014
The BLIS framework: Experiments in portability
FG Van Zee, TM Smith, B Marker, TM Low, RAVD Geijn, FD Igual, ...
ACM Transactions on Mathematical Software (TOMS) 42 (2), 1-19, 2016
1242016
Strassen's algorithm reloaded
J Huang, TM Smith, GM Henry, RA Van De Geijn
SC'16: Proceedings of the International Conference for High Performance …, 2016
812016
Implementing high-performance complex matrix multiplication via the 3m and 4m methods
FG Van Zee, TM Smith
ACM Transactions on Mathematical Software (TOMS) 44 (1), 1-36, 2017
412017
A Tight I/O Lower Bound for Matrix Multiplication
TM Smith, B Lowery, J Langou, RA van de Geijn
arXiv preprint arXiv:1702.02017, 2019
172019
Compressive sensing using iterative hard thresholding with low precision data representation: Theory and applications
NM Gürel, K Kara, A Stojanov, T Smith, T Lemmin, D Alistarh, M Püschel, ...
IEEE Transactions on Signal Processing 68, 4268-4282, 2020
112020
Implementing strassen's algorithm with blis
J Huang, TM Smith, GM Henry, RA van de Geijn
arXiv preprint arXiv:1605.01078, 2016
102016
Pushing the bounds for matrix-matrix multiplication
TM Smith, RA van de Geijn
CoRR abs/1702.02017, 2017
92017
The MOMMS family of matrix multiplication algorithms
TM Smith, RA van de Geijn
arXiv preprint arXiv:1904.05717, 2019
82019
Fast quantized arithmetic on x86: Trading compute for data movement
A Stojanov, TM Smith, D Alistarh, M Püschel
2018 IEEE International Workshop on Signal Processing Systems (SiPS), 349-354, 2018
82018
Theory and practice of classical matrix-matrix multiplication for hierarchical memory architectures
TM Smith
62018
Toward ABFT for BLIS GEMM
TM Smith, RA van de Geijn, M Smelyanskiy, ES Quintana-Orti
Tech. Rep. TR-15–05. The University of Texas at Austin, 2015
62015
Automating the last-mile for high performance dense linear algebra
RM Veras, TM Low, TM Smith, R van de Geijn, F Franchetti
arXiv preprint arXiv:1611.08035, 2016
52016
Analytical models for the BLIS framework
TM Low, FD Igual, TM Smith, ES Quintana-Ortí
ACM Transactions on Mathematical Software, 2015
52015
Implementing level-3 BLAS with BLIS: Early experience
FG Van Zee, T Smith, FD Igual, M Smelyanskiy, X Zhang, M Kistler, ...
The University of Texas at Austin, Department of Computer Science, FLAME …, 2013
52013
Opportunities for Parallelism in Matrix Multiplication
TM Smith, RA van de Geijn, M Smelyanskiy, J Hammond, FG Van Zee
Univ. Texas Techinical Report, 2013
4*2013
Lowering barriers into HPC through open education
RA van de Geijn, J Huang, ME Myers, DN Parikh, TM Smith
EdEduHPC-17: Workshop on Education for High-Performance Computing., 2017
32017
Code generation to aid parallel code development
B Marker, T Smith, D Batory, F Van Zee, R Van de Geijn
Technical report TR-14-08, The University of Texas at Austin, Department of …, 2014
22014
Inducing complex matrix multiplication via the 3m and 4m methods FLAME Working Note# 81
FG Van Zee, TM Smith
12016
The system can't perform the operation now. Try again later.
Articles 1–20