Follow
Geonhwa Jeong
Geonhwa Jeong
Research Scientist, Meta
Verified email at meta.com - Homepage
Title
Cited by
Cited by
Year
ConfuciuX: Autonomous Hardware Resource Assignment for DNN Accelerators using Reinforcement Learning
SC Kao, G Jeong, T Krishna
In Proc. of the 53rd Annual IEEE/ACM International Symposium on …, 2020
1092020
TurboFlux: A fast continuous subgraph matching system for streaming graph data
K Kim, I Seo, WS Han, JH Lee, S Hong, H Chafi, H Shin, G Jeong
In Proc. of the 44th International Conference on Management of Data (SIGMOD …, 2018
672018
GEAR: An Efficient KV Cache Compression Recipe for Near-Lossless Generative Inference of LLM
H Kang, Q Zhang, S Kundu, G Jeong, Z Liu, T Krishna, T Zhao
arXiv preprint arXiv:2403.05527, 2024
272024
Evaluating Spatial Accelerator Architectures with Tiled Matrix-Matrix Multiplication
GE Moon, H Kwon, G Jeong, P Chatarasi, S Rajamanickam, T Krishna
IEEE Transactions on Parallel and Distributed Systems (TPDS), 2021
262021
Union: A Unified HW-SW Co-Design Ecosystem in MLIR for Evaluating Tensor Operations on Spatial Accelerators
G Jeong, G Kestor, P Chatarasi, A Parashar, PA Tsai, S Rajamanickam, ...
In Proc. of the 30th International Conference on Parallel Architectures and …, 2021
182021
RASA: Efficient Register-Aware Systolic Array Matrix Engine for CPU
G Jeong, E Qin, A Samajdar, CJ Hughes, S Subramoney, H Kim, ...
In Proc. of the 58th Annual Design Automation Conference (DAC), 2021
182021
Extending Sparse Tensor Accelerators to Support Multiple Compression Formats
E Qin, G Jeong, W Won, SC Kao, H Kwon, S Srinivasan, D Das, GE Moon, ...
In Proc. of the 35th IEEE International Parallel & Distributed Processing …, 2021
172021
VEGETA: Vertically-Integrated Extensions for Sparse/Dense GEMM Tile Acceleration on CPUs
G Jeong, S Damani, AR Bambhaniya, E Qin, CJ Hughes, S Subramoney, ...
In Proc. of the 29th IEEE International Symposium on High-Performance …, 2023
142023
Algorithm-Hardware Co-Design of Distribution-Aware Logarithmic-Posit Encodings for Efficient DNN Inference
A Ramachandran, Z Wan, G Jeong, J Gustafson, T Krishna
In Proc. of the 61st Annual Design Automation Conference (DAC), 2024
52024
Demystifying Platform Requirements for Diverse LLM Inference Use Cases
A Bambhaniya, R Raj, G Jeong, S Kundu, S Srinivasan, M Elavazhagan, ...
arXiv preprint arXiv:2406.01698, 2024
32024
Characterization of Data Compression in Datacenters
G Jeong, B Sharma, N Terrell, A Dhanotia, Z Zhao, N Agarwal, ...
In Proc. of the 24th IEEE International Symposium on Performance Analysis of …, 2023
32023
SDQ: Sparse Decomposed Quantization for LLM Inference
G Jeong, PA Tsai, SW Keckler, T Krishna
arXiv preprint arXiv:2406.13868, 2024
12024
Understanding Data Compression in Warehouse-Scale Datacenter Services
G Jeong, B Sharma, N Terrell, A Dhanotia, Z Zhao, N Agarwal, ...
In Proc. of the 23rd IEEE International Symposium on Performance Analysis of …, 2022
12022
Bridging the Frequency Gap in Heterogeneous 3D SoCs through Technology-Specific NoC Router Architectures
JM Joseph, L Bamberg, G Jeong, RT Chien, R Leupers, A Garía-Ortiz, ...
In Proc. of the 26th Asia and South Pacific Design Automation Conference …, 2021
12021
Generating sparse neural networks
G Jeong, PA Tsai, JM Pool
US Patent US20240152407A1, 2024
2024
Abstracting Sparse DNN Acceleration via Structured Sparse Tensor Decomposition
G Jeong, PA Tsai, AR Bambhaniya, SW Keckler, T Krishna
arXiv preprint arXiv:2403.07953, 2024
2024
The system can't perform the operation now. Try again later.
Articles 1–16