追蹤
Rahulkumar Gayatri
Rahulkumar Gayatri
在 lbl.gov 的電子郵件地址已通過驗證
標題
引用次數
引用次數
年份
Kokkos 3: Programming model extensions for the exascale era
CR Trott, D Lebrun-Grandié, D Arndt, J Ciesko, V Dang, N Ellingwood, ...
IEEE Transactions on Parallel and Distributed Systems 33 (4), 805-817, 2021
1932021
TERAFLUX: Harnessing dataflow in next generation teradevices
R Giorgi, RM Badia, F Bodin, A Cohen, P Evripidou, P Faraboschi, ...
Microprocessors and Microsystems 38 (8), 976-990, 2014
892014
An empirical roofline methodology for quantitatively assessing performance portability
C Yang, R Gayatri, T Kurth, P Basu, Z Ronaghi, A Adetokunbo, B Friesen, ...
2018 IEEE/ACM International Workshop on Performance, Portability and …, 2018
442018
A case study for performance portability using OpenMP 4.5
R Gayatri, C Yang, T Kurth, J Deslippe
Accelerator Programming Using Directives: 5th International Workshop, WACCPD …, 2019
392019
Billion atom molecular dynamics simulations of carbon at extreme conditions and experimental time and length scales
K Nguyen-Cong, JT Willman, SG Moore, AB Belonoshko, R Gayatri, ...
Proceedings of the International Conference for High Performance Computing …, 2021
282021
A novel multi-level integrated roofline model approach for performance characterization
T Koskela, Z Matveev, C Yang, A Adedoyin, R Belenov, P Thierry, Z Zhao, ...
High Performance Computing: 33rd International Conference, ISC High …, 2018
222018
Experiences in porting mini‐applications to OpenACC and OpenMP on heterogeneous systems
VG Vergara Larrea, RD Budiardja, R Gayatri, C Daley, O Hernandez, ...
Concurrency and Computation: Practice and Experience 32 (20), e5780, 2020
172020
Case study of using Kokkos and SYCL as performance-portable frameworks for Milc-Dslash benchmark on NVIDIA, AMD and Intel GPUs
AS Dufek, R Gayatri, N Mehta, D Doerfler, B Cook, Y Ghadar, C DeTar
2021 International Workshop on Performance, Portability and Productivity in …, 2021
102021
Rapid exploration of optimization strategies on advanced architectures using testsnap and lammps
R Gayatri, S Moore, E Weinberg, N Lubbers, S Anderson, J Deslippe, ...
arXiv preprint arXiv:2011.12875, 2020
102020
Loop level speculation in a task based programming model
R Gayatri, RM Badia, E Aygaude
20th Annual International Conference on High Performance Computing, 39-48, 2013
82013
Evaluating performance portability of OpenMP for SNAP on NVIDIA, Intel, and AMD GPUs using the roofline methodology
NA Mehta, R Gayatri, Y Ghadar, C Knight, J Deslippe
Accelerator Programming Using Directives: 7th International Workshop, WACCPD …, 2021
62021
Comparing managed memory and ats with and without prefetching on nvidia volta gpus
R Gayatri, K Gott, J Deslippe
2019 IEEE/ACM Performance Modeling, Benchmarking and Simulation of High …, 2019
62019
Kokkos 3: Programming Model Extensions for the Exascale Era, IEEE T. Parall. Distr., 33, 805–817
CR Trott, D Lebrun-Grandié, D Arndt, J Ciesko, V Dang, N Ellingwood, ...
52022
Transactional access to shared memory in StarSs, a task based programming model
R Gayatri, RM Badia, E Ayguade, M Luján, I Watson
Euro-Par 2012 Parallel Processing: 18th International Conference, Euro-Par …, 2012
52012
Scaling and performance portability of the particle-in-cell scheme for plasma physics applications through mini-apps targeting exascale architectures
S Muralikrishnan, M Frey, A Vinciguerra, M Ligotino, AJ Cerfon, ...
arXiv preprint arXiv:2205.11052, 2022
22022
Non-recurring engineering (NRE) best practices: a case study with the NERSC/NVIDIA OpenMP contract
CS Daley, A Southwell, R Gayatri, S Biersdorfff, C Toepfer, G Özen, ...
Proceedings of the International Conference for High Performance Computing …, 2021
22021
A Methodology for Evaluating Tightly-integrated and Disaggregated Accelerated Architectures
T Groves, C Daley, R Gayatri, HA Nam, N Ding, L Oliker, NJ Wright, ...
2022 IEEE/ACM International Workshop on Performance Modeling, Benchmarking …, 2022
12022
Increasing parallelism through speculation in a task-based programming model
R Gayatri
Universitat Politècnica de Catalunya (UPC), 2015
12015
The Kokkos OpenMPTarget Backend: Implementation and Lessons Learned
R Gayatri, SL Olivier, CR Trott, J Doerfert, J Ciesko, D Lebrun-Grandie
International Workshop on OpenMP, 99-113, 2023
2023
ALPINE: A set of performance portable plasma physics particle-in-cell mini-apps for exascale computing.
S Muralikrishnan, M Frey, A Vinciguerra, M Ligotino, AJ Cerfon, ...
CoRR, 2022
2022
系統目前無法執行作業,請稍後再試。
文章 1–20