Follow
Pratyush Patel
Pratyush Patel
Verified email at cs.washington.edu - Homepage
Title
Cited by
Cited by
Year
Gandiva: Introspective cluster scheduling for deep learning
W Xiao, R Bhardwaj, R Ramjee, M Sivathanu, N Kwatra, Z Han, P Patel, ...
13th USENIX Symposium on Operating Systems Design and Implementation (OSDI …, 2018
4982018
The demikernel datapath os architecture for microsecond-scale datacenter systems
I Zhang, A Raybuck, P Patel, K Olynyk, J Nelson, OSN Leija, A Martinez, ...
Proceedings of the ACM SIGOPS 28th Symposium on Operating Systems Principles …, 2021
852021
A server-based approach for predictable GPU access control
H Kim, P Patel, S Wang, RR Rajkumar
2017 IEEE 23rd International Conference on Embedded and Real-Time Computing …, 2017
362017
The virtual block interface: A flexible alternative to the conventional virtual memory framework
N Hajinazar, P Patel, M Patel, K Kanellopoulos, S Ghose, ...
2020 ACM/IEEE 47th Annual International Symposium on Computer Architecture …, 2020
302020
Analytical enhancements and practical insights for MPCP with self-suspensions
P Patel, I Baek, H Kim, R Rajkumar
2018 IEEE Real-Time and Embedded Technology and Applications Symposium (RTAS …, 2018
302018
SoundWatch: Exploring smartwatch-based deep learning approaches to support sound awareness for deaf and hard of hearing users
D Jain, H Ngo, P Patel, S Goodman, L Findlater, J Froehlich
Proceedings of the 22nd International ACM SIGACCESS Conference on Computers …, 2020
272020
A server-based approach for predictable GPU access with improved analysis
H Kim, P Patel, S Wang, RR Rajkumar
Journal of Systems Architecture 88, 97-109, 2018
202018
Splitwise: Efficient generative llm inference using phase splitting
P Patel, E Choukse, C Zhang, Í Goiri, A Shah, S Maleki, R Bianchini
arXiv preprint arXiv:2311.18677, 2023
182023
Timershield: Protecting High-Priority Tasks from Low-Priority Timer Interference
P Patel, M Vanga, BB Brandenburg
2017 IEEE Real-Time and Embedded Technology and Applications Symposium (RTAS …, 2017
122017
Towards improved power management in cloud gpus
P Patel, Z Gong, S Rizvi, E Choukse, P Misra, T Anderson, A Sriraman
IEEE Computer Architecture Letters 22 (2), 141-144, 2023
92023
Srifty: Swift and thrifty distributed neural network training on the cloud
L Luo, P West, P Patel, A Krishnamurthy, L Ceze
Proceedings of Machine Learning and Systems 4, 833-847, 2022
42022
The magazine archive includes every article published in Communications of the ACM for over the past 50 years.
UN Umesh, MQ Huynh, L Jessup
Communications of the ACM 48 (6), 82-87, 2005
42005
POLCA: Power Oversubscription in LLM Cloud Providers
P Patel, E Choukse, C Zhang, Í Goiri, B Warrier, N Mahalingam, ...
arXiv preprint arXiv:2308.12908, 2023
32023
Hybrid Computing for Interactive Datacenter Applications
P Patel, K Lim, K Jhunjhunwalla, A Martinez, M Demoulin, J Nelson, ...
arXiv preprint arXiv:2304.04488, 2023
22023
Characterizing Power Management Opportunities for LLMs in the Cloud
P Patel, E Choukse, C Zhang, Í Goiri, B Warrier, N Mahalingam, ...
Proceedings of the 29th ACM International Conference on Architectural …, 2024
12024
SoundWatch: deep learning for sound accessibility on smartwatches
D Jain, H Ngo, P Patel, S Goodman, K Nguyen, R Grossman-Kahn, ...
Communications of the ACM 65 (6), 100-108, 2022
12022
An Agile Pathway Towards Carbon-aware Clouds
P Patel, T Gregersen, T Anderson
Proceedings of the 2nd Workshop on Sustainable Computer Systems, 1-8, 2023
2023
Predictable GPU Arbitration for Fixed-Priority Real-Time Systems
P Patel
Birla Institute of Technology and Science, Pilani, 2017
2017
File Systems are not Enough: Rethinking the Storage API for Microsecond-Scale Cloud Applications
A Martinez, K Lim, P Patel, I Zhang, D Ports, J Nelson, T Anderson
Designing Equitable Data Center Scheduling Systems
S Rangarajan, X Chen, P Patel, J Wang, A Sriraman
The system can't perform the operation now. Try again later.
Articles 1–20