Edward Chuah
Edward Chuah
The Alan Turing Institute & The University of Warwick
Verified email at acm.org - Homepage
TitleCited byYear
Linking resource usage anomalies with system failures from cluster log data
E Chuah, A Jhumka, S Narasimhamurthy, J Hammond, JC Browne, ...
2013 IEEE 32nd International Symposium on Reliable Distributed Systems, 111-120, 2013
332013
Diagnosing the root-causes of failures from cluster log files
E Chuah, S Kuo, P Hiew, WC Tjhi, G Lee, J Hammond, MT Michalewicz, ...
2010 International Conference on High Performance Computing, 1-10, 2010
302010
Crude: Combining resource usage data and error logs for accurate error detection in large-scale distributed systems
N Gurumdimma, A Jhumka, M Liakata, E Chuah, J Browne
2016 IEEE 35th Symposium on Reliable Distributed Systems (SRDS), 51-60, 2016
112016
Establishing hypothesis for recurrent system failures from cluster log files
E Chuah, G Lee, WC Tjhi, SH Kuo, T Hung, J Hammond, T Minyard, ...
2011 IEEE Ninth International Conference on Dependable, Autonomic and Secure …, 2011
112011
Online failure prediction for hpc resources using decentralized clustering
A Pelaez, A Quiroz, JC Browne, E Chuah, M Parashar
2014 21st International Conference on High Performance Computing (HiPC), 1-9, 2014
92014
Towards detecting patterns in failure logs of large-scale distributed systems
N Gurumdimma, A Jhumka, M Liakata, E Chuah, J Browne
2015 IEEE International Parallel and Distributed Processing Symposium …, 2015
82015
An optimal smooth QoS adaptation strategy for QoS differentiated scalable media streaming
X Li, E Chuah, JY Tham, KH Goh
2008 IEEE International Conference on Multimedia and Expo, 429-432, 2008
62008
Using message logs and resource use data for cluster failure diagnosis
E Chuah, A Jhumka, JC Browne, N Gurumdimma, S Narasimhamurthy, ...
2016 IEEE 23rd International Conference on High Performance Computing (HiPC …, 2016
32016
Towards increasing the error handling time window in large-scale distributed systems using console and resource usage logs
N Gurumdimma, A Jhumka, M Liakata, E Chuah, J Browne
2015 IEEE Trustcom/BigDataSE/ISPA 3, 61-68, 2015
32015
Towards comprehensive dependability-driven resource use and message log-analysis for HPC systems diagnosis
E Chuah, A Jhumka, S Alt, D Balouek-Thomert, JC Browne, M Parashar
Journal of Parallel and Distributed Computing, 2019
2019
Enabling dependability-driven resource use and message log-analysis for cluster system diagnosis
E Chuah, A Jhumka, S Alt, T Damoulas, N Gurumdimma, MC Sawley, ...
2017 IEEE 24th International Conference on High Performance Computing, Data …, 2017
2017
Insights into the Diagnosis of System Failures from Cluster Message Logs
E Chuah, A Jhumka, JC Browne, B Barth, S Narasimhamurthy
2015 11th European Dependable Computing Conference (EDCC), 225-232, 2015
2015
On Handling Redundancy for Failure Log Analysis of Cluster Systems
N Gurumdimma, A Jhumka, M Liakata, E Chuah, J Browne
DEPEND 2015: The Eighth International Conference on Dependability, 2015
2015
Online monitoring of HPC resources using decentralized clustering
A Pelaez, M Parashar, JC Browne, A Quiroz, E Chuah
HIPC 2010-December 19-22, 2010-Goa, India Program
O Beaumont, AL Rosenberg, A Bhatele, GR Gupta, LV Kale, IH Chung, ...
The system can't perform the operation now. Try again later.
Articles 1–15