Mantas Mazeika

Cited by

	All	Since 2019
Citations	6437	6411
h-index	13	13
i10-index	14	14

2500

1250

625

1875

201920202021202220232024131 449 906 1258 2441 1211

Public access

View all

2 articles

0 articles

available

not available

Based on funding mandates

Mantas Mazeika

University of Illinois Urbana-Champaign

Verified email at illinois.edu

ML Safety AI Safety Machine Ethics ML Reliability


Title Sort by citations Sort by year Sort by title	Cited by Cited by	Year
Deep anomaly detection with outlier exposure D Hendrycks, M Mazeika, T Dietterich arXiv preprint arXiv:1812.04606, 2018	1457	2018
Measuring massive multitask language understanding D Hendrycks, C Burns, S Basart, A Zou, M Mazeika, D Song, J Steinhardt arXiv preprint arXiv:2009.03300, 2020	1081	2020
Using self-supervised learning can improve model robustness and uncertainty D Hendrycks, M Mazeika, S Kadavath, D Song Advances in neural information processing systems 32, 2019	955	2019
Using pre-training can improve model robustness and uncertainty D Hendrycks, K Lee, M Mazeika International Conference on Machine Learning, 2712-2721, 2019	746	2019
Beyond the Imitation Game: Quantifying and extrapolating the capabilities of language models A Srivastava, A Rastogi, A Rao, AAM Shoeb, A Abid, A Fisch, AR Brown, ... arXiv preprint arXiv:2206.04615, 2022	724	2022
Using trusted data to train deep networks on labels corrupted by severe noise D Hendrycks, M Mazeika, D Wilson, K Gimpel Advances in neural information processing systems 31, 2018	591	2018
Scaling out-of-distribution detection for real-world settings D Hendrycks, S Basart, M Mazeika, M Mostajabi, J Steinhardt, D Song arXiv preprint arXiv:1911.11132, 2019	303	2019
Measuring coding challenge competence with apps D Hendrycks, S Basart, S Kadavath, M Mazeika, A Arora, E Guo, C Burns, ... arXiv preprint arXiv:2105.09938, 2021	291	2021
Pixmix: Dreamlike pictures comprehensively improve safety measures D Hendrycks, A Zou, M Mazeika, L Tang, B Li, D Song, J Steinhardt Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern …, 2022	87	2022
A benchmark for anomaly segmentation D Hendrycks, S Basart, M Mazeika, M Mostajabi, J Steinhardt, D Song	65	2019
X-risk analysis for ai research D Hendrycks, M Mazeika arXiv preprint arXiv:2206.05862, 2022	52	2022
What would jiminy cricket do? towards agents that behave morally D Hendrycks, M Mazeika, A Zou, S Patel, C Zhu, J Navarro, D Song, B Li, ... arXiv preprint arXiv:2110.13136, 2021	44	2021
Forecasting Future World Events with Neural Networks A Zou, T Xiao, R Jia, J Kwon, M Mazeika, R Li, D Song, J Steinhardt, ... arXiv preprint arXiv:2206.15474, 2022	13	2022
How to steer your adversary: Targeted and efficient model stealing defenses with gradient redirection M Mazeika, B Li, D Forsyth International Conference on Machine Learning, 15241-15254, 2022	12	2022
How Would The Viewer Feel? Estimating Wellbeing From Video Scenarios M Mazeika, E Tang, A Zou, S Basart, JS Chan, D Song, D Forsyth, ... arXiv preprint arXiv:2210.10039, 2022	7	2022
The singular value decomposition and low rank approximation M Maezika Technical Report, 2016	5	2016
Moral scenarios for reinforcement learning agents D Hendrycks, M Mazeika, A Zou, S Patel, C Zhu, J Navarro, B Li, D Song, ... ICLR 2021 Workshop on Security and Safety in Machine Learning Systems, 2021	3	2021
Improving and Assessing Anomaly Detectors for Large-Scale Settings D Hendrycks, S Basart, M Mazeika, A Zou, J Kwon, M Mostajabi, ...	1	2021

The system can't perform the operation now. Try again later.

Articles 1–18

Citations per year

Duplicate citations

Merged citations

Add co-authorsCo-authors

Follow

Cited by