Follow
Mantas Mazeika
Title
Cited by
Cited by
Year
Deep anomaly detection with outlier exposure
D Hendrycks, M Mazeika, T Dietterich
arXiv preprint arXiv:1812.04606, 2018
14572018
Measuring massive multitask language understanding
D Hendrycks, C Burns, S Basart, A Zou, M Mazeika, D Song, J Steinhardt
arXiv preprint arXiv:2009.03300, 2020
10812020
Using self-supervised learning can improve model robustness and uncertainty
D Hendrycks, M Mazeika, S Kadavath, D Song
Advances in neural information processing systems 32, 2019
9552019
Using pre-training can improve model robustness and uncertainty
D Hendrycks, K Lee, M Mazeika
International Conference on Machine Learning, 2712-2721, 2019
7462019
Beyond the Imitation Game: Quantifying and extrapolating the capabilities of language models
A Srivastava, A Rastogi, A Rao, AAM Shoeb, A Abid, A Fisch, AR Brown, ...
arXiv preprint arXiv:2206.04615, 2022
7242022
Using trusted data to train deep networks on labels corrupted by severe noise
D Hendrycks, M Mazeika, D Wilson, K Gimpel
Advances in neural information processing systems 31, 2018
5912018
Scaling out-of-distribution detection for real-world settings
D Hendrycks, S Basart, M Mazeika, M Mostajabi, J Steinhardt, D Song
arXiv preprint arXiv:1911.11132, 2019
3032019
Measuring coding challenge competence with apps
D Hendrycks, S Basart, S Kadavath, M Mazeika, A Arora, E Guo, C Burns, ...
arXiv preprint arXiv:2105.09938, 2021
2912021
Pixmix: Dreamlike pictures comprehensively improve safety measures
D Hendrycks, A Zou, M Mazeika, L Tang, B Li, D Song, J Steinhardt
Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern …, 2022
872022
A benchmark for anomaly segmentation
D Hendrycks, S Basart, M Mazeika, M Mostajabi, J Steinhardt, D Song
652019
X-risk analysis for ai research
D Hendrycks, M Mazeika
arXiv preprint arXiv:2206.05862, 2022
522022
What would jiminy cricket do? towards agents that behave morally
D Hendrycks, M Mazeika, A Zou, S Patel, C Zhu, J Navarro, D Song, B Li, ...
arXiv preprint arXiv:2110.13136, 2021
442021
Forecasting Future World Events with Neural Networks
A Zou, T Xiao, R Jia, J Kwon, M Mazeika, R Li, D Song, J Steinhardt, ...
arXiv preprint arXiv:2206.15474, 2022
132022
How to steer your adversary: Targeted and efficient model stealing defenses with gradient redirection
M Mazeika, B Li, D Forsyth
International Conference on Machine Learning, 15241-15254, 2022
122022
How Would The Viewer Feel? Estimating Wellbeing From Video Scenarios
M Mazeika, E Tang, A Zou, S Basart, JS Chan, D Song, D Forsyth, ...
arXiv preprint arXiv:2210.10039, 2022
72022
The singular value decomposition and low rank approximation
M Maezika
Technical Report, 2016
52016
Moral scenarios for reinforcement learning agents
D Hendrycks, M Mazeika, A Zou, S Patel, C Zhu, J Navarro, B Li, D Song, ...
ICLR 2021 Workshop on Security and Safety in Machine Learning Systems, 2021
32021
Improving and Assessing Anomaly Detectors for Large-Scale Settings
D Hendrycks, S Basart, M Mazeika, A Zou, J Kwon, M Mostajabi, ...
12021
The system can't perform the operation now. Try again later.
Articles 1–18