关注
Tinghao Xie
Tinghao Xie
在 princeton.edu 的电子邮件经过验证 - 首页
标题
引用次数
引用次数
年份
Fine-tuning aligned language models compromises safety, even when users do not intend to!
X Qi, Y Zeng, T Xie, PY Chen, R Jia, P Mittal, P Henderson
arXiv preprint arXiv:2310.03693, 2023
962023
Revisiting the assumption of latent separability for backdoor defenses
X Qi, T Xie, Y Li, S Mahloujifar, P Mittal
The eleventh international conference on learning representations, 2022
65*2022
Towards practical deployment-stage backdoor attack on deep neural networks
X Qi, T Xie, R Pan, J Zhu, Y Yang, K Bu
Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern …, 2022
442022
Towards a proactive {ML} approach for detecting backdoor poison samples
X Qi, T Xie, JT Wang, T Wu, S Mahloujifar, P Mittal
32nd USENIX Security Symposium (USENIX Security 23), 1685-1702, 2023
16*2023
Assessing the brittleness of safety alignment via pruning and low-rank modifications
B Wei, K Huang, Y Huang, T Xie, X Qi, M Xia, P Mittal, M Wang, ...
arXiv preprint arXiv:2402.05162, 2024
42024
BaDExpert: Extracting Backdoor Functionality for Accurate Backdoor Input Detection
T Xie, X Qi, P He, Y Li, JT Wang, P Mittal
arXiv preprint arXiv:2308.12439, 2023
12023
A Handbook for Deep Learning with their Piecemeal Intuitions from Causal Theory
T Xie
2021
Ensemble of Narrow DNN Chains
T Xie
2021
Texture Packing
T Xie, H Lin, Z Zhao
2020
系统目前无法执行此操作,请稍后再试。
文章 1–9