Armen Aghajanyan

Cited by

	All	Since 2019
Citations	2970	2960
h-index	20	19
i10-index	26	25

1200

600

300

900

2019202020212022202320248 37 175 504 1125 1103

Co-authors

Luke ZettlemoyerUniversity of Washington; MetaVerified email at cs.washington.edu
Mike LewisFacebook AI ResearchVerified email at fb.com
Sonal GuptaResearcher at GoogleVerified email at google.com
Gargi GhoshMeta AI ResearchVerified email at fb.com
Scott Wen-tau YihMeta FAIRVerified email at meta.com
Mandar JoshiGoogle AIVerified email at google.com
Naman GoyalFacebook AI ResearchVerified email at gatech.edu
Florian MetzeCarnegie Mellon University; Meta AIVerified email at andrew.cmu.edu
Marjan GhazvininejadResearch Scientist, FAIR (Facebook AI Research)Verified email at fb.com

Armen Aghajanyan

Facebook AI Research

Verified email at fb.com

Deep Learning


Title Sort by citations Sort by year Sort by title	Cited by Cited by	Year
Videoclip: Contrastive pre-training for zero-shot video-text understanding H Xu, G Ghosh, PY Huang, D Okhonko, A Aghajanyan, F Metze, ... arXiv preprint arXiv:2109.14084, 2021	455	2021
Incoder: A generative model for code infilling and synthesis D Fried, A Aghajanyan, J Lin, S Wang, E Wallace, F Shi, R Zhong, W Yih, ... arXiv preprint arXiv:2204.05999, 2022	446	2022
Intrinsic dimensionality explains the effectiveness of language model fine-tuning A Aghajanyan, L Zettlemoyer, S Gupta arXiv preprint arXiv:2012.13255, 2020	393	2020
Muppet: Massive multi-task representations with pre-finetuning A Aghajanyan, A Gupta, A Shrivastava, X Chen, L Zettlemoyer, S Gupta arXiv preprint arXiv:2101.11038, 2021	250	2021
Better fine-tuning by reducing representational collapse A Aghajanyan, A Shrivastava, A Gupta, N Goyal, L Zettlemoyer, S Gupta arXiv preprint arXiv:2008.03156, 2020	228	2020
Memorization without overfitting: Analyzing the training dynamics of large language models K Tirumala, A Markosyan, L Zettlemoyer, A Aghajanyan Advances in Neural Information Processing Systems 35, 38274-38290, 2022	164	2022
Pre-training via paraphrasing M Lewis, M Ghazvininejad, G Ghosh, A Aghajanyan, S Wang, ... Advances in Neural Information Processing Systems 33, 18470-18481, 2020	153	2020
Cm3: A causal masked multimodal model of the internet A Aghajanyan, B Huang, C Ross, V Karpukhin, H Xu, N Goyal, D Okhonko, ... arXiv preprint arXiv:2201.07520, 2022	139	2022
Improving passage retrieval with zero-shot question generation DS Sachan, M Lewis, M Joshi, A Aghajanyan, W Yih, J Pineau, ... arXiv preprint arXiv:2204.07496, 2022	97	2022
Scaling autoregressive multi-modal models: Pretraining and instruction tuning L Yu, B Shi, R Pasunuru, B Muller, O Golovneva, T Wang, A Babu, B Tang, ... arXiv preprint arXiv:2309.02591 2 (3), 2023	83	2023
Retrieval-augmented multimodal language modeling M Yasunaga, A Aghajanyan, W Shi, R James, J Leskovec, P Liang, ... arXiv preprint arXiv:2211.12561, 2022	83	2022
Htlm: Hyper-text pre-training and prompting of language models A Aghajanyan, D Okhonko, M Lewis, M Joshi, H Xu, G Ghosh, ... arXiv preprint arXiv:2107.06955, 2021	66	2021
Megabyte: Predicting million-byte sequences with multiscale transformers L Yu, D Simig, C Flaherty, A Aghajanyan, L Zettlemoyer, M Lewis Advances in Neural Information Processing Systems 36, 2024	60	2024
Scaling laws for generative mixed-modal language models A Aghajanyan, L Yu, A Conneau, WN Hsu, K Hambardzumyan, S Zhang, ... International Conference on Machine Learning, 265-279, 2023	57	2023
Conversational semantic parsing A Aghajanyan, J Maillard, A Shrivastava, K Diedrick, M Haeger, H Li, ... arXiv preprint arXiv:2009.13655, 2020	50	2020
D4: Improving llm pretraining via document de-duplication and diversification K Tirumala, D Simig, A Aghajanyan, A Morcos Advances in Neural Information Processing Systems 36, 2024	39	2024
Semantic representations using structural ontology for assistant systems A Aghajanyan, S Gupta, B Moran, TF Levin, CANSH Nakatsu, D Difranco, ... US Patent 11,688,022, 2023	36	2023
On-device convolutional neural network models for assistant systems A Aly, A Babu, A Aghajanyan US Patent 11,314,941, 2022	36	2022
Non-autoregressive semantic parsing for compositional task-oriented dialog A Babu, A Shrivastava, A Aghajanyan, A Aly, A Fan, M Ghazvininejad arXiv preprint arXiv:2104.04923, 2021	25	2021
Softtarget regularization: An effective technique to reduce over-fitting in neural networks A Aghajanyan 2017 3rd IEEE International Conference on Cybernetics (CYBCONF), 1-5, 2017	20	2017

The system can't perform the operation now. Try again later.

Articles 1–20

Citations per year

Duplicate citations

Merged citations

Add co-authorsCo-authors

Follow

Cited by

Co-authors