The hitchhiker’s guide to testing statistical significance in natural language processing R Dror, G Baumer, S Shlomov, R Reichart Proceedings of the 56th annual meeting of the association for computational …, 2018 | 315 | 2018 |
Deep dominance-how to properly compare deep neural models R Dror, S Shlomov, R Reichart Proceedings of the 57th Annual Meeting of the Association for Computational …, 2019 | 73 | 2019 |
Replicability Analysis for Natural Language Processing: Testing Significance with Multiple Datasets R Dror, G Baumer, M Bogomolov, R Reichart Transactions of the Association for Computational Linguistics 5, 471--486, 2017 | 49 | 2017 |
Statistical significance testing for natural language processing R Dror, L Peled-Cohen, S Shlomov, R Reichart Synthesis Lectures on Human Language Technologies 13 (2), 1-116, 2020 | 36 | 2020 |
A statistical analysis of summarization evaluation metrics using resampling methods D Deutsch, R Dror, D Roth Transactions of the Association for Computational Linguistics 9, 1132-1146, 2021 | 28 | 2021 |
Re-Examining System-Level Correlations of Automatic Summarization Evaluation Metrics D Deutsch, R Dror, D Roth arXiv preprint arXiv:2204.10216, 2022 | 12 | 2022 |
Resin-11: Schema-guided event prediction for 11 newsworthy scenarios X Du, Z Zhang, S Li, P Yu, H Wang, T Lai, X Lin, Z Wang, I Liu, B Zhou, ... Proceedings of the 2022 Conference of the North American Chapter of the …, 2022 | 9 | 2022 |
Zero-Shot On-the-Fly Event Schema Induction R Dror, H Wang, D Roth arXiv preprint arXiv:2210.06254, 2022 | 3 | 2022 |
Human-in-the-Loop Schema Induction T Zhang, I Tham, Z Hou, J Ren, L Zhou, H Xu, L Zhang, LJ Martin, R Dror, ... arXiv preprint arXiv:2302.13048, 2023 | 2 | 2023 |
On the Limitations of Reference-Free Evaluations of Generated Text D Deutsch, R Dror, D Roth arXiv preprint arXiv:2210.12563, 2022 | 2 | 2022 |
Pareto-efficient probabilistic solutions A Kantor, M Masin, S Shlomov, R Dror US Patent App. 15/905,988, 2019 | 1 | 2019 |
Recommended statistical significance tests for NLP tasks R Dror, R Reichart arXiv preprint arXiv:1809.01448, 2018 | 1 | 2018 |
The Structured Weighted Violations Perceptron Algorithm RDR Reichart Conference on Empirical Methods in Natural Language Processing, 469–478, 2016 | 1* | 2016 |
Proceedings of the 58th Annual Meeting of the Association for Computational Linguistics: Student Research Workshop S Rijhwani, J Liu, Y Wang, R Dror Proceedings of the 58th Annual Meeting of the Association for Computational …, 2020 | | 2020 |
The Structured Weighted Violations MIRA D Ringel, R Dror, R Reichart arXiv preprint arXiv:2005.04418, 2020 | | 2020 |
Statistical Significance Tests R Dror, L Peled-Cohen, S Shlomov, R Reichart Statistical Significance Testing for Natural Language Processing, 9-21, 2020 | | 2020 |
Open Questions and Challenges R Dror, L Peled-Cohen, S Shlomov, R Reichart Statistical Significance Testing for Natural Language Processing, 75-77, 2020 | | 2020 |
Deep Significance R Dror, L Peled-Cohen, S Shlomov, R Reichart Statistical Significance Testing for Natural Language Processing, 35-50, 2020 | | 2020 |
Statistical Hypothesis Testing R Dror, L Peled-Cohen, S Shlomov, R Reichart Statistical Significance Testing for Natural Language Processing, 3-7, 2020 | | 2020 |
Replicability Analysis R Dror, L Peled-Cohen, S Shlomov, R Reichart Statistical Significance Testing for Natural Language Processing, 51-73, 2020 | | 2020 |