Follow
Shawn Tan
Shawn Tan
Montreal Institute of Learning Algorithms
Verified email at mila.quebec - Homepage
Title
Cited by
Cited by
Year
Ordered Neurons: Integrating Tree Structures into Recurrent Neural Networks
Y Shen, S Tan, A Sordoni, A Courville
International Conference on Learning Representations (ICLR), 2019, 2019
4032019
Improving explorability in variational inference with annealed variational objectives
CW Huang, S Tan, A Lacoste, AC Courville
Advances in neural information processing systems 31, 2018
652018
Improving the interpretability of deep neural networks with stimulated learning
S Tan, KC Sim, M Gales
2015 ieee workshop on automatic speech recognition and understanding (asru …, 2015
652015
Icentia11k: An unsupervised representation learning dataset for arrhythmia subtype discovery
S Tan, G Androz, A Chamseddine, P Fecteau, A Courville, Y Bengio, ...
arXiv preprint arXiv:1910.09570, 2019
322019
Ordered memory
Y Shen, S Tan, A Hosseini, Z Lin, A Sordoni, AC Courville
Advances in Neural Information Processing Systems 32, 2019
292019
Learning utterance-level normalisation using Variational Autoencoders for robust automatic speech recognition
S Tan, KC Sim
Spoken Language Technology Workshop (SLT), 2016 IEEE, 2016
252016
Moduleformer: Learning modular large language models from uncurated data
Y Shen, Z Zhang, T Cao, S Tan, Z Chen, C Gan
arXiv preprint arXiv:2306.04640, 2023
222023
Towards implicit complexity control using variable-depth deep neural networks for automatic speech recognition
S Tan, KC Sim
2016 IEEE International Conference on Acoustics, Speech and Signal …, 2016
142016
Sparse universal transformer
S Tan, Y Shen, Z Chen, A Courville, C Gan
arXiv preprint arXiv:2310.07096, 2023
112023
Scattered Mixture-of-Experts Implementation
S Tan, Y Shen, R Panda, A Courville
arXiv preprint arXiv:2403.08245, 2024
92024
Explicitly modeling syntax in language models with incremental parsing and a dynamic oracle
Y Shen, S Tan, A Sordoni, S Reddy, A Courville
arXiv preprint arXiv:2011.07960, 2020
9*2020
Unsupervised dependency graph network
Y Shen, S Tan, A Sordoni, P Li, J Zhou, A Courville
Proceedings of the 60th Annual Meeting of the Association for Computational …, 2022
82022
Investigating biases in textual entailment datasets
S Tan, Y Shen, C Huang, A Courville
arXiv preprint arXiv:1906.09635, 2019
82019
Self-organized hierarchical softmax
Y Shen, S Tan, C Pal, A Courville
arXiv preprint arXiv:1707.08588, 2017
82017
Learning to dequantise with truncated flows
S Tan, CW Huang, A Sordoni, A Courville
International Conference on Learning Representations, 2021
52021
Recursive top-down production for sentence generation with latent trees
S Tan, Y Shen, TJ O'Donnell, A Sordoni, A Courville
arXiv preprint arXiv:2010.04704, 2020
52020
Generating contradictory, neutral, and entailing sentences
Y Shen, S Tan, CW Huang, A Courville
arXiv preprint arXiv:1803.02710, 2018
52018
Power scheduler: A batch size and token number agnostic learning rate scheduler
Y Shen, M Stallone, M Mishra, G Zhang, S Tan, A Prasad, AM Soria, ...
arXiv preprint arXiv:2408.13359, 2024
12024
Inferring identity factors for grouped examples
S Tan, CJ Pal, A Courville
12018
Stick-breaking Attention
S Tan, Y Shen, S Yang, A Courville, R Panda
arXiv preprint arXiv:2410.17980, 2024
2024
The system can't perform the operation now. Try again later.
Articles 1–20