Valentin Thomas
Valentin Thomas
PhD student, Mila, University of Montreal
Verified email at - Homepage
Cited by
Cited by
Independently Controllable Factors
V Thomas*, J Pondard*, E Bengio*, M Sarfati, P Beaudoin, MJ Meurs, ...
arXiv preprint arXiv:1708.01289, 2017
Disentangling the independently controllable factors of variation by interacting with the world
V Thomas, E Bengio, W Fedus, J Pondard, P Beaudoin, H Larochelle, ...
NIPS 2017 workshop on Learning Disentangled Representations: from …, 2018
On the interplay between noise and curvature and its effect on optimization and generalization
V Thomas, F Pedregosa, B Merriënboer, PA Manzagol, Y Bengio, ...
International Conference on Artificial Intelligence and Statistics, 3503-3513, 2020
Probabilistic Planning with Sequential Monte Carlo methods
V Thomas*, A Piché*, C Ibrahim, Y Bengio, C Pal
ICLR,, 2019
Independently Controllable Features
E Bengio, V Thomas, J Pineau, D Precup, Y Bengio
RLDM 2017, 2017
Beyond variance reduction: Understanding the true impact of baselines on policy optimization
V Thomas*, W Chung*, MC Machado, N Le Roux
International Conference on Machine Learning, 1999-2009, 2021
Decoupling Backpropagation using Constrained Optimization Methods
A Gotmare*, V Thomas*, J Brea, M Jaggi
ICML 2018 workshop on Efficient Credit Assignment, 2018
The role of baselines in policy gradient optimization
J Mei, W Chung, V Thomas, B Dai, C Szepesvari, D Schuurmans
Advances in Neural Information Processing Systems 35, 17818-17830, 2022
Information matrices and generalization
V Thomas, F Pedregosa, B van Merriënboer, PA Mangazol, Y Bengio, ...
arXiv preprint arXiv:1906.07774, 2019
Bridging the Gap Between Target Networks and Functional Regularization
A Piché*, V Thomas*, J Marino, GM Marconi, C Pal, ME Khan
TMLR 2023,, 2021
On the role of overparameterization in off-policy Temporal Difference learning with linear function approximation
V Thomas
NeurIPS,, 2022
In-Context Data Distillation with TabPFN
J Ma, V Thomas, G Yu, A Caterini
arXiv preprint arXiv:2402.06971, 2024
Retrieval & Fine-Tuning for In-Context Tabular Models
V Thomas, J Ma, R Hosseinzadeh, K Golestan, G Yu, M Volkovs, ...
arXiv preprint arXiv:2406.05207, 2024
Learning and planning with noise in optimization and reinforcement learning
V Thomas
Contrôle de l’intrication de deux Qubits
V Thomas
Planning with Latent Simulated Trajectories
A Piché1, V Thomas12, C Ibrahim, J Cornebise, C Pal
The system can't perform the operation now. Try again later.
Articles 1–16