Rupam Mahmood
Fellow & Canada CIFAR AI Chair
Academic Affiliations
Industry and Research Affiliations
Areas of Expertise
Fellow & Canada CIFAR AI Chair
Academic Affiliations
Industry and Research Affiliations
Areas of Expertise
Rupam Mahmood develops reinforcement learning algorithms and real-time learning systems for controlling physical robots.
Rupam Mahmood develops reinforcement learning algorithms and real-time learning systems for controlling physical robots. His research focuses on developing general and constructive mechanisms for continually improving robot minds. Currently, he is working on two long-term programs consisting of several short-term projects: a simple and general reinforcement learning system for robot control, and core constructive mechanisms for continually learning agents. In the first program, Rupam and his team are working to develop a reinforcement learning system that can be easily deployed in many different robots for solving various tasks. The second program has his research teams developing and analyzing algorithms for learning policies and representations in a continual learning setup, in which the agent is expected to go through a series of changes in the environment and tasks. Through these two programs, Rupam seeks to develop a system that enables scientific understanding as well as large-scale industrial adoption of robotics by analyzing and addressing shortcomings of current policy and representation learning methods.
Rupam is a Fellow and Canada CIFAR AI Chair at Amii and an Assistant Professor in the Department of Computing Science at the University of Alberta. In 2017, he achieved his Ph.D. in Statistical Machine Learning at the University of Alberta under the supervision of Richard S. Sutton (Amii Fellow, Chief Scientific Advisor and a pioneer of reinforcement learning) with his thesis focusing on incremental off-policy reinforcement learning algorithms. Previously, Rupam was a Research Scientist and then Lead of AI Research at Kindred Inc., where he now acts as a Scientific Advisor. He is an Associate Editor of the IEEE/RJS International Conference on Intelligent Robots and Systems (IROS) and a Senior Program Committee Member for the International Joint Conference on Artificial Intelligence (IJCAI). Rupam has produced software that provides a computational framework and a benchmark task suite for developing and evaluating reinforcement learning methods with physical robots.
Oct 29th 2018
Research Post
May 1st 2016
Research Post
Dec 8th 2014
Research Post
Rupam has produced software that provides a computational framework and a benchmark task suite for developing and evaluating reinforcement learning methods with physical robots.
Aug 21st 2024
News
In a paper published in Nature, a team of Amii researchers have revealed more about a mysterious problem in machine learning — a discovery that might be a major step towards building advanced AI that can function effectively in the real world.
Jul 20th 2021
News
Watch this week's videos featuring Michael Przystupa and Alex Lewandowski, as well as a panel moderated by Michael Bowling feat. Rich Sutton, Martha White, Patrick Pilarski & Rupam Mahmood!
Jul 13th 2021
News
The work of Amii researchers is being featured at the 38th annual International Conference on Machine Learning (ICML), running online this year from July 18 to 24.
Looking to build AI capacity? Need a speaker at your event?