University of Toronto · MIE

Sequential decision making for a world of uncertainty.

The Dynamic Optimization and Reinforcement Learning Lab builds principled, data-driven methods for intelligent systems that learn, plan, and adapt — from supply chains and healthcare to robotics and finance.

18
Researchers
7+
Application Domains
25+
Years at U of T
Welcome to DORL

Bridging classical operations research with modern AI.

Grounded in Markov decision processes, approximate dynamic programming, and bandit theory — and extended through deep and inverse reinforcement learning — our work develops scalable, reliable algorithms for the decisions that matter.

The Dynamic Optimization & Reinforcement Learning Lab (DORL) advances fundamental research in data-driven intelligence and dynamic decision making. Our work bridges theory and practice through innovations in deep learning, reinforcement learning, and dynamic optimization, with the goal of developing efficient, generalizable, and safe AI systems.

We explore the foundations of learning under uncertainty — from self-supervised and continual learning to safe and transfer reinforcement learning — and apply these principles to complex real-world systems spanning supply chain optimization, intelligent healthcare, autonomous robotics, building management, transportation, manufacturing, and finance.

Embedded within the University of Toronto's interdisciplinary research ecosystem, DORL publishes in leading machine learning and applied AI venues while contributing to intelligent systems that shape the future of technology and society.

See what we're working on
Our Core Values

Three commitments that shape every project.

01 / THEORY → IMPACT

Bridging Theory and Real-World Impact

We advance the foundations of deep learning and reinforcement learning while ensuring our methods deliver measurable impact in healthcare, robotics, transportation, manufacturing, and finance.

02 / ROBUST LEARNING

Learning Under Uncertainty

We develop robust algorithms for decision making in dynamic, uncertain environments — from safe RL to continual and online learning — enabling intelligent systems that adapt over time.

03 / SCALE

Generalizable & Scalable Intelligence

Our mission is to push AI toward efficiency, scalability, and generalization — essential components of future intelligent systems capable of operating across diverse domains.

Research Themes

Where classical optimization meets modern learning.

Active threads across the lab, each with deep theoretical grounding and live applications.

THEME

Reinforcement Learning

Safe RL, multi-agent RL, risk-sensitive and longevity-aware agents, inverse RL, duality & occupancy-measure formulations, and general-utility objectives.

THEME

Deep Learning & Generative AI

Hierarchical representation learning, graph neural networks, constrained generative models, discrete diffusion for structured outputs, and out-of-distribution generalization.

THEME

Dynamic Optimization & OR

Markov decision processes, approximate dynamic programming, combinatorial optimization, scheduling, and mechanism design for self-interested multi-agent systems.

THEME

Robotics & Embodied AI

Language-conditioned robot learning, multimodal manipulation, human-robot interaction, sim-to-real transfer, and long-horizon autonomy for real-world tasks.

THEME

Foundation Models for Decisions

Vision-language-action models, LLM-based decision making, neurosymbolic AI, and interpretable methods for complex sequential decision-making tasks.

THEME

Learning Under Distribution Shift

Online learning, test-time training and adaptation, robust time series analysis, continual learning, and belief revision in dynamic environments.

Application Domains

Algorithms deployed where decisions carry weight.

Supply Chain Optimization
Intelligent Healthcare
Autonomous Robotics
Building Management
Transportation
Smart Manufacturing
Financial Engineering
Dynamic Pricing
The Team

A research group as broad as the problems we tackle.

A single PI leading a cohort of PhD researchers spanning reinforcement learning theory, robotics, generative AI, graph learning, and applied operations research.

Interested in working with us?

We welcome prospective students, collaborators, and industry partners exploring new applications of dynamic optimization and reinforcement learning.