Computational Rationalization: The Inverse Equilibrium Problem Abstract:Modeling the purposeful behavior of imperfect agents from a small number of observations is a challenging task. When restricted to the single-agent decision-theoretic setting, inverse optimal control techniques assume that observed behavior is an approximately optimal solution to an unknown decision problem. These techniques learn a utility function that [...]

{ Comments on this entry are closed }

Welcome to new post-doctoral fellows

by lairlab on July 18, 2011

Moslem Kazemi: working on perception and force guided manipulation. Working with Nancy Pollard and Drew Bagnell. Kris Katani: working with Martial Hebert and Drew Bagnell on activity prediction. Paul Vernaza: working with Drew Bagnell on compressed information space reasoning.

{ Comments on this entry are closed }

Preprint: A Reduction of Imitation Learning and Structured Prediction to No-Regret Online Learning

March 17, 2011

Stéphane Ross Geoffrey J. Gordon J. Andrew Bagnell, Carnegie Mellon University To Appear in Proceedings of the 14th International Conference on Artificial Intelligence and Statistics (AISTATS), 2011 Link to Paper Abstract: Sequential prediction problems such as imitation learning, where future observations depend on previous predictions (actions), violate the common i.i.d. assumptions made in statistical learning. [...]

Read the full article →

Preprint: Maximum Causal Entropy Correlated Equilibria for Markov Games

February 22, 2011

Brian D. Ziebart, J. Andrew Bagnell, Anind K. Dey Carnegie Mellon University To appear at International Conference on Autonomous Agents and Multiagent Systems (AAMAS 2011). Link to Paper Motivated by a machine learning perspective|that game theoretic equilibria constraints should serve as guidelines for predicting agents’ strategies, we introduce maximum causal entropy correlated equilibria (MCECE), a [...]

Read the full article →

Amusing New Dodge Commercial

February 20, 2011

Dodge has released an amusing new commercial referencing self-driving cars and other robotics advances. Well, those of us on Team Robot take it as a compliment.

Read the full article →

Preprint: 3-D Scene Analysis via Sequenced Predictions over Points and Regions

January 31, 2011

3-D Scene Analysis via Sequenced Predictions over Points and Regions Xuehan Xiong, Daniel Munoz, J. Andrew Bagnell, Martial Hebert To appear: ICRA 2011. Preprint (pdf) We address the problem of understanding scenes from 3-D laser scans via per-point assignment of semantic labels. In order to mitigate the difficulties of using a graphical model for modeling [...]

Read the full article →

Merry Christmas and Happy Holidays!

December 11, 2010

Andy, our ARM (Autonomous Robot Manipulation) wishes you a merry christmas! Mihail Pivtoraiko, our motion planning for manipulation expert, demonstrates Andy’s current dexterity.

Read the full article →

Preprint: Stacked Hierarchical Labeling

July 5, 2010

Stacked Hierarchical Labeling Daniel Munoz, J. Andrew Bagnell, Martial Hebert To appear: ECCV 2010. Preprint (pdf) In this work we propose a hierarchical approach for labeling semantic objects and regions in scenes. Our approach is reminiscent of early vision literature in that we use a decomposition of the image in order to encode relational and [...]

Read the full article →

Congrats to Brian! Best Paper Runner up: Modeling Interaction via the Principle of Maximum Causal Entropy

June 20, 2010

Modeling Interaction via the Principle of Maximum Causal Entropy ICML runner up for best student paper by Brian Ziebart, J. Andrew Bagnell, and Anind Dey. @inproceedings{bziebart-maxcausalent, author = {Brian D. Ziebart and J. Andrew Bagnell and Anind K. Dey}, title = {Modeling Interaction via the Principle of Maximum Causal Entropy}, year = {2010}, booktitle = [...]

Read the full article →

Preprint: Reinforcement Planning: RL for Optimal Planners

April 20, 2010

Reinforcement Planning: RL for Optimal Planners Matt Zucker and J. Andrew Bagnell PDF Search based planners such as A* and Dijkstra’s algorithm are proven methods for guiding today’s robotic systems. Although such planners are typically based upon a coarse approximation of reality, they are nonetheless valuable due to their ability to reason about the future, [...]

Read the full article →