Aidan Curtis

I am a research scientist at Boston Dynamics working on the Atlas humanoid.

My research focuses on developing generalist robotic systems capable of long-horizon reasoning, robust manipulation, and natural collaboration with humans.

I received my PhD from MIT CSAIL, where I was advised by Leslie Kaelbling, Tomás Lozano-Pérez, and Joshua Tenenbaum. During my PhD, I also co-developed MIT’s 6.S898 Deep Learning course.

Google Scholar / Twitter / Github

Publications

	LLM-Guided Probabilistic Program Induction for POMDP Model Estimation Aidan Curtis, Hao Tang, Thiago Veloso, Kevin Ellis, Joshua Tenenbaum, Tomas Lozano-Perez, Leslie Pack Kaelbling In submission, 2025 Paper Domain randomization in reinforcement learning is an established technique for increasing the robustness of control policies learned in simulation. In this paper, we present a more flexible representation for domain randomization using normalizing flows, and show how the learned flows can be used as artifacts for multi-step planning.
	Flow-based Domain Randomization for Learning and Sequencing Skills Aidan Curtis, Eric Li, Michael Noseworthy, Nishad Gothoskar, Sachin Chitta, Hui Li, Leslie Pack Kaelbling, Nicole E Carey ICML, 2025 Paper / Code Domain randomization in reinforcement learning is an established technique for increasing the robustness of control policies learned in simulation. In this paper, we present a more flexible representation for domain randomization using normalizing flows, and show how the learned flows can be used as artifacts for multi-step planning.
	Trust the PRoC3S: Solving Long-Horizon Robotics Problems with LLMs and Constraint Satisfaction Aidan Curtis, Nishanth Kumar, Jing Cao, Tomás Lozano-Pérez, Leslie Pack Kaelbling CoRL, 2024 Website / Paper / Code We prompt the LLM to output code for a function with open parameters, which, together with environmental constraints, can be viewed as a Continuous Constraint Satisfaction Problem (CCSP). This CCSP can be solved through sampling or optimization to find a skill sequence and continuous parameter settings that achieve the goal while avoiding constraint violations.
	Partially Observable Task and Motion Planning with Uncertainty and Risk Awareness Aidan Curtis, George Matheos, Nishad Gothoskar, Vikash Mansinghka, Joshua Tenenbaum, Tomás Lozano-Pérez, Leslie Pack Kaelbling RSS, 2024 Website / Paper / Code Integrated task and motion planning (TAMP) has proven to be a valuable approach to generalizable long-horizon robotic manipulation and navigation problems. We propose a strategy for TAMP with Uncertainty and Risk Awareness (TAMPURA) that is capable of efficiently solving long-horizon planning problems with initial-state and action outcome uncertainty, including problems that require information gathering and avoiding undesirable and irreversible outcomes.
	Bayes3D: fast learning and inference in structured generative models of 3D objects and scenes Nishad Gothoskar, Matin Ghavami, Eric Li* Aidan Curtis, Michael Noseworthy, Karen Chung, Brian Patton William T. Freeman, Joshua B. Tenenbaum, Mirko Klukas, Vikash K. Mansinghka arXiv, 2024 Paper / Code We present Bayes3D, an uncertainty-aware perception system for structured 3D scenes, that reports accurate posterior uncertainty over 3D object shape, pose, and scene composition in the presence of clutter and occlusion. Bayes3D delivers these capabilities via a novel hierarchical Bayesian model for 3D scenes and a GPU-accelerated coarse-to-fine sequential Monte Carlo algorithm.
	Hierarchical Hybrid Learning for Long-Horizon Contact-Rich Robotic Assembly Jiankai Sun, Aidan Curtis, Yang You, Yan Xu, Michael Koehle, Leonidas Guibas, Sachin Chitta, Mac Schwager, Hui Li In submission, 2024 Website / Paper Generalizable long-horizon robotic assembly requires reasoning at multiple levels of abstraction. We propose a hierarchical modular approach, named ARCH which combines imitation learning and reinforcement learning for long-horizon high-precision assembly in contact-rich settings.
	Towards Practical Finite Sample Bounds for Motion Planning in TAMP Seiji Shaw, Aidan Curtis, Leslie Pack Kaelbling, Tomás Lozano-Pérez, and Nicholas Roy WAFR, 2024 Paper When using sampling-based motion planners such as PRMs, it is difficult to determine how many samples are required for the PRM to find a solution consistently. We attempt to address this problem by proving an upper bound on the number of samples that are sufficient, with high probability, for a radius PRM to find a feasible solution, drawing on prior work in deterministic sampling and sample complexity theory.
	Task-Directed Exploration in Continuous POMDPs for Robotic Manipulation of Articulated Objects Aidan Curtis, Leslie Kaelbling, Siddarth Jain ICRA, 2023 Paper In this paper, we propose STRUG, an online POMDP solver capable of handling domains that require long-horizon planning with significant task-relevant and task-irrelevant uncertainty. We demonstrate our solution on several temporally extended versions of toy POMDP problems as well as robotic manipulation of articulated objects using neural perception.
	Visibility-Aware Navigation Among Movable Obstacles Aidan Curtis, Jose Muguira-Iturralde, Yilun Du, Leslie Pack Kaelbling, Tomás Lozano-Pérez ICRA, 2023 Paper / Code In this paper, we examine the problem of visibility aware robot navigation among movable obstacles (VANAMO). A variant of the well-known NAMO robotic planning problem, VANAMO puts additional visibility constraints on robot motion and object movability.
	PG3: Policy-Guided Planning for Generalized Policy Generation Ryan Yang, Tom Silver, Aidan Curtis, Tomas Lozano-Perez, Leslie Pack Kaelbling IJCAI, 2022 Paper / Code In this work, we study generalized policy search-based methods with a focus on the score function used to guide the search over policies. The main idea is that a candidate policy should be used to guide planning on training problems as a mechanism for evaluating that candidate.
	Let's Handle It: Generalizable Manipulation of Articulated Objects Aidan Curtis, Zhutian Yang ICLR Workshop, 2022 Paper / Code This workshop paper describes our award-winning solution to the Sapien Maniskill Manipulation Challenge. We present a framework for building generalizable manipulation controller policies that map from raw input point clouds and segmentation masks to joint velocities.
	Long-Horizon Manipulation of Unknown Objects via Task and Motion Planning with Estimated Affordances Aidan Curtis, Xiaolin Fang, Leslie Pack Kaelbling, Tomás Lozano-Pérez, Caelan Reed Garrett ICRA, 2022 Paper / Video / Code We present a strategy for designing and building very general robot manipulation systems involving the integration of a general-purpose task-and-motion planner with engineered and learned perception modules that estimate properties and affordances of unknown objects.
	Discovering State and Action Abstractions for Generalized Task and Motion Planning Aidan Curtis, Tom Silver, Joshua B Tenenbaum, Tomas Lozano-Perez, Leslie Pack Kaelbling AAAI, 2022 Paper / Code Generalized planning accelerates classical planning by finding an algorithm-like policy that solves multiple instances of a task. Here we apply generalized planning to hybrid discrete-continuous task and motion planning.
	Map Induction: Compositional spatial submap learning for efficient exploration in novel environments Sugandha Sharma, Aidan Curtis, Marta Kryven, Josh Tenenbaum, Ila Fiete ICLR, 2022 Paper / Code Humans are expert explorers. In this work, we try to understand the computational cognitive mechanisms that support this efficiency can advance the study of the human mind and enable more efficient exploration algorithms.
	Planning with learned object importance in large problem instances using graph neural networks Tom Silver, Rohan Chitnis, Aidan Curtis, Joshua Tenenbaum, Tomas Lozano-Perez, Leslie Pack Kaelbling AAAI, 2021 Video / Code / Paper Real-world planning problems often involve hundreds or even thousands of objects, straining the limits of modern planners. In this work, we address this challenge by learning to predict a small set of objects that, taken together, would be sufficient for finding a plan.
	A Spatiotemporal Map of Reading Aloud Oscar Woolnough, Cristian Donos, Aidan Curtis, Patrick S Rollo, Zachary J Roccaforte, Stanislas Dehaene, Simon Fischer-Baum, Nitin Tandon JNeurosci, 2021 Paper Reading words aloud is a foundational aspect of the acquisition of literacy. We used direct intracranial recordings in a large cohort to create a holistic yet fine-grained map of word processing, enabling us to derive the spatiotemporal neural codes of multiple word attributes critical to reading
	Flexible and efficient long-range planning through curious exploration Aidan Curtis, Minjian Xin, Dilip Arumugam, Kevin Feigelis, Daniel Yamins ICML, 2020 Paper / Code / Website A core problem of long-range planning is finding an efficient way to search through the tree of possible action sequences. Here, we propose the Curious Sample Planner (CSP), which fuses elements of TAMP and DRL by combining a curiosity-guided sampling strategy with imitation learning to accelerate planning.
	Threedworld: A platform for interactive multi-modal physical simulation Chuang Gan, Jeremy Schwartz, Seth Alter, Martin Schrimpf, James Traer, Julian De Freitas, Jonas Kubilius, Abhishek Bhandwaldar, Nick Haber, Megumi Sano, Kuno Kim, Elias Wang, Damian Mrowca, Michael Lingelbach, Aidan Curtis, Kevin Feigelis, Daniel M Bear, Dan Gutfreund, David Cox, James J DiCarlo, Josh McDermott, Joshua B Tenenbaum, Daniel LK Yamins NeurIPS, 2022 Paper / Website / Code We introduce ThreeDWorld (TDW), a platform for interactive multi-modal physical simulation.
	HealthSense: Software-defined mobile-based clinical trials Aidan Curtis, Amruta Pai, Jian Cao, Nidal Moukaddam, Ashutosh Sabharwal Mobicom, 2019 Paper We take a software-inspired viewpoint of clinical trial designs to enable expressibility of complex trials, composability with diverse devices and services while maximally maintaining simplicity for a clinical research user.
	Saccadic Corruption of Long Range Cerebral Connectivity Metrics Aidan Curtis, Kiefer Forseth, Oscar Woolnough, Cihan Kadipasaoglu, Nitin Tandon Biorxiv, 2022 Paper Top-down visual object recognition processes driven by the human orbitofrontal cortex (OFC) have been proposed to facilitate rapid processing of images in higher-level visual regions. We found that trials lacking the saccade artifact on scalp EEG also lacked low gamma band PLV increase in iEEG. This work illustrates the importance of eliminating confounding saccadic artifacts.
Other Projects
	Wildfire Prevention and Management using Deep Reinforcement Learning Aidan Curtis, William Shen Paper / Project / Code We use Deep Reinforcement Learning to train AI agents which are able to combat wildfires. This page demonstrates the videos of our learned policies. Please see our paper for more details.
	Short Term Spatiotemporal Video Prediction on Sports via Convolutional LSTMs Aidan Curtis, Victor Gonzalez Paper Predicting short term video dynamics has many useful applications in self-driving cars, weather nowcasting, and model-based reinforcement learning. In this project we provide an in-depth analysis of the available models for video prediction and their strengths and weaknesses in predicting natural sequences of images.
	Actor Critic Reinforcement Learning in 2D and 3D Aidan Curtis, Kevin Feigelis Blog The project assesses the efficacy of finetuning off different neural architectures trained on ImageNet for Actor-Critic reinforcement learning in 2D and 3D environments. We find a combination of semantic and spatial information results in the best few-shot performance.
	Wireless Recorder for Intracranial Epileptic Seizure Monitoring Aidan Curtis, Sophia D’Amico, Andres Gomez, Benjamin Klimko, Zhiyang Zhang Paper / Website / Video / Code In this project we design and build a wireless intracranial neural recording system that uses sparse coding compression to efficiently transmit neural data.
website source code