Strategies for Collaboration, Autonomy, Learning, and Exploration in Robotics Lab
Director: Dr. Rohan Paleja

Our lab advances machine learning and artificial intelligence to improve robot learning, human-robot interaction, and multi-agent coordination.

  • Interactive Robot Learning: Developing computational approaches to help humans teach new behaviors or correct existing ones.
  • Explainable Robotics: Imbibing robotic systems with decision-making capabilities that can be understood, traced, and trusted by humans.
  • Multi-Agent Coordination: Developing methods that enable teams of robots and humans to communicate and collaborate in complex environments.

Research Areas

Imitation Learning

Learning from demonstration and interactive robot learning frameworks.

Explainable AI

Interpretable AI, transparent policies, and rigorous system validation.

Multi-Agent RL

Heterogeneous multi-agent coordination, reinforcement learning, and communication.

Human-Machine Teaming

Algorithmic HRI, ad hoc teaming, and mutual understanding architectures.

Recent News

Publications

* denotes equal contribution. Blue - Conference. Orange - Journal. Pink - Workshop/Other.

Teaser for Beyond Partner Diversity: An Influence-Based Team Steering Framework for Zero-Shot Human-Machine Teaming
Beyond Partner Diversity: An Influence-Based Team Steering Framework for Zero-Shot Human-Machine Teaming

Wei Sheng , Rohan Paleja

arXiv 2026 arXiv CS.

While AI agents are rapidly advancing from isolated tools to interactive collaborators, data-driven human-machine teaming (HMT) methods remain costly in their reliance on human interaction data across domains, teammates, and team sizes. Zero-shot coordination (ZSC) addresses this bottleneck by simulating diverse partner populations to approximate how unseen partners might behave. However, partner coverage alone is insufficient as team settings scale and communication becomes degraded. To remedy this deficiency, we propose Influence-Based Team Steering (IBTS), a framework that uses influence shaping to incentivize agents to discover diverse, high-performing team interaction patterns and further steers ongoing trajectories toward stronger learned coordination modes. We assess IBTS on Overcooked-AI in both two-agent and three-agent settings, allowing us to test whether learned coordination structure transfers beyond dyadic interaction. Our evaluation includes simulated partners, synthetic partner-style variation, and, to our knowledge, the first 30-subject Overcooked-AI HMT study involving two real human teammates and one machine teammate. Across these evaluations, IBTS improves team performance against competing baselines, highlighting the need for scaled ZSC to combine sparse-reward coordination mechanisms with partner-variation coverage rather than relying on diversity alone.

Teaser for Event-Grounded Sparse Autoencoders for Vision-Language-Action Policies
Event-Grounded Sparse Autoencoders for Vision-Language-Action Policies

Xinchen Jin , Aditya Chatterjee , Pranav Kumar , Rohan Paleja

arXiv 2026 arXiv CS.

Vision-Language-Action (VLA) policies translate language and visual inputs into robot actions, where their hidden representations directly shape closed-loop behavior. However, mechanistic interpretability tools from language and vision-language models do not transfer cleanly to VLAs: outputs are robot actions rather than human-readable tokens, and interventions can only be tested via expensive closed-loop rollouts. We propose an event-grounded interpretability pipeline that anchors SAE feature analysis to behavioral events rather than text contexts. End-effector keyframes are clustered within each task using visual, state, and temporal cues, linking SAE features to behaviorally salient events and, via optional VLM annotations, to semantic context. To our knowledge, our pipeline is among the first to ground SAE-based VLA analysis in closed-loop behavioral events. Across two simulation architectures and a real-robot study, event-grounded ranking yields the strongest causal effects on OpenVLA and transfers to the continuous action chunks of $\pi_{0.5}$. SAE is a sparse but imperfect intervention basis: usability varies with architecture and intervention site, and aggressive intervention reveals safety and interpretability limits. Overall, event-grounded SAE analysis emerges as a practical starting point for behavior-anchored VLA interpretability, motivating future work on SAE features beyond action-aligned coordinates, finer-grained closed-loop evaluation, and safe interventions for high-stakes VLA deployments. Code is available at https://github.com/xc-j/Event-SAE.

Teaser for Differentiable Belief-Based Opponent Shaping
Differentiable Belief-Based Opponent Shaping

Aarav Sane , Karthik Sivachandran , Rohan Paleja

arXiv 2026 arXiv CS.

Human coordination often relies on the ability to influence the beliefs of others through strategic action. In multi-agent reinforcement learning, opponent shaping attempts to replicate this influence, though existing methods typically operate within an opponent’s parameter, policy, or value space. Meanwhile, belief-manipulation techniques in hidden-role games often rely on hard-coded objectives, such as deception or belief saturation. We propose Differentiable Belief-based Opponent Shaping (D-BOS), a first-order method that treats each observer’s belief as the shaped opponent state and differentiates through $k$-step softmax-Bayes belief dynamics. Rather than explicitly rewarding deceptive or cooperative behavior, our method treats the belief state as the target for shaping. This allows the optimal strategy to emerge naturally from the environment’s reward structure. This belief-space formulation provides an opponent-shaping signal by differentiating through opponent belief updates, and naturally extends to multiple observers by aggregating gradients over their individual inferred belief trajectories. Empirically, D-BOS outperforms PPO and BBM in hidden-role games, with the largest gains in mixed-motive settings.

Get In Touch

If you'd like to discuss research, collaborations, or opportunities, feel free to reach out!

Email: rpaleja@purdue.edu for all inquiries

You can also find me on:

Google Scholar GitHub LinkedIn Twitter