Replicating DDAT: Diffusion Policies for Robot Trajectories
An independent reproduction of DDAT in MuJoCo, evaluating the necessity of projection operators across four underactuated robotic systems.
Inspired by the ICON Lab’s research, I independently replicated the DDAT (Diffusion Dynamic Allocation Task) framework. This evaluation analyzes how projection operators enforce dynamical admissibility during open-loop execution in high-dimensional state-action spaces.
🦿 Agent 1: Hopper (12 states, 3 actions)
The projection layer is vital for the Hopper’s vertical stability. Without it, the agent fails to maintain its single-legged equilibrium.
🏃 Agent 2: Walker2d (18 states, 6 actions)
For the multi-link Walker2d, the DDAT framework prevents the “drifting” effect common in standard generative policies.
🐆 Agent 3: HalfCheetah (18 states, 6 actions)
The HalfCheetah agent demonstrates the scalability of DDAT to high-velocity movement patterns.
🚁 Agent 4: Quadcopter (17 states, 4 actions)
While MuJoCo rendering is omitted for trajectory focus, the quantitative analysis shows DDAT’s ability to maintain dynamically admissible flight paths during open-loop control.
🔍 Technical Insights
- Projection Mechanism: My results confirm that the differentiable projection operator is the cornerstone of DDAT, ensuring that diffusion-generated samples remain within the feasible physics manifold.
- Generative Training: I curated and trained diffusion-based datasets for each agent, experiencing the complexities of multi-modal task allocation.
- Stability Analysis: Comparative plots highlight that without projection, high-dimensional open-loop systems suffer from accumulated error and rapid loss of equilibrium.
⚠️ Note on Variations
Minor discrepancies in performance compared to the original DDAT paper may arise from differences in MuJoCo engine versions and subtle variations in hyper-parameter tuning during the independent replication process. Despite these minor deviations, the fundamental role of the projection operator in ensuring open-loop stability remains consistently verified across all tested agents.