overview

SkillBlender performs versatile autonomous humanoid loco-manipulation tasks within different embodiments and environments, given only one or two intuitive reward terms.

Summary Video

Framework: SkillBlender

Method Pipeline

framework

Overview of SkillBlender. We first pretrain goal-conditioned primitive expert skills that are task-agnostic, reusable, and physically interpretable, and then reuse and blend these skills to achieve complex whole-body loco-manipulation tasks given only one or two task-specific reward terms.

A Pretrain-then-Blend Paradigm


Low-Level Primitive Skills (on H1)


Walking
Reaching
Squatting
Stepping

High-Level Loco-Manipulation Tasks (on H1)

NOTE: All below high-level task rollouts are performed by our SkillBlender, which is fully autonomous, WITHOUT any hand-crafted trajectories or heuristic poses, and WITH only one or two reward terms for guidance.


FarReach
ButtonPress
CabinetClose
FootballShoot
BoxPush
PackageLift
BoxTransfer
PackageCarry

Benchmark: SkillBench

overview

Our SkillBench is a parallel, cross-embodiment, and diverse simulated benchmark containing three embodiments, four primitive skills, and eight loco-manipulation tasks.


Parallel Simulation


Cross-Embodiment


Diverse Tasks


Experiment Results

Qualitative Comparison

Choose task:



SkillBlender (Ours)
PPO
HumanoidBench
Sequential HRL
MCP
HumanPlus

Qualitative comparison between different methods. Our SkillBlender not only achieves higher task accuracy, but also avoids reward hacking and yields more natural and feasible movements.

Skill Blending Decomposition


Visualization of whole-body per-joint weights at different task stages. More blue means more Reaching, and more green means more Walking. This visualization highlights the spatial-temporal decomposition of our skill blending, where the two skills interleave rather than one skill dominating the overall motion.

Primitive Skill Deployment


Reaching
Squatting

We also provide a sim2real toolkit support that deploys our simulation-trained primitive skill policies in the real world using a Unitree H1 humanoid robot.

Try out SkillBlender and SkillBench!

We open-sourced our SkillBlender framework and SkillBench benchmark. Feel free download the code and try it out yourself! We look forward to more and more open-source humanoid research!

Our Team



1University of Southern California, 2Stanford University,
3Peking University, 4University of California, Berkeley
* Equal contributions

BibTex

If you have any questions, please contact Yuxuan Kuang.