"매 robot 의 universal foundation model — 매 ChatGPT moment for embodied AI". 매 Physical Intelligence (PI, 2024 launch)는 π0 — 매 vision-language-action (VLA) foundation model 의 출시한 startup. 매 single weights 로 매 다양한 robot 매 dishwashing, laundry folding, table bussing 의 수행.
매 핵심
매 회사 + 모델
매 회사: Physical Intelligence (Carolina Parada, Sergey Levine, Chelsea Finn 등 — 매 Google Brain/Stanford alumni). 2024 founded, $400M+ raised, $2.4B valuation.
매 π0 (pi-zero, 2024-10): 매 first VLA foundation model. PaliGemma (3B VLM) backbone + 매 action expert (300M params, flow matching for continuous actions). 매 50Hz control.
매 π0.5 (2025): open-world generalization, hierarchical planning, longer-horizon tasks.
# π0 outputs 50-step action chunks at 50Hz# Execute first k steps, predict again — temporal ensemble for smoothnessACTION_HORIZON=50EXECUTE_STEPS=8action_buffer=collections.deque(maxlen=ACTION_HORIZON)fortinrange(MAX_STEPS):ift%EXECUTE_STEPS==0:chunk=policy(obs)# predict 50-step chunkaction_buffer.extend(chunk[0])action=action_buffer.popleft()obs=robot.step(action)
fromlerobot.common.datasets.lerobot_datasetimportLeRobotDatasetds=LeRobotDataset("lerobot/aloha_static_fork_pick_up")sample=ds[0]# {'observation.images.top': tensor, 'observation.state': tensor,# 'action': tensor, 'task': 'pick up the fork'}
Cross-embodiment fine-tune
# Fine-tune π0 on a new robot (e.g. custom 6-DoF arm)config=PI0Config(action_dim=6,state_dim=6)# adjust dimspolicy=PI0Policy(config)policy.load_pretrained_vlm("lerobot/pi0")# load PaliGemma + freeze# Train action head only on small (~1000 episode) custom dataset
Language-conditioned task switch
# Same weights, different language prompts → different behaviorsfortaskin["fold the shirt","pick up the cup","wipe the table"]:obs["task"]=[task]action=policy(obs)execute(action)
매 결정 기준
상황
Approach
매 single-task robot, abundant data
매 task-specific BC/RL
매 multi-task, language-conditioned
매 π0 fine-tune
매 zero-shot new task
매 π0.5 (open-world)
매 humanoid full-body
매 π0 + whole-body controller
매 high-frequency control (>100Hz)
매 distill π0 → smaller policy
기본값: 매 cross-embodiment manipulation 의 π0 fine-tune (lerobot 사용).
언제: 매 multi-task robot manipulation, language-conditioned policy, cross-embodiment transfer 의 사용.
언제 X: 매 simple pick-and-place (overkill), 매 sub-50Hz needed (latency), 매 contact-rich precision tasks (still ongoing research).
❌ 안티패턴
매 raw pretrained π0 deploy: 매 fine-tune 없이 — 매 robot/scene mismatch 의 fail.
매 ignore action chunking: 매 single-step prediction → 매 jittery motion.
매 mismatched camera intrinsics: 매 training cam 매 deploy cam 의 different → 매 OOD failure.
🧪 검증 / 중복
Verified (Physical Intelligence official, π0 paper 2024-10, lerobot integration).
신뢰도 A.
🕓 Changelog
날짜
변경
2026-05-08
Phase 1
2026-05-10
Manual cleanup — π0/π0.5 VLA foundation model + lerobot patterns