Event
Robbyant Demonstrates LingBot-VA's Video-Action Integration
Key points
- • Robbyant showcased LingBot-VA, an autoregressive diffusion framework that simultaneously learns frame prediction and policy execution.
- • The model features a shared latent space integrating vision and action tokens.
- • Evaluations demonstrated significant promise in long-horizon manipulation and strong generalizability to novel configurations.
Company context
Robotics unit of Ant Group focused on embodied AI systems and service robotics deployment.
Context
- Segment
- Humanoid
- Event type
- Demonstration
- Geography
- Shanghai · China