MVP Lab Mobile vision perception research lab
Home Posts Join About GitHub

Video Generation Research

RAVEN: real-time autoregressive video extrapolation

RAVEN is a training-time test framework that repacks self rollouts into interleaved clean …

Read more

Infrastructure

MVP Engine: agentic training recipes for multimodal models

MVP Engine is a lightweight multimodal training framework that keeps orchestration small, moves …

Read more

Multimodal Research

LLaVA-OneVision-2: open long-video multimodal training

LLaVA-OneVision-2 extends fully open multimodal training toward long-video understanding, using …

Read more

Multimodal Research

LLaVA-OneVision-1.5: reproducible multimodal training at scale

LLaVA-OneVision-1.5 turns multimodal model release into a reproducible training system, opening …

Read more
MVP Lab Mobile vision perception research lab

Researching AI that can see it, say it, and sort it. From perception to understanding to action.

MVP Lab

Posts About

Connect

GitHub Email

© 2026 MVP Lab

Built with Hugo