Z-Brain

持续自主进化的物理智能体

z0

全球首个持续自主进化的物理智能体

性能领先全球新标杆

z0 产品在多个世界级权威评测集中均登顶领先,重新定义具身智能全球新标杆

具身导航图标

01 具身导航

Embodied Navigation

榜单荣誉:两大权威评测集 R2R、RxR 取得领先地位

核心突破

R2R-CE (unseen)

成功率SR(%)
60.19
z0
55.8
Monodream
54.0
NavILA
47.0
Uni-Navid
47.0
AO-Planner

RxR-CE (unseen)

成功率SR(%)
54.5
z0
49.4
Monodream
44.0
NavILA
40.9
Uni-Navid
40.5
NaVid
流式视频理解图标

02 流式视频理解

Streaming Video Understanding

榜单荣誉:两大流式基准 Streamingbench、OVO-Bench 取得领先地位

核心突破

Streamingbench(2 FPS)

准确率Accuracy(%)
38.0
z0
34.6
MMDuet-2
28.9
StreamAgent
25.3
Dispider
4.0
VideoLLM-Online

OVO-Bench

准确率Accuracy(%)
30.99
z0
20.51
MMDuet-2
13.95
StreamForest
6.93
VideoLLM Online
4.77
FVStream
智能体记忆图标

03 智能体记忆

Agent Memory

榜单荣誉:三大记忆力基准 Alfworld、Scienceworld、Embodied bench 取得领先地位

核心突破

AlfWorld

成功率SR(%)
82.16
z0
67.16
G-Mem
46.27
No Mem
42.54
MemGen
36.57
LangMem
12.69
Mem-O
11.19
A-Mem

EmbodiedBench

成功率SR(%)
55.67
z0
44
G-Mem
38.33
LangMem
38.33
Mem0
36.67
A-Mem
34.67
No Mem
25.67
MemGen

ScienceWorld

成功率SR(%)
46.67
z0
34.44
A-Mem
32.22
G-Mem
31.11
No Mem
27.78
Mem-0
24.65
MemGen
15.56
LangMem
具身操作图标

04 具身操作

Embodied Manipulation

榜单荣誉:操作类顶尖评测集 RoboCasa Composite Tasks 超过强基线

核心突破

RoboCasa Composite Tasks

平均分(avg)
提升
25.67z0
vs.
4.8official_report
5.3倍

相较于官方基线模型,性能暴涨近 5.3 倍

细分任务表现

z0official_report
PreStackPan
284
RestockPantry
39.336
MicrowaveThawing
9.332
ArrangeVegetables
46.6716
PrepareCoffee
50

基于这一全新范式,z0 在流式视觉理解、具身记忆、组合复杂操作、视觉导航等 8 项行业权威评测中,精度均领先主流方案。在持续交互中,成功率还可提升 20% 以上

评测集与文献脚注

论文成果:Technical Report (TBD)

连接未来,共同进化

发送您的需求

源自清华定义物理智能体未来

50+国际顶会论文
20+国家发明专利
7项牵头制定国家/团体标准
领先导航、操作、记忆8大榜单

最新论文成果

具身多模态感知与理解

具身多模态感知与理解
  1. StreamMind: Unlocking Full Frame Rate Streaming Video Dialogue through Event-Gated Cognition (ICCV 2025)
  2. Babel: A Scalable Pre-trained Model for Multi-Modal Sensing via Expandable Modality Alignment (SenSys 2025)
  3. Em-Garde: A Propose-Match Framework for Proactive Streaming Video Understanding (arXiv 2026)
  4. VoLLM: Smoothness-aware Serving of LLM-powered Voice Q&A via Adaptive Preemption (IPDPS 2026)
  5. Empower Vision Applications with LoRA LMM (VaLoRA) (EuroSys 2025)
  6. Region-based Content Enhancement for Efficient Video Analytics at the Edge (RegenHance) (NSDI 2025)
  7. BiSwift: Bandwidth Orchestrator for Multi-Stream Video Analytics on Edge (INFOCOM 2024)

具身记忆与长程推理

具身记忆与长程推理
  1. Sculptor: Empowering LLMs with Cognitive Agency via Active Context Management (ICLR 2026)
  2. SeerAttention: Learning Intrinsic Sparse Attention in Your LLMs (NeurIPS 2025)
  3. SeerAttention-R: Sparse Attention Adaptation for Long Reasoning (ICLR 2026)
  4. Time's Up! An Empirical Study of LLM Reasoning Ability Under Output Length Constraint (EMNLP 2025)
  5. AVA: Towards Agentic Video Analytics with Vision Language Models (NSDI 2026)
  6. Video-in-the-Loop: Span-Grounded Long Video QA with Interleaved Reasoning (ICML 2026)
  7. OxyGen: Unified KV Cache Management for Vision-Language-Action Models under Multi-Task Parallelism (arXiv 2026)
  8. JENGA: Enhancing LLM Long-Context Fine-tuning with Contextual Token Sparsity (USENIX ATC 2025)

具身决策与自主规划

具身决策与自主规划
  1. AdaNav: Adaptive Reasoning with Uncertainty for Vision-Language Navigation (ICLR 2026)
  2. MetaNav: Stop Wandering: Efficient Vision-Language Navigation via Metacognitive Reasoning (arXiv 2026)
  3. V-Droid: Advancing Mobile GUI Agents: A Verifier-Driven Approach to Practical Deployment (MobiCom 2026)
  4. ProRe: A Proactive Reward System for GUI Agents via Reasoner-Actor Collaboration (ICLR 2026)
  5. A3TP: Automated, Accurate, and Adaptive UAV Task Planning for Large-Scale Power Transmission Networks Inspection (MobiCom 2026)
  6. Auto-UIT: Automated UAV Inspection Trajectory Generation from Noisy Sparse 3D Point Cloud (MobiCom 2025)

数据合成与质量评估

数据合成与质量评估
  1. Condor: Enhance LLM Alignment with Knowledge-Driven Data Synthesis and Refinement (ACL 2025)
  2. SynDiSC: High-Quality Tabular Data Synthesis with Distributional and Semantic Consistency (SIGIR 2026)
  3. Seeing the Whole Through the Parts: Discovering Objects through Semantic Part Mining in Weak Supervision (SIGIR 2026)
  4. PriCAF: Privacy-Preserving Contribution Assessment in Federated Learning Before Model Training (ACM MM 2025)
  5. Training Data Attribution: Was Your Model Secretly Trained On Data Created By Mine? (KDD 2025)
  6. Multi-Label and Evolvable Dataset Preparation for Web-Based Object Detection (TKDD 2024)
  7. CoAst: Validation-Free Contribution Assessment for Federated Learning based on Cross-Round Valuation (ACM MM 2024)
  8. VIDAR: Data Quality Improvement for Monocular 3D Reconstruction through In-situ Visual Interaction (ICRA 2024)

高效推理与边缘部署

高效推理与边缘部署
  1. Scaling LLM Test-Time Compute with Mobile NPU on Smartphones (EuroSys 2026)
  2. T-MAC: CPU Renaissance via Table Lookup for Low-Bit LLM Deployment on Edge (EuroSys 2025)
  3. BitDistiller: Unleashing the Potential of Sub-4-Bit LLMs via Self-Distillation (ACL 2024)
  4. BitDecoding: Unlocking Tensor Cores for Long-Context LLMs Decoding with Low-Bit KV Cache (HPCA 2026)
  5. Pre-gated MoE: An Algorithm-System Co-Design for Fast and Scalable Mixture-of-Expert Inference (ISCA 2024)
  6. Neuralink: Fast on-Device LLM Inference with Neuron Co-Activation Linking (ASPLOS 2025)
  7. H2O: Heterogeneity-Aware Hierarchical Orchestration for Memory-Efficient On-Device LLM Inference (TMC 2025)
  8. SwapMoE: Serving Off-the-shelf MoE-based Large Language Models with Tunable Memory Budget (ACL 2024)
  9. Empowering In-Browser Deep Learning Inference on Edge Through Just-In-Time Kernel Optimization (MobiSys 2024)
  10. FlexNN: Efficient and Adaptive DNN Inference on Memory-Constrained Edge Devices (MobiCom 2024)
  11. Amanda: Unified Instrumentation Framework for Deep Neural Networks (ASPLOS 2024)
  12. LUT Tensor Core: A Software-Hardware Co-Design for LUT-Based Low-Bit LLM Inference (ISCA 2025)

存算一体与新型计算范式

存算一体与新型计算范式
  1. Pim-dl: Expanding the Applicability of Commodity DRAM-PIMs for Deep Learning via Algorithm-System Co-Optimization (ASPLOS 2024)
  2. VSPIM: SRAM Processing-in-Memory DNN Acceleration via Vector-Scalar Operations (IEEE ToC 2023)
  3. Bulk Bitwise Accumulation in Commercial DRAM (NeurIPS 2024 Workshop MLNCP Oral)
  4. PUDTune: Multi-Level Charging for High-Precision Calibration in Processing-Using-DRAM (IEEE CAL 2025)
  5. MVDRAM: Enabling GeMV Execution in Unmodified DRAM for Low-Bit LLM Acceleration (arXiv 2025)