SenseTime's SenseNova-U1 Pro claims 8K native output, directly compares with GPT-Image-2 across 5 scenarios
English summary
SenseTime previewed its next-gen multimodal model SenseNova-U1 Pro, claiming native 8K resolution output versus GPT-Image-2's 4K. The model uses a unified 'Understand-Generate-Action' architecture targeting professional design workflows. Direct comparisons showed U1 Pro outperforming GPT-Image-2 in an infographic, a scroll painting layout, a magazine spread, an academic poster, and a high-resolution storyboard. The model also generated the entire 20+ page shareholder meeting presentation end-to-end. Invite testing is slated to begin in July 2026.
Chinese summary
商汤预览了下一代多模态模型SenseNova-U1 Pro,宣称支持原生8K分辨率输出(对比GPT-Image-2的原生4K)。该模型采用统一的“理解-生成-行动”架构,瞄准专业设计工作流。直接对比显示U1 Pro在信息图、长卷画布局、杂志跨页、学术海报和高分辨率分镜故事板共5个场景中优于GPT-Image-2。该模型还端到端生成了整场股东会20余页的PPT。邀约测试将于2026年7月启动。
Key points
Native 8K resolution output, double the claimed 4K native of GPT-Image-2.
原生8K分辨率输出,是GPT-Image-2声称的原生4K的两倍。
Unified 'Understand-Generate-Action' multimodal architecture for professional design tasks.
统一的“理解-生成-行动”多模态架构,用于专业设计任务。
Direct comparison across 5 scenarios: infographic, Chinese painting scroll, magazine spread, academic poster, and storyboard, with U1 Pro showing higher fidelity and detail retention.
在信息图、国画卷轴、杂志跨页、学术海报和故事板5个场景中直接对比,U1 Pro展现了更高的保真度和细节保留能力。
Generated the entire 20+ page shareholder meeting PPT deck end-to-end, including planning, layout, and number accuracy.
端到端生成了整场股东会20多页的PPT,包括策划、版式和数字准确性。
Invite testing begins July 2026.
2026年7月启动邀约测试。