[Feature] DeepGen 1.0

### Feature Summary

DeepGen 1.0: A Lightweight Unified Multimodal Model for Advancing Image Generation and Editing.

### Detailed Description

https://huggingface.co/deepgenteam/DeepGen-1.0
https://github.com/deepgenteam/deepgen

<img width="2426" height="1211" alt="Image" src="https://github.com/user-attachments/assets/3f244c44-6f6f-4e8c-a3cd-b7a1dc0c21fa" />

<img width="2568" height="1485" alt="Image" src="https://github.com/user-attachments/assets/4af87d93-77c0-4836-b724-4831915461af" />

DeepGen 1.0 is a lightweight unified multimodal model with only 5B parameters (3B VLM + 2B DiT). It integrates five core capabilities—general image generation, general image editing, reasoning image generation, reasoning image editing, and text rendering—within a single model. Across multiple authoritative benchmarks, DeepGen 1.0 is competitive with competitive with or surpassing the state-of-the-art unified multimodal models that are 3× to 16× larger, achieving comprehensive performance, demonstrating that massive scaling is not the sole path to high-performance multimodal generation.

### Alternatives you considered

_No response_

### Additional context

_No response_

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

[Feature] DeepGen 1.0 #1286

Feature Summary

Detailed Description

Alternatives you considered

Additional context

Metadata

Assignees

Labels

Projects

Milestone

Relationships

Development

[Feature] DeepGen 1.0 #1286

Description

Feature Summary

Detailed Description

Alternatives you considered

Additional context

Metadata

Metadata

Assignees

Labels

Projects

Milestone

Relationships

Development

Issue actions