### [**π Website**](https://ephemeral182.github.io/PosterCraft/) | [**π― Demo**](https://github.com/Ephemeral182/PosterCraft) | [**π Paper**](https://arxiv.org/abs/2506.10741) | [**π€ Models**](https://huggingface.co/PosterCraft) | [**π Datasets**](https://huggingface.co/PosterCraft) | [**π₯ Video**](https://www.youtube.com/watch?v=92wMU4D7qx0) | [**π€ HF Demo**](https://huggingface.co/spaces/Ephemeral182/PosterCraft)
| Method | Text Recall β | Text F-score β | Text Accuracy β |
|---|---|---|---|
| OpenCOLE (Open) | 0.082 | 0.076 | 0.061 |
| Playground-v2.5 (Open) | 0.157 | 0.146 | 0.132 |
| SD3.5 (Open) | 0.565 | 0.542 | 0.497 |
| Flux1.dev (Open) | 0.723 | 0.707 | 0.667 |
| Ideogram-v2 (Close) | 0.711 | 0.685 | 0.680 |
| BAGEL (Open) | 0.543 | 0.536 | 0.463 |
| Gemini2.0-Flash-Gen (Close) | 0.798 | 0.786 | 0.746 |
| PosterCraft (ours) | 0.787 | 0.774 | 0.735 |
![]() Adventure Travel |
![]() Post-Apocalyptic |
![]() Sci-Fi Drama |
![]() Space Thriller |
![]() Cultural Event |
![]() Luxury Product |
![]() Concert Show |
![]() Children's Book |
![]() Movie Poster |
| Model | Stage | Description | Download |
|---|---|---|---|
| π― PosterCraft-v1_RL | Stage 3: Aesthetic-Text RL | Optimized via Aesthetic-Text Preference Optimization for higher-order aesthetic trade-offs. | π€ HF |
| π PosterCraft-v1_Reflect | Stage 4: Vision-Language Feedback | Iteratively refined using vision-language feedback for further harmony and content accuracy. | π€ HF |
| Dataset | Size | Description | Download |
|---|---|---|---|
| π€ Text-Render-2M | 2M samples | High-quality text rendering examples with multi-instance support | π€ HF |
| π¨ HQ-Poster-100K | 100K samples | Curated high-quality posters with aesthetic evaluation | π€ HF |
| π Poster-Preference-100K | 100K images | Preference learning poster pairs for RL training | π€ HF |
| π Poster-Reflect-120K | 120K images | Vision-language feedback pairs for iterative refinement | π€ HF |