Update README.md
Browse files
README.md
CHANGED
@@ -21,7 +21,7 @@ The abstract of the paper is the following:
|
|
21 |
We introduce LeX-Art, a comprehensive suite for high-quality text-image synthesis that systematically bridges the gap between prompt expressiveness and text rendering fidelity. Our approach follows a data-centric paradigm, constructing a high-quality data synthesis pipeline based on Deepseek-R1 to curate LeX-10K, a dataset of 10K high-resolution, aesthetically refined 1024$\times$1024 images. Beyond dataset construction, we develop LeX-Enhancer, a robust prompt enrichment model, and train two text-to-image models, LeX-FLUX and LeX-Lumina, achieving state-of-the-art text rendering performance. To systematically evaluate visual text generation, we introduce LeX-Bench, a benchmark that assesses fidelity, aesthetics, and alignment, complemented by Pairwise Normalized Edit Distance (PNED), a novel metric for robust text accuracy evaluation. Experiments demonstrate significant improvements, with LeX-Lumina achieving a 22.16\% PNED gain, and LeX-FLUX outperforming baselines in color (+10.32\%), positional (+5.60\%), and font accuracy (+5.63\%). The codes, models, datasets, and demo are publicly available.
|
22 |

|
23 |
|
24 |
-
**Usage of LeX-
|
25 |
```python
|
26 |
import torch
|
27 |
from diffusers import FluxPipeline
|
|
|
21 |
We introduce LeX-Art, a comprehensive suite for high-quality text-image synthesis that systematically bridges the gap between prompt expressiveness and text rendering fidelity. Our approach follows a data-centric paradigm, constructing a high-quality data synthesis pipeline based on Deepseek-R1 to curate LeX-10K, a dataset of 10K high-resolution, aesthetically refined 1024$\times$1024 images. Beyond dataset construction, we develop LeX-Enhancer, a robust prompt enrichment model, and train two text-to-image models, LeX-FLUX and LeX-Lumina, achieving state-of-the-art text rendering performance. To systematically evaluate visual text generation, we introduce LeX-Bench, a benchmark that assesses fidelity, aesthetics, and alignment, complemented by Pairwise Normalized Edit Distance (PNED), a novel metric for robust text accuracy evaluation. Experiments demonstrate significant improvements, with LeX-Lumina achieving a 22.16\% PNED gain, and LeX-FLUX outperforming baselines in color (+10.32\%), positional (+5.60\%), and font accuracy (+5.63\%). The codes, models, datasets, and demo are publicly available.
|
22 |

|
23 |
|
24 |
+
**Usage of LeX-FLUX:**
|
25 |
```python
|
26 |
import torch
|
27 |
from diffusers import FluxPipeline
|