This paper introduces WordArt Designer, a user-driven framework for artistic typography synthesis, relying on Large Language Models (LLM). The system incorporates four key modules: the LLM Engine, SemTypo, StyTypo, and TexTypo modules. 1) The LLM Engine, empowered by LLM (e.g. GPT-3.5-turbo), interprets user inputs and generates actionable prompts for the other modules, thereby transforming abstract concepts into tangible designs. 2) The SemTypo module optimizes font designs using semantic concepts, striking a balance between artistic transformation and readability. 3) Building on the semantic layout provided by the SemTypo module, the StyTypo module creates smooth, refined images. 4) The TexTypo module further enhances the design's aesthetics through texture rendering, enabling the generation of inventive textured fonts. Notably, WordArt Designer highlights the fusion of generative AI with artistic typography.

Method

Existing solutions mainly generate semantically coherent and visually pleasing typography within predefined concepts. These solutions often lack adaptability, creativity, and computational efficiency. To overcome these limitations, we introduce WordArt Designer, a system composed of three primary modules: the LLM Engine, SemTypo Module, and StyTypo Module, supplemented by the TexTypo Module for texture rendering.

Framework Overview

Method
This structure involves an LLM engine, the SemTypo module for Semantic Typography, the StyTypo module for Stylization Typography, and the TexTypo module for Texture Typography. These modules operate coherently, guided by a preset control flow, to facilitate a seamless and innovative transformation of text into artistic typography.

Method
The Semantic Typography (SemTypo) module alters typographies based on a given semantic concept. It unfolds in three stages: (1) Character Extraction and Parameterization, (2) Region Selection for Transformation, and (3) Semantic Transformation and Differentiable Rasterization.






Qualitative & Quantitative Results

We compare our method against 6 state of the art baselines qualitatively and quantitatively. Below are some examples of images generated from our method. For our full analysis please refer to our paper.

Method Method Method
Ablation study of the ranking model on the validation set. `p', `r', and `s' stand for precision, recall, and success rate, respectively. `x' in `TopX' indicates the number of stylized images retained. In the ranking-based method, `TopX' are selected based on ranking scores, while for the random-based method, `TopX' are selected randomly. Results of the random-based method are obtained by averaging over 10,000 iterations. Increased values are indicated in blue.

Method

BibTeX

@misc{he2023wordart,
      title = {WordArt Designer: User-Driven Artistic Typography Synthesis using Large Language Models},
      author = {Jun-Yan He Zhi-Qi Cheng Chenyang Li Jingdong Sun Wangmeng Xiang Xianhui Lin Xiaoyang Kang Zengke Jin Yusen Hu Bin Luo Yifeng Geng Xuansong Xie Jingren Zhou},
      year={2023},
      eprint={2312.09252},
      archivePrefix={arXiv},
      primaryClass={cs.CV}
}