Enhancing Reference-based Sketch Colorization via Separating Reference Representations
*Dingkun Yan, *Xinrui Wang, Zhuoru Li, Suguru Saito, Yusuke Iwasawa, Yutaka Matsuo, Jiaxian Guo (* equal contribution)
arXiv 2025
Abstract: This work studies distribution shift in reference-based sketch colorization, where real references can be spatially or semantically misaligned with sketches. It separates reference representations into modular stages for embedding guidance, background detail transfer, and global style transfer, improving visual quality while reducing spatial artifacts.
Towards High-resolution and Disentangled Reference-based Sketch Colorization
*Dingkun Yan, *Xinrui Wang, *Ru Wang, Zhuoru Li, Jinze Yu, Yusuke Iwasawa, Yutaka Matsuo, Jiaxian Guo (* equal contribution)
CVPR 2026 Highlight
Abstract: The paper targets spatial entanglement in high-resolution sketch colorization. It introduces a dual-branch feature alignment framework with Gram regularization, plus anime-specific attribute control and texture-transfer modules, to improve resolution, controllability, and reference consistency.
One-shot Portrait Stylization via Geometric Alignment
Xinrui Wang, Zilin Guo, Zhuoru Li, Jinze Yu, Heng Zhang, Yusuke Iwasawa, Yutaka Matsuo, Jiaxian Guo
WACV 2026
Abstract: This work learns a portrait style from a single artistic reference. It combines geometric alignment, content and style LoRA optimization, orthogonal adaptation, and ControlNet guidance to produce high-quality stylized portraits with a smaller computation and parameter budget than common inversion or diffusion baselines.
Real-Time Data-efficient Portrait Stylization via Geometric Alignment
Xinrui Wang, Zhuoru Li, Xuanyu Yin, Xiao Zhou, Yusuke Iwasawa, Yutaka Matsuo, Jiaxian Guo
Neural Networks 2025
Abstract: The method builds geometric correlations between portrait photos and style samples using facial landmarks and differentiable thin-plate-spline alignment. Its lightweight GAN framework improves training data efficiency and computational cost, enabling real-time portrait stylization on mobile devices.
Image Referenced Sketch Colorization Based on Animation Creation Workflow
*Dingkun Yan, *Xinrui Wang, Zhuoru Li, Suguru Saito, Yusuke Iwasawa, Yutaka Matsuo, Jiaxian Guo (* equal contribution)
CVPR 2025
Abstract: Inspired by professional animation pipelines, this diffusion framework uses sketches for spatial guidance and RGB images for color reference. It separates foreground and background reference signals with spatial masks and split cross-attention LoRA modules to reduce artifacts from mismatched references.
Adaptive Inertia: Disentangling the Effects of Adaptive Learning Rate and Momentum
Zeke Xie, Xinrui Wang, Huishuai Zhang, Issei Sato, Masashi Sugiyama
ICML 2022
Abstract: This paper analyzes why Adam converges quickly but often generalizes worse than SGD. It separates the roles of adaptive learning rate and momentum in saddle-point escape and flat-minima selection, then proposes Adaptive Inertia, a parameter-wise adaptive momentum framework with strong empirical generalization.
Generating Manga from Illustrations via Mimicking Manga Creation Workflow
Lvmin Zhang, Xinrui Wang, Qingnan Fan, Yi Ji, Chunping Liu
CVPR 2021
Abstract: This framework converts digital illustrations into manga by mimicking studio workflows: line drawings, regular screentones, and irregular screen textures. The generated layers can be composed into manga images and further edited, supported by a large artist-annotated dataset.
Learning to Cartoonize Using White-Box Cartoon Representations
Xinrui Wang, Jinze Yu
CVPR 2020
Abstract: The paper introduces a controllable image cartoonization framework by decomposing cartoons into surface, structure, and texture representations. Separate learning objectives for each representation make the model adjustable across cartoon styles and artist requirements.