Skip to main content Skip to navigation

SakugaFlow: A Stagewise Illustration Framework Emulating the Human Drawing Process and Providing Interactive Tutoring for Novice Drawing Skills

Project Overview

The document explores the application of generative AI in education, particularly through the example of SakugaFlow, a tool aimed at helping novice artists develop illustration skills. By showcasing the drawing process in four distinct stages—rough sketch, line art, coloring, and final finishing—SakugaFlow facilitates a structured learning experience. An integrated educational dialogue agent, powered by a Large Language Model (LLM), enhances this process by offering real-time feedback on artistic principles, thereby transforming generative AI from a simple image generator into an interactive educational partner. This approach not only fosters skill development through iterative practice but also emphasizes the potential of generative AI to support personalized learning experiences in creative fields. The findings suggest that such tools can significantly enhance student engagement and proficiency in artistic skills, highlighting the broader impact of generative AI in reshaping educational methodologies across various domains.

Key Applications

SakugaFlow

Context: Educational tool for novice artists learning illustration techniques.

Implementation: The tool guides users through a four-phase workflow that includes rough sketching, line art refinement, coloring, and final touches, providing real-time feedback via an LLM-based tutor.

Outcomes: Users can develop foundational skills in illustration, receive targeted feedback on their work, and understand the drawing process in a structured manner.

Challenges: The backend diffusion model remains optimized for final outputs, which may limit interpretability of intermediate states.

Implementation Barriers

Technical Barrier

The backend diffusion model is primarily designed for generating final images, which restricts the visibility of intermediate steps in the drawing process.

Proposed Solutions: Future enhancements may involve constructing more granular representations and training the model explicitly on sequential refinements.

Project Team

Kazuki Kawamura

Researcher

Jun Rekimoto

Researcher

Contact Information

For information about the paper, please contact the authors.

Authors: Kazuki Kawamura, Jun Rekimoto

Source Publication: View Original PaperLink opens in a new window

Project Contact: Dr. Jianhua Yang

LLM Model Version: gpt-4o-mini-2024-07-18

Analysis Provider: Openai

Let us know you agree to cookies