Wei-Ta Chu, Chia-Hsiang Yu, and Hsin-Han Wang
Multimedia Computing Laboratory
Dept. of Computer Science and Information Engineering
National Chung Cheng University
1. Introduction
We propose a system to transform any temporal image sequence into a comics-based presentation, as an effective and interesting storytelling manner. Three main components, including page allocation, layout selection, and speech balloon placement, are respectively formulated as optimization problems, and systematic approaches are proposed to find solutions. Page allocation is viewed as a labeling problem, and the best solution is determined by the genetic algorithm. Importance values of images and predefined layouts are both represented in vector forms, and the best layout is selected by finding the best match between vectors. Feasible solutions of speech balloons constitute a solution space, and the best solution that jointly describes the best balloon locations of all balloons in a page is determined by the particle swarm optimization algorithm. Objective evaluation and subjective evaluation are designed from various perspectives to demonstrate effectiveness and superiority of the proposed system.
2. Demonstration
2.1 Sample Results
2.2 Share of Preprocessing Results
To ease other researchers' burden on data preprocessing to construct comics-based storytelling, we share preprocessing results of three copyright-free videos mentioned in the paper.
Popeye: Fright to the Finish (1954) | Popeye: Spree Lunch (1957) | The Boy in the Plastic Bubble (1976) |
@The shot change detection module was obtained from http://sourceforge.net/projects/shot-change/
& Regions of interest were obtained by the module provided by Ming-Ming Cheng http://mmcheng.net/code-data/
Last Updated: September 23, 2014