CAPTAIN: Comprehensive Composition Assistance for Photo Taking
Farshid Farhat, Mohammad M. Kamani, James Z. Wang
The Pennsylvania State University, USA
Abstract:
Many people are interested in taking astonishing photos and sharing
them with others. Emerging hightech hardware and software facilitate
the ubiquitousness and functionality of digital photography. Because
composition matters in photography, researchers have leveraged some
common composition techniques, such as the rule of thirds and the
perspective-related techniques, in providing photo-taking
assistance. However, composition techniques developed by professionals
are far more diverse than well-documented techniques can cover. We
present a new approach to leverage the underexplored photography
ideas, which are virtually unlimited, diverse, and correlated. We
propose a comprehensive fork-join framework, named CAPTAIN
(Composition Assistance for Photo Taking), to guide a photographer
with a variety of photography ideas. The framework consists of a few
components: integrated object detection, photo genre classification,
artistic pose clustering, and personalized aesthetics-aware image
retrieval. CAPTAIN is backed by a large managed dataset crawled from a
Website with ideas from photography enthusiasts and professionals. The
work proposes steps to decompose a given amateurish shot into
composition ingredients and compose them to bring the photographer a
list of useful and related ideas. The work addresses personal
preferences for composition by presenting a user-specified preference
list of photography ideas. We have conducted many experiments on the
newly proposed components and reported findings. A user study
demonstrates that the work is useful to those taking photos.
Full color PDF file (10MB)
Citation:
Farshid Farhat, Mohammad M. Kamani and James Z. Wang, ``CAPTAIN:
Comprehensive Composition Assistance for Photo Taking,'' ACM
Transactions on Multimedia Computing, Communications and Applications,
vol. 18, no. 1, article 14, pp.14:1-24, 2022.
Copyright 2022 ACM. Personal use of this material is
permitted. However, permission to reprint/republish this material for
advertising or promotional purposes or for creating new collective
works for resale or redistribution to servers or lists, or to reuse
any copyrighted component of this work in other works, must be
obtained from ACM.
Last Modified:
January 28, 2022
© 2022