CS 545I - Advanced Image Databases
Winter quarter, 1999
Fridays, 3:15 - 4:30
Gates building, Room 100
Oscar Firschein (oscar@infolab.stanford.edu) use for queries,
sponsored by Gio Wiederhold
(gio@cs.stanford.edu).
Some References;
see new URLs.
The first four seminars deal with the basics of image characterization;
the next four presentations describe image retrieval systems. In the final
seminar, registered students will present 15-20 minute reviews of papers
in the literature dealing with image security.
Imaging Informatics, MIS 215, meets Tuesday and Thursday 2:30 to 3:45
PM from Jan 5 to March 11, 1999 in the Medical School Office Building (MSOB)
Room 275 (Conference Room). The class deals with various aspects of medical
imaging and image analysis. (Contact dev@summit.stanford.edu)
An image matching system by James Ze Wang is based on a wavelet
approach
Planned schedule for CS545I (click for class summaries)
Material from prior
years including CS545 spring/fall seminars
BASIC PRESENTATIONS
-
Friday January 8 Introduction; Oscar Firschein/
Human perception: Physics of vision, Psychology of vision, Dr. Martin A.
Fischler, AI Center, SRI International, (fischler@ai.sri.com).
-
Friday January 15: Representation of Images and Video
: Dr. Parvati Dev, Stanford Medical School. (dev@summit.stanford.edu),
Medical images (X-ray, CatScan, NMR, visible human), Image standards (JPEG,
etc.)
-
Friday January 29: Indexing of many-attribute descriptors;
Prof. Carlo Tomasi, Computer Science Department, Stanford University, (tomasi@robotics.stanford.edu).
Range searches -- nearest neighbor search, weights to match customer needs.
PRESENTATIONS OF IMAGE RETRIEVAL SYSTEMS
-
Friday February 5: IBM Image and Video Retrieval;
Dr. Dragutin Petkovic, IBM Almaden Reasearch Labs, (petkovic@almaden.ibm.com).
Recent Advances in Automated Video Indexing: IBM Almaden's CueVideo Project
-
Friday February 12: NEC AMORE Web search engine;
Dr.
Kyoji Hirata, NEC (hirata@ccrl.sj.nec.com). NEC image retrieval research,
including work with the Getty Museum.
-
Friday February 19: Virage; Dr. Bradley Horowitz,
Virage (bradley@virage.com). The Virage Video Cataloger system.
-
Friday March 5: Student Presentations; Watermarking
of image documents, automatic removal of identifying alphanumerics from
images; Screening to eliminate Web sites having objectionable images.
Today or soon.
Seminar abstract
CS545I is a reading and projects seminar devoted to advanced image and
video databases, a topic of increasing interest due to large image and
video databases soon to be available on the Web. The emphasis will be on
combining image-derived descriptors and text descriptors to retrieve local
and online images. There are many challenges in storing high-dimensional
feature vectors for fast retrieval, and developing metrics of closeness
between query and stored vectors. Applications including art collections,
sales catalogs, aerial photographs, and medical images will be surveyed
to illustrate the strengths or weaknesses of specific techniques. Retrieval
from video databases is also an active area of research. We will be interested
in how to automatically construct an inverse story-board (a visual summary)
from video, and how to use the audio track to extract frames of interest.
Credit for the seminar only and a presentation will be 1 unit. Additional
units (3 normally) under CS-395 will be available for students who wish
to execute a project of their own choosing this or successor quarters.
Check with Prof. Wiederhold for acceptable projects.
Students should pick out a presentation paper by the THIRD meeting of
the quarter. Papers can be of your own choosing with approval of instructors,
or from the following list (** denotes available from Oscar Firschein):
As of 1/11/99 the first three papers have NOT been taken.
-
Tutorial on wavelets (use any book or paper that you want)
-
"Browsing through High Quality Document Images with DjVu" **
-
"Comparison of video shot boundary detection techniques" **
-
"Safeguarding Digital Library Contents and Users: Digital Watermarking;
" Digital Library Magazine, Dec. 1997 www.dlib.org
-
"A Textual Information Detection and Elimination System for Secure Medical
Image Distribution," James Ze Wang, Michel Bilello, and Gio Wiederhold
**
-
"System for Screening Objectionable Images Using Daubechies' Wavelets and
Color Histograms," James Ze Wang, Gio Wiederhold, and Oscar Firschein.
**
Prerequisites
Students should have a background in database concepts of at least CS145.
Coordinator:
Oscar Firschein (oscar@infolab.stanford.edu) recently retired from the Artificial
Intelligence Center, SRI International and a four-year assignment to ARPA,
where he was Program Manager for Image Understanding (IU). He has performed
research and supervised many advanced projects in image processing.