Mar 13


	[CPL Seminar] [Schedule] [Jan 9] [Jan 16] [Jan 23] [Jan 30] [Feb 6] [Feb 20] [Feb 25] [Mar 7 Shum] [Mar 7 Szeliski] [Mar 13] [Mar 20] [Mar 27] [April 3] [April 10] [April 17] [April 24]

Harpreet Singh Sawhney & Rakesh (Teddy) Kumar
Sarnoff Corporation

2D & 3D Manipulation of Motion Imagery

The confluence of Video and Vision research at Sarnoff is strongly
influenced by real world applications. Over the past decade or so,
advances in fundamental algorithms, and successes in creating customer
oriented technologies and products have produced a unique R&D
environment at Sarnoff. This talk will present a tour of algorithm and
application development as it has evolved at Sarnoff in the recent past.
Specifically, we will present a framework for video representation
within which progressively complex video alignment models reveal the
underlying 2D and 3D nature of scenes and objects behind moving video
pixels. The framework will be highlighted through applications ranging
from tele-presence, scene modeling, geo-registration and targeting,
video manipulation and enhanced visualization of video.

Video alignment is a key tool in the representation and manipulation of
the fundamental information content present in motion video sequences.
Global 2D parametric alignment models reveal the 2D nature of video
frames from close-by vantage points. These can be used for stabilization
of video sequences and construction of video panoramas. Layered
representations of motion video in terms of moving object and multiple
scene layers can be exploited for tracking, compression and indexing.
Alignment of video frames with generic optical flow and 3D models
reveals the rigid and non-rigid structure of objects and scenes, and can
be used for video enhancement, 3D recovery, modeling and image based
rendering. Alignment of videos to stored reference models enables
augmented reality, video insertion, and targeting. The talk will span
these and other related applications.

Biography:

HARPREET SAWHNEY (Member IEEE) received his Ph.D. in Computer Science in
1992 from the University of Massachusetts, Amherst, focusing on Computer
Vision. He is a Senior Member, Technical Staff in the Vision
Technologies Lab., at the Sarnoff Corp. where he has led R&D in 3D
modeling & manipulation from motion video, video enhancement & indexing,
and video mosaicing, under a number of commercial and government
programs since 1995. He led R&D in video annotation & indexing at the
IBM Almaden Research Center from 1992 to 1995. Dr. Sawhney has authored
50 technical publications, holds 5 patents and has a number of patent
applications pending.

RAKESH KUMAR (Member IEEE) is currently the head of the Media Vision
Group at Sarnoff Corporation, Princeton, New Jersey. He received his
Ph.D. in Computer Science from the University of Massachusetts at
Amherst in 1992, his MS from the State University of New York, Buffalo
and his B.Tech. from the Indian Institute of Technology, Kanpur. At
Sarnoff, he has been directing commercial and government research and
development projects in computer vision with a focus in the areas of
immersive tele-presence and 3D modeling from images, image registration,
video manipulation and exploitation. He is an Associate Editor for the
IEEE Transactions on Pattern Analysis and Machine Intelligence. He is an
author/ co-author of more than 30 technical publications and is a
co-inventor of 7 patents.