Deco
max planck institut
informatik
mpii logoMinerva of the Max Planck Society
 

Computer Vision for Computer Graphics

Seminar – Summer Semester 2011

Christian Theobalt

Co-organizers: Kwang In Kim, Miguel Granados, Kiran Varanasi

Images courtesy of respective authors.

Contact

In case you have questions about this seminar, please contact Kwang In Kim - kkim (at) mpi-inf.mpg.de .

Schedule

Date What
15.4.2011 First meeting of the seminar (Slides).
29.4.2011 Class room lecture: How to give a good talk (Slides)
6.5.2011 or 13.5.2011 First seminar talk by participant, to be discussed

Overview

The field of Computer Vision strives to develop algorithms for understanding, interpreting and reconstructing information of real world scenes from image and video data. On the other hand, the focus of Computer Graphics has been on the image syntehsis side, i.e., the development of algorithms to build and edit static and dynamic vrtual worlds and to display them in photorealistic or stylized ways.

In recent years, it could be observed that the fields have more and more converged since both sides started to investigate research questions that are of relevance for either side. As an example, Computer Graphics researchers have started to investigate algorithms to reconstruct detailed models of static and dynamic scenes from image data, such that more believable virtual renderings of real world scenes could be achieved. In a similar manner, Compter Vision research has been able to benefit from the experience of Computer Graphics research to efficiently and effectlively model image fomration and light transport, which also simplifies scene analysis and reconstruction tasks. Recently, the ever increasing amount of image and video data has further strengthened the links between the two fields, as both communities start to investigate new ways to analyze, structure and immersively display these data.

This seminar covers advanced research topics that lie on the boundary betweem the fields of Computer Vision and Computer Graphics. This includes classic and recent research results that were published at the top tier journals and conferences in either field. More specifically, the seminar will cover research papers from the following problem fields: Motion estimation and tracking, multi-view geometry and reconstruction, computational photography and videography, reonstruction of static and dynamic 3D scenes, advanced image and video processing, etc.

The target audience are graduate students in computer science or related fields. Basic knowledge in at least one of the fields of 3D geometry, computer vision, image processing, and computer graphics is required. Every participant will give a talk on a chosen scientific topic. Afterwards, the topic will be discussed within the seminar group. This is a great opportunity for students to improve their research and presentation skills and to learn about the latest developments in computer vision and computer graphics. The seminar language is English.

Seminar Structure

Presentation

Every participant will perform a detailed reading on one topic and present the main ideas in class in an approximately 45 min presentation.

Working on one topic usually requires reading and understanding of the papers to be presented as well as acquiring background information necessary to understand the topic. For background knowledge usually some of the papers/books referenced in the paper will have to be consulted.

The presentation should cover the main ideas of the chosen topic. Usually, this means that content has to be selected from the paper; it will often not be possible to present all results of the two articles. Try to focus on the topic instead. Find a common thread linking the two papers and build your presentation on that. If you use formulas in your presentation make sure that all symbols are introduced properly. Similarly, make sure that figures are labeled correctly and that all terminology that is new to the topic is introduced appropriately. You can of course assume that your audience is familiar with the topics that have already been presented in class. It is very important for a good presentation to find a good balance between overview and detail. Most importantly, the general theme of your topic should become clear from your presentation. Your goal is that the other participants have a general knowledge about your topic after your presentation. You can pick one or two things that you find interesting and present those in more detail. Finally, prepare your presentation on time. Our feedback will help you to improve it. Also plan to spend some time on practicing your talk.

Important: We expect you to lead the discussion after your presentation. This requires that you prepare some questions, remarks or provocative statements for the audience. Try to challenge them. The discussion is also a good place to collect information and thoughts for the paper section where you present your own ideas. Keep this in mind when preparing your questions.

Paper

In addition to in class presentations, we require a written report on the topic. This report should summarize the main ideas of the paper and discuss limitations and drawbacks of the work. Based on this analysis, participants should develop and sketch an idea of their own of how to address one specific shortcoming. The report should consist of 6-8 pages for the material covering the presented papers and about 2-3 pages for the improvement proposed by the student.

The paper uses a template for one of the two major computer vision conferences, the IEEE Conference on Computer Vision and Pattern Recognition. Please use the review version of this template. It makes it easier to refer to certain parts of your text if need arises. For your convenience we provide the template links here:

.tar.gz (Linux) .zip (Windows)

Your paper should be emailed to us by August 11 2011. The format is .pdf. Please keep that in mind when using the Word template.

We will grade your paper based on a number of criteria. Again, as in the presentation, focus on getting the "big picture" right. Present your topic in context. Try to link the two articles, present a coherent text, i.e. a structure of simply

Paper 1
Paper 2

is usually insufficient. Also, try to avoid copying the structure of the original papers as far as this is possible. Equations can of course be reproduced, but do not stick too closely to the original text. Use your imagination to find a way for presenting the topic in your own words. The articles have been chosen such that they provide different views on a common theme. Try to exploit that link. Also keep in mind to structure your paper well in a formal sense. An abstract, summarizing the paper provides a short overview. The introduction expands on that. Give an overview of your paper's structure. After that present your topic and details of interest to you. Then expand on the topic, i.e. add your own ideas. Finally, a conclusion summarizes the paper, this is also the place where you can add your own opinions about the topic.

We encourage you to use additional literature in the research on your own ideas. We expect 3-4 additional references, if you want to include more you are welcome to do so. Cite the references in appropriate places, i.e. if you use concepts developed in a different publication or if you want to link to additional information. Your own research idea can be developed from scratch - if you have a great idea, try to sketch it and a possible solution. You can also discuss limitations or ideas that you feel are missing from the paper (You can choose one of the two). A further possibility is to perform research on developments that followed the papers of your topic. If you persue this route, make sure that you do not simply copy parts of other papers. In this case you should base your "own ideas" section on several follow-up publications. A good resource for researching links between articles is Google Scholar. The "cited by" link can sometimes give useful information. Often free links to pdf's can be found as well. Another source for freely downloadable scientific articles is Citeseer. If you can only find a pay link, send us an email - we will try to get you a copy.

Participation

Participants are expected to read every paper in preparation for the upcoming presentations. We expect students to actively engage in discussions to further understanding of the presented material. We aim at creating a creative atmosphere - ideas developed during the seminar work might lead to Master thesis projects.

To encourage a more lively atmosphere in the discussion we decided that you can improve your grade by one step, e.g. from 2.0 to 1.7, if you participate actively in the discussions after the talks.

Time and Place

Time: Fridays 14:00-16:00
First seminar: April 15, 2011
Room: Building E1.4 (Max-Planck-Institut), Room 019
Registration: open


Presentation Slots

The first presentation by a student is scheduled for May 6 or May 13 2011 (to be discussed), and then every week until the end of the term. The exact schedule will appear here once all topics are selected.

Mailing List

https://lists.mpi-inf.mpg.de/listinfo/cvcg

Registration for the Seminar

The details of the seminar and the available subjects will be discussed in the first class on April 15. Attendance is required in all classes. In case you want to participate in the seminar, please register by sending an Email to theobalt@mpii.de or kkim@mpii.de. If possible, also take a moment already to look through the list of topics and provide a ranked list of the three favorite topics you would like to work on: (1) best, (2) second best etc. It will still be possible to provide that list after the first class, but, in any case, please register before the first class via Email. Based on your preference we plan to assign topics by April 19. Please note that the number of available presentation slots is limited. We will try to accommodate all requests for participation but in case we have many more requests than available slots, we will assign slots on a first come first serve basis. Any official announcement will be made through the mailing list. Please subscribe/unsubscribe it.

Instructor Office Hours

Christian

Wedensdyas, 14-15, room 228

Kwang In

Wednesdays, 14-15, room 225

Kiran

Thrusdays, 16-17, room 226

Miguel

Fridays, 16-17, room 221




List of Papers

Topic Authors Paper Title Published at Contact Presenter Date Slides

Motion deblurring

Anat Levin, Peter Sand, Taeg Sang Cho, Fredo Durand, William T. Freeman

Rob Fergus, Barun Singh, Aaron Hertzmann, Sam T. Roweis, William T. Freeman

Motion-Invariant Photography

Removing Camera Shake from a Single Image

SIGGRAPH 2008

SIGGRAPH 2006

Kwang In

Han Du

May 13

Super-resolution

Daniel Glasner, Shai Bagon, Michal Irani

Michal Irani, Shmuel Peleg

Super-Resolution From a Single Image

Improving Resolution by Image Registration

ICCV 2009

CVGIP 1991

Kwang In

JingYu Yang

May 20

Space-time super-resolution

Eli Shechtman, Yaron Caspi, Michal Irani

Oded Shahar, Alon Faktor, Michal Irani

Space-Time Super-Resolution

Space-Time Super-Resolution from a Single Video

PAMI 2005

CVPR 2011

Kwang In

Madina Boshtayeva

May 27

Image and video inpainting

Connelly Barnes, Eli Shechtman, Adam Finkelstein, Dan B. Goldman

Yonatan Wexler, Eli Shechtman, Michal Irani

PatchMatch: A randomized correspondence algorithm for structural image editing

Space-time completion of video

SIGGRAPH 2009

IEEE Transactions on Pattern Analysis and Machine Intelligence (PAMI), 2007

Miguel

Jing Niu

Jun 03

Advance image and video editing

Connelly Barnes, Dan B. Goldman, Eli Shechtman, Adam Finkelstein

Yael Pritch, Eitam Kav-Venaki, Shmuel Peleg

Video tapestries with continuous temporal zoom

Shift-map image editing

SIGGRAPH 2010

CVPR 2010

Miguel

Sayyed Auwn Muhammad

Jun 10

Shape from silhouette

Wojciech Matusik, Chris Buehler, Ramesh Raskar, Leonard McMillan, Steven Gortler

Aldo Laurentini

Image-Based Visual Hulls

The Visual Hull Concept for Silhouette-Based Image Understanding

SIGGRAPH 2000

IEEE Transactions on Pattern Analysis and Machine Intelligence (PAMI), 1994

Kiran

Rui Xu

Jun 17

Reconstruction from community photo collections

Noah Snavely, Steven M. Seitz, Richard Szeliski

Michael Goesele, Noah Snavely, Brian Curless, Steven M. Seitz, Hugues Hoppe

Photo Tourism: Exploring image collections in 3D

Multi-View Stereo for Community Photo Collections

ACM Transactions on Graphics (Proc. of SIGGRAPH), 2006

Proc. of International Conference on Computer Vision (ICCV), 2009

Christian

Wenkai Dai

Jul 01

Real-time Motion Capture with Range Sensors

Jamie Shotton, Andrew Fitzgibbon, Mat Cook, Toby Sharp, Mark Finocchio, Richard Moore, Alex Kipman, and Andrew Blake

Robert Y. Wang, Jovan Popović

Real-Time Human Pose Recognition in Parts from a Single Depth Image

Real-Time Hand-Tracking with a Color Glove

CVPR 2011

SIGGRAPH 2009

Kiran

Haiyang Xu

Jul 15




Search MPII (type ? for help)