What Makes Paris Look like Paris?
Given a large repository of geotagged imagery, we seek to automatically find visual elements, e.g. windows, balconies, and street signs, that are most distinctive for a certain geo-spatial area, for example the city of Paris. This is a tremendously difficult task as the visual features distinguishing architectural elements of different places can be very subtle. In addition, we face a hard search problem: given all possible patches in all images, which of them are both frequently occurring and geographically informative? To address these issues, we propose to use a discriminative clustering approach able to take into account the weak geographic supervision. We show that geographically representative image elements can be discovered automatically from Google Street View imagery in a discriminative manner. We demonstrate that these elements are visually interpretable and perceptually geo-informative. The discovered visual elements can also support a variety of computational geography tasks, such as mapping architectural correspondences and influences within and across cities, finding representative elements at different geo-spatial scales, and geographically-informed image retrieval.
Paper & Presentation
SIGGRAPH talk slides (pptx, 238MB)
Carl Doersch, Saurabh Singh, Abhinav Gupta, Josef Sivic, and Alexei A. Efros. What Makes Paris Look like Paris? ACM Transactions on Graphics (SIGGRAPH 2012), August 2012, vol. 31, No. 3. [Show BibTex]
S. Singh, A. Gupta and A. A. Efros. 2012. Unsupervised discovery of mid-level discriminative patches. In ECCV 2012.
C. Doersch, A. Gupta and A. A. Efros. 2013. Mid-Level Visual Element Discovery as Discriminative Mode Seeking. In NIPS 2013.
Version 4.5 tar.gz (1.6 MB; released Jan. 29 2012)
Version 3.0 tar.gz (1.5 MB; released Sept. 13 2012)
Full Results: Paris Elements
An example of the results obtained entirely automatically, using our algorithm to find the unique visual elements of Paris. Each row shows the most confident detections in Paris for one of these elements. WARNING: This page contains about 5000 patches; don't click the link unless your browser can handle it!
Take the Paris-NonParis test!
This test presents 100 random street-level images: 50 from Paris, 50 from 11 other cities. Guess whether each one is from Paris (try not to look at the text...) and we'll score you at the end!
For one experiment, we had an artist sketch an image, and then later re-sketch it with help from our discovered elements. See the resulting sketches, plus the materials we showed our artist.
This research is supported by:
- NDSEG Fellowship for Carl Doersch
- NSF IIS0905402
- ONR N000141010934
- ONR N000141010766