Visipedia

Visipedia, short for “Visual Encyclopedia,” is a network of people and machines that is designed to harvest  and organize visual information and make it accessible to anyone anywhere. Visipedia machines can learn from experts how to discover and classify animals, plants and objects in images. Communities of scientists and interested citizens may use Visipedia software to share, annotate and organize meaningful content in images. Recent experiments include software that can detect and classify trees from satellite and street-level images, and an app that can recognize North American birds. Visipedia is a joint project between Pietro Perona’s Vision Group at Caltech and Serge Belongie’s Vision Group at Cornell Tech.

If you wish to learn more about Visipedia, a good place to start is here. You can also visit our joint project page at visipedia.org.

Funding for Visipedia provided by a Google Focused Research Award and the Jacobs Technion-Cornell Institute.

Papers

2017

Kernel Pooling for Convolutional Neural Networks

Cui, Yin; Zhou, Feng; Wang, Jiang; Liu, Xiao; Lin, Yuanqing; Belongie, Serge

Kernel Pooling for Convolutional Neural Networks

Computer Vision and Pattern Recognition (CVPR), Honolulu, HI, 2017.

(Links | BibTeX)

2016

Fine-grained Categorization and Dataset Bootstrapping using Deep Metric Learning with Humans in the Loop

Cui, Yin; Zhou, Feng; Lin, Yuanqing; Belongie, Serge

Fine-grained Categorization and Dataset Bootstrapping using Deep Metric Learning with Humans in the Loop

Computer Vision and Pattern Recognition (CVPR), Las Vegas, NV, 2016.

(Links | BibTeX)

2015

Learning Concept Embeddings with Combined Human-Machine Expertise

Wilber, Michael; Kwak, Iljung; Kriegman, David; Belongie, Serge

Learning Concept Embeddings with Combined Human-Machine Expertise

International Conference on Computer Vision (ICCV), 2015.

(Links | BibTeX)

Belongie, Serge; Perona, Pietro

Visipedia circa 2015

Pattern Recognition Letters, 2015.

(Links | BibTeX)

Building a Bird Recognition App and Large Scale Dataset With Citizen Scientists: The Fine Print in Fine-Grained Dataset Collection

Van Horn, Grant; Branson, Steve; Farrell, Ryan; Haber, Scott; Barry, Jessie; Ipeirotis, Panos; Perona, Pietro; Belongie, Serge

Building a Bird Recognition App and Large Scale Dataset With Citizen Scientists: The Fine Print in Fine-Grained Dataset Collection

Computer Vision and Pattern Recognition (CVPR), Boston, MA, 2015.

(Links | BibTeX)

Learning Localized Perceptual Similarity Metrics for Interactive Categorization

Wah, Catherine; Maji, Subhransu; Belongie, Serge

Learning Localized Perceptual Similarity Metrics for Interactive Categorization

IEEE Winter Conference on Applications of Computer Vision (WACV), Waikoloa Beach, HI, 2015.

(Links | BibTeX)

2014

Bird Species Categorization Using Pose Normalized Deep Convolutional Nets

Branson, Steve; Horn, Grant Van; Belongie, Serge; Perona, Pietro

Bird Species Categorization Using Pose Normalized Deep Convolutional Nets

British Machine Vision Conference (BMVC), Nottingham, 2014.

(Links | BibTeX)

The Ignorant Led by the Blind: A Hybrid Human–Machine Vision System for Fine-Grained Categorization

Branson, Steve; Horn, Grant Van; Wah, Catherine; Perona, Pietro; Belongie, Serge

The Ignorant Led by the Blind: A Hybrid Human–Machine Vision System for Fine-Grained Categorization

International Journal of Computer Vision (IJCV), 2014.

(Links | BibTeX)

Similarity Comparisons for Interactive Fine-Grained Categorization

Wah, Catherine; Horn, Grant Van; Branson, Steve; Maji, Subhransu; Perona, Pietro; Belongie, Serge

Similarity Comparisons for Interactive Fine-Grained Categorization

Computer Vision and Pattern Recognition (CVPR), Columbus, OH, 2014.

(Links | BibTeX)

A User Friendly Crowdsourcing Task Manager

Matera, Tomas; Jakes, Jan; Cheng, Munan; Belongie, Serge

A User Friendly Crowdsourcing Task Manager

Workshop on Computer Vision and Human Computation, Columbus, OH, 2014.

(Links | BibTeX)

2013

Efficient Large-Scale Structured Learning

Branson, Steve; Beijbom, Oscar; Belongie, Serge

Efficient Large-Scale Structured Learning

IEEE Conference on Computer Vision and Pattern Recognition (CVPR), Portland, OR, 2013.

(Links | BibTeX)

Wah, Catherine; Belongie, Serge

Attribute-Based Detection of Unfamiliar Classes with Humans in the Loop

IEEE Conference on Computer Vision and Pattern Recognition (CVPR), Portland, OR, 2013.

(Links | BibTeX)

Bootstrapping Fine-Grained Classifiers: Active Learning with a Crowd in the Loop

Patterson, Genevieve; Horn, Grant Van; Belongie, Serge; Perona, Pietro; Hays, James

Bootstrapping Fine-Grained Classifiers: Active Learning with a Crowd in the Loop

NIPS Workshop on Crowdsourcing: Theory, Algorithms and Applications, Lake Tahoe, 2013.

(Links | BibTeX)

2011

Strong Supervision From Weak Annotation: Interactive Training of Deformable Part Models

Branson, Steve; Perona, Pietro; Belongie, Serge

Strong Supervision From Weak Annotation: Interactive Training of Deformable Part Models

IEEE International Conference on Computer Vision (ICCV), Barcelona, 2011.

(Links | BibTeX)

Multiclass Recognition and Part Localization with Humans in the Loop

Wah, Catherine; Branson, Steve; Perona, Pietro; Belongie, Serge

Multiclass Recognition and Part Localization with Humans in the Loop

IEEE International Conference on Computer Vision (ICCV), Barcelona, 2011.

(Links | BibTeX)

2010

Caltech-UCSD Birds 200

Welinder, Peter; Branson, Steve; Mita, Takeshi; Wah, Catherine; Schroff, Florian; Belongie, Serge; Perona, Pietro

Caltech-UCSD Birds 200

Caltech (CNS-TR-201), 2010.

(Links | BibTeX)

The Multidimensional Wisdom of Crowds

Welinder, Peter; Branson, Steve; Belongie, Serge; Perona, Pietro

The Multidimensional Wisdom of Crowds

Neural Information Processing Systems Conference (NIPS), 2010.

(Links | BibTeX)

Visual Recognition with Humans in the Loop

Branson, Steve; Wah, Catherine; Babenko, Boris; Schroff, Florian; Welinder, Peter; Perona, Pietro; Belongie, Serge

Visual Recognition with Humans in the Loop

European Conference on Computer Vision (ECCV), Heraklion, Crete, 2010.

(Links | BibTeX)

People