Visipedia

Visipedia, short for “Visual Encyclopedia,” is a network of people and machines that is designed to harvest  and organize visual information and make it accessible to anyone anywhere. Visipedia machines can learn from experts how to discover and classify animals, plants and objects in images. Communities of scientists and interested citizens may use Visipedia software to share, annotate and organize meaningful content in images. Recent experiments include software that can detect and classify trees from satellite and street-level images, and an app that can recognize North American birds. Visipedia is a joint project between Pietro Perona’s Vision Group at Caltech and Serge Belongie’s Vision Group at Cornell Tech.

If you wish to learn more about Visipedia, a good place to start is here. You can also visit our joint project page at visipedia.org.

Funding for Visipedia provided by a Google Focused Research Award and the Jacobs Technion-Cornell Institute.

Papers

2021

The Herbarium 2021 Half–Earth Challenge Dataset

de Lutio, Riccardo; Little, Damon; Ambrose, Barbara; Belongie, Serge

The Herbarium 2021 Half–Earth Challenge Dataset

CVPR Workshop on Fine-Grained Visual Categorization (FGVC), Virtual, 2021.

(Links | BibTeX)

The Plant Pathology 2021 Challenge dataset to classify foliar disease of apples

Thapa, Ranjita; Wang, Qianqian; Snavely, Noah; Belongie, Serge; Khan, Awais

The Plant Pathology 2021 Challenge dataset to classify foliar disease of apples

CVPR Workshop on Fine-Grained Visual Categorization (FGVC), Virtual, 2021.

(Links | BibTeX)

2020

Fashionpedia: Ontology, Segmentation, and an Attribute Localization Dataset

Jia*, Menglin; Shi*, Mengyun; Sirotenko*, Mikhail; Cui*, Yin; Hariharan, Bharath; Cardie, Claire; Adam, Hartwig; Belongie, Serge

Fashionpedia: Ontology, Segmentation, and an Attribute Localization Dataset

European Conference on Computer Vision (ECCV), Glasgow, Scotland, 2020, (Oral, *Equal Contribution).

(Links | BibTeX)

An algorithm competition for automatic species identification from herbarium specimens

Little, Damon; Tulig, Melissa; Tan, Kiat Chuan; Liu, Yulong; Belongie, Serge; Kaeser‐Chen, Christine; Michelangeli, Fabián; Panesar, Kiran; Guha, RV; Ambrose, Barbara

An algorithm competition for automatic species identification from herbarium specimens

Applications in Plant Sciences, 8 (6), 2020.

(Links | BibTeX)

2019

Neural Naturalist: Generating Fine-Grained Image Comparisons

Forbes, Maxwell; Kaeser-Chen, Christine; Sharma, Piyush; Belongie, Serge

Neural Naturalist: Generating Fine-Grained Image Comparisons

Conference on Empirical Methods in Natural Language Processing (EMNLP), Hong Kong, 2019.

(Abstract | Links | BibTeX)

The iMaterialist Fashion Attribute Dataset

Guo, Sheng; Huang, Weilin; Zhang, Xiao; Srikhanta, Prasanna; Cui, Yin; Li, Yuan; Scott, Matthew; Adam, Hartwig; Belongie, Serge

The iMaterialist Fashion Attribute Dataset

ICCV Workshop on Computer Vision for Fashion, Art, and Design (CVFAD), Seoul, Korea, 2019, (Best Paper Award).

(Links | BibTeX)

Training Machines to Identify Species using GBIF-mediated Datasets

Robertson, Tim; Belongie, Serge; Adam, Hartwig; Kaeser-Chen, Christine; Zhang, Chenyang; Tan, Kiat Chuan; Liu, Yulong; Brulé, Denis; Deltheil, Cédric; Loarie, Scott; Van Horn, Grant; {Mac Aodha}, Oisin; Beery, Sara; Perona, Pietro; Copas, Kyle; Waller, John Thomas

Training Machines to Identify Species using GBIF-mediated Datasets

Biodiversity Information Science and Standards (TDWG), Leiden, NL, 2019.

(Links | BibTeX)

Towards Ethical Deployment of AI for Conservation Systems

Kaeser-Chen, Christine; Birch, Tanya; Chou, Katherine; Gadot, Tomer; Adam, Hartwig; Belongie, Serge; Robertson, Tim; Fegraus, Eric; Morris, Dan

Towards Ethical Deployment of AI for Conservation Systems

KDD Workshop Data Mining and AI for Conservation (DMAIC), Anchorage, AK, 2019.

(Links | BibTeX)

The iMet Collection 2019 Challenge Dataset

Zhang, Chenyang; Kaeser-Chen, Christine; Vesom, Grace; Choi, Jennie; Kessler, Maria; Belongie, Serge

The iMet Collection 2019 Challenge Dataset

CVPR Workshop on Fine-Grained Visual Categorization (FGVC), Long Beach, CA, 2019.

(Links | BibTeX)

2018

Lean Multiclass Crowdsourcing

Van Horn, Grant; Branson, Steve; Loarie, Scott; Belongie, Serge; Perona, Pietro

Lean Multiclass Crowdsourcing

Computer Vision and Pattern Recognition (CVPR), Salt Lake City, UT, 2018.

(Links | BibTeX)

The iNaturalist Species Classification and Detection Dataset

Van Horn, Grant; Aodha, Oisin Mac; Song, Yang; Cui, Yin; Sun, Chen; Shepard, Alex; Adam, Hartwig; Perona, Pietro; Belongie, Serge

The iNaturalist Species Classification and Detection Dataset

Computer Vision and Pattern Recognition (CVPR), Salt Lake City, UT, 2018.

(Links | BibTeX)

Large Scale Fine-Grained Categorization and Domain-Specific Transfer Learning

Cui, Yin; Song, Yang; Sun, Chen; Howard, Andrew; Belongie, Serge

Large Scale Fine-Grained Categorization and Domain-Specific Transfer Learning

Computer Vision and Pattern Recognition (CVPR), Salt Lake City, UT, 2018.

(Links | BibTeX)

2017

Kernel Pooling for Convolutional Neural Networks

Cui, Yin; Zhou, Feng; Wang, Jiang; Liu, Xiao; Lin, Yuanqing; Belongie, Serge

Kernel Pooling for Convolutional Neural Networks

Computer Vision and Pattern Recognition (CVPR), Honolulu, HI, 2017.

(Links | BibTeX)

2016

Fine-grained Categorization and Dataset Bootstrapping using Deep Metric Learning with Humans in the Loop

Cui, Yin; Zhou, Feng; Lin, Yuanqing; Belongie, Serge

Fine-grained Categorization and Dataset Bootstrapping using Deep Metric Learning with Humans in the Loop

Computer Vision and Pattern Recognition (CVPR), Las Vegas, NV, 2016.

(Links | BibTeX)

2015

Learning Concept Embeddings with Combined Human-Machine Expertise

Wilber, Michael; Kwak, Iljung; Kriegman, David; Belongie, Serge

Learning Concept Embeddings with Combined Human-Machine Expertise

International Conference on Computer Vision (ICCV), 2015.

(Links | BibTeX)

Belongie, Serge; Perona, Pietro

Visipedia circa 2015

Pattern Recognition Letters, 2015.

(Links | BibTeX)

Building a Bird Recognition App and Large Scale Dataset With Citizen Scientists: The Fine Print in Fine-Grained Dataset Collection

Van Horn, Grant; Branson, Steve; Farrell, Ryan; Haber, Scott; Barry, Jessie; Ipeirotis, Panos; Perona, Pietro; Belongie, Serge

Building a Bird Recognition App and Large Scale Dataset With Citizen Scientists: The Fine Print in Fine-Grained Dataset Collection

Computer Vision and Pattern Recognition (CVPR), Boston, MA, 2015.

(Links | BibTeX)

Learning Localized Perceptual Similarity Metrics for Interactive Categorization

Wah, Catherine; Maji, Subhransu; Belongie, Serge

Learning Localized Perceptual Similarity Metrics for Interactive Categorization

IEEE Winter Conference on Applications of Computer Vision (WACV), Waikoloa Beach, HI, 2015.

(Links | BibTeX)

2014

Bird Species Categorization Using Pose Normalized Deep Convolutional Nets

Branson, Steve; Horn, Grant Van; Belongie, Serge; Perona, Pietro

Bird Species Categorization Using Pose Normalized Deep Convolutional Nets

British Machine Vision Conference (BMVC), Nottingham, 2014.

(Links | BibTeX)

The Ignorant Led by the Blind: A Hybrid Human–Machine Vision System for Fine-Grained Categorization

Branson, Steve; Horn, Grant Van; Wah, Catherine; Perona, Pietro; Belongie, Serge

The Ignorant Led by the Blind: A Hybrid Human–Machine Vision System for Fine-Grained Categorization

International Journal of Computer Vision (IJCV), 2014.

(Links | BibTeX)

Similarity Comparisons for Interactive Fine-Grained Categorization

Wah, Catherine; Horn, Grant Van; Branson, Steve; Maji, Subhransu; Perona, Pietro; Belongie, Serge

Similarity Comparisons for Interactive Fine-Grained Categorization

Computer Vision and Pattern Recognition (CVPR), Columbus, OH, 2014.

(Links | BibTeX)

A User Friendly Crowdsourcing Task Manager

Matera, Tomas; Jakes, Jan; Cheng, Munan; Belongie, Serge

A User Friendly Crowdsourcing Task Manager

Workshop on Computer Vision and Human Computation, Columbus, OH, 2014.

(Links | BibTeX)

2013

Efficient Large-Scale Structured Learning

Branson, Steve; Beijbom, Oscar; Belongie, Serge

Efficient Large-Scale Structured Learning

IEEE Conference on Computer Vision and Pattern Recognition (CVPR), Portland, OR, 2013.

(Links | BibTeX)

Attribute-Based Detection of Unfamiliar Classes with Humans in the Loop

Wah, Catherine; Belongie, Serge

Attribute-Based Detection of Unfamiliar Classes with Humans in the Loop

IEEE Conference on Computer Vision and Pattern Recognition (CVPR), Portland, OR, 2013.

(Links | BibTeX)

Bootstrapping Fine-Grained Classifiers: Active Learning with a Crowd in the Loop

Patterson, Genevieve; Horn, Grant Van; Belongie, Serge; Perona, Pietro; Hays, James

Bootstrapping Fine-Grained Classifiers: Active Learning with a Crowd in the Loop

NIPS Workshop on Crowdsourcing: Theory, Algorithms and Applications, Lake Tahoe, 2013.

(Links | BibTeX)

Style Finder: Fine-Grained Clothing Style Recognition and Retrieval

Di, Wei; Wah, Catherine; Bhardwaj, Anurag; Piramuthu, Robinson; Sundaresan, Neel

Style Finder: Fine-Grained Clothing Style Recognition and Retrieval

IEEE International Workshop on Mobile Vision, Portland, OR, 2013.

(Links | BibTeX)

2011

Strong Supervision From Weak Annotation: Interactive Training of Deformable Part Models

Branson, Steve; Perona, Pietro; Belongie, Serge

Strong Supervision From Weak Annotation: Interactive Training of Deformable Part Models

IEEE International Conference on Computer Vision (ICCV), Barcelona, 2011.

(Links | BibTeX)

Multiclass Recognition and Part Localization with Humans in the Loop

Wah, Catherine; Branson, Steve; Perona, Pietro; Belongie, Serge

Multiclass Recognition and Part Localization with Humans in the Loop

IEEE International Conference on Computer Vision (ICCV), Barcelona, 2011.

(Links | BibTeX)

2010

Caltech-UCSD Birds 200

Welinder, Peter; Branson, Steve; Mita, Takeshi; Wah, Catherine; Schroff, Florian; Belongie, Serge; Perona, Pietro

Caltech-UCSD Birds 200

Caltech (CNS-TR-201), 2010.

(Links | BibTeX)

The Multidimensional Wisdom of Crowds

Welinder, Peter; Branson, Steve; Belongie, Serge; Perona, Pietro

The Multidimensional Wisdom of Crowds

Neural Information Processing Systems Conference (NIPS), 2010.

(Links | BibTeX)

Visual Recognition with Humans in the Loop

Branson, Steve; Wah, Catherine; Babenko, Boris; Schroff, Florian; Welinder, Peter; Perona, Pietro; Belongie, Serge

Visual Recognition with Humans in the Loop

European Conference on Computer Vision (ECCV), Heraklion, Crete, 2010.

(Links | BibTeX)

People