Assistive Technology

The contemporary urban environment is brimming with rich visual cues that provide valuable directional and informational content to sighted individuals. The goal of the this project is to make these visual cues universally accessible in a variety of real-world domains.

People

Papers

2017

Detecting Oriented Text in Natural Images by Linking Segments

Shi, Baoguang; Bai, Xiang; Belongie, Serge

Detecting Oriented Text in Natural Images by Linking Segments

Computer Vision and Pattern Recognition (CVPR), Honolulu, HI, 2017.

(Links | BibTeX)

2016

Revisiting Grocery Recognition using TensorFlow

Hoffman, Sam; Thiagarajan, Dilip

Revisiting Grocery Recognition using TensorFlow

Cornell University CS Department Summer Internship Continutity report, 2016.

(Links | BibTeX)

COCO-Text Explorer

Su, Philip

COCO-Text Explorer

Cornell University CS Department MEng Report, 2016.

(Links | BibTeX)

COCO-Text: Dataset and Benchmark for Text Detection and Recognition in Natural Images

Veit, Andreas; Matera, Tomas; Neumann, Lukas; Matas, Jiri; Belongie, Serge

COCO-Text: Dataset and Benchmark for Text Detection and Recognition in Natural Images

arXiv preprint arXiv:1601.07140, 2016.

(Links | BibTeX)

2015

COCO-Reader: User Study and Market Evaluation

Verma, Pragya

COCO-Reader: User Study and Market Evaluation

Cornell University CS Department MEng Report, 2015.

(Links | BibTeX)

COCO-Reader: Image Reader for the Blind

Wu, Xiaoyan; Su, Jiaqi

COCO-Reader: Image Reader for the Blind

Cornell University CS Department Summer Internship Continuity Report, 2015.

(Links | BibTeX)

2014

Video Text Detection and Recognition: Dataset and Benchmark

Nguyen, Phuc Xuan; Wang, Kai; Belongie, Serge

Video Text Detection and Recognition: Dataset and Benchmark

Winter Conference on Applications of Computer Vision (WACV), Steamboat Springs, CO, 2014.

(Links | BibTeX)

2011

End-to-End Scene Text Recognition

Wang, Kai; Babenko, Boris; Belongie, Serge

End-to-End Scene Text Recognition

IEEE International Conference on Computer Vision (ICCV), Barcelona, Spain, 2011.

(Links | BibTeX)

EdgeSonic: Image Feature Sonification for the Visually Impaired

Yoshida, Tsubasa; Kitani, Kris; Belongie, Serge; Schlei, Kevin; Koike, Hideki

EdgeSonic: Image Feature Sonification for the Visually Impaired

International Conference on the Augmented Human, Tokyo, 2011.

(Links | BibTeX)

2010

Word Spotting in the Wild

Wang, Kai; Belongie, Serge

Word Spotting in the Wild

European Conference on Computer Vision (ECCV), Heraklion, Crete, 2010.

(Links | BibTeX)

EdgeSonic: Sonification of Image Features for the Visually Impaired

Yoshida, Tsubasa; Kitani, Kris; Belongie, Serge; Schlei, Kevin

EdgeSonic: Sonification of Image Features for the Visually Impaired

Workshop on Interactive Systems and Software, Japan, 2010.

(Links | BibTeX)

Toward real-time grocery detection for the visually impaired

Winlock, Tess; Christiansen, Eric; Belongie, Serge

Toward real-time grocery detection for the visually impaired

Computer Vision Applications for the Visually Impaired (CVAVI), San Francisco, CA, 2010.

(Links | BibTeX)

2009

CAPTCHA-based Image Labeling on the Soylent Grid

Faymonville, Peter; Wang, Kai; Miller, John; Belongie, Serge

CAPTCHA-based Image Labeling on the Soylent Grid

Human Computation Workshop (HCOMP), Paris, France, 2009.

(Links | BibTeX)

2007

Recognizing Groceries in situ Using in vitro Training Data

Merler, Michele; Galleguillos, Carolina; Belongie, Serge

Recognizing Groceries in situ Using in vitro Training Data

SLAM, Minneapolis, MN, 2007.

(Links | BibTeX)