Volume 5, Number 8, Abstract 742, Page 742a doi:10.1167/5.8.742 http://journalofvision.org/5/8/742/ ISSN 1534-7362
Standard model v2.0: How visual cortex might learn a universal dictionary of shape components
Thomas Serre
Center for Biological and Computational Learning, MIT
[e-mail]
Tomaso Poggio
Center for Biological and Computational Learning, MIT
Abstract

The tuning properties of neurons in inferotemporal (IT) cortex are likely to play a key role for visual perception in primates and in particular for their object recognition abilities. The tuning of specific neurons probably depends, at least in part, on visual experience.
We describe a model of plasticity and learning in V4 and IT extending the initial version of the standard model of object recognition in cortex [Riesenhuber and Poggio, Nat. Neurosci. 1999] -- that accounts for known physiological data. When exposed to many natural images the model generates a large set of shape-tuned units which support robust recognition performance and which can be interpreted as a universal dictionary of shapes with the properties of overcompleteness and non-uniqueness. Preliminary results suggest that the set of shape-tuned units obtained is consistent with recent physiological data collected in V4, see abstract by [Cadieu et al, VSS 2005]. We also show that the model can handle the recognition of different object-categories in natural images at the level of the best existing computer vision recognition systems.

History
Received September 15, 2005; published September 23, 2005
Citation
Serre, T., & Poggio, T. (2005). Standard model v2.0: How visual cortex might learn a universal dictionary of shape components [Abstract]. Journal of Vision, 5(8):742, 742a, http://journalofvision.org/5/8/742/, doi:10.1167/5.8.742.
Keywords
None
On-Line Presentation
None
for articles that cite this paper
for related articles by these authors
for papers that cite this paper
Get citation
Get help with this






jov