span.GRAME {mso-gram-e:yes;}
Topics on Vision in Humans and Machines
CSE 291-B100, Spring 2005
David Kriegman
Kriegman {at} cs.ucsd.edu
Marian Stewart Bartlett
Marni {at} salk.edu
Grading: S/U, 2 units only.
This graduate seminar will explore topics in vision such as stereo, segmentation, motion interpretation, object recognition, face identity and expression recognition. In each class session, a topic will be considered from the perspectives of computer vision and human visual perception. There will be assigned readings on both human and machine vision, and a pair of students will serve as discussants from the two perspectives.
Class structure: Every week, each student will submit two questions or comments on the readings to the Discussion Board. Students will also complete tutorials from the Cold Spring Harbor course on Computational Visual Neuroscience.
3/30 Organizational Meeting, Center Hall 223 9:30-11.
4/06 Statistics of Natural Images
4/13 Information Theory and Neural Representation
4/20 Luminance, Illumination, and Contrast Normalization
4/27 Motion
5/4 Color
5/11 Texture/Segmentation
.
5/18 Object Representation and Recognition
5/25 Face Processing
6/1 Active Exploration
Reading List
4/06 Statistics
of Natural Images
Field (1987) Relations between the statistics of natural images and the response properties of cortical cells. J. Opt. Soc. Am. (or more recent papers, e.g. ‘What is the goal of sensory coding’ Neural Computation). pdf
J. Huang and D. Mumford. Statistics of natural images and models. CVPR, pages 541–7, 1999. pdf
Optional:
Ron O. Dror, Thomas K. Leung, Edward H. Adelson, and Alan S. Willsky,
Statistics of Real-World Illumination CVPR 2001 pdf
Ruderman & Bialek (1994). Statistics of natural images: scaling in the woods. Physical Review letters. pdf
Field (1999). Wavelets, vision and the statistics of natural
scenes. Phil. Trans. R. Soc. London A. 357, 2527-2542. pdf
4/13 Information
Theory and Neural Representation
Simoncelli & Olshaussen (2001). Natural image statistics and Neural Representation. Ann Rev Neurosci. This was on the reading list for the cogsci200 on learning in vision. pdf
Linsker (1990). Perceptual neural organization – Some approaches based on network models and information theory. Annual Review of Neurology 13, p. 257-281. pdf
Optional:
Barlow (1989). Unsupervised learning. (Or his original 1961 paper ‘possible principles underlying the transformations of sensory messages.’ It comes highly recommended.) pdf
Olshaussen & Field (1996). Emergence of simple cell receptive field properties by learning a sparse code for natural images. Nature 381, p. 607-609. pdf See also Vision Research 1997. pdf
Bell & Sejnowski (1996). The independent components of natural scenes are edge filters. Vision Research. pdf
Edelman & Intrator (2004).
Unsupervised statistical learning in vision: Computational principles,
biological evidence. Interesting but short. pdf
Shannon, C.E. (1948). A
Mathematical Theory of Communication. Bell System Technocal Journal 27,
379-423, 623-656. pdf
4/20 Luminance,
illumination, and contrast normalization
Wainwright, Schwartz & Simoncelli (2002). Natural image statistics and divisive normalization: Modeling nonlinearity and adaptation in cortical neurons. In Rao, Olshausen & Lewicki (eds). Probabilistic models of the brain: Perception and Neural Function. pdf
J. Ho, D. Kriegman, ``On the Efect of Illumination and Face Recognition” to appear in “Face Processing: Advanced Modeling and Methods,” R. Chellappa, W. Zhao, editors, 2005. pdf
Optional:
Adelson (2000). Lightness perception and lightness
illusions. In Gazzaniga (ed). The new cognitive neurosciences. pdf
Demos of Adelson’s lightness illusions.
4/27 Motion
Fleet, D. J., Black, M. J. and Nestares, O., (2002). Bayesian Inference of Visual Motion Boundaries, in Lakemeyer, G. and Nebel, B. (Eds.) Exploring Artificial Intelligence in
the New Millennium, Morgan Kaufmann Pub. pdf
J. Xiao, S. Baker, I. Matthews, and T. Kanade (2004). Real-Time Combined 2D+3D
Active Appearance Models. CVPR. pdf
Demos of the system in the Xiao paper and also demos of other AAM systems by the same group
Optional:
Albright (1993)
Cortical Processing of Visual Motion. In F.A. Miles and J. Wallman, Eds. Visual
Motion and its Role in the Stabilization of Gaze. Elsavier Science Publishers,
pg. 177-201. pdf
Adelson & Bergen (1985). Spatio-temporal energy models for the perception of motion. JOSA. pdf
Weiss, Simoncelli & Adelson (2002) Motion Illusions as optimal percepts. Nature Neuroscience. pdf
Black, M. J. and Anandan, P., (1996) Computer Vision and Image Understanding, CVIU, 63(1), pp. 75-104. pdf
Bruhn, Weikert and Schnorr (2005). Lucas/Kanade Meets Horn/Schunk: Combining Local and Global Optic Flow Methods. IJCV 61(3):211-231, Feb 2005. pdf
5/4 Color
von der Twer, T. and Macleod. D.I.A. (2001). Optimal
nonlinear codes for the perception of natural colors. Network, Computaiton
inNeural Systems, 12 395-407. pdf
NOTE: We
originally listed the following book chapter, but the Network paper is a little
easier to follow, so Josh recommends reading that one instead. If you read the
book chapter, then Josh will be
focusing on pgs 1-21 of the chapter, which covers the same material as the
Network paper. MacLeod, D.I.A. (2003). Colour Discrimination, Colour
Constancy, and Natural Scene
Statistics (The Verriest Lecture). In: J.D. Mollon, J. Pokorny, & K. Knoblauch (Eds.),
Proceedings of the Thomas Young
Symposium of the International Colour Vision Society London: Oxford
University Press. pdf
Neitz, Carroll, Yamauchi, Neitz, & Williams (2002).
Color Perception Is Mediated by a Plastic Neural Mechanism that Is Adjustable
in Adults. Neuron, Vol. 35, 783. pdf
Optional:
Brainard Kraft & Longere (2003). Color contstancy:
developing empirical tests of computational models. In Maunsfeld & Heyer
(Eds.) Colour Perception: Mind and the physical world. pdf
Delahunt & Brainard (2004). Does human color constancy incorporate the statistical regularity of natural daylight? Journal of vision 4 p. 57-81. pdf
Rhea Eskew. Chromatic Detection and Discrimination, In Gegenfurtner and Sharpe (Eds), Color Vision from Genes to Perception. Cambridge University Press. p. 345-368.
Nathans, J., (1989) The genes for color vision. Sci Am,
1989. 260(2): p.
42-9. pdf
5/11 Texture
/ segmentation
Malik & Perona (1990) Preattentive texture discrimination with early vision mechanisms. JOSA pdf
Zhu, Wu, & Mumford (1997). Minimax Entropy Principle and its Application to Texture Modeling. Neural Computation 9(8), p. 1627-1660. pdf
Optional:
Rosenholtz (2001). Search asymmetries? What search asymmetries? Perception & psychophysics. pdf
Rosenholtz (2000). Significantly different textures: a computational model of pre-attentive texture segmentation. ECCV. pdf
Farid & Adelson (2001). Synchrony does not promote grouping in temporally structured displays. Nature Neuroscience 4(9) p. 875-876. pdf
Heeger and Bergen (1995). Pyramid-Based Texture Analysis and Synthesis. Siggraph. P. 229-238. pdf
Fowlkes & Malik. Learning edge detection from human examples.
5/18 Object
representation
Kersten, Mamassian & Yuille (2004). Object perception as
Bayesian inference. Annual review of Psychology. pdf (See also Kersetn & Yuille 2003, Bayesian
models of object perception. Current Opinion in Neurobiology. ) pdf
Rothganger, F., Schmidt, C., Ponce, J. (2005). 3D Object Modeling Using Local Affine-Invariant Image Descriptors and Multi-View Spatial Constraints. International Journal of Computer Vision. pdf
Optional:
Edelman & Duvdevani-barr (1997). A model of visual recognition and categorization. Phil Trans. Royal Soc. London 352 p. 1191-1202. pdf
2D vs 3D (Mike Tarr) and/or Biederman & Kolocsoi
5/25 Face
Representation
Bartlett (submitted). Information maximization in face processing. Neurocomputing. pdf
Webster, Werner, & Field (in press) . “Adaptation and the phenomenology of perception.” In Clifford and Rhodes (Eds.) Fitting the mind to the world: adaptation and aftereffects in high level vision. (Mike Webster has some cool work on face adaptation that speaks to the information maximization idea.) doc
Optional:
Valentine (2000). Face Space models of face recognition. In Wenger & Townhend (Eds.) Computational, geometric, and process perspectives on facial cognition: Contexts and challenges. pdf
6/1 Active
Exploration
Sprague, Ballard and Robinson (submitted). Modeling
attention with Embodied Visual Behaviors.
ACM Transactions on Applied Perception. pdf
Walker-Renninger, Coughlan, Verghese, & Malik (2005). An
Information
Maximization Model of Eye Movements. NIPS 17, 1121-1128. pdf
Please also read the introduction of Nelson (in press) Psych
Review, for a review of probabilistic accounts of information acquisition (pgs
1-11 and 42-44, and the tables on 52,53,64). Nelson, J.D. (in press) Finding
useful questions. On Bayesian diagnosticity, probability, impact, and
information gain. Psychological Review. pdf
Optional:
Walther, D. , Rutishauser, U.,
Koch, C., and Perona, P. (in
press). Selective visual attention enables learning and recognition of
multiple objects in cluttered scenes. Computer Vision and Image Understanding. pdf
Denzler, J. & Brown, C.M.
(2002). Information theoretic sensor data selection for active object
recognition and state estimation. PAMI 24 p. 145-157. pdf
Yuille, Coughlan & Kersten
(1998). Computational vision: Principles of perceptual inference. Book
Chapter. This is a good chapter
introducing Bayesian models, as well as how they tie into information theory.
Also covers active exploration. pdf
Lee & Yu (1999). An Information-Theoretic Framework for
Understanding Saccadic Behaviors. NIPS 12, 834-840. pdf
Najemnik, J. & Geisler, W. S. (2005, March 17). Optimal eye movement
strategies in visual search. Nature, 434, 387-391. pdf
Nelson & Cottrell (submitted). A probabilistic model of
eye movements
in concept formation.
Related Tutorials
Cold Spring Harbor Computational Vision Tutorials by Eero Simoncelli, Paul Glimcher, EJ Cichilnisky, and others.
NIPS Computational
Vision Tutorial by Dan Kertsen
ICA Tutorial by Hyvarinen & Oja
Linear
Algebra tutorial by Eero Simoncelli
Linear Algebra in Parallel Distributed Processing by Mike Jordan.