From Image Statistics to Scene Gist: Evoked Neural Activity Reveals Transition from Low-Level Natural Image Structure to Scene Category

The visual system processes natural scenes in a split second. Part of this process is the extraction of “gist,” a global first impression. It is unclear, however, how the human visual system computes this information. Here, we show that, when human observers categorize global information in real-wor...

Full description

Saved in:
Bibliographic Details
Published inThe Journal of neuroscience Vol. 33; no. 48; pp. 18814 - 18824
Main Authors Groen, Iris I.A., Ghebreab, Sennay, Prins, Hielke, Lamme, Victor A.F., Scholte, H. Steven
Format Journal Article
LanguageEnglish
Published United States Society for Neuroscience 27.11.2013
Subjects
Online AccessGet full text
ISSN0270-6474
1529-2401
1529-2401
DOI10.1523/JNEUROSCI.3128-13.2013

Cover

More Information
Summary:The visual system processes natural scenes in a split second. Part of this process is the extraction of “gist,” a global first impression. It is unclear, however, how the human visual system computes this information. Here, we show that, when human observers categorize global information in real-world scenes, the brain exhibits strong sensitivity to low-level summary statistics. Subjects rated a specific instance of a global scene property, naturalness, for a large set of natural scenes while EEG was recorded. For each individual scene, we derived two physiologically plausible summary statistics by spatially pooling local contrast filter outputs: contrast energy (CE), indexing contrast strength, and spatial coherence (SC), indexing scene fragmentation. We show that behavioral performance is directly related to these statistics, with naturalness rating being influenced in particular by SC. At the neural level, both statistics parametrically modulated single-trial event-related potential amplitudes during an early, transient window (100–150 ms), but SC continued to influence activity levels later in time (up to 250 ms). In addition, the magnitude of neural activity that discriminated between man-made versus natural ratings of individual trials was related to SC, but not CE. These results suggest that global scene information may be computed by spatial pooling of responses from early visual areas (e.g., LGN or V1). The increased sensitivity over time to SC in particular, which reflects scene fragmentation, suggests that this statistic is actively exploited to estimate scene naturalness.
Bibliography:ObjectType-Article-1
SourceType-Scholarly Journals-1
ObjectType-Feature-2
content type line 23
Author contributions: I.I.A.G., S.G., V.A.F.L., and H.S.S. designed research; I.I.A.G. and H.P. performed research; S.G. and H.S.S. contributed unpublished reagents/analytic tools; I.I.A.G. analyzed data; I.I.A.G., S.G., V.A.F.L., and H.S.S. wrote the paper.
ISSN:0270-6474
1529-2401
1529-2401
DOI:10.1523/JNEUROSCI.3128-13.2013