Beyond [script l].sub.1 sparse coding in V1
Growing evidence indicates that only a sparse subset from a pool of sensory neurons is active for the encoding of visual stimuli at any instant in time. Traditionally, to replicate such biological sparsity, generative models have been using the [script l].sub.1 norm as a penalty due to its convexity...
Saved in:
| Published in | PLoS computational biology Vol. 19; no. 9; p. e1011459 |
|---|---|
| Main Authors | , , , |
| Format | Journal Article |
| Language | English |
| Published |
Public Library of Science
12.09.2023
|
| Subjects | |
| Online Access | Get full text |
| ISSN | 1553-734X |
| DOI | 10.1371/journal.pcbi.1011459 |
Cover
| Summary: | Growing evidence indicates that only a sparse subset from a pool of sensory neurons is active for the encoding of visual stimuli at any instant in time. Traditionally, to replicate such biological sparsity, generative models have been using the [script l].sub.1 norm as a penalty due to its convexity, which makes it amenable to fast and simple algorithmic solvers. In this work, we use biological vision as a test-bed and show that the soft thresholding operation associated to the use of the [script l].sub.1 norm is highly suboptimal compared to other functions suited to approximating [script l].sub.p with 0 [less than or equal to] p < 1 (including recently proposed continuous exact relaxations), in terms of performance. We show that [script l].sub.1 sparsity employs a pool with more neurons, i.e. has a higher degree of overcompleteness, in order to maintain the same reconstruction error as the other methods considered. More specifically, at the same sparsity level, the thresholding algorithm using the [script l].sub.1 norm as a penalty requires a dictionary of ten times more units compared to the proposed approach, where a non-convex continuous relaxation of the [script l].sub.0 pseudo-norm is used, to reconstruct the external stimulus equally well. At a fixed sparsity level, both [script l].sub.0 - and [script l].sub.1 -based regularization develop units with receptive field (RF) shapes similar to biological neurons in V1 (and a subset of neurons in V2), but [script l].sub.0 -based regularization shows approximately five times better reconstruction of the stimulus. Our results in conjunction with recent metabolic findings indicate that for V1 to operate efficiently it should follow a coding regime which uses a regularization that is closer to the [script l].sub.0 pseudo-norm rather than the [script l].sub.1 one, and suggests a similar mode of operation for the sensory cortex in general. |
|---|---|
| ISSN: | 1553-734X |
| DOI: | 10.1371/journal.pcbi.1011459 |