Differential Correlations and Information

In this section we leave neural networks, and turn to theoretical analysis of differential correlations. We analyze information when there is a ‘pure’ f′f′^T component and, just as importantly, when there is a not so pure component. We show that in the former case information saturates with N; in the latter case it doesn’t. We also show, somewhat surprisingly, that the optimal decoder doesn’t need to know about the f′f′^T component of the correlations. In the Supplementary Modeling, we provide further insight into differential correlations by expressing them in terms of the eigenvectors and eigenvalues of the covariance matrix, and we use that analysis to understand why, and when, it’s hard to accurately estimate Fisher information.
Here we ask how the linear Fisher information scales with the number of neurons, N, when the covariance matrix contains a pure f′f′^T component (the second term in equation (31)). Our starting point is a covariance matrix, Σ₀(s), that doesn’t necessarily contain an f′f′^T component. As in equation (3), the (linear) Fisher information associated with Σ₀(s), denoted I₀, is given by

I_{0} = f' {(s)}^{T} Σ_{0}^{- 1} (s) f' (s)

where, as usual, f(s) is a vector of tuning curves,

f (s) \equiv {(f_{1} (s), f_{2} (s), \dots, f_{N} (s))}^{T}

and a prime denotes a derivative with respect to s. Note that the information also depends on stimulus, s; we suppress that dependence for clarity. To add a pure f′f′^T component, we define a new covariance matrix, Σ_ε(s), via

Σ_{ε} (s) = Σ_{0} (s) + ε f' (s) {f'}^{T} (s)

The new information, denoted I_ε, is given by

I_{ε} = f' {(s)}^{T} Σ_{ε}^{- 1} (s) f' (s)

To compute I_ε, we need the inverse of Σ_ε. As is easy to verify, this inverse is given by

Σ_{ε}^{- 1} (s) = Σ_{0}^{- 1} (s) - \frac{ε}{1 + ε I_{0}} Σ_{0}^{- 1} (s) f' (s) {f'}^{T} (s) Σ_{0}^{- 1} (s)

Inserting equation (33) into (32), we arrive at

I_{ε} = I_{0} - \frac{ε I_{0}^{2}}{1 + ε I_{0}} = \frac{I_{0}}{1 + ε I_{0}}

which is equation (5).
Perhaps surprisingly, although f′f′^T correlations have a critical role in determining information, they are irrelevant for decoding, in the sense that they have no effect on the locally optimal linear estimator. To see this explicitly, note first of all that the locally optimal linear estimator, denoted w^T, generates an estimate of the stimulus near some particular value, s₀, by linearly operating on neural activity,

ŝ = s_{0} + w^{T} (r - f (s_{0}))

In the presence of the covariance matrix given in equation (31), the optimal weight,

w_{opt}^{T}

is given by

w_{opt}^{T} = \frac{{f'}^{T} {(Σ_{0} + ε f' {f'}^{T})}^{- 1}}{{f'}^{T} {(Σ_{0} + ε f' {f'}^{T})}^{- 1} f'}

where we have dropped, for clarity, the explicit dependence on s₀. Using equation (33), this reduces to

w_{opt}^{T} = \frac{{f'}^{T} Σ_{0}^{- 1}}{{f'}^{T} Σ_{0}^{- 1} f'}

Thus, the locally optimal linear decoder does not need to know the size of the f′f′^T correlations.
In hindsight this makes sense: f′f′^T correlations shift the hill of activity, and there is, quite literally, nothing any decoder can do about this. This suggests that these correlations are in some sense special. To determine just how special, we ask what happens when we add correlations in a different direction—say correlations of the form uu^T, where u is not parallel to f′. In that case, the covariance matrix becomes (with a normalization added for convenience only)

Σ_{u} (s) = Σ_{0} (s) + ε \frac{f' {(s)}^{T} Σ_{0}^{- 1} (s) f' (s)}{u^{T} Σ_{0}^{- 1} (s) u} u u^{T}

Repeating the steps leading to equation (34), we find that

I_{u} \equiv f' {(s)}^{T} Σ_{u}^{- 1} (s) f' (s) = I_{0} {sin}^{2} θ + \frac{I_{0} {cos}^{2} θ}{1 + ε I_{0}}

where I₀ is defined in equation (29) and

cos θ \equiv \frac{f' {(s)}^{T} Σ_{0}^{- 1} (s) u}{{[f' {(s)}^{T} Σ_{0}^{- 1} (s) f' (s) u^{T} Σ_{0}^{- 1} (s) u]}^{1 / 2}}

Whenever θ ≠ 0—meaning u is not parallel to f′(s)—information does not saturate as N goes to infinity. Thus, in the large N limit, f′(s)f′(s)^T correlations are the only ones that cause saturation.
A Supplementary Methods Checklist is available.

Partial Protocol Preview
This section provides a glimpse into the protocol.
The remaining content is hidden due to licensing restrictions, but the full text is available at the following link: Access Free Full Text.

Moreno-Bote R., Beck J., Kanitscheider I., Pitkow X., Latham P, & Pouget A. (2014). Information-limiting correlations. Nature neuroscience, 17(10), 1410-1417.

Publication 2014

F component Neural Neurons Vector

Corresponding Organization :

Other organizations : Universitat de Barcelona, Parc Sanitari Sant Joan de Déu, University of Rochester, University of Geneva, University College London, Oxford Centre for Computational Neuroscience

Top 5 similar protocols

Protocol cited in 11 other protocols

Variable analysis

independent variables

None explicitly mentioned

dependent variables

Linear Fisher information

control variables

Covariance matrix Σ₀(s) that does not necessarily contain an f'f'ᵀ component
New covariance matrix Σᵋ(s) that contains a pure f'f'ᵀ component

Annotations

This protocol is too long. Unable to provide accurate annotations

About PubCompare

Our mission is to provide scientists with the largest repository of trustworthy protocols and intelligent analytical tools, thereby offering them extensive information to design robust protocols aimed at minimizing the risk of failures.

We believe that the most crucial aspect is to grant scientists access to a wide range of reliable sources and new useful tools that surpass human capabilities.

However, we trust in allowing scientists to determine how to construct their own protocols based on this information, as they are the experts in their field.

Ready to get started?

Revolutionizing how scientists
search and build protocols!