The above analysis implicitly assumes that the minimum of the cos

The above analysis implicitly assumes that the minimum of the cost function over the allowed range of weights corresponds to a local minimum, so that the first derivative is zero and the second derivatives characterize deviations from the minimum. However, because Dale’s law constrains

the weights to be strictly nonnegative or nonpositive, the best-fit parameters can occur on the boundary of the permitted set of weights. In such cases, we also ATM Kinase Inhibitor chemical structure computed the gradient of the cost function to determine the direction of greatest sensitivity to infinitesimal changes in weights. However, for changes in weights large enough to lead to noticeable mistuning, the increase in the cost function due to linear changes along the gradient direction were much smaller than the quadratic changes determined by the sensitivity matrix (Figure S6G). In addition, because the gradient vector reflected weights that were prevented by Dale’s law from changing signs, its direction corresponded to increasing magnitudes of all zero-valued weights and therefore overlapped with eigenvector 2. Thus, for the circuits analyzed here, the gradient provided little additional information beyond that provided by the sensitivity matrix. This work was supported by NSF grant IIS-1208218-0 (M.S.G., E.R.F.A.), NIH grant

R01 MH069726 (M.S.G.), a Sloan Foundation Research Tofacitinib order Fellowship (M.S.G.), a Burroughs Wellcome Collaborative Research Travel Grant (M.S.G.), a UC Davis Ophthalmology Research to Prevent Blindness grant (M.S.G.), a Wellesley College Brachmann-Hoffman Fellowship (M.S.G.), a Burroughs Wellcome Career Award at the Scientific Interface (E.R.F.A.), and the Searle Scholars program (E.R.F.A.). We thank Guy Major, Jennifer Raymond, Sukbin Lim, Andrew Miri, Brian Mulloney, Michael Wright, Melanie Lee, and Jochen Ditterich TCL for helpful comments on this work and Melanie Lee for computational assistance. “
“We choose between objects based on their values, which we learn from past experience with rewarding consequences (Awh et al., 2012 and Chelazzi et al., 2013).

The values of some objects change flexibly, and we have to search valuable objects based on their consequent outcome (Barto, 1994, Dayan and Balleine, 2002, Padoa-Schioppa, 2011 and Rolls, 2000). On the other hand, the values of some other objects remain unchanged, and we have to choose the valuable objects based on the long-term memory. Since the stable value formed by repetitive experiences is reliable, we may consistently choose the object regardless of the outcome (Ashby et al., 2010, Balleine and Dickinson, 1998, Graybiel, 2008, Mishikin et al., 1984 and Wood and Neal, 2007). Both flexible and stable value-guided behaviors are critical to choose the valuable objects efficiently. If we rely only on flexible values, we would always have to make an effort to find valuable objects by trial and error.

Comments are closed.