The Free Energy Principle and Active Inference in Cognitive Dynamics
This overview of the free energy principle by Karl Friston delves into how conscious operations are linked to inferring causes of sensations, emphasizing the necessity of probabilistic beliefs about external states. The discussion includes topics like embodied exchange with the world, ergodic systems, active inference, and epistemic behavior emerging from these beliefs.
Download Presentation
Please find below an Image/Link to download the presentation.
The content on the website is provided AS IS for your information and personal use only. It may not be sold, licensed, or shared on other websites without obtaining consent from the author. Download presentation by click this link. If you encounter any issues during the download, it is possible that the publisher has removed the file from their server.
E N D
Presentation Transcript
International Research Training Group (IRTG) The Brain in Action I am therefore I think Karl Friston, University College London Abstract: TThis overview of the free energy principle offers an account of embodied exchange with the world that associates conscious operations with actively inferring the causes of our sensations. Its agenda is to link formal (mathematical) descriptions of dynamical systems to a description of perception in terms of beliefs and goals. The argument has two parts: the first calls on the lawful dynamics of any (weakly mixing) ergodic system from a single cell organism to a human brain. These lawful dynamics suggest that (internal) states can be interpreted as modelling or predicting the (external) causes of sensory fluctuations. In other words, if a system exists, its internal states must encode probabilistic beliefs about external states. Heuristically, this means that if I exist (am) then I must have beliefs (think). The second part of the argument is that the only tenable beliefs I can entertain about myself are that I exist. This may seem rather obvious; however, if we associate existing with ergodicity, then (ergodic) systems that exist by predicting external states can only possess prior beliefs that their environment is predictable. It transpires that this is equivalent to believing that the world and the way it is sampled will resolve uncertainty about the causes of sensations. We will conclude by looking at the epistemic behavior that emerges under these beliefs, using simulations of active inference. Key words: active inference autopoiesis cognitive dynamics free energy epistemic value self-organization . .
What does it mean to be embodied? The statistics of life Markov blankets and ergodic systems simulations of a primordial soup The anatomy of inference graphical models and predictive coding canonical microcircuits Action and perception action and its observation simulations of saccadic searches
How can the events in space and time which take place within the spatial boundary of a living organism be accounted for by physics and chemistry? (Erwin Schr dinger 1943) The Markov blanket as a statistical boundary (parents, children and parents of children) Internal states External states Sensory states Active states
The Markov blanket in biotic systems Sensory states = ( , , ) s a + s f s s = ( , , ) s a + f ( , , ) s a f External states Internal states ( , , ) f s a a a Active states
lemma: any (ergodic random) dynamical system (m) that possesses a Markov blanket will appear to engage in active inference = + ( ) f x x ( | p x m ) = ( | p x m ) ( ) f p The Fokker-Planck equation And its solution in terms of curl-free and divergence-free components = = ) ln ( | ( | p x m ) 0 ( ) f x ( ) Q p x m
But what about the Markov blanket? = s a ( , , ) s = = ( ) s ( ) ln ( | p s m ) f Q Perception ( ) ( ) ln ( | p s m ) f s Q Action a a = ln ( | p s m ) Reinforcement learning, optimal control and expected utility theory Value Pavlov = = ln ( | p s m ) F Infomax, minimum redundancy and the free-energy principle Surprise Barlow [ ln ( | = )] E p s m Self-organisation, synergetics and Entropy t homoeostasis Haken = ( | p s m ) Bayesian brain, evidence accumulation and predictive coding Model evidence Helmholtz
Overview The statistics of life Markov blankets and ergodic systems simulations of a primordial soup The anatomy of inference graphical models and predictive coding canonical microcircuits Action and perception action and its observation simulations of saccadic searches
Simulations of a (prebiotic) primordial soup Position Short-range forces Strong repulsion Weak electrochemical attraction
Finding the (principal) Markov blanket Markov blanket matrix encodes the children, parents and parents of children = + + T T B A A A A Markov Blanket = [B [eig(B) > ]] Adjacency matrix Markov Blanket 20 Hidden states 40 60 80 Sensory states Active states Internal states 100 120 20 40 60 80 100 120 Element Does action maintain the structural and functional integrity of the Markov blanket (autopoiesis) ? Do internal states appear to infer the hidden causes of sensory states (active inference) ?
Active lesion Markov blanket 8 8 6 6 4 4 2 2 0 0 -2 -2 -4 -4 -6 -6 -8 -8 -8 -6 -4 -2 0 2 4 6 8 -8 -6 -4 -2 0 2 4 6 8 Position Position Autopoiesis, oscillator death and simulated brain lesions Sensory lesion Internal lesion 8 8 6 6 4 4 2 2 0 0 -2 -2 -4 -4 -6 -6 -8 -8 -8 -6 -4 -2 0 2 4 6 8 -8 -6 -4 -2 0 2 4 6 8 Position Position
Decoding through the Markov blanket and simulated brain activation True and predicted motion 0 Christiaan Huygens Motion of external state -0.1 -0.2 Predictability -0.3 8 6 -0.4 4 100 200 300 400 500 2 Position Time 0 -2 Internal states -4 -6 5 -8 -5 0 5 10 Position Modes 15 20 25 30 100 200 300 400 500 Time
The existence of a Markov blanket necessarily implies a partition of states into internal states, their Markov blanket (sensory and active states) and external or hidden states. Because active states change but are not changed by external states they minimize the entropy of internal states and their Markov blanket. This means action will appear to maintain the structural and functional integrity of the Markov blanket (autopoiesis). Internal states appear to infer the hidden causes of sensory states (by maximizing Bayesian evidence) and influence those causes though action (active inference)
Overview The statistics of life Markov blankets and ergodic systems simulations of a primordial soup The anatomy of inference graphical models and predictive coding canonical microcircuits Action and perception action and it observation simulations of saccadic searches
Objects are always imagined as being present in the field of vision as would have to be there in order to produce the same impression on the nervous mechanism - von Helmholtz Hermann von Helmholtz Richard Gregory Geoffrey Hinton The Helmholtz machine and the Bayesian brain Thomas Bayes Richard Feynman
Objects are always imagined as being present in the field of vision as would have to be there in order to produce the same impression on the nervous mechanism - von Helmholtz Hermann von Helmholtz Richard Gregory Impressions on the Markov blanket s S
Bayesian filtering and predictive coding = = ( ) s ( ) ( , ) F s f Q D prediction update = ( ) s g prediction error
Making our own sensations sensations predictions Prediction error Action Perception Changing sensations Changing predictions
Hierarchical generative models what where A simple hierarchy the v (3) v (3) v (2) (2) v Descending predictions x (2) (2) x (2) (2) x x v (2) v (1) (2) v (1) v x (1) (1) x (1) (1) x x Ascending prediction errors v (1) v (0) (1) v (0) v Sensory fluctuations = D
David Mumford Predictive coding with reflexes Action = (1) a s oculomotor signals reflex arc proprioceptive input pons Perception retinal input Prediction error (superficial pyramidal cells) frontal eye fields geniculate = ( ) i ( 1) ( ) i ( ) i i ( ) g ( ) i Top-down or backward predictions Expectations (deep pyramidal cells) Bottom-up or forward prediction error = ( ) i ( ) i ( ) i ( ) i ( ) i D ( ) i visual cortex
Biological agents minimize their average surprise (entropy) They minimize surprise by suppressing prediction error Prediction error can be reduced by changing predictions (perception) Prediction error can be reduced by changing sensations (action) Perception entails recurrent message passing to optimize predictions Action makes predictions come true (and minimizes surprise)
Overview The statistics of life Markov blankets and ergodic systems simulations of a primordial soup The anatomy of inference graphical models and predictive coding canonical microcircuits Action and perception action and its observation simulations of saccadic searches
Action with point attractors v (2) x (1) v (1) visual input V J = + s (0,0) 1x Descending proprioceptive predictions 1J = ( )) (1) v (1)( ( ) v s a g proprioceptive input 2x x x 1 = + s v (1) = ( , v v v , ) 2J V 1 2 3 2 = s (1) v a a
Action with itinerant attractors Heteroclinic cycle (central pattern generator) (1) x (2) x Descending proprioceptive predictions action observation 0.4 0.6 position (y) 0.8 1 1.2 1.4 0 0.2 0.4 0.6 0.8 1 1.2 1.4 0 0.2 0.4 0.6 0.8 1 1.2 1.4 position (x) position (x) = s (1) v a a
Overview The statistics of life Markov blankets and ergodic systems simulations of a primordial soup The anatomy of inference graphical models and predictive coding canonical microcircuits Action and perception action and its observation simulations of saccadic searches
= [ ln ( | , ) ln ( m | ) ln ( | u + ln ( , )] q ( , ) F s ) E p s p p u m u q Entropy Likelihood Empirical priors Prior beliefs Free energy Energy I am [ergodic] therefore I think [I will minimise free energy] ( ) = + | ) ln ( u ln | [ln ( | ) ln ( | )] u p u m E p s p q , ( | ) u q s Expected energy Expected entropy = + [ln ( | )] [ [ ( D q | , )|| ( s u | )]] u q s u E p s m q s u E q ( | ) ( | ) Epistemic value or information gain Extrinsic value KL or risk-sensitive control Bayesian surprise and Infomax Expected utility theory In the absence of ambiguity: In the absence of prior beliefs about outcomes: In the absence of uncertainty or risk: = [ [ ( E D q | , )|| ( s u | )]] u q = = = [ln ( | )] E P s m [ln ( [ ( D q | ) u ln ( | )] u E p q q q q | )|| ( u | )] u p Bayesian surprise Extrinsic value = [ ( , | )|| ( u | ) ( u Q s | )] u D q s q Predicted divergence Predicted mutual information
= [ ln ( | , ) ln ( m | ) ln ( | u + ln ( , )] q ( , ) F s ) E p s p p u m u q Entropy Likelihood Empirical priors Prior beliefs Free energy Energy I am [ergodic] therefore I think [I will minimise free energy] ( ) = = ln | [ [ ( D q | , )|| ( s u | )]] u ( ) u p u m E q ( | ) q s u Epistemic value or information gain ( ) t ( ) s t S ?(? Extrinsic value stimulus visual input salience sampling Sampling the world to resolve uncertainty
Parietal (where) Frontal eye fields , x p x v p v x Visual cortex , v q qs x Pulvinar salience map Fusiform (what) , x q ( ) u x p a , v p oculomotor reflex arc Superior colliculus ps
Saccadic eye movements Saccadic fixation and salience maps Action (EOG) 2 Hidden (oculomotor) states 0 -2 200 400 600 800 1000 1200 1400 time (ms) Visual samples Posterior belief 5 vs. vs. Conditional expectations about hidden (visual) states 0 -5 200 400 600 800 1000 1200 1400 time (ms) And corresponding percept
Hermann von Helmholtz Each movement we make by which we alter the appearance of objects should be thought of as an experiment designed to test whether we have understood correctly the invariant relations of the phenomena before us, that is, their existence in definite spatial relations. The Facts of Perception (1878) in The Selected Writings of Hermann von Helmholtz, Ed. R. Karl, Middletown: Wesleyan University Press, 1971 p. 384
Thank you And colleagues: And thanks to collaborators: Micah Allen Felix Blankenburg Andy Clark Peter Dayan Ray Dolan Allan Hobson Paul Fletcher Pascal Fries Geoffrey Hinton James Hopkins Jakob Hohwy Mateus Joffily Henry Kennedy Simon McGregor Read Montague Tobias Nolte Anil Seth Mark Solms Paul Verschure Rick Adams Ryszard Auksztulewicz Andre Bastos Sven Bestmann Harriet Brown Jean Daunizeau Mark Edwards Chris Frith Thomas FitzGerald Xiaosi Gu Stefan Kiebel James Kilner Christoph Mathys J r mie Mattout Rosalyn Moran Dimitri Ognibene Sasha Ondobaka Will Penny Giovanni Pezzulo Lisa Quattrocki Knight Francesco Rigoli Klaas Stephan Philipp Schwartenbeck And many others
Time-scale Free-energy minimisation leading to 3 10 s Perception and Action: The optimisation of neuronal and neuromuscular activity to suppress prediction errors (or free- energy) based on generative models of sensory data. 0 10 s Learning and attention: The optimisation of synaptic gain and efficacy over seconds to hours, to encode the precisions of prediction errors and causal structure in the sensorium. This entails suppression of free-energy over time. 3 10 s Neurodevelopment: Model optimisation through activity- dependent pruning and maintenance of neuronal connections that are specified epigenetically 6 10 s Evolution: Optimisation of the average free-energy (free-fitness) over time and individuals of a given class (e.g., conspecifics) by selective pressure on the epigenetic specification of their generative models. 15 10 s
Searching to test hypotheses life as an efficient experiment = + ( , ) ( | [ ln ( ( )| p s t ) ( | ) )] H S H S m E H S = + = [ ( | ( ))] s t m E H S t t Free energy principle minimise uncertainty = | , )]} ( ) t argmin{ [ ( H q