Masters Thesis 2020

3 minute read

Date:

This post is another retrospective, but instead of a conference or journal paper it takes a look at my masters thesis, titled “Modelling Learning and Decision Making Under Information Processing Constraints”. This blog post will go through the begninning stages of the project and how it ultimately narrowed down the focus of the thesis and project into what it eventually became.

Very early on in my PhD, I became interested my advisor Chris Sims’ previous work using infromation theory, specifically mutual information, as a tool to understand the cognitive costs of behaviour in learning and decision making tasks. While this may seem somewhat narrow, there are many possible applications of information theory in this way, and a wide range of psychological experimentation that has been done looking into modelling constrained cognition. We became interested in bandit learning tasks due to their simplicity, long history, and the publically available datasets with human participant responses. Since this was early on in my PhD and a while before I would eventually propose my own experiment to run, it made sense to test the ideas I had on another similar task that could guide my research in my PhD.

This inspired the project I worked on that would become an abstract paper in the Reinforcement Learning and Decision Making conference, and a slight extension of that paper is essentially the final section of my masters thesis. However, when I began the work in writing up my thesis, I realized I had become interested in the decision making setting during my economic modelling course. A large portion of this course was modelling decision making under risk and uncertainty, which is closely related to the decision making that takes place when learning in the bandit setting. I was wondering if the same concepts of mutual information and behavioural complexity could be used in this slightly different setting.

The key difference in the decision making setting is that the outcome utiltiies and probabilities that determine optimal behaviour are given directly, instead of being learned through experience as in the learning setting. This second interest and application ended up being roughly a third to a half of the content in my thesis, as it required a long background on decision making in various settings, which are some of the oldest and most well studied phenomenon in cognitive science. Taking a look back at my thesis, I realize it may seem like a lot of extra work to include only a slightly different phenomenon, but I am glad I did include it as it allowed me to relate and contrast my understanding of cognition with similar accounts from a wide variety of cognitive modells.

Broadly this experience in extending my ideas into related domains taught me that it is important to show as broad an application as possible, or at least as you find interesting. Often these models and approaches are applied to a very small domain which can make it difficult to relate to human cognition as we know it is extremely broad in its application. Additionally I was able to expand on my knowledge of cognitive modelling methods in similar domains as the ones I had experience in, which I am similarly greatful for.