PhD Defence • Artificial Intelligence | Autonomous Vehicles • Empirical Game Theoretic Models for Autonomous Driving: Methods and Applications | Cheriton School of Computer Science

Please note: This PhD defence will take place in DC 3317 and online.

Atrisha Sarkar, PhD candidate
David R. Cheriton School of Computer Science

Supervisor: Professor Krzysztof Czarnecki

In recent years, there has been enormous public interest in autonomous vehicles (AV), with more than 80 billion dollars invested in self-driving car technology. However, for the foreseeable future, self-driving cars will interact with human driven vehicles and other human road users, such as pedestrians and cyclists. Therefore, in order to ensure safe operation of AVs, there is need for computational models of humans traffic behaviour that can be used for testing and verification of autonomous vehicles. Game theoretic models of human driving behaviour is a promising computational tool that can be used in many phases of AV development. However, traditional game theoretic models are typically built around the idea of rationality, i.e., selection of the most optimal action based on individual preferences. In reality, not only is it hard to infer diverse human preferences from observational data, but real-world traffic shows that humans rarely choose the most optimal action that a computational model suggests. Through the lens of behavioural game theory, my thesis bridges the gap between observational naturalistic behaviour and game theory to create models of traffic behaviour that can have versatile applications in AV development, including testing, verification, and motion planning.

The first part of the thesis makes a set of methodological contributions towards creating models of traffic behaviour from naturalistic datasets using behavioural game theory. Based on the structure of a hierarchical game, the thesis first presents various design choices available in the construction of a game, and evaluates the models based on naturalistic data. Next, the thesis focuses on construction of agent utilities. Driving is a multi-objective task, and humans aggregate objectives of safety and progress in a context and individual specific manner. It is challenging to infer the parameters of multiobjective utility aggregation solely from observations because of a number of unobserved variables. Based on the concept of rationalizability, the thesis develops algorithms for estimating multiobjective aggregation parameters for two aggregation methods, weighted and satisficing aggregation, and two reasoning processes, strategic and nonstrategic.

In the final methodological contribution, the thesis addresses two key challenges of building traffic behaviour models using dynamic games; model instability and model uncertainty. Model instability arises when a class of boundedly rational agents have no reason to adhere to elementary models over time in the game. The thesis addresses this problem by developing a nonstrategic yet sophisticated finite-state transducer-based model of level-0 behaviour within the level-k framework. Model uncertainty arises when agents are free to follow any model of reasoning as is often the case in naturalistic data. This problem is addressed by developing a generalised cognitive hierarchy model consisting of three layers, nonstrategic, strategic, and robust. Each layer can hold multiple behaviour models, and the chapter develops solutions for heterogeneous models based on the consistency of beliefs over observations.

Building on the game theoretic models, the second part of the thesis demonstrates the application of the models by developing novel safety validation methodologies for testing AV planners. The first application is based on a Quantal Best Response model to create interpretable variations of lane change behaviour. The proposed model is shown to be effective for generating both rare-event situations and to replicate the typical behaviour distribution observed in naturalistic data. The second application is safety validation of strategic planners in situations of dynamic occlusion. Using the concept of hypergames, in which different agents have different views of the game, the thesis develops a new safety surrogate metric, dynamic occlusion risk (DOR), that can be used to evaluate the risk associated with each action in situations of dynamic occlusion. The thesis concludes with a taxonomy of strategic interactions that maps complex design specific strategies in a game to a simpler taxonomy of traffic interactions. Regulations around what strategies an AV should execute in traffic can be developed over the simpler taxonomy, and a process of automated mapping can protect the proprietary design decisions of an AV manufacturer.

Location Information

Location Address: DC - William G. Davis Computer Research Centre
200 University Avenue West
DC 3317 | Online PhD defence
Waterloo, ON, CA N2L 3G1

Location coordinates: