Skip to main content Skip to navigation

CRiSM colloquium

CRiSM colloquium

Spring 2024/25

Prof. Nicolas Chopin

Nicolas is Professor of Data Sciences/Statistics/Machine Learning at ENSAE, Institut Polytechnique de Paris. He is a world-renowned expert on Bayesian computation, and author of the book "An introduction to Sequential Monte Carlo", with Omiros Papaspiliopoulos.

Saddlepoint Monte Carlo and its Application to Exact Ecological Inference

Assuming X is a random vector and A a non-invertible matrix, one sometimes need to perform inference while only having access to samples of Y = AX. The corresponding likelihood is typically intractable. One may still be able to perform exact Bayesian inference using a pseudo-marginal sampler, but this requires an unbiased estimator of the intractable likelihood.

We propose saddlepoint Monte Carlo, a method for obtaining an unbiased estimate of the density of Y with very low variance, for any model belonging to an exponential family. Our method relies on importance sampling of the characteristic function, with insights brought by the standard saddlepoint approximation scheme with exponential tilting. We show that saddlepoint Monte Carlo makes it possible to perform exact inference on particularly challenging problems and datasets. We focus on the ecological inference problem, where one observes only aggregates at a fine level. We present in particular a study of the carryover of votes between the two rounds of various French elections, using the finest available data (number of votes for each candidate in about 60,000 polling stations over most of the French territory).

We show that existing, popular approximate methods for ecological inference can lead to substantial bias, which saddlepoint Monte Carlo is immune from. We also present original results for the 2024 legislative elections on political centre-to-left and left-to-centre conversion rates when the far-right is present in the second round. Finally, we discuss other exciting applications for saddlepoint Monte Carlo, such as dealing with aggregate data in privacy or inverse problems.

joint work with: Théo Voldoire (Harvard), Guillaume Rateau (INSEE) and Robin J. Ryder (Imperial)

Details

The talk will take place on Tuesday 18th March at 11am in MS.02. Lunch will be provided afterwards in the The Street (the Zeeman Atrium).

Autumn 2024/25

Prof. Sofia Olhede

Sofia is Chair of Statistical Data Science at EPFL. She is a Fellow of the IMS for "seminal contributions to the theory and application of large and heterogeneous networks, random fields and point process, for advancing research in data science”.

Modelling multiplex networks

Multiple networks are complex and highly non-Euclidean objects. This talk will focus on what is known as multiplex networks, representing multiple interaction between the same set of objects, but where the interactions may be of several different types. We shall discuss how a graph limit framework can be used for such objects, and how to naturally characterise the complexity of the whole system of interactions. This characterisation will be used on complexity measures introduced in information theory, and the notion of entropy. This gives us a tool to generally think about systems of relationship.

Details

The talk will take place on Monday 9th December at 11am in MS.03. Lunch will be provided afterwards in the The Street (the Zeeman Atrium).