Events
Statistics Seminar - Scalable Estimation of Probit Models with Crossed Random Effects
Location: MB0.07
by Professor Cristiano Varin (Ca’ Foscari University of Venice, Italy)
Abstract: Crossed random effects structures arise in many scientific contexts. They raise severe computational problems with likelihood and Bayesian computations scaling like N^(3/2) or worse for N data points. In this paper we develop a composite likelihood approach for crossed random effects probit models. For data arranged in R rows and C columns, the likelihood function includes a very difficult R + C dimensional integral. The composite likelihood we develop uses the marginal distribution of the response along with two hierarchical models. The cost is reduced to O(N) and it can be computed with R + C one-dimensional integrals. We find that the commonly used Laplace approximation has a cost that grows superlinearly. We get consistent estimates of the probit slope and variance components from our composite likelihood algorithm. We also show how to estimate the covariance of the estimated regression coefficients. The algorithm scales readily to a data set of five million observations from Stitch Fix with R + C > 700,000.
This is joint work with Ruggero Bellio (Udine), Swarnadip Ghosh (Stanford) and Art B. Owen (Stanford).