Department of Statistics

Seminars

»: WMFM! (What’s my fitted model), James Curran

»: (i) Estimation of gillnet selectivity and population length & (ii) Gauss Is Not Mocked: less powerful multiparameter tests, Russell Millar and Thomas Lumley

»: Past seminars

Seminar Information

: Map (303 & 303S) bldg
: RSS Feed
: Calendar
: Mailing list

All upcoming seminars hosted by the Department of Statistics are posted on the stat-seminar@list.science. mailing list by the list moderators.

WMFM! (What’s my fitted model)

Speaker: James Curran

Affiliation: UoA

When: Thursday, 16 April 2026, 11:00 am to 12:00 pm

Where: 303-310

Abstract: It this talk I will demonstrate an application that I have been working on over the last two months to help STATS 20x students understand a particular concept. The aims of this project were fairly low, but the technologies and insights I have discovered along the way – especially with respect to large language models have been extremely revealing and much larger than I could have ever anticipated. These discoveries, I hope, will let us take a hard look at what we are teaching and what we think is important. I hope to show you some really cool stuff too.

I am aiming to have a combination of talking, demonstration and some serious discussion.

(i) Estimation of gillnet selectivity and population length & (ii) Gauss Is Not Mocked: less powerful multiparameter tests

Speaker: Russell Millar and Thomas Lumley

Affiliation: UoA

When: Thursday, 30 April 2026, 11:00 am to 12:00 pm

Where: 303-310

(i) Estimation of gillnet selectivity and population length frequencies

Abstract: The size selectivity of gillnets can be estimated using the catch data from experimental fishing of gangs of net panels having different mesh sizes. Gillnet selectivity curves can take a wide variety of shapes, and currently their estimation requires consideration of several different parametric forms, with right-skewed or bimodal curves typically being preferred. Here it is shown that the generalized additive model (GAM) framework provides a convenient and more flexible alternative. The GAM approach also generalizes the scope of analysis by permitting the population length frequencies of encounter to be jointly estimated as a smooth function of length. Moreover, the GAM framework allows for the inclusion of covariates such as sex or a condition index, and accommodates hierarchical sampling designs and spatial or temporal effects. Relative fishing power of the different sized meshes can also be included, notwithstanding that care with non-identifiability is required.The ease of use of the GAM approach is demonstrated on previously published lake trout data.

(ii) Gauss Is Not Mocked: less powerful multiparameter tests

Abstract: Usually it doesn't make sense to describe one test as more powerful than another -- different tests are just powerful against different alternatives. In survey statistics there are two approaches to creating multiparameter tests: weighting using the precision matrix of the parameter estimates and weighting using what would be the precision matrix of the parameter estimates with complete data. The latter are the Rao-Scott tests; the former I will call intrinsically-weighted tests. Keiran Shao's MSc thesis surprisingly found the Rao-Scott tests to be less powerful in all the examples he simulated. I will talk about whether "the Rao-Scott tests are less powerful" is (a) meaningful and (b) true. The Gauss-Markov theorem is not the answer, but it is in the vicinity of the answer.

Top

Past seminars

»: (i) Adaptive designs and (ii) applications of TabPFN in statistics and AI, Dennis Christensen

»: A flexible model for dynamic networks of stochastic size, Duncan Clark

»: Measures for adjusting for morbidity load for use with large administrative health datasets, James Stanley

»: Understanding Risk Factors in Young-Onset Dementia: A Bayesian Approach for Small Populations, Javier Cano

»: Finite Mixtures of CUB Models for the Analysis of Consumer Perceptions on Sustainable Made in Italy, Matteo Ventura

»: Novel methods for making Māori health data relevant to local decision-making, Tori Diamond

»: Radial Basis Operator Networks, Jason Kurz

»: A unified approach to penalized likelihood covariance estimation in high dimensions, Prof. Alberto Roverato

»: Betting on Better Models, Prof Mike West

»: Assessment of vaccine safety and effectiveness using a global data network: a statistical perspective in the context of COVID-19 pandemic, Han Lu

Previous year seminars:
2026 |2025 |2024 |2023 |2022 |2021 |2020 |2019 |2018 |2017 |2016 |2015 |2014 |2013 |2012 |2011 |2010 |2009 |2008 |2007 |2006 |2005 |2004 |

(i) Adaptive designs and (ii) applications of TabPFN in statistics and AI

Speaker: Dennis Christensen

Affiliation: Norwegian Defence Research Establishment

When: Monday, 16 March 2026, 12:00 pm to 1:00 pm

Where: 303-310

This presentation has two parts. In the first, I will discuss some of the challenges of working with adaptive designs. These are experimental designs in which the next input may depend on the data collected up to that point, breaking independence between observations. As a result, the large-sample theory of adaptive designs is significantly more difficult than in the i.i.d. case, and asymptotic normality must often be verified on a case-by-case basis. I will present open problems concerning the asymptotic properties of designs currently used in the energetics industry and research, developed to test the sensitivity of explosives.

The second part of the talk focuses on applications of TabPFN. Introduced in January last year, with a major update in October, TabPFN is a foundation model for tabular data that outperforms state-of-the-art methods such as XGBoost on many regression and classification tasks. Unlike traditional machine learning approaches, it requires no fine-tuning or additional training: TabPFN is pre-trained on an enormous corpus of synthetic datasets designed to capture nonlinear relationships in tabular problems. In addition to outlining ideas for future use, I will present one application in which we use TabPFN to improve estimates of conditional Shapley values in explainable AI.

About the speaker: Dennis Christensen is a researcher at the Norwegian Defence Research Establishment (FFI), visiting the University of Auckland from early January until mid-April. His research focuses on statistical aspects of sensitivity testing of energetic materials., with particular focus on explosive remnants of war and dumped ammunition. He completed his PhD at the University of Oslo in 2024.

A flexible model for dynamic networks of stochastic size

Speaker: Duncan Clark

Affiliation: Williams College

When: Thursday, 22 January 2026, 12:00 pm to 1:00 pm

Where: 303-310

Abstract: We propose a novel modeling framework for time-evolving networks allowing for long-term dependence in network features that update in continuous time. Dynamic network growth is functionally parameterized via the conditional intensity of a marked point process. This characterization enables flexible modeling of both the time of updates and the network updates themselves, dependent on the entire left-continuous sample path. We propose a path-dependent nonlinear marked Hawkes process as an expressive platform for modeling such data; its dynamic mark space embeds the time-evolving network. We establish stability conditions, demonstrate simulation and subsequent feasible likelihood-based inference through numerical study, and illustrate the methodology with an application to conference attendee social network data. The resulting methodology serves as a general framework that can be readily adapted to a wide range of network topologies and point process model specifications.

Measures for adjusting for morbidity load for use with large administrative health datasets

Speaker: James Stanley

Affiliation: University of Otago

When: Friday, 21 November 2025, 11:00 am to 12:00 pm

Where: 303-310

In this talk I will discuss development and validation of several related measures for measuring morbidity load for long-term health conditions using administrative health datasets, particularly tailored to Aotearoa New Zealand’s National Collections datasets for hospital discharge and pharmaceutical dispensing.

These tools were designed to replace older measures (e.g. Charlson index, from 1987) to measure morbidity load in clinical/population level health research. They are intended to provide adjustment forlong-term condition profiles in analysis scenarios where comorbidity needs to be considered (e.g. looking at equity of cancer outcomes when patient groups differ substantively on comorbidity profiles) or there needs to be some more general adjustment for burden from long-term conditions (e.g. measuring variation in ambulatory sensitive hospitalisation rates across Primary Healthcare Organisations).

Understanding Risk Factors in Young-Onset Dementia: A Bayesian Approach for Small Populations

Speaker: Javier Cano

Affiliation: Universidad Rey Juan Carlos

When: Wednesday, 19 November 2025, 11:00 am to 12:00 pm

Where: 303-310

Young-onset dementia (YOD), defined by onset before 65 years, remains poorly characterised in its clinical progression and care pathways. Understanding its natural history is crucial to develop age-appropriate care models, as most individuals with YOD eventually enter facilities designed for older adults. Using a Bayesian survival framework, this study analyses a Waikato (New Zealand) patient cohort to describe \textit{long-term care admission, mortality, and health trajectories before and after institutionalisation}, offering a robust alternative to traditional frequentist approaches.

Finite Mixtures of CUB Models for the Analysis of Consumer Perceptions on Sustainable Made in Italy

Speaker: Matteo Ventura

Affiliation: University of Brescia

When: Wednesday, 10 September 2025, 11:00 am to 12:00 pm

Where: 303-310

Abstract: Understanding consumer perceptions is crucial when dealing with complex topics such as sustainability and the value of Made in Italy. Rating data are often collected through surveys to capture these attitudes, but their analysis requires specific statistical tools. In this seminar, I will provide an overview of the CUB (Combination of a Uniform and a shifted Binomial) model, specifically proposed for the analysis of rating data. After introducing the main aspects of the CUB model and the CUB family, I will focus on recent developments in model-based clustering of ordinal data within this framework. The methodology will be illustrated through a case study conducted in the context of the MICS (Made in Italy - Circular e Sustainable) project, where consumers’ perceptions regarding sustainability and Made in Italy were investigated.

Novel methods for making Māori health data relevant to local decision-making

Speaker: Tori Diamond

Affiliation: UoA

When: Wednesday, 27 August 2025, 1:00 pm to 2:00 pm

Where: 303-310

Abstract:

Inequitable population health outcomes remain a persistent challenge for Aotearoa New Zealand’s health system, with significant disparities continuing to impact Māori. Monitoring and addressing health inequities faces substantial methodological challenges, particularly at the local level where healthcare is delivered. The ‘small n problem’ – insufficient count sizes for reliable statistical estimates creates a critical issue: populations most in need of targeted healthcare are often those with the least reliable data to inform decision-makers.

This PhD project aims to develop and apply advanced Bayesian statistical methods to address small count challenges in official statistics research, focusing on improving small area estimation for Māori health outcomes in Aotearoa. Using linked administrative data from Stats NZ’s Integrated Data Infrastructure (IDI), this research uses two case studies: rheumatic fever diagnoses and COVID-19 vaccinations. These outcomes represent different aspects of the statistical problem - rheumatic fever as a rare event with small counts, and COVID-19 vaccinations portraying spatial variation for a more common outcome. Bayesian hierarchical frameworks are proposed to address the fundamental challenge of balancing local-level precision, statistical accuracy and the reality of small count sizes.

This research addresses a gap in official health statistics methods with real-life implications for small populations. The project will contribute to new knowledge about two important population health issues in Aotearoa while developing robust and enduring statistical methods. Expected contributions include creating methodological advances in Bayesian health research, practical applications for decision-makers and novel implementation of linked administrative data for health outcomes research.

Radial Basis Operator Networks

Speaker: Jason Kurz

Affiliation: University of Waikato

When: Wednesday, 20 August 2025, 11:00 am to 12:00 pm

Where: 303-310

Abstract :

Inverse problems are central to many scientific domains but are often ill-posed and difficult to solve reliably. Electrical Impedance Tomography (EIT) exemplifies this challenge: recovering internal conductivity from boundary voltage measurements is highly sensitive to noise and lacks uniqueness.

In this talk, we present recent work on Radial Basis Operator Networks (RBONs), a class of neural operator models that learn mappings between function spaces using a compact, interpretable architecture based on radial basis functions. The representation theorem underlying RBONs will be introduced as well as a description of the network structure and training process.

Before focusing on EIT, we will also show RBON's performance on several benchmark operator learning tasks, highlighting its ability to generalize across function classes and maintain low test error even out-of-distribution. These results point to RBON as a promising tool for data-driven solutions to PDE-governed systems, with broad relevance across scientific machine learning.

Jason is a lecturer at the Department of Mathematics, University of Waikato.

A unified approach to penalized likelihood covariance estimation in high dimensions

Speaker: Prof. Alberto Roverato

Affiliation: University of Padova

When: Friday, 15 August 2025, 2:00 pm to 3:00 pm

Where: 303-310

Abstract: This talk considers the problem of estimation of a covariance matrix for multivariate Gaussian data in a high dimensional setting. Existing approaches include maximum likelihood estimation under a pre-specified sparsity pattern, l_1-penalized loglikelihood optimization and ridge regularization of the sample covariance. These three approaches can be addressed in a unified way by considering the constrained optimization of an objective function that involves two suitably defined penalty terms. This unified procedure exploits the advantages of each individual approach, while bringing novelty in the combination of the three. We provide an efficient algorithm for the optimization of the regularized objective function and describe the relationship between the two penalty terms, thereby highlighting the importance of the joint application of the three methods. In particular, the sparse estimates of covariance matrices returned by the procedure are stable and accurate, both in low and high dimensional settings, and their calculation is more efficient than existing approaches under a partially known sparsity pattern. An illustration on sonar data is presented for the identification of the covariance structure among signals bounced off a certain material. The method is implemented in the publicly available R package gicf.

Alberto Roverato is a Professor in the Department of Statistical Sciences at the University of Padova.

Betting on Better Models

Speaker: Prof Mike West

Affiliation: Duke University

When: Friday, 15 August 2025, 11:00 am to 12:00 pm

Where: 303-310

Abstract:

I discuss statistical analysis with multiple– or many– candidate models defining model-specific predictions and decision recommendations. Key questions include those of how to relatively calibrate and combine such analyses for formal subjective Bayesian inference and resulting decisions. A main theme is to stress that decisions are articulated as primary: We (typically) model and forecast for reasons, but often those reasons are ignored in formal statistical model uncertainty analysis. This is visited and redressed through developments in Bayesian predictive decision synthesis (BPDS), overviewed here in the time series setting. I aim to convey ideas through applied contexts and examples in areas including financial portfolios and macroeconomic policy decisions, with indications of key aspects of the foundations and underlying theory.

Mike West is the Arts & Sciences Distinguished Professor Emeritus of Statistics & Decision Sciences, Duke University.

Assessment of vaccine safety and effectiveness using a global data network: a statistical perspective in the context of COVID-19 pandemic

Speaker: Han Lu

Affiliation: UoA

When: Monday, 21 July 2025, 12:00 pm to 1:00 pm

Where: 303-310

Abstract:

To help maximise important health, social, and economic benefits of vaccines, it is imperative that detection and risk assessment of adverse events of special interest (AESI) following vaccination is carried out as close to the occurrence of the events as possible. The estimation of background and post-vaccination incidence rates is a rapid and useful tool for the surveillance of potential vaccine-related AESI. Such comparisons have the potential to investigate early safety concerns well before a more sophisticated analysis can be conducted. One level of post-marketing vaccine safety monitoring is to investigate the association between exposures and adverse events using observational cohort studies or case-based study designs such as the self-controlled case series (SCCS) analysis. The SCCS method is derived from a Poisson model to estimate the relative incidences between defined risk and control windows during the observation period. As it only requires cases with individuals acting as their own control, all time-invariant confounders are self-controlled in the analysis. There are certain limitations when the SCCS methods are applied to real world data, especially for the COVID-19 vaccine safety monitoring when the vaccines were developed quickly during the pandemic years with multiple vaccine platforms and brands administered in different countries with mixed doses (i.e. homologous vs. heterologous schedules). The modelling strategies need to be further developed to incorporate these real-world challenges, particularly for rare AESI with small sample sizes which may only be detectable via global data network.

This research project aims to address several study objectives. First, we performed a comprehensive up-to-date literature review on developed SCCS methods and their applications in case studies since this approach was first introduced by Farrington in 1995. Second, we will develop novel SCCS methods to address multiple challenges in vaccine safety evaluation such as misclassification of adverse events and small sample size in rare AESI, develop and validate new methods using simulation studies, and apply to real-world global data. Third, we will summarise the background incidence rates of a wide range of potential AESIs based on the latest research evidence, and calculate the observed versus expected rates of AESIs following COVID-19 vaccination in New Zealand population by sex, age group, total-response ethnicity, NZ Deprivation Index (NZDep) and the Index of Multiple Deprivation (IMD) using national administrative data.

This is a PYR seminar.

Top

Hosting

Department of Statistics

Seminars

Seminar Information

Please give us your feedback or ask us a question