Research

My research interests lie broadly at the intersection of time series and learning theory, with a central focus on developing valid statistical inference methods for data exhibiting complex dependency structures. Most recently, I've been exploring the emerging statistical foundations of large language models.

Preprints

  • Fast segmentation of watermarked texts from large language models through epidemic change-points framework.
    S. Bonnerjee, S. Roy, S. Karmakar
    Major Revision at Journal of American Statistical Association, 2026
  • Stability beyond bounded differences: sharp generalization bounds under finite Lp moments.
    Q. Lei, S. Bonnerjee, Y. Han, W. B. Wu
    Under review at ICML 2026.
  • Identifying stability regions of SGD with constant learning rates.
    O. Goldreich, S. Bonnerjee, Q. Lei, J. Li, W. B. Wu
    Under review at ICML 2026.
  • Testing synchronization of change-points for multiple time series.
    S. Bonnerjee, S. Karmakar, M. Cheng, W. B. Wu
    Major Revision at Biometrika, 2025
    🏆 IMS Hannan Graduate Student Travel Award, 2025
  • How Private is Your Attention? Bridging Privacy with In-Context Learning.
    Under Review at TMLR, 2025

Journal Publications

  • Gaussian Approximation For Non-stationary Time Series with Optimal Rate and Explicit Construction.
    S. Bonnerjee, S. Karmakar, W. B. Wu
    Annals of Statistics, 52(5): 2293–2317, 2024
  • A Generalized Epidemiological Model with Dynamic and Asymptomatic Population.
    A. Ghatak, S. S. Patel, S. Bonnerjee, S. Roy
    Statistical Methods in Medical Research, 31(11): 2137–2163, 2022

Conference Publications