TDAbench 2026 | Benchmark Dataset for Astronomical Transients

What We Aim to Achieve

Goals

Our workshop is structured around seven key objectives that will guide our collaborative efforts

Define Classification Taxonomy

Establish a comprehensive and scientifically rigorous taxonomy for astronomical transient classification that serves as the foundation for benchmark development.

Curate High-Quality, Open Data

Compile and validate a diverse dataset of well-characterized transients with real (i.e., messy and heterogeneous) multi-band light curves and spectroscopic classifications that is made freely available to the community.

Standardize Evaluation Metrics

Develop and agree upon standardized metrics and evaluation protocols for comparing machine learning classification methods.

Identify Key Challenges

Document the main challenges in transient classification and establish research priorities for the community to address.

Build Community Infrastructure

Create sustainable infrastructure for dataset hosting, model evaluation, and community engagement using standard tools in the machine learning community.

Foster Long-term Collaboration

Establish working groups and communication channels to continue development and maintenance of the benchmark dataset.

Set a TDA Benchmark Standard

Set the standard for the first TDAbench release and set the foundation for future releases that include LSST and Roman data..

The Benchmark

Data

Building a Comprehensive Benchmark Dataset

The TDAbench dataset will serve as the definitive benchmark for evaluating machine learning approaches to astronomical transient classification. Our goal is to create a dataset that is:

Representative of the full diversity of transient phenomena
Well-characterized with high-confidence labels
Multi-wavelength and multi-epoch coverage
Balanced to address class imbalance challenges
Documented with comprehensive metadata
Designed for realistic evaluation scenarios
Messy and capture all the challenges associated with real observations that are difficult to recreate in simulations

The dataset will include photometric light curves, host galaxy properties, contextual information, and where available, spectroscopic classifications from major transient surveys. The recently complete Bright Transient Survey (BTS) conducted by the Zwicky Transient Facility (ZTF), which has spectroscopically classified more than 10k transients will serve as the preliminary basis for the benchmark dataset. Relevant observations from other surveys will be added during the course of the workshop.

Transient Classification Benchmark Coming March 2026

Mark Your Calendar

Important Dates

Key deadlines and milestones for TDAbench 2026

Jan 5

Registration Opens

Online registration portal opens for all participants. Early registration is encouraged due to limited capacity.

Jan 31

Abstract Submission Deadline

There will be a very limited number of slots for (brief) contributed talks on topics relevant to building the benchmark dataset. There will be multiple (extended) poster sessions where participants can highlight their ongoing work on transient classification. All submissions will be reviewed by the scientific organizing committee.

Feb 6

Abstract Acceptance Notification

Authors will be notified of abstract acceptance and presentation format (oral or poster).

Jan 31

Early Registration Deadline

Last day for early bird registration rates. Hotel block reservations recommended by this date.

Feb 23

Registration Closes

Final deadline for workshop registration. Late registrations may be accepted on a space-available basis.

Mar 9-12

TDAbench 2026 Workshop

Four days of collaborative sessions, presentations, and working group meetings at SkAI Institute, Chicago.