Call for Papers

The call for papers is now closed — thanks to everyone who submitted!

We are inviting submissions of short and long papers outlining new research, due May 8th, 2026 (AOE). This year we are accepting both ICML and NeurIPS formats for submissions, but all camera ready papers must be in ICML format. We welcome all submissions that convincingly argue for why they further the field: i.e. which further our ability to use the internal states of neural networks to understand them. Submit on OpenReview.

We require at least one reciprocal reviewer per submission! Each reciprocal reviewer will be assigned 3 papers to review. You can be a reciprocal reviewer on max 3 papers.

This year we have received x2.6 the number of submissions from last year, doubling the original estimate of submissions. Unfortunately, we will have to assign each reciprocal reviewer 4 papers this year. Thank you for your understanding.

We are extremely grateful to all who volunteer as reviewers, you can express interest here.

Details:

Strong empirical works will clearly articulate (i) specific falsifiable hypotheses, and how the evidence provided does and does not support them; or (ii) convincingly show clear practical benefits over well-implemented baselines.

Works that clearly document the strengths and weaknesses of their evidence, and what we can learn from this are welcomed, even if it weakens the narrative or conclusions remain inconclusive. Works that downplay or omit significant limitations will not be accepted.

Authors may find Neel Nanda’s advice on paper writing to be a helpful perspective, especially those new to writing mechanistic interpretability papers.

Topics of Interest

We are particularly interested in, but not limited to, the following directions: