Adaptive Recurrence as Algorithmic Time for Length Generalization in Addition

2nd Workshop on Compositional Learning: Safety, Interpretability, and Agents at the International Conference on Machine Learning (ICML), - Jul 2026 Open Access
Associated documents :  
Length generalization asks whether a model can reuse a learned computation beyond the lengths seen in training. Arithmetic makes this test precise because a model trained on short additions should apply the same digit-wise rule for more steps, rather than fit patterns tied to the training range. Many strong extrapolation results, however, rely on explicit aids that tell the model how digits should be aligned. We ask whether a model can length-generalize without such aids. We train small looped transformers with adaptive halting on a controlled decimal-addition task, without positional encodings or arithmetic-specific side information. The models extrapolate beyond the training range across multiple training regimes. To understand why, we analyze their internal dynamics and find that recurrent depth becomes organized as algorithmic time: later answer positions are resolved at later recurrent phases. These results suggest that adaptive recurrence can turn repeated latent computation into a less hand-engineered route for applying a digit-wise rule beyond the training length.

 

@InProceedings{IWL26,
 	 author =  {Ibrahimli, Imran and Wermter, Stefan and Lee, Jae Hee},
 	 title = {Adaptive Recurrence as Algorithmic Time for Length Generalization in Addition},
 	 booktitle = {2nd Workshop on Compositional Learning: Safety, Interpretability, and Agents at the International Conference on Machine Learning (ICML)},
 	 journal = {}
 	 editors = {}
 	 number = {}
 	 volume = {}
 	 pages = {}
 	 year = {2026},
 	 month = {Jul},
 	 publisher = {}
 	 doi = {}
 	 url = {https://openreview.net/forum?id=XW18V4H9sq},
 }