Shared Gradient Discovery and Superposition: Learning Dynamics of Generalization in LLMs

Shared Gradient Discovery and Superposition: Learning Dynamics of Generalization in LLMs

Abstract

Abstract

Venue

Venue

ICLR 2026 Workshop on Scientific Methods for Understanding Deep Learning (Sci4DL 2026)