====== Gradient Highway ====== The property of [[concepts:residual_connections|residual connections]] that allows gradients to flow directly through the identity path during backpropagation, bypassing layer transformations. This enables stable training of very deep networks. See also: [[concepts:residual_connections]], [[concepts:vanishing_gradients]], [[papers:attention_residuals]]