====== Neural Scaling ====== The broad phenomenon where [[concepts:llm|model]] performance improves predictably with increases in parameters, data, or compute. [[concepts:scaling_laws|Scaling laws]] formalize these relationships mathematically. See also: [[concepts:scaling_laws]], [[concepts:chinchilla_scaling]], [[papers:attention_residuals]]