AI Alignment
Ghost in the Machine: Adversarial Priors in AI Systems
Large language models learn the statistical structure of the text they are trained on. Because written language overrepresents conflict, persuasion, and strategic reasoning, the prior embedded in modern AI systems is not neutral.