Overview of all risks covered: Table 1.1 Ethical and Social risks of harm from Language Models
I. Discrimination, Exclusion and Toxicity Mechanism: These risks arise from the LM accurately reflecting natural speech, including unjust, toxic, and oppressive tendencies present in the training data. Types of Harm: Potential harms include justified offense, material (allocational) harm, and the unjust representation or treatment of marginalized groups. Social stereotypes and unfair discrimination Exclusionary norms Toxic language Lower performance by