Aligning AI agents to social commonsense norms and values.
Social value alignment refers to creating agents whose behaviors conform to expected moral and social norms for a given context and group of people – in our case, it means agents that behave in a manner that is less harmful and more beneficial for themselves and others.
References
2022
Aligning to Social Norms and Values in Interactive Narratives