An Enactive Approach to Value Alignment in Artificial Intelligence: A Matter of Relevance

Michael Cannon

An Enactive Approach to Value Alignment in Artificial Intelligence: A Matter of Relevance

In Vincent C. Müller (ed.), Philosophy and Theory of Artificial Intelligence 2021. pp. 119-135 (2022) Copy BIBT_EX

Abstract

The “Value Alignment Problem” is the challenge of how to align the values of artificial intelligence with human values, whatever they may be, such that AI does not pose a risk to the existence of humans. A fundamental feature of how the problem is currently understood is that AI systems do not take the same things to be relevant as humans, whether turning humans into paperclips in order to “make more paperclips” or eradicating the human race to “solve climate change”. Specifically, existing approaches approaches to alignment appear to be concerned with how AI might *solve* problems in the relevant way. This paper presents and explores an approach to alignment rooted in the Enactive Theory of mind. It offers an alternative conception of the alignment as “how do we make relevant to AI what is relevant to humans?” In this conception, alignment is concerned with building AI so that it *discerns* and defines the problem in the relevant way. In this way, the Alignment Problem is shown to be the same problem as the Frame Problem. The paper concludes with a consideration of tradeoffs between these conceptions of the alignment problem.

Cite

Plain text

BibTeX

Formatted text

Zotero

EndNote

Reference Manager

RefWorks

Options

Mark as duplicate

Find it on Scholar

Request removal from index

Revision history

Edit

View on PhilPapers

Author's Profile

Michael Cannon

Eindhoven University of Technology

Archival history

Archival date: 2022-11-19
View all versions

Keywords

AI Enactive Theory of Mind Value Alignment Frame Problem

Reprint years

Analytics

Added to PP
2022-11-19

Downloads
259 (#62,886)

6 months
138 (#26,604)

Historical graph of downloads since first upload

This graph includes both downloads from PhilArchive and clicks on external links on PhilPapers.

How can I increase my downloads?

Applied ethics	Epistemology	History of Western Philosophy	Meta-ethics	Metaphysics	Normative ethics
Philosophy of biology	Philosophy of language	Philosophy of mind	Philosophy of religion	Science Logic and Mathematics	More ...

An Enactive Approach to Value Alignment in Artificial Intelligence: A Matter of Relevance

Abstract

Author's Profile

Archival history

Categories

Keywords

Reprint years

Analytics