Deontology and Safe Artificial Intelligence

William D'Alessandro

Deontology and Safe Artificial Intelligence

Philosophical Studies (forthcoming) Copy BIBT_EX

Abstract

The field of AI safety aims to prevent increasingly capable artificially intelligent systems from causing humans harm. Research on moral alignment is widely thought to offer a promising safety strategy: if we can equip AI systems with appropriate ethical rules, according to this line of thought, they'll be unlikely to disempower, destroy or otherwise seriously harm us. Deontological morality looks like a particularly attractive candidate for an alignment target, given its popularity, relative technical tractability and commitment to harm-avoidance principles. I argue that the connection between moral alignment and safe behavior is more tenuous than many have hoped. In general, AI systems can possess either of these properties in the absence of the other, and we should favor safety when the two conflict. In particular, advanced AI systems governed by standard versions of deontology need not be especially safe.

Cite

Plain text

BibTeX

Formatted text

Zotero

EndNote

Reference Manager

RefWorks

Options

Mark as duplicate

Find it on Scholar

Request removal from index

Revision history

Edit

View on PhilPapers

Author's Profile

William D'Alessandro

University of Oxford

Archival history

Archival date: 2024-05-06
View all versions

Keywords

alignment problem moral AI ethical AI value alignment AI safety anti-natalism deontology existential risk human extinction AI risk

Reprint years

Analytics

Added to PP
2024-05-06

Downloads
0

6 months
0

Historical graph of downloads since first upload

This graph includes both downloads from PhilArchive and clicks on external links on PhilPapers.

How can I increase my downloads?

Applied ethics	Epistemology	History of Western Philosophy	Meta-ethics	Metaphysics	Normative ethics
Philosophy of biology	Philosophy of language	Philosophy of mind	Philosophy of religion	Science Logic and Mathematics	More ...

Deontology and Safe Artificial Intelligence

Abstract

Author's Profile

Archival history

Categories

Keywords

Reprint years

Analytics