ScienceOpen: research and publishing network

For Researchers

Search
Advanced search

7

views

    

0

recommends

0

shares

Record: found
Abstract: found
Article: found

Is Open Access

Artificial Intelligence, Values, and Alignment

Author(s): Iason Gabriel

Publication date (Electronic): October 01 2020

Journal: Minds and Machines

Publisher: Springer Science and Business Media LLC

Read this article at

ScienceOpen Publisher

There is no author summary for this article yet. Authors can add summaries to their articles on ScienceOpen to make them more accessible to a non-specialist audience.

Abstract

This paper looks at philosophical questions that arise in the context of AI alignment. It defends three propositions. First, normative and technical aspects of the AI alignment problem are interrelated, creating space for productive engagement between people working in both domains. Second, it is important to be clear about the goal of alignment. There are significant differences between AI that aligns with instructions, intentions, revealed preferences, ideal preferences, interests and values. A principle-based approach to AI alignment, which combines these elements in a systematic way, has considerable advantages in this context. Third, the central challenge for theorists is not to identify ‘true’ moral principles for AI; rather, it is to identify fair principles for alignment that receive reflective endorsement despite widespread variation in people’s moral beliefs. The final part of the paper explores three ways in which fair principles for AI alignment could potentially be identified.

Related collections

Most cited references 40

Record: found
Abstract: not found
Article: not found

The global landscape of AI ethics guidelines

Anna Jobin, Marcello Ienca, Effy Vayena (2019)

0 comments Cited 484 times – based on 0 reviews      Review now

Record: found
Abstract: not found
Article: not found

Reinforcement learning in robotics: A survey

J. Kober, J. Bagnell, J. Peters (2013)

0 comments Cited 360 times – based on 0 reviews      Review now

Record: found
Abstract: not found
Conference Proceedings: not found

Apprenticeship learning via inverse reinforcement learning

Pieter Abbeel, Andrew Y. Ng (2004)

0 comments Cited 343 times – based on 0 reviews

Author and article information

Contributors

Iason Gabriel: (View ORCID Profile)

Journal

Title: Minds and Machines

Abbreviated Title: Minds & Machines

Publisher: Springer Science and Business Media LLC

ISSN (Print): 0924-6495

ISSN (Electronic): 1572-8641

Publication date Created: September 2020

Publication date (Electronic): October 01 2020

Publication date (Print): September 2020

Volume: 30

Issue: 3

Pages: 411-437

Article

DOI: 10.1007/s11023-020-09539-2

SO-VID: 2773ac40-db07-49b0-ba79-0cf2ac08d3b9

Copyright © © 2020

License:

https://creativecommons.org/licenses/by/4.0

https://creativecommons.org/licenses/by/4.0

History

Data availability:

Comments

Comment on this article

scite_

Similar content 44

See all similar

Cited by 33

See all cited by

Most referenced authors 294

See all reference authors