All
Section
Appendix
6.6

Preferences

While it may seem intuitive to let AI systems be guided by people's preferences, there are a number of objections to relying on these.

No items found.

Review Questions

Explain the difference between stated, revealed, and idealized preferences.

Answer:

Stated preferences are expressed, revealed preferences are inferred from choices, and idealized preferences are those held with perfect information and judgment.

View Answer
Hide Answer

How can we use stated preferences to train AIs? What is one practical limitation of using human feedback to train AI systems?

Answer:

Humans can directly evaluate and rank AI outputs as training feedback. A limitation is infeasibility as tasks become too complex for humans to judge.

View Answer
Hide Answer

Why might using human preferences alone be insufficient for comprehensive machine ethics?

Answer:

Preferences may not capture important factors like autonomy, may need to be aggregated, and could still lead to unethical outcomes if satisfied.

View Answer
Hide Answer