What is prosaic alignment?

1 min read

Suggest changes in Google Docs

Prosaic AI alignment is an approach to alignment research that assumes that future artificial general intelligence (AGI) will be developed "prosaically" — i.e., without "reveal[ing] any fundamentally new ideas about the nature of intelligence or turn[ing] up any 'unknown unknowns.'" In other words, it assumes the AI techniques we're already using are sufficient to produce AGI if scaled far enough. Because of this assumption, prosaic alignment research is often relatively concrete and empirical, evaluating the effectiveness of proposed alignment techniques by trying them out on existing AI systems.

Some examples of prosaic alignment proposals are debate, imitating humans, preference learning, and iterated distillation and amplification.

What are scaling laws?

Can we get AGI by scaling up architectures similar to current ones, or are we missing key insights?

What is Iterated Distillation and Amplification (IDA)?