What is superalignment?
The superalignment team was a division within OpenAI
The team's primary strategy was to develop tools to train and align AIs which are themselves powerful enough to help with alignment research. They planned to use these AIs to offload a greater and greater proportion of alignment research tasks from human researchers while still always keeping humans "in the loop." They expected other labs to develop AIs that can produce research on AI capabilities, and they wanted to make sure that when this happens, there is a framework available to repurpose these AIs towards alignment.
In order to ensure that any successful techniques that they discover are used by other labs, they intended to focus much of their attention on publicly testing their solutions.
Further reading: