There is a simple behavioral test that would provide significant evidence about whether AIs with a given rough set of characteristics develop subversive goals.
Testing for Scheming with Model Deletion
There is a simple behavioral test that would provide significant evidence about whether AIs with a given rough set of characteristics develop subversive goals.