Just for kicks I looked at the newly released dataset used for Reflection 70B to see how bad it is...