I still don't understand why that would cause testing to be hard, but since you won't explain it... I guess it leaves me thinking the problem might not be entirely them.
Sounds more like the problem is means-testing with strict cutoffs. Mean-testing is good in helping to even the field, the problem is that (some of) our current programs don't slowly fade out as you climb up.
Can you elaborate a bit on how such testing is done, or share a good article on the topic? It sounds like a hard problem to need to get things right this bad, or else.
reply