It would be interesting to separate out the sensitivity (how often does the test give a positive result on good candidates?) from specificity (how often does the test give a negative result on bad candidates?) for each of these tests. I'd guess a fizz-buzz-style test has good sensitivity but terrible specificity.