Flies In the Ointment

© Copyright 2005, Paul Kislanko

Kenneth Massey's College Football Ranking Comparison rates the rankings by how well they correlate to the average of all ranks (by left-to-right order) and includes data for ranking violations. "Rankings violations" are based upon the number of games in which a team currently ranked worse won the game.

For teams, Dr. Massey's page only shows the mean (the top-to-bottom order), median, and standard deviation of the computers' rankings for each team. It occurred to me that I could use the same kind of analysis as for the rankings to find which teams have caused the most trouble for the computers trying to rank them.

Taking the computer rankings^*1 and the schedule of games played, for each team I calculated:

the number of games for which each computer currently has the winner rated higher, and the number for which winner is rated lower, and a "won-lost" style percentage.
An "upset factor"^*2 equal to the sum of the differences in rankings for games in which the current computer ranking is inconsistent with the game result, summed over all computers.
A "surprise factor" that is just the sum of the upset factor divided by the percentage of computer ranks that have the winner above the loser for the team's games.

Each of these measures a different aspect of how hard a team is to be characterized by a ranking of all teams. Do not mistake these for rankings of team quality. For example, one of the teams for which all of the computers have the team ranked below all of the teams it's lost to and ahead of all the teams it's beaten is Temple, which is pretty much ranked below any team it could possibly play and dutifully lost to all teams it has played.

Through November 13, with 85 computer rankings the top 25 teams in terms of hardest for the computers to deal with are shown below. The "Surprise Factor" is a combination of the number of unexplainable results combined with the magnitude of the "upsets." TCU and UCLA, for instance, make the top 25 because of just one inexplicable loss.

Team Rec Conf SF

1 San Diego St 4-6 MW 17799

2 SMU 3-6 CUSA 15307

3 Iowa State 7-3 B12 14479

4 Arizona 3-7 P10 12975

5 Florida St 7-3 ACC 12925

6 Memphis 4-5 CUSA 12703

7 Central Michigan 5-5 MAC 12311

8 North Carolina St 4-5 ACC 10871

9 Middle Tenn St 3-5 SBC 9161

10 Oregon St 4-6 P10 9057

11 Northern Illinois 5-4 MAC 8739

12 Pittsburgh 5-5 BigE 8725

13 TCU 10-1 MW 8274

14 Vanderbilt 4-6 SEC 8048

15 Wake Forest 4-7 ACC 7896

16 UCLA 9-1 P10 6474

17 Georgia Tech 6-3 ACC 6442

18 Ball State 4-6 MAC 6248

19 Houston 5-4 CUSA 6245

20 Akron 4-5 MAC 6172

21 Clemson 6-4 ACC 5260

22 South Florida 5-3 BigE 5124

23 Oklahoma St 4-5 B12 5056

24 Kansas 5-5 B12 4986

25 Missouri 6-4 B12 4907

Of course, since a team can be on this list because it won a game it shouldn't have, another team might be on it for having lost the same game. The games that have caused the most trouble are shown below. The first ten are off the scale, because no computer has the winner ranked higher even after the fact.

10/1/2005 ------ UTEP 20 @ Memphis 27

10/6/2005 ------ North Carolina St 17 @ Georgia Tech 14

10/8/2005 ------ San Diego St 10 @ UNLV 13

11/12/2005 ------ Texas Tech 17 @ Oklahoma St 24

11/5/2005 ------ UCLA 14 @ Arizona 52

11/5/2005 ------ North Carolina St 20 @ Florida St 15

9/10/2005 ------ San Diego St 29 @ Air Force 41

9/10/2005 ------ TCU 10 @ SMU 21

9/17/2005 ------ Michigan St 44 @ Notre Dame 41

9/24/2005 ------ Northern Illinois 42 @ Akron 48

9/26/2005 296091 Tennessee 30 @ LSU 27

9/24/2005 255818 Eastern Michigan 23 @ Central Michigan 20

10/8/2005 170318 Baylor 23 @ Iowa State 13

10/8/2005 155091 Ball State 60 @ Western Michigan 57

10/1/2005 153818 Iowa State 20 @ Nebraska 27

10/1/2005 145000 Clemson 27 @ Wake Forest 31

9/3/2005 144773 Rutgers 30 @ Illinois 33

10/6/2005 117909 Florida Atlantic 28 @ UL Lafayette 10

10/15/2005 116409 South Florida 17 @ Pittsburgh 31

11/5/2005 113091 Florida Intl 31 @ UL Monroe 29

9/5/2005 100818 Miami-Florida 7 @ Florida St 10

11/5/2005 85182 Boston College 14 @ North Carolina 16

9/9/2005 80469 Pittsburgh 10 @ Ohio 16

10/15/2005 72182 Penn State 25 @ Michigan 27

10/1/2005 68594 BYU 10 @ San Diego St 31

9/10/2005 60682 North Texas 14 @ Middle Tenn St 7

10/15/2005 58136 Iowa State 24 @ Missouri 27

Note 1	I actually included 7 computer rankings that are not reported by Massey, but the only one worth mentioning is the Sagarin "Predictor" ranking. The others are part of a separate study, and of those only Boyd Nation's ISR really contributes to the conclusions of this article.
Note 2	This metric was suggested by Adam Holtz, who feared I might actually use it for more than he suggested, which I did.

	Team	Rec	Conf	SF
1	San Diego St	4-6	MW	17799
2	SMU	3-6	CUSA	15307
3	Iowa State	7-3	B12	14479
4	Arizona	3-7	P10	12975
5	Florida St	7-3	ACC	12925
6	Memphis	4-5	CUSA	12703
7	Central Michigan	5-5	MAC	12311
8	North Carolina St	4-5	ACC	10871
9	Middle Tenn St	3-5	SBC	9161
10	Oregon St	4-6	P10	9057
11	Northern Illinois	5-4	MAC	8739
12	Pittsburgh	5-5	BigE	8725
13	TCU	10-1	MW	8274
14	Vanderbilt	4-6	SEC	8048
15	Wake Forest	4-7	ACC	7896
16	UCLA	9-1	P10	6474
17	Georgia Tech	6-3	ACC	6442
18	Ball State	4-6	MAC	6248
19	Houston	5-4	CUSA	6245
20	Akron	4-5	MAC	6172
21	Clemson	6-4	ACC	5260
22	South Florida	5-3	BigE	5124
23	Oklahoma St	4-5	B12	5056
24	Kansas	5-5	B12	4986
25	Missouri	6-4	B12	4907