Counting Votes - Some Real Data

© Copyright 2006, Paul Kislanko

Thanks to Rick Workman and The Workman Poll I have some real voters' ballots with which to illustrate the difference the way a poll is counted can make.

The poll is a traditional "top 25", with an interactive ballot that requires 25 teams to be ranked and doesn't allow duplicate rankings. (It's the nicest online ranked ballot I've ever used.)

49 teams were ranked in at least one of 52 ballots. There was a remarkable degree of consensus in this ballot set - the 24 teams that were ranked on a majority of ballots can be ordered so that every team is ranked higher than or equal to all teams below it by a majority of voters.

This is not usually possible, but when at least one team pairwise defeats all other teams that team is said to be the Condorcet winner. When there is a subset of teams such that every team within the subset has a pairwise advantage over all teams outside the subset, it is called a Smith Set. In this example, the first 21 teams listed form a Smith set, and Ohio State is the Condorcet Winner.

When we count the votes using the standard "25 points for 1st, 24 for 2nd, etc." approach we're using a modified form of the Borda Count, and I showed last time that this can result in a team having a higher ranking even if a majority of voters had it ranked lower than another team that got a worse rank. We can see some of the same effects in this week's Workman Poll when we count the ballots four different ways.

Now, remember these are all results from exactly the same ballots. Does this group of fans collectively think Florida is third or do they think the Gators are sixth? Oregon is the eighth-best team in the country, or they're 12th. There's a famous (to the Nobel Prize committee, anyway) theorem to the effect that there's no perfect way to decide, but we can categorize the methods by their attributes.

Cond
The Condorcet order is such that every higher-listed team has more ballots where it is listed higher than every lower-listed team. As I noted above this is not always possible, because there may be "cycles" where more voters rank team A higher than team B and team B higher than team C, but more voters rank team C higher than team A. In this example there was also a tie - 46 voters ranked Florida St, Missouri or both, and 23 ballots had FSU higher than Mizzou, with 23 having Mizzou higher than FSU. A typical way to break the tie is to use the stronger defeat over the next non-tied team. In this case FSU's 23-21 margin over Rutgers trumps Missouri's 21-20 margin.
Maj
is my implementation of what is known as Bucklin counting. A team is assigned the highest rank for which a majority of the voters rank the team. If every voter ranks every team (and there are an odd number of voters) this is equivalent to the median rank. There are often ties for this rank, so to order the teams I use the Count of votes for that rank or higher, and if that is also a tie, the Real Borda count for the tied teams.
RealB
is based upon the original definition of the Borda Count. For each ballot, points are assigned to the team at each rank based upon how many teams it is ranked higher than. Since there are 119 teams in Division 1-A, a first place vote is worth 118 points, a second place vote is worth 117, etc. So one vote for 20th is worth 99 points, and six votes for 25th is worth 564.
25Pts
This is the traditional system that assigns 25 points for 1st and 1 point for 25th on each ballot. If one voter ranks team A 20th and six voters rank team B 25th the ranking "says" those teams are equally-regarded by the group, which is not likely what all but one of the group would say.
25Pts RealB Maj Cond
1 Ohio State Ohio State Ohio State Ohio State
2 USC USC USC USC
3 Auburn Florida Auburn Auburn
4 Florida Auburn Michigan Michigan
5 Michigan Michigan West Virginia Florida
6 West Virginia Louisville Florida West Virginia
7 Louisville West Virginia Louisville Louisville
8 Texas Oregon Texas Texas
9 Virginia Tech Georgia LSU Virginia Tech
10 Georgia Texas Virginia Tech LSU
11 Oregon Virginia Tech Georgia Georgia
12 LSU Iowa Oregon Oregon
13 Iowa TCU Iowa Iowa
14 TCU LSU TCU Notre Dame
15 Notre Dame Notre Dame Notre Dame TCU
16 Tennessee California Tennessee Oklahoma
17 Oklahoma Oklahoma Oklahoma Tennessee
18 California Boise St California California
19 Boise St Tennessee Clemson Clemson
20 Clemson Clemson Boise St Boise St
21 Nebraska Nebraska Nebraska Nebraska
22 Missouri Rutgers Rutgers Florida St*
23 Rutgers Missouri Missouri Missouri*
24 Florida St Florida St Florida St Rutgers
Diff Team 25Pts Realb Maj Cond
0 Ohio State 1 1 1 1
0 USC 2 2 2 2
1 Auburn 3 4 3 3
3 Florida 4 3 6 5
1 Michigan 5 5 4 4
2 West Virginia 6 7 5 6
1 Louisville 7 6 7 7
2 Texas 8 10 8 8
2 Virginia Tech 9 11 10 9
2 Georgia 10 9 11 11
4 Oregon 11 8 12 12
5 LSU 12 14 9 10
1 Iowa 13 12 13 13
2 TCU 14 13 14 15
1 Notre Dame 15 15 15 14
3 Tennessee 16 19 16 17
1 Oklahoma 17 17 17 16
2 California 18 16 18 18
2 Boise St 19 18 20 20
1 Clemson 20 20 19 19
0 Nebraska 21 21 21 21
1 Missouri 22 23 23 23
2 Rutgers 23 22 22 24
2 Florida St 24 24 24 22

When the AP and USA Today (nee UPI) polls' only purpose was to fill newspaper space on a non-gameday and give fans something to talk about, it really didn't matter how they came up with the answer to "who's number 1?" But now that the "polls" are being used to decide which teams get a $14-17 million payday, it is worth scrutinizing the methods by which they make such decisions. As a minimum they should publish the number of votes for each rank for all teams receiving votes. As I've noted before in order to achieve "transparency in the BCS process" we don't need to see the individual ballots, nor do we need to know who voted how - we just need the counts of votes for each rank.. From that we can calculate each of the orders except the Condorcet order, which usually doesn't exist.

Ballot Summary

Rank Team Maj Cnt Borda Best #Ballots RealB
1 Ohio State 1 34 1258 1 52 6094
2 Southern California 2 28 1158 1 52 5994
3 Auburn 4 31 1110 1 51 5853
4 Michigan 5 27 1021 1 51 5764
5 West Virginia 5 27 1003 1 51 5746
6 Florida 6 35 1033 1 52 5869
7 Louisville 7 27 927 1 52 5763
8 Texas 9 28 807 4 50 5457
9 LSU 10 28 732 2 47 5103
10 Virginia Tech 10 27 790 2 50 5440
11 Georgia 11 29 741 3 51 5484
12 Oregon 12 27 734 3 52 5570
13 Iowa 13 30 655 6 50 5305
14 TCU 15 28 577 4 50 5227
15 Notre Dame 15 28 504 6 49 5061
16 Tennessee 17 30 420 3 42 4326
17 Oklahoma 18 28 396 10 45 4581
18 California 19 30 359 8 46 4637
19 Clemson 20 31 315 9 39 3942
20 Boise St 20 27 330 3 44 4422
21 Nebraska 22 30 271 9 38 3805
22 Rutgers 23 27 201 3 34 3363
23 Missouri 24 27 259 6 33 3328
24 Florida St 24 27 179 12 30 2969
25 Boston College     76 12 21 2029
Rank Team Maj Cnt Borda Best #Ballots RealB
26 Texas Tech     108 12 20 1968
27 Purdue     104 12 18 1778
28 Wake Forest     131 8 16 1619
29 Georgia Tech     72 12 15 1467
30 Texas A&M     129 6 14 1431
31 Houston     101 5 11 1124
32 Wisconsin     72 13 11 1095
33 Alabama     60 2 9 897
34 Arizona St     42 17 8 786
35 Michigan St     25 19 8 769
36 Navy     27 14 5 492
37 Penn State     24 16 5 489
38 Washington     27 14 4 399
39 Arkansas     14 16 4 386
40 Tulsa     24 9 3 303
41 UCLA     16 17 3 295
42 Southern Miss     15 18 3 294
43 Utah     8 23 3 287
44 Air Force     29 8 2 215
45 South Carolina     8 19 2 194
46 Miami-Florida     4 22 1 97
47 Iowa State     2 24 1 95
48 Kansas     1 25 1 94
48 South Florida     1 25 1 94