Counting Votes - Some Real Data
Thanks to Rick Workman and The Workman Poll I have some real voters' ballots with which to illustrate the difference the way a poll is counted can make.
The poll is a traditional "top 25", with an interactive ballot that requires 25 teams to be ranked and doesn't allow duplicate rankings. (It's the nicest online ranked ballot I've ever used.)
49 teams were ranked in at least one of 52 ballots. There was a remarkable degree of consensus in this ballot set - the 24 teams that were ranked on a majority of ballots can be ordered so that every team is ranked higher than or equal to all teams below it by a majority of voters.
This is not usually possible, but when at least one team pairwise defeats all other teams that team is said to be the Condorcet winner. When there is a subset of teams such that every team within the subset has a pairwise advantage over all teams outside the subset, it is called a Smith Set. In this example, the first 21 teams listed form a Smith set, and Ohio State is the Condorcet Winner.
When we count the votes using the standard "25 points for 1st, 24 for 2nd, etc." approach we're using a modified form of the Borda Count, and I showed last time that this can result in a team having a higher ranking even if a majority of voters had it ranked lower than another team that got a worse rank. We can see some of the same effects in this week's Workman Poll when we count the ballots four different ways.
Now, remember these are all results from exactly the same ballots. Does this group of fans collectively think Florida is third or do they think the Gators are sixth? Oregon is the eighth-best team in the country, or they're 12th. There's a famous (to the Nobel Prize committee, anyway) theorem to the effect that there's no perfect way to decide, but we can categorize the methods by their attributes.
- Cond
- The Condorcet order is such that every higher-listed team has more ballots where it is listed higher than every lower-listed team. As I noted above this is not always possible, because there may be "cycles" where more voters rank team A higher than team B and team B higher than team C, but more voters rank team C higher than team A. In this example there was also a tie - 46 voters ranked Florida St, Missouri or both, and 23 ballots had FSU higher than Mizzou, with 23 having Mizzou higher than FSU. A typical way to break the tie is to use the stronger defeat over the next non-tied team. In this case FSU's 23-21 margin over Rutgers trumps Missouri's 21-20 margin.
- Maj
- is my implementation of what is known as Bucklin counting. A team is assigned the highest rank for which a majority of the voters rank the team. If every voter ranks every team (and there are an odd number of voters) this is equivalent to the median rank. There are often ties for this rank, so to order the teams I use the Count of votes for that rank or higher, and if that is also a tie, the Real Borda count for the tied teams.
- RealB
- is based upon the original definition of the Borda Count. For each ballot, points are assigned to the team at each rank based upon how many teams it is ranked higher than. Since there are 119 teams in Division 1-A, a first place vote is worth 118 points, a second place vote is worth 117, etc. So one vote for 20th is worth 99 points, and six votes for 25th is worth 564.
- 25Pts
- This is the traditional system that assigns 25 points for 1st and 1 point for 25th on each ballot. If one voter ranks team A 20th and six voters rank team B 25th the ranking "says" those teams are equally-regarded by the group, which is not likely what all but one of the group would say.
| 25Pts | RealB | Maj | Cond |
1 | Ohio State | Ohio State | Ohio State | Ohio State |
2 | USC | USC | USC | USC |
3 | Auburn | Florida | Auburn | Auburn |
4 | Florida | Auburn | Michigan | Michigan |
5 | Michigan | Michigan | West Virginia | Florida |
6 | West Virginia | Louisville | Florida | West Virginia |
7 | Louisville | West Virginia | Louisville | Louisville |
8 | Texas | Oregon | Texas | Texas |
9 | Virginia Tech | Georgia | LSU | Virginia Tech |
10 | Georgia | Texas | Virginia Tech | LSU |
11 | Oregon | Virginia Tech | Georgia | Georgia |
12 | LSU | Iowa | Oregon | Oregon |
13 | Iowa | TCU | Iowa | Iowa |
14 | TCU | LSU | TCU | Notre Dame |
15 | Notre Dame | Notre Dame | Notre Dame | TCU |
16 | Tennessee | California | Tennessee | Oklahoma |
17 | Oklahoma | Oklahoma | Oklahoma | Tennessee |
18 | California | Boise St | California | California |
19 | Boise St | Tennessee | Clemson | Clemson |
20 | Clemson | Clemson | Boise St | Boise St |
21 | Nebraska | Nebraska | Nebraska | Nebraska |
22 | Missouri | Rutgers | Rutgers | Florida St* |
23 | Rutgers | Missouri | Missouri | Missouri* |
24 | Florida St | Florida St | Florida St | Rutgers |
|
Diff | Team | 25Pts | Realb | Maj | Cond |
0 | Ohio State | 1 | 1 | 1 | 1 |
0 | USC | 2 | 2 | 2 | 2 |
1 | Auburn | 3 | 4 | 3 | 3 |
3 | Florida | 4 | 3 | 6 | 5 |
1 | Michigan | 5 | 5 | 4 | 4 |
2 | West Virginia | 6 | 7 | 5 | 6 |
1 | Louisville | 7 | 6 | 7 | 7 |
2 | Texas | 8 | 10 | 8 | 8 |
2 | Virginia Tech | 9 | 11 | 10 | 9 |
2 | Georgia | 10 | 9 | 11 | 11 |
4 | Oregon | 11 | 8 | 12 | 12 |
5 | LSU | 12 | 14 | 9 | 10 |
1 | Iowa | 13 | 12 | 13 | 13 |
2 | TCU | 14 | 13 | 14 | 15 |
1 | Notre Dame | 15 | 15 | 15 | 14 |
3 | Tennessee | 16 | 19 | 16 | 17 |
1 | Oklahoma | 17 | 17 | 17 | 16 |
2 | California | 18 | 16 | 18 | 18 |
2 | Boise St | 19 | 18 | 20 | 20 |
1 | Clemson | 20 | 20 | 19 | 19 |
0 | Nebraska | 21 | 21 | 21 | 21 |
1 | Missouri | 22 | 23 | 23 | 23 |
2 | Rutgers | 23 | 22 | 22 | 24 |
2 | Florida St | 24 | 24 | 24 | 22 |
|
When the AP and USA Today (nee UPI) polls' only purpose was to fill newspaper space on a non-gameday and give fans something to talk about, it really didn't matter how they came up with the answer to "who's number 1?" But now that the "polls" are being used to decide which teams get a $14-17 million payday, it is worth scrutinizing the methods by which they make such decisions. As a minimum they should publish the number of votes for each rank for all teams receiving votes. As I've noted before in order to achieve "transparency in the BCS process" we don't need to see the individual ballots, nor do we need to know who voted how - we just need the counts of votes for each rank.. From that we can calculate each of the orders except the Condorcet order, which usually doesn't exist.
Ballot Summary
Rank | Team | Maj | Cnt | Borda | Best | #Ballots | RealB |
1 | Ohio State | 1 | 34 | 1258 | 1 | 52 | 6094 |
2 | Southern California | 2 | 28 | 1158 | 1 | 52 | 5994 |
3 | Auburn | 4 | 31 | 1110 | 1 | 51 | 5853 |
4 | Michigan | 5 | 27 | 1021 | 1 | 51 | 5764 |
5 | West Virginia | 5 | 27 | 1003 | 1 | 51 | 5746 |
6 | Florida | 6 | 35 | 1033 | 1 | 52 | 5869 |
7 | Louisville | 7 | 27 | 927 | 1 | 52 | 5763 |
8 | Texas | 9 | 28 | 807 | 4 | 50 | 5457 |
9 | LSU | 10 | 28 | 732 | 2 | 47 | 5103 |
10 | Virginia Tech | 10 | 27 | 790 | 2 | 50 | 5440 |
11 | Georgia | 11 | 29 | 741 | 3 | 51 | 5484 |
12 | Oregon | 12 | 27 | 734 | 3 | 52 | 5570 |
13 | Iowa | 13 | 30 | 655 | 6 | 50 | 5305 |
14 | TCU | 15 | 28 | 577 | 4 | 50 | 5227 |
15 | Notre Dame | 15 | 28 | 504 | 6 | 49 | 5061 |
16 | Tennessee | 17 | 30 | 420 | 3 | 42 | 4326 |
17 | Oklahoma | 18 | 28 | 396 | 10 | 45 | 4581 |
18 | California | 19 | 30 | 359 | 8 | 46 | 4637 |
19 | Clemson | 20 | 31 | 315 | 9 | 39 | 3942 |
20 | Boise St | 20 | 27 | 330 | 3 | 44 | 4422 |
21 | Nebraska | 22 | 30 | 271 | 9 | 38 | 3805 |
22 | Rutgers | 23 | 27 | 201 | 3 | 34 | 3363 |
23 | Missouri | 24 | 27 | 259 | 6 | 33 | 3328 |
24 | Florida St | 24 | 27 | 179 | 12 | 30 | 2969 |
25 | Boston College | | | 76 | 12 | 21 | 2029 |
Rank | Team | Maj | Cnt | Borda | Best | #Ballots | RealB |
26 | Texas Tech | | | 108 | 12 | 20 | 1968 |
27 | Purdue | | | 104 | 12 | 18 | 1778 |
28 | Wake Forest | | | 131 | 8 | 16 | 1619 |
29 | Georgia Tech | | | 72 | 12 | 15 | 1467 |
30 | Texas A&M | | | 129 | 6 | 14 | 1431 |
31 | Houston | | | 101 | 5 | 11 | 1124 |
32 | Wisconsin | | | 72 | 13 | 11 | 1095 |
33 | Alabama | | | 60 | 2 | 9 | 897 |
34 | Arizona St | | | 42 | 17 | 8 | 786 |
35 | Michigan St | | | 25 | 19 | 8 | 769 |
36 | Navy | | | 27 | 14 | 5 | 492 |
37 | Penn State | | | 24 | 16 | 5 | 489 |
38 | Washington | | | 27 | 14 | 4 | 399 |
39 | Arkansas | | | 14 | 16 | 4 | 386 |
40 | Tulsa | | | 24 | 9 | 3 | 303 |
41 | UCLA | | | 16 | 17 | 3 | 295 |
42 | Southern Miss | | | 15 | 18 | 3 | 294 |
43 | Utah | | | 8 | 23 | 3 | 287 |
44 | Air Force | | | 29 | 8 | 2 | 215 |
45 | South Carolina | | | 8 | 19 | 2 | 194 |
46 | Miami-Florida | | | 4 | 22 | 1 | 97 |
47 | Iowa State | | | 2 | 24 | 1 | 95 |
48 | Kansas | | | 1 | 25 | 1 | 94 |
48 | South Florida | | | 1 | 25 | 1 | 94 |
|