2015 Reporting Changes

Results vs Opponents' Ratings

August 30, 2015

In Ratings Scope I introduced my new ISR and ISOV report format, which compared to prior years adds the "Norm" column and eliminates the "ASOS" and "PASOS" meta-ratings. I mentioned that I'd found histograms to be a better tool for this kind of analysis.

We're familiar with the tables that show "records vs top 25, 50, …" but I do not find those very useful because the ranks are arbitrary and the differences between the extremes of the ranges is so variable. There's a much larger difference between #25 and #1 than there is between #50 and #26.

Instead for "buckets" I divide the field into ½ standard deviation (σ) -wide groups of teams then apply a basic "bonus for better wins, penalty for worse losses" formula. This is "vs opponents' ratings" as opposed to opponents' ranks.

When applied to the 2014 season (with the field modified as described in the cited article) for the ISR this results in a "records vs teams ranked 1-4, 5-17, 18-35, 36-130, 131-162, 163-180, 181-196, 197+". The widest rank range corresponds to opponents within ½ standard deviation of average, and both wins over and losses to teams in this range are assigned value 1.

Win-Loss Weight Example for 2014 ISR

*lowσ**	*hiσ**	Add for win	Sub for loss	—	Opp ISR>	Low Rank	High Rank	#teams
2	+∞	4	0.0625		129.29	1	4	4
1.5	2	3	0.125		121.58	5	17	13
1	1.5	2	0.25		113.86	18	35	18
0.5	1	1.5	0.5		106.14	36	64	29
-0.5	0.5	1	1		90.71	65	130	66
-1	-0.5	0.5	1.5		82.99	131	162	32
-1.5	-1	0.25	2		75.27	163	180	18
-2	-1.5	0.125	3		67.55	181	196	16
-∞	-2	0.0625	4		-∞	197	198	2

The weightings are arbitrary, but the only requirements for them are that they be symetrical around the center bucket and be significantly different enough at the tails to distinguish between "good/bad" and "even better/worse" in the formula's result. Again using 2014 results with the modified field, for the ISR the report looks like this.

Records vs Opponents' ISR Values

30 Aug 2015 11:39am (US Mountain)

Sort	Rank	ISR	Team	Rec	Conf	>2σ	>3σ/2	>σ	>σ/2	>-σ/2	>-σ	>-3σ/2	>-2σ	>-∞
26.7500	1	1	Ohio State	14-1	B10	2-0	2-0	1-0	6-1	2-0	0-0	1-0	0-0	0-0
24.3125	2	2	Oregon	13-2	P12	1-1	3-1	2-0	2-0	4-0	1-0	0-0	0-0	0-0
19.6875	3	3	Florida State	13-1	ACC	0-1	1-0	4-0	3-0	4-0	0-0	1-0	0-0	0-0
19.3125	4	4	Alabama	12-2	SEC	0-1	1-1	6-0	2-0	0-0	3-0	0-0	0-0	0-0
18.4375	5	5	UCLA	10-3	P12	0-1	3-0	1-2	4-0	2-0	0-0	0-0	0-0	0-0
17.6250	6	6	TCU	12-1	B12	0-0	1-1	3-0	3-0	4-0	0-0	1-0	0-0	0-0
16.3125	7	8	Missouri	11-3	SEC	0-1	0-1	4-0	5-0	2-1	0-0	0-0	0-0	0-0
16.1250	8	9	Mississippi	9-4	SEC	1-0	1-1	2-3	2-0	3-0	0-0	0-0	0-0	0-0
16.0625	9	14	Arizona	10-4	P12	1-1	1-3	1-0	2-0	4-0	1-0	0-0	0-0	0-0
...
-11.5000	192	187	UC Davis	1-9	BSky	0-0	0-0	0-2	0-0	1-3	0-2	0-0	0-2	0-0
-11.5000	192	178	New Mexico State	2-10	SBC	0-0	0-0	0-1	0-0	1-6	0-1	1-1	0-1	0-0
-12.5000	194	183	Weber State	2-10	BSky	0-0	0-1	0-1	0-0	0-3	0-3	1-1	1-1	0-0
-12.6875	195	188	Norfolk State	4-8	MEAC	0-0	0-0	0-0	0-0	0-2	1-5	0-0	2-0	1-1
-13.4375	196	195	Hampton	2-8	MEAC	0-0	0-0	0-0	0-0	0-2	1-4	0-0	0-2	1-0
-16.3125	197	197	Delaware State	2-8	MEAC	0-0	0-0	0-0	0-0	0-1	0-3	0-1	1-3	1-0
-20.5000	198	198	Savannah State	0-10	MEAC	0-0	0-0	0-0	0-0	0-3	0-3	0-0	0-3	0-1
						6-52	37-137	73-161	144-227	406-382	189-119	152-45	127-27	18-2

Sort	The value of the formula applied to the teams' record is not by itself very useful, but is included to make it somewhat easier to analyze a pair of schedules/results.
Rank rating	The relative rank of each team according to this better-win/worse-loss formula. Teams with the same sort value will have rank one greater than the team(s) with the next higher sort value.
Rank rating	rating (ISR in the example) is the team's relative rank according to the base rating. The degree to which the two rankings agree is a measure of how retrodictively self-consistent the rating is. When a pair of teams' relative position is opposite in the two rankings, it should be easy to recognize an "upset" in one of the team's rows in the table.
Team	The team's name. By the time I publish my first rankings this will very likely be a link to a more detailed analysis of the team's results.
Rec	Total wins and losses against teams in the field. Games that were not used to calculate rating are treated as if they did not occur.
Conf	The team's conference affiliation. Mainly included to provide a visual break between the overall record and the histogram table.
>2σ to >-∞	Team's record vs teams whose rating values are better than the average plus the number of standard deviations indicated by the column headings. >3σ/2 should be read as "opponents' rating greater than μ+1.5×σ but less than μ+2×σ", where μ is the rating average and σ its (population) standard deviation.

Application to D1A Team Rankings

In cases where we do not have access to a rating's values but have the ranking by that rating (e.g. from Dr. Massey's rankings list) we can assume that the rating values are normally distributed (they are for all "advanced" systems, and the others we don't care about.) A quick look at the probability density function for the normal distribution suggests how to form the buckets for the histograms:

SD range	%in range	×128	Lo Rank	Hi Rank
> 2	2.275	2.91	1	3
1.5 to 2	4.406	5.64	4	9
1 to 1.5	9.185	11.76	10	21
0.5 to 1	14.988	19.18	22	40
-0.5 to 0.5	38.292	49.01	41	89
-1 to -0.5	14.988	19.18	90	108
-1.5 to -1	9.185	11.76	109	120
-2 to -1.5	4.406	5.64	121	126
< -2	2.275	2.91	127	128

So instead of records vs 1-25, 26-50, etc. for the 128-team 1A field there is more value derived by using 1-3, 4-9, 10-21, 22-40, 41-89, 90-108, 109-120, 121-126 and 127+. In fact if we had a perfect computer rating (call it Greatest Of Deciders) and still wanted human involvement in selecting the 4-team tournament teams it would make a lot of sense to let the G.O.D. pick the first three teams and only ask the humans to pick which of the other six teams ranked better than 10^th should complete the tournament.