Last year I used what I called a "surprise factor" to assign a weight to "upsets" according to the various rating systems. I never really liked the weighting I used in the Weighted Retrodictive Ranking Violation report, because the weights tended to zero too quickly. An upset of #25 only contributed .04 × ΔScore×ΔRank.
During the college baseball season I finally got around to defining an improvement, using these criteria:
The improved calculation is:
| |||||
|
For FBS football with 120 teams, N=25 seems a logical choice, and the resulting weightings by loser's rank look like this:

With the new calculation, the WRRV for 2007 looks like this. Be careful with the interpretations, QPR is only "best" by this metric because it only ranks 50 teams.
| Top 10 Games for 28 Aug-1 Sep | ||||||||||||||||||||||||||||||||||||||||||
|
To find the most interesting games, just use Game Interest = (M+1−MAX(R1,R2))׃(R1,R2) (where M is the number of ranked teams - in all of this, M+1 is the rank assigned to any unranked team.)
The coefficient (M+1)-(rank of lower-rated team) just gives a multiplier of 1 for the worst team up to 120 (for D-1A) for a game involving the #1 team. The highest value for a regular-season game in 2008 based upon 2007 rankings is 1086.9 for #4 Georgia at #1 LSU. In fact, the top 9 are all games involving LSU, but this is ok - games involving the #1 are inherently of interest. Even if the opponent's rank is really bad, the game is interesting because an upset would be a really big one.
For the first weekend, the algorithm gives these as the top 10 games.
If strikes me that if we choose N so that playing the team ranked last counts as close to ½ as we can get we can define another reasonable measure of schedule strength. The derivation of that will come in another essay.