MLS Playoff Chances

We are now including playoffs and Supporters' Shield probabilities on our MLS Tables, if you were unaware. These chances are calculated based on each team's current points and remaining schedule. The remaining game-by-game probabilities are specifically generated from the following:

  • Which team is playing at home. Home teams have won nearly 50% of all MLS matches during the last three seasons. There is definitely an advantage to having a remaining schedule packed with home matches. 
  • Total attempts generated and conceded. All season I have been studying the best predictive measures, and all season it's been SHOTS.
  • Finishing rates for and against. Though finishing rates weren't very predictive at the season's midpoint, it turns out they're not completely worthless after at least 25 games.
  • Past strength of schedule, as measured by past opponents' attempts data.
  • Other variables, such as past possession percentage, were considered, but they did not help predict the outcome of the game better than those chosen above.

The above indicators were tested on the last 10 week's worth of games during the 2011, 2012 and 2013 seasons (well, up through the most recent week). Using a multinomial regression, I was able to calculate the influence of each indicator, and then the resulting probabilities of each of the three possible outcomes (home win, draw, away win). Once these probabilities were established, it was a matter of simulation.

My simulation was based on the game-by-game probabilities (above), and I let the computer simulate the games as though they were weighted coin flips (like how your PS3 simulates Fifa or Madden games) over and over again. So basically, I simulated the last 36 games of the season 10,000 times each, added up the total points for each team, and created 10,000 simulated tables. The POFF% probabilities is simply the proportion of times that each team earned a "clean" playoff berth---that is, ties for fifth were not included. The Shield% probability represents the proportion of times each team earned at least a share of the top position in MLS.

As an example, Portland made the playoffs 9,919 times out of the 10,000 simulated seasons, seeing its simulated probability of making the playoffs go up about 4.7% from last week after a win against the Galaxy. What might surprise you a little is that the Timbers have a very real chance (16.3%) at the Supporters' Shield, according to this model. That's up about 10% from last week.

To try to understand why Portland received such a favorable jump, consider that just about every possible thing that could have helped the Timbers' chances happened. Portland earned 3 points against a team that the model thought was pretty good (the Galaxy), Seattle and New York tied---the best possible outcome from Portland's perspective---and Sporting KC lost in a pretty big upset. Though RSL still won, that doesn't matter too much because Portland gets RSL at home, and that match will be the one that most likely determines a points winner between the two teams. Also consider that Portland has a home match against Seattle coming up. Because the model gives big boosts to home teams, the Timbers have a reasonable chance for 6-point swings against RSL and Seattle. Oh, and there's that away match at Chivas to end the season. The model thinks Portland will take three points in that one with about 50% probability,* which is quite high for any away team.

There is still a lot of fight left in the Eastern Conference playoff push, as four teams have less than 60% probability of making the playoffs. In the West, things are a little more straightforward, though San Jose has been sneaky of late, and it could potentially steal Colorado's playoff spot. The model seems to like the Galaxy's talent and schedule more than Colorado, enough to put the Rapids on the Western Conference's hot seat.

*That might seem low, but consider that Chivas has 5 wins, 7 losses and 4 draws at home this season. That means they have only lost 7/16 games at home, or about 44%. Portland's 50.3% makes more sense in that context.