Season Previews / SEBA Projections

2018 initial SEBA projections

Photo: Ryan Griffith

Editor’s note: This is the latest piece of PSP’s season preview on Philadelphia Union. Read the other posts of the season preview here.

Below are the initial 2018 MLS season forecasts using data from games through February 25, 2018.

What remains the same

We build a model to predict the probabilities for Win/Loss/Tie for every match, and then apply those odds to a large amount of season-simulations to get an estimate of how the season may turn out given where we are now.

The parts of the model that remain for last year are Team, Opponent, and Home/Away. There are changes that will be described below.

Also remaining as part of the model’s weighting structure is how previous matches inform the model based upon their player-minutes distribution. For example, if Philadelphia starts the season with a starting lineup, but by June four of the starters have lost their playing time, we might not find the opening day as useful for assessing the “Philadelphia Union’s” importance as one where the rosters are much more similar.

We are also still using goal differential to weight match results, by assuming that a 2-1 victory was less assured than a 4-1 victory, and therefore that the model should take model errors to the latter more seriously than the former.

*Last year I introduced, mid-season, Plus/Minus values for players based upon how goals were scored when a player is on the field. The Plus/Minus values won’t be in this initial, post, but they will be returning.

What changed from last year

The biggest change is that I’ve now added Travel Distance as a factor into the model. Last season I did some data exploration for potential causes of home field advantage, and I found that Travel Distance could sometimes have a decent effect on the Away team performance (even though the effect is not necessarily what we’d expect).

I also intend to add short rest (and short upcoming rest) into the model in the next couple of weeks, but I’m just not there yet (and it should have a modest effect based upon previous research).

When doing early-season forecasts, instead of manually setting a parameter to adjust the confidence of manual predictions, I am now looking back at the previous season to set this parameter. For example, with currently only 2017 data to predict the 2018 season, I look back at how predictive 2016 data was for the 2017 season to set the confidence of my 2018 model.

Graphics.  I’m working to embed interactive charts into these posts instead of the many images I copied/pasted previously to demonstrate results. For the moment it’s not quite available on PSP directly, but for the time being I’ll post both the old format as well as some links to the interactive charts so that you can peruse however you’d like. Hopefully, these charts will also get better over time. Let me know if you have a suggestion.

CONCACAF Champions League: I’m now forecasting the 2018 CCL Championship winners. To help with this, I’m bringing in data from the Costa Rica, Honduras, Panama, El Salvador, and Dominican top domestic leagues (I was already using Mexico’s) to help with those predictions.

I am no longer going to do power rankings comparisons (with other media outlets), because it was a fair amount of manual work. I’d rather focus that time on researching the data instead.

Graphics

You can view a more interactive chart here

Power Rankings

The “Power Rankings” we concoct are the actual “strength” of the team according to competitive expectations. They are computed showing average expected points result if every team in MLS played every team both home and away.

SEBA has Philadelphia in 10th.

Currently, Philadelphia’s performance expectation is set by the following distribution of minutes:

Playoffs probability and more

Philadelphia has a 53.1% chance of making the playoffs.

Philadelphia is started off with a 3.2% chance of winning the Supporters’ Shield.

The Union has a 3.4% chance of winning the MLS Cup.

As the 2018 US Open Cup’s format has not yet been announced, I have made some assumptions based upon last year. The most challengable assumption I had to make to adapt to 2018 is that I have the 5 eligible NASL clubs entering the competition in the first round along with other sub-2nd-division clubs.

Philadelphia is currently marked as having a U.S. Open Cup win probability of 4.2%.

 

The 2019 CONCACAF Champions League qualification from the U.S. was supposed to be the 2017 & 2018 winners of the U.S. Open Cup and the MLS Cup. This is why Kansas City has already qualified. However, since Toronto (not eligible for a U.S. qualification slot) won the MLS Cup, that slot (and any other future Canadian or Kansas City wins) will be given to the club with the most regular season points over both 2017 and 2018.

Philadelphia is presently listed as having an 8.1% chance of qualifying for the 2019 edition of the CONCACAF Champions League.

The following are the probabilities of each final playoffs placement for Philadelphia.

The following shows the summary of the simulations in an easy table format.

TeamConfptsWLTGFGAGDpowerRkpwrScrPlyfsMLSCupShldUSOCChmpCANChmpCCLQualCCLChmpconfRegSeasFinishregSeasFinishplyfFinish
New York Red BullsEast71227562332921.85210030.71000NA69.70112.9
Atlanta UnitedEast69217670442611.87110024.100NA100NA223.3
Sporting Kansas CityWest62188865402531.73610014.900NA100NA133.6
Seattle SoundersWest591811552371551.6131008.100NA8.10244.4
Los Angeles FCWest57169968521641.651007.700NA7.7NA355.2
FC DallasWest5716995244861.5711004.200NA4.20466.4
New York CityEast561610859451481.5471003.300NA3.3NA376.7
Portland TimbersWest54151095448671.5541001.800NA1.8NA588.2
DC UnitedEast511411960501091.5211003.500NA3.5NA497.3
Columbus CrewEast51141194345-2111.41000.700NA0.7NA5109.3
Philadelphia UnionEast50151454950-1101.4161000.600NA0.6NA6119.9
Real Salt LakeWest49141375558-3151.3191000.300NA0.3NA61210.9
Los Angeles GalaxyWest481312966642141.3220000NA0NA71313
Vancouver WhitecapsWest47131385467-13161.227000NA00NA81414
Montreal ImpactEast46141644753-6121.388000NA00NA71515
New England RevolutionEast411013114955-6181.1890000NA0NA81616
Houston DynamoWest381016858580171.199000100NA100NA91717
Minnesota UnitedWest36112034971-22210.9630000NA0NA101818
Toronto FCEast36101865964-5131.342000NA100100091919
Chicago FireEast3281884861-13191.0680000NA0NA102020
Colorado RapidsWest3181973663-27220.9260000NA00112121
Orlando CityEast2882244374-31200.9750000NA0NA112222
San Jose EarthquakesWest2142194971-22230.8060000NA0NA122323

The following shows the expectations for upcoming Philadelphia matches:

 

7 Comments

  1. This is without Dockal and Accam being factored in? Does is not include the other MLS additions, too? It’s waaaaay more optimistic than I am, anyway.

    • Chris Sherman says:

      That’s correct. Right now, the player-scheme is only to weight how informative past matches are for determining the current strength of a club. Once the first match starts Accam, Fontana, and Trusty should factor in more and the model will update. Ultimately, I don’t have a way to forecast how additions will perform until matches start rolling in (although players no longer at the club at all should affect the team’s assessment even now).

      Also keep in mind that while the Union’s rank is 10th, the relative strength of all the clubs is currently very close due to the model’s uncertainty. Even Toronto, where we actually have some match data from CCL, is only currently estimated to score an additional 0.24 points/per/game than the Union.

  2. Interesting that Philly and NE should be battling it out for the last playoff spot when most pundits think they’ll be fighting for the cellar.
    .
    Your last graph cuts off at the Seattle game and is missing the last few games of the season. I’m guessing their expected points against Seattle will plummet once you start factoring in short rest since it involves long travel and short rest on both ends (hopefully they’ll be able to use that as a throwaway and not play anyone over the age of 25 except McCarthy).

    • Chris Sherman says:

      Perhaps, but my research showed that long distances were quite favorable to Away clubs compared with mid-range-distances, possibly due to clubs taking extra time/preparations for the coast-to-coast trips. Additionally, short rest did seem to have an effect, but had a smaller impact than many of us would expect.

  3. Old Soccer Coach says:

    Looking forward to the new Additions, and the graphic interface. Many thanks. Clearly a busy off-season with a lot of effort on our behalf.
    .
    How did you generate any data for LAFC? You may have already explained that but my brain did not catch it.

    • Chris Sherman says:

      Thanks! LAFC doesn’t have any data, which means the model treats it like an average club as far as performance, so that will likely change rapidly as data comes in

  4. I was getting a bit worried the SEBA analysis column wasn’t coming back this season. Keep up the great work Chris!

Leave a Reply to JB Cancel

Your email address will not be published. Required fields are marked *

*