A Look At The Weekend Performances Within the Final Third

I just recently discovered that the Golazo web application---with up-to-the-minute statistical game information courtesy of MLSsoccer.com---provides details for teams in the final third. Sadly enough they don't keep the data available for long so I took advantage of a late evening to tally up the following numbers over the nine MLS matches this weekend. Below are the passing rates for each team in the final third of the pitch.

 Team Pass Completions Pass Attempts Pass Percentages Opponent
Union 68 105 74.20% Vancouver
RSL 68 98 72.40% NYRB
NYRB 89 131 69.10% RSL
San Jose 60 97 67.50% Portland
Houston 92 141 67.30% Chicago
Portland 91 146 66.40% San Jose
New England 72 121 66.20% DC
DC 69 115 64.10% New England
Sporting KC 75 121 63.50% Montreal
Colorado 64 113 63.30% LA Galaxy
LA Galaxy 69 121 63.20% Colorado
Toronto FC 85 137 62.90% Columbus
Columbus 54 107 62.50% Toronto FC
Seattle 67 117 61.50% Chivas
Chicago 57 103 61.10% Houston
Vancouver 51 102 55.30% Union
Chivas 54 108 52.00% Seattle
Montreal 31 80 50.80% Sporting KC

There really isn't much in the way of true context at this point. Some of this is information is still about style rather than performance, but comparing the results and then applying some added data will be interesting. I wonder what, if anything, can be determined by this type of data.

I'd love to hear inputs from all you smart people.

ASA Podcast XV: The One Where We Talk About Home Field Advantage

We're back in Episode 15, in this episode we talk about the USMNT and the Gold Cup Final (that the US has now already won), we talk a bit about Home Field Advantage and we finish up with a little game of Marry, Boff, Kill, announcer style. We must decide between Eric Wynalda, Gus Johnson and Ian Darke. You can find us on both iTunes and Stitcher... please be kind on your ratings.

[audio http://americansocceranalysis.files.wordpress.com/2013/07/asa-episode-xv.mp3]

A little information on: PDO

With the holiday behind us we can once again start to return to the business at hand. The half-way mark is upon us and MLS has given us an exciting and very tight race across both the Western and Eastern conferences. With that comes another week of podcasts. #AnalysisEvolved. This week we plan on talking a bit about a statistic by the name PDO. Unlike how you might imagine most statistic names coming about, or things with three random letters, this is not an acronym. It's pronounced how it's sounds.

Originally a hockey metric, PDO is simply the sum of save percentage and scoring percentage, then multiplied times 1000. The rest of the history as it applies to soccer isn't necessarily important.

A great introduction to the idea and how it applies to the sport is given by Tyler Dellow of mc79hockey.com, and there is another introduction on the site Pension Plan Puppet by one "Skinny Fish."

Both sites give examples of how PDO can potentially isolate a team's performance over the course of a season and compare those to past performances, various incarnations of the team, and of course, other teams.

I can't directly attribute who was the first to apply the team analysis to the sport of soccer(Grayson confirmed he was the first...). But the oldest article I can find referencing its usage within the sport comes from the ever-smart and sophisticated Canuck, James Grayson. His series of introductions to the metric is linked below.

A Premier: PDO

PDO – part I

PDO – part II

Along with an explanation of the stat and some information about how it regresses to the mean---because it's fantastic at doing that---there is also a bit of information about how it can be used to compare different clubs to one another.

Basically, it comes down to being one of the best methods to determine the barometer of a team. While we can look at point totals and standings in the table, PDO can reasonably tell us if a team is over performing or under performing.

I'm not in any way an expert on this stat. There are of course some occasions were you may run into issues with trying to apply it to a specific scenario, and I could point anyone in search of more answers on the subject in better directions than toward myself. I could easily name about a dozen or so people that are much more versed in this metric than I am.

However, since we were going to take about it on our podcast this weekend, I wanted to give the reader/listener an opportunity to find some quick and easy references to the material before hearing us talk about it this weekend.

I'll have an updated PDO standings for you all tomorrow which will lead into our discussions on Saturday.

Possession Confusion Update

I wrote back in May about the paradoxical nature of OPTA's possession statistic in MLS---how more possession corresponds to better shot ratios, better shot ratios correspond to better goal differentials, but somehow more possession does not correspond to better goal differentials when we control for certain variables. In fact, I found that once I controlled for the teams playing in a given game, possession had a negative correlation with goal differential and winning. The new data agrees with the old. Correlations suggest that team possession still correlates positively with scoring attempts (p-value = 0.01), scoring attempts still correlate positively to goal differential (p-value = 0.02), and now with more data, possession is also positively correlated to goal differential (p-value = 0.01). That all seems to line up with logic, but the paradox from before still exists.

When I look game-by-game and control for the home and away teams, in-game possession has a positive correlation to shot ratio, but a negative correlation to goal differential. In other words, the team that has more possession in a given game tends to also earn more shot attempts, but still loses more frequently than we would expect. As mentioned in the first article back in May, this seems paradoxical. I had some theories in that article, but reader David Stringer got me to think about another logical explanation.

Teams that develop leads tend to sit back more defensively, and often are satisfied allowing the opponent to possess all it wants in less dangerous parts of the pitch. A team that has a lead in the second half probably  got that lead because it was generating more opportunities (read: attempts). It makes sense that the team that eventually went on to win also produced better shot ratios early on before getting the lead. After getting the lead, the team in front was willing to give up extreme possession relative to a more neutral shot rate. Thus it ends the game with poor possession, but a still favorable shot rate.

Just a theory, and I'd love to hear about other ideas! The stats are definitely not lying. These correlations are very real, but the causes for the possession paradox are still elusive.

ASA Podcast XI: The One Where We Talk Gold Cup XI And MLS Best XI

It's all about XI. We talk about Eddie Johnsons golden dome, run down the results of the US Open Cup (spoiler alert: Matty was 4/4 with his predictions) and then we cover the possible Gold Cup starting XI and then talk about our personal starting XI in the MLS. Enjoy! My apologies for not getting this sooner as I ran into a bit of hiccup yesterday. Hopefully that's behind me and we can press forth.

[audio http://americansocceranalysis.files.wordpress.com/2013/07/asa-episode-xi.mp3]

Introducing Shot Locations

On the site and on the podcast we have discussed shot rates an awful lot. A team’s shot rate is simply how many shots it has taken divided by how many shots it has conceded to its opponents. Whenever I make a Game-of the-Week prediction on the podcast, you’ll hear me use two primary pieces of information: which team is at home and which team has recorded the better shot rate. In general, shot rates help to explain not only the relative number of scoring opportunities a team has given itself, but also the relative number of scoring opportunities it is likely to get in the future. It’s predictive. There are, however, some conspicuous outliers in the league—teams that just don’t seem to follow the rules. Harrison wrote earlier this week about Montreal’s shot data. While Montreal gives up far more shots that it earns for itself, Harrison pointed out that Marco Di Vaio and company also place the ball quite well, finding the lower corners a high percentage of the time.

Perhaps Montreal’s own finishing rate is for real. But I won’t be convinced about the low rate at which teams have finished against Montreal before first delving into some new numbers. We have our own shot location data here at American Soccer Analysis, now, and I’m going to use it.

Scoring ZonesI have broken the field down into six primary scoring zones (seen to the right) in the hopes of accounting for the difficulty of both angle and distance.  It is possible that some teams earn a higher quality of opportunities rather than a higher quantity—or vice versa. In addition to recording where each team gets its own shots, I have also gathered the locations of the shots that each team has given up defensively from each zone. Here are some interesting tidbits about Montreal’s defense.

Despite being ahead much of the time—which would seemingly encourage low-quality attempts—Montreal still gives up a league-average proportion of shots from high-scoring zones one and two. In fact, if Montreal’s opponents had finished their attempts from zones one and two at the league average clip, Montreal would have given up six additional goals this season. However, including all six zones, Montreal would have given up just two additional goals due to some unlucky results from distance.

Because Montreal has played a wide range of opponents, it would make sense that its goal scoring rates against would stabilize to something close to league norms. It turns out, for the most part, that those rates have stabilized. The zones help to control for difficulty of shots, and Montreal’s defense isn’t getting particularly lucky based on the shots it is allowing. The major controversy still lies in the Impact’s offense, and whether or not it can sustain a league-leading finishing rate. According to its shot locations, the Impact "should have" scored eight fewer goals this season.

On the flip side we have Sporting Kansas City. Unlike Montreal, the Wiz have dominated the league all season in shot rates, and yet find themselves third in the East in points per match. Could quality of shots be playing a role?

Possibly. Sporting KC gets more shots from zones two and four than the league average team, and those tend to be decent scoring zones. SKC has outscored its opponents by five goals on the season, but with average finishing rates from each zone, one would expect a goal differential closer to +7 or +8. SKC has underachieved by only about two goals according to the shot locations data. How much of that difference is skill versus luck is still well beyond this blogger, but maybe someday...

*Own goals are taken out of the shot locations data.

Montreal Impact And Shot Placement

We like raw numbers around these parts. The lowest common denominator the better. But we like numbers in general, it's as if we are... kind of involved. There isn't much in the way of discrimination. You can take Numbers, and they can tell a story. Numbers can be just as biased as any news reporter or general fan too. They can also help give us insight to a specific question that we may have. A popular question around these parts is simply: why is Montreal so good? A club racing towards an opportunity for Supporting Shield. They sit 4th in the table with 26 points, two points behind the leading FC Dallas and have atleast two games in hand against all clubs above them in the standings. Obviously, they are in very good shape with a chance to run away this season with hardware. So how are they doing it?

Well, the one specific point of contention for us is their shooting. Currently the Impact are 5th in the league in shots on target per match and even further down the pipe at 14th with total shots attempted per match. So the question then becomes, how have they scored 1.69 goals a game, good for best in all of MLS?

They're shooting the lights out. Well, sort of. The ball is ending up in the back of the net at unusually high rates. Matthias and I have pretty much just summed this up to being  an irregularity, an outlier, and one that will eventually see the Impact coming back down to earth.

And yet, they haven't.

Montreal have the highest goal scoring rate in the league, yet have the same goal differential as the New England Revolution that sit 11th in the Supporter Shield table. 6 of their 8 wins have been by won by a single goal margin. Which tell us they've been strong in holding their leads.

It's obviously something that could and likely will involve a much further investigation as time permits. But I did formulate some interesting enough thoughts while digging through Whoscored.com and Squawka data.

Goal Locations

A good 80% of the goals are in high percentage conversation locations on the frame. Predominately low and presumably away from the keeper. You can see that trend continues with their overall shot selection.

shot locations

The majority of their shots are all, again, in great places with one third of the total shots in the lower half of the frame.

I'm not at this point sold that the Impact are going to come back down to earth with their conversion ratio. It's not so much that they are taking shots, but the type of shots they are taking. Marco Di Vaio is 36 and with that comes experience and intelligence.

He understands what he's doing. I believe that his effort to place high percentage shots is not only a skill; it's purposeful, and it's a game plan.

I'm not sure if they can continue to win in their +1 goal states, but their defence* has been very good thus far. It's possible, considering their current form, that they have a legit shot at the Supporter Shield at year's end.

Then again, we just may have to dig deeper into this.**

*Editor's note: Harrison is turning Redcoat on us.

**Editor's note: We will.

Prediction versus Explanation

There is a subtle, yet very important, distinction between explanation and prediction in most sports, and Major League Soccer is no different. I don’t intend to make this long or particularly math heavy, so hang on. Here’s a simple example of what I’m talking about when I refer to explanation. In its first six games of the season, the Portland Timbers recorded 89 attempts and allowed just 57 to their opponents. During that same time, Portland scored ten goals while allowing eight. I might explain that the Timbers’ +2 goal differential was due—at least in part—to earning more offensive opportunities than their opponents.

Here’s another example, but this time in regards to prediction. In their first six games, the New England Revolution scored two goals while allowing six to its opponents. During its next six games, New England scored eight goals while allowing just three to its opponents. Using just New England as an example, it would seem as though goal scoring in the past (-4) poorly predicted goal scoring in the future (+5).

Of course, we have nineteen teams, not two, so I sorted through all nineteen teams looking for patterns. Here is what I found.

A team’s goal differential during its first six games explained its total points over that same time period extremely well (R2 was 77%). This is not surprising. Teams that tend to score more goals than their opponents also tend to win more games. Nothing shocking there.

However, a team’s goal differential in the first six games of the season provided no help in predicting its total points over the next six games. Here’s the plot on that one:

GD vs. Future Points - 6 weeks 2013

There is virtually no relationship between how well a team scored before, and then how many points it earned later. In other words, goal differentials are not predictive over six games.

But if you’re convinced the lack of predictive ability is completely due to a small sample size of twelve total games, check this out. A team’s attempts differential in its first six games shows a statistically significant correlation to both its future goal differential and points earned:

AD vs. GD and AD vs. Pts

 

Because it’s sports, prediction is never going to be precise, and these aren't perfect correlations at all. But I find it particularly impressive that over just twelve total games, the attempts data from a team’s first six games shows statistically significant predictive ability of the team’s results in the next six games.

If you’ve listened to our Game-of-the-Week section during our podcasts, you hear us talking a lot about shot ratios. This post hopefully clarified why we do that. Past shot ratios are better than past results at predicting future results.

Game Of The Week Review: Montreal Impact Visit Sporting Kansas City

I know I shouldn't be surprised by the Impact stealing a match on the road, especially considering Sporting's lack of strength at home as of their recent string of outcomes. Though, with all the statistical pointers, it's quiet uncanny that they came up with even a point, let alone all three. SKC-IMP

It's hard to look at the tackles, interceptions and clearances and not think that it's a by product of the Impact largely being on their heels for the majority of the match. That in large part is due to the style which the Montreal Impact implements. The team as a whole has functioned with 48% possession through 12 matches and even less possession (44%) in away games. It's not a bad thing, but it naturally produces more defensive events.

Much of our discussion during the podcasts has dealt with shots and their predictive nature. Montreal has been at the forefront of the discussion, with amazing results despite being outshot on both total attempts at goal (12 to 15 per game) and actual shots on target (4.9 to 5.2) Montreal Impact is currently now sporting 26 points with a goal differential of +7. Not to mention they are boasting the highest conversion rate in the league of 15.3%. Better than the next highest (FC Dallas, 13.9%) by nearly a whole point and a half.

Matthias, Drew and I have discussed whether or not Montreal can continue to maintain such a high finishing rate. It's a legitimate question considering the construct of the situation but, as pointed out by Ravi Ramineni in a discussion this morning on twitter, ‏the problem with making such assertions is that we're looking purely at the shot totals rather than looking at the qualitative state of the shot.

However, while it's interesting enough to question whether or not the Impact are going to stick around and continue to score goals at their current rate, I'm going to leave that for another day. It's even more interesting that Kansas City came up with twice the amount of attempts on goal and the only scored once. That one goal was on a foul that was made right on the line of the 18 yard box. Had the linesman not been on his game, that call could have easily been a free kick.

The question that I really have is more of why was Sporting unable to build upon their chances. Looking at the amount of clearances that the Impact had  I kind of wondered if the fact was that they just couldn't maintain the needed pressure upon Troy Perkins goal.

Kansas City Attempts Name Minutes
FIRST HALF
Miss Joseph Peterson 6'
Attempted blocked Paulo Nagamura 19'
Miss Claudio Bieler 25'
Miss Claudio Bieler 42'
Goal Claudio Bieler 49'
SECOND HALF
Miss Seth Sinovic 49'
Miss Claudio Bieler 56'
Miss Kei Kamara 60'
Attempted Save Benny Feilhaber 65'
Miss Aurélien Collin 69'
Attempted Save Paulo Nagamura 70'
Miss Paulo Nagamura 71'
Attempted blocked Claudio Bieler 76'
Miss C.J.Sapong 78'
Attempted Save Joseph Peterson 82'
Attempted Save Aurélien Collin 85'
Miss Joseph Peterson 90'
Attempted Save Claudio Bieler 92'
Miss Kei Kamara 94'

SKCTimeline

Looking at this you can see three real bunches. First at the 69th-71st minute, Again with the 76th and 78th minute and then in the final moments game a solid run of 90 to 94, ending with Kei Kamra's shot that just drifted wide.

Ultimately, I'm more inclined to believe that Sporting did just as much to not earn a result as the Impact did to really earn one. But while most people would be willing to chalk this game up to luck, I just think it's the largest example of what the Impact do well, and that's disrupting opposing teams while allowing the Impact to sit in their own defensive third. I'm still not inclined, as I'm sure Matty isn't either, to give the Impact the full rights of being a team that is "for real". But they certainly continue to prove their case week in and week out.

Finding Numbers: WhoScored And Their Expanding American Statistics

I've stated that one of the goals for this site is the production of numbers, but also generally using them as book markers to bits of information. I haven't been too good about this second part but I'm going to try to get better. One thing that was pointed out to me this last week (h/t Brian Stern) was that WhoScored has at long last increased the information that they provide on MLS. We've had a link to WhoScored for sometime but generally speaking they haven't been very good about keeping the information updated, keeping it accurate or producing anything worth visiting it on a consistent basis. But the fact that they have some basic visuals and do have good content for most of the rest of the world makes it a worthy site. There was also the hope that they would finally find the time to spend on the American-based league.

Well, now they have and it's pretty awesome.

For instance; if I wanted to know who averages the most passes for the San Jose Earthquakes, I could go in real quickly and see that Sam Cronin averages 45.2 passes per game, almost a whole 8 above the trailer Ramiro Corrales (37.9), and that Cronin has completed 475 of 633 passes. The only thing at this point in time that could be better is for it to give a spray chart of where he likes to pass and to whom.

I could see that, according to the stats, Philadelphia has yet to score purely on a counter attack and seems to favor carrying the ball down the right side of the pitch, as they use that side 37% of the time. Alternatively we can see that DC United has taken 65% of its shots from the middle of the pitch. But mostly what it does is it allows us to do is put things in context.

Is it unusual that Houston attempts short passes 80% of the time, or is that normal? Well I can see that DC United and the Columbus Crew make short passes at 78%, but short passes make up 82% of the Montreal Impact's pass attempts. There isn't a wrong or right amount of short passes, but it does help us understand the specific influence of the style and attack.

You may or may not have already seen WhoScored and you may or may not have seen that they updated their American side site. That's cool. This is for those that didn't hear. We're trying to make it easier for people to conduct their own analysis and do so in the most educated way possible.

As a side note it looks like they further plan to expand the information they provide on the US Leagues as they have NASL section dated later on this year, probably anticipated for the second half of their season. That could further help us when attempting to gather information on some of the US Open Cup teams and some of the lesser known players who don't have an opportunity to have their name shine.