Sports Reference Blog

Archive for the 'Features' Category

Senior National Team Data Now on FBref

20th March 2020

FBref already covers a wide variety of competitions around the globe, and continues to expand with its latest addition of senior national team competitions. There's a lot involved in that, but the headline is the FIFA Men's World Cup, which we now have data for from its first incarnation in 1930 to the most recent competition in 2018. That means we finally have pages for classic footballing legends Diego Maradona and Pele, while also filling out modern players who've made their mark on the global stage such as Miroslav Klose and Ronaldo. Peruse through the histories of countries that have a nicely stocked trophy case as well as some nations that have struggled historically. In addition to the World Cup, several major continental competitions such as the European Championship, Copa America and the Asian Cup are covered for recent years. International friendlies data will also be available on FBref.

The full Men's World Cup addition of course complements our existing collection of FIFA Women's World Cup history. One of the competitions included in this national team addition is the SheBelieves Cup, which we now have data for since its inaugural 2016 edition. Other continental women's national team competitions are included in this addition as well.

Here is the full list of competitions now available on FBref with this announcement:

World Cup - 1930-2018 (entire history)
European Championship - 2000-2020
AFC Asian Cup - 2000-2019
FIFA Confederations Cup - 2001-2017
UEFA Women's Championship - 2001-2017
OFC Nations Cup - 2007-2008-2016
Africa Cup of Nations - 2010-2019
CONCACAF Women's Championship - 2014-2018
AFC Women's Asian Cup - 2014-2018
International Friendlies (Men) - 2014-2020
CONCACAF Gold Cup - 2015-2019
Copa America - 2015-2020
SheBelieves Cup - 2016-2020
Africa Women Cup of Nations - 2016-2018
Copa América Femenina - 2018
OFC Women's Nations Cup - 2018
UEFA Nations League - 2018-2019
Algarve Cup - 2019-2020
International Friendlies (Women) - 2019-2020

Qualifiers
FIFA World Cup Qualification — CONCACAF - 1998-2018
FIFA World Cup Qualification — UEFA - 1998-2018
FIFA World Cup Qualification — CAF - 2002-2022
FIFA World Cup Qualification — CONMEBOL - 2002-2018
FIFA World Cup Qualification — AFC - 2002-2022
FIFA World Cup Qualification — OFC - 2014-2018
WCQ — UEFA (W) - 2019

AFC Asian Cup qualification - 2000-2019
UEFA Euro Qualification - 2008-2020
Africa Cup of Nations qualification - 2015-2021
UEFA Women's EURO Qualification - 2017-2021
AFC Women's Asian Cup Qualification - 2018

Player statistics from national team competitions can be viewed in the statistics tables of player pages by selecting the National Team tab, which will show their stats for every competition we currently cover. We are excited about this addition and hope to get to a place where we can also further fill out the statistics of the myriad of historical players we now have pages for. You can keep up with the latest additions of competitions coverage and new features here on the Sports Reference Blog, or by signing up for the This Week in Sports Reference mailing list. Feel free to send us any questions or suggestions through our feedback form or FBref's official Twitter account.

 

Posted in Announcement, Data, FBref, Features, History | Comments Off on Senior National Team Data Now on FBref

NFL100 Awards on PFR

16th March 2020

Part of the celebration of the National Football League's 100th anniversary included celebrating the top 100 in the league's history in various categories. Pro Football Reference has collected the major awards for display on the site: top 100 games of all time, top 100 teams of all time, and the NFL's 100th Anniversary All-Time Team. You can find them all linked on PFR's Awards index. The top 100 games list is unique in that we include a summary of what made the games notable, as well as a link to YouTube for games that the NFL has officially uploaded in full in case you want to go through history in the offseason.

If you have any questions or suggestions, feel free to contact us through our feedback form or Pro Football Reference's official Twitter account. Thanks for following us!

Posted in Announcement, Awards, Data, Features, History, Pro-Football-Reference.com, Super Bowl | Comments Off on NFL100 Awards on PFR

2020 WAR Update

16th March 2020

As we approach the beginning of the 2020 season, we have made some updates to our Wins Above Replacement calculations.  You may notice some small changes to figures as you browse the site. As always, you can find full details on how we calculate WAR here.

Defensive Runs Saved Changes

Last week, we updated Defensive Runs Saved (DRS) totals across the site with new figures from Baseball Info Solutions.  The new methodology involves breaking down infielder defense using the PART system - assigning run values to Positioning, Air Balls, Range, and Throwing.  Under the new system, an infielder’s total DRS is the sum of his Air Balls, Range, and Throwing runs saved, while Positioning runs saved are credited to the team as a whole.  You can read more about the updates in the Sports Info Solutions blog.  The PART system applies to all infielders since 2013.

Folding these numbers into WAR, we see some significant changes for individual player seasons.  The 2019 Oakland A’s get even more recognition for defense on the left side of their infield, with shortstop Marcus Semien gaining 0.7 WAR and third baseman Matt Chapman gaining 1.6 WAR from the new DRS numbers, lifting both players above Mike Trout and into second and third place respectively on the 2019 AL WAR leaderboard.  Chapman’s 1.6 additional WAR represents the largest single-season change in this update.

On the other end of the spectrum, we see Adrian Beltre with the most significant drop in this update, losing 1.5 WAR in 2015.

Since we use DRS to measure the quality of a team’s defense, these new values also impact pitcher WAR values.  Team total DRS changed by as much as 46 runs for a given team and season - the 2019 Dodgers defense improved from 75 DRS to 121 DRS by non-pitchers under the new system.  Once applied to a specific pitcher, however, the changes to WAR are much smaller in magnitude than the changes to individual fielders. The most extreme example is Hyun-Jin Ryu, who pitched 182.2 innings in front of the 2019 Dodgers defense.  Considering the Dodgers defense to be 46 runs better across the entire season, and considering that Ryu was the pitcher for 13.52% of the Dodgers’ balls in play in 2019, we adjust our expected runs allowed for Ryu by 6.2 runs for the season. After following the rest of the steps in our pitching WAR calculation, the end result is a drop of 0.3 WAR for the season.  All other changes to pitching WAR from this change to team defense are smaller than Ryu’s 0.3 WAR drop in 2019.

Park Factors

Park factors for 2018 have been re-computed to include the 2019 season, since WAR uses a three-year average for park factors when computing pitching WAR.  The most significant change here is the Miami Marlins, whose pitching park factor rose from 90 to 95 (where <100 represents a pitcher’s park and >100 represents a hitter’s park).  José Ureña sees the biggest benefit from this, with his 2018 WAR rising by 0.7 wins. All other changes to pitching WAR from updated park factors are smaller than Ureña’s 0.7 WAR gain in 2018.

New Game Logs from Retrosheet (1904-1907)

Last month, we updated the site with new data from Retrosheet, including new game logs for players from 1904 to 1907.  Having game-level data allows us to be more precise in our WAR calculations, since we can consider the specific ballparks a pitcher played in and the opponents he faced.

Take Christy Mathewson in 1907 as an example.  Prior to this change, we used the league average (excluding his team) of 3.36 runs per nine innings as the expected quality of his opposition.  However, with game-level data, we can see that Mathewson’s actual opponents averaged 3.55 runs per nine innings, showing that Mathewson was probably used strategically and started more games against better opponents.  Indeed, Mathewson pitched in 10 of the Giants’ 22 games against the league’s best offense, the Pirates, as well as 7 of the Giants’ 22 games against the Cubs, the NL’s second-best offense. Against the Dodgers and Cardinals, who each struggled offensively and scored fewer than 3 runs per game, Mathewson pitched in just 8 games total.

Knowing this about his usage, we can set more accurate expectations for how many runs an average player would have allowed under Mathewson’s circumstances.  By adjusting the quality of his opposition, we expect an average pitcher to have allowed about 7 more runs over the course of the season, resulting in a bump of 0.9 WAR in 1907.  All other changes to pitching WAR from new game log data are smaller than Mathewson’s 0.9 WAR gain in 1907.

Baserunning and Double Plays from Play-by-Play Data (1931-1947)

When calculating runs from baserunning and double plays, we use play-by-play data from seasons where it is complete enough to credit players for things like scoring from first on a double, advancing from first to third on a single, and hitting into fewer double plays than expected.

In the past, we have taken play-by-play data into account back to 1948 for baserunning and double plays, because the data further back than that has been incomplete and could give players an advantage in their WAR simply by having more complete play-by-play records than their peers.  As this data has become more complete over time, we have moved this cutoff back to 1931. The data is still somewhat sparse for games that took place during World War II (1943-45), but we felt it was worth including those years as well.

Pete Reiser of the Brooklyn Dodgers was skilled at taking extra bases, and it showed in the play-by-play accounts.  In 1942, he took extra bases at a rate of 55%, compared to the league average of 45%. Additionally, the Dodgers were tied with the Cardinals as the league’s top scoring offense, so Reiser had many opportunities to put his speed to use.  He scored from first on doubles a league-leading ten times in just 15 opportunities, and also scored from second on a single 24 times, good for 5th in the NL that year, in just 29 opportunities. Using this play-by-play data while computing WAR gives Reiser an additional 1.2 WAR in 1942.  All other changes to batting WAR from this change are smaller than Reiser’s 1.2 WAR gain in 1942.

Caught Stealing Totals from Game Logs (1926-1940)

When crediting runners for how many runs they contributed with their baserunning, we take into account their stolen base and caught stealing totals.  Caught stealing totals are missing for many players between 1926 and 1940, but we have complete game logs for players in that span.

In the past, when we didn’t have a caught stealing total for a player, we would estimate how many times they were likely to have been caught stealing based on the league’s stolen base success rate and the ways the player reached base during the season.

We are now using actual caught stealing totals from the players’ game logs, so there are some changes for players who did considerably better or worse than we had been estimating.

Take, for example, Freddie Lindstrom.  In 1928, the Giants third baseman stole 15 bases, but his official season stat line does not have caught stealing available.  Previously, we had estimated that he was caught stealing 11.57 times, based on everything else we knew about his performance and the league he played in.  However, game logs indicate that Lindstrom was caught 21 times, nearly twice as often as we had estimated. This difference gets folded into our baserunning runs calculation and results in a drop of 0.4 WAR.  All other changes to batting WAR from this change are smaller than Lindstrom’s 0.4 WAR drop in 1928.

Biggest Career Movers

Hall of Famer Ernie Lombardi sees the biggest change to his career WAR with this update, sinking from 46.8 WAR to 39.5 WAR, a drop of 7.3 wins.  The largest gain goes to infielder Lonny Frey, who picks up 5.2 wins. Both these players played in the 1930s and 1940s and saw big changes because of their baserunning.  Lombardi is known for being one of the slowest runners in baseball history, and this update shows that the numbers back that reputation. Frey was a fast runner in an era where stolen bases were rare, so he has been underrated to this point when it comes to his baserunning contributions.

On the mound, previously cited Hall of Famer Christy Mathewson is the big winner.  As discussed above, his WAR now recognizes how his manager would use him against tougher opponents, and he sees his career WAR jump by 2.2 wins.  Barney Pelty experiences the biggest drop of 1.9 wins.

We’ve highlighted some of the more extreme changes here, but to see full lists of the largest changes to season and career WAR totals, please see the spreadsheet here.

We're very excited about these new additions and hope you enjoy them as well. Thanks to Baseball Info Solutions for their contributions. Please let us know if you have any comments, questions or concerns.

Posted in Advanced Stats, Announcement, Baseball-Reference.com, Data, Features, History, Leaders, Play Index, Statgeekery, WAR | 5 Comments »

Box Scores Since 1904 & Play-by-Play Since 1918 Now on Baseball Reference

20th February 2020

Thanks to the efforts of our friends at Retrosheet, we have added box scores back to the 1904 season to Baseball Reference. Previously, our game log coverage was back to 1908. Additionally, we have added partial play-by-play coverage for games games as far back as 1918. Previously, our oldest play-by-plays were from 1925. Since our last major Retrosheet update, the final two missing full play-by-plays of 1973 were added which means we now have complete PBP data back to that season now. In addition to the boxes and PBPs themselves, this update allows for a variety of new information searchable in the play index, as well as new rows of information in team/player/league statistics tables.

Here are some examples of the new information/searches available on the site.

If you have any questions about our data coverage, you can always see it here.

We're very excited about these new additions and hope you enjoy them, as well. Please let us know if you have any comments, questions or concerns.

And thanks again to Retrosheet!

Posted in Announcement, Baseball-Reference.com, Data, Features, General, History, Play Index | 7 Comments »

Every Buzzer-Beater in NBA History Added to Basketball-Reference

17th February 2020

After years poring over play-by-plays, watching videos (tough, I know) and reading thousands of game stories in newspaper archives, Basketball-Reference has compiled the first comprehensive list of every buzzer-beating game-winning shot in the history of the NBA and the BAA. To date, there have been 772 such shots in NBA history, including free throws with time expired. I'm defining game-winning buzzer-beaters as successful shots taken with the shooter's team tied or trailing which left no time on the clock after going through the net. These are true game-enders leaving no opportunity for the opponent to respond. As Tim Duncan knows, there are no such thing as game-winning buzzer-beaters that leave even 0.4 seconds on the clock.

Read the rest of this entry

Posted in Announcement, Basketball-Reference.com, Data, Features, History | 21 Comments »

Every Buzzer-Beater in NCAA Tournament History

6th February 2020

We have compiled every game-winning buzzer-beater in NCAA Tournament history. Since you cannot advance the ball to half-court with a timeout in college basketball, we have been lenient with how we define "buzzer-beating game-winner" and included all shots in the final 2.0 seconds of a game that put the winning team in the lead (so either tied or trailing at time of shot, and leading afterwards). However, it wasn't until the 1993-94 season that the clock automatically stopped on makes in late-game situations. Consequently, we have included some shots from 1993 and earlier that were made with more than 2 seconds left, but which left the opponent with 2 seconds or fewer left to respond by the time they chased down the make to inbound or call a timeout. An additional wrinkle is that the NCAA added tenths of a second to the clock for the 1990-91 season, but just had whole numbers for earlier seasons. One notable exception is Tate George's buzzer-beater against Clemson, since that was played in an NBA arena (The Meadowlands).

A few notes about how this data was compiled:

        • We read recaps or watched video of every NCAA Tournament game decided by three points or fewer. It's possible there was a game with free throws shot after time expired due to a technical foul (or something else) that we missed because the final margin ended up being 4+ points. Same with a three-pointer made at the buzzer on which the player was also fouled or a team that scored multiple times in the final two seconds.
        • The distances and assists listed are unofficial, gathered from play-by-plays, video review and newspaper accounts. Distances sometimes varied in different accounts, so we used the distances listed in the most comprehensive game stories we could find.

      If you have any additions to this data, please
      let us know

Posted in Announcement, CBB at Sports Reference, Data, Features, Uncategorized | 2 Comments »

Teammates/Opponents Finder Now on Basketball-Reference

31st January 2020

Earlier this season, there was some buzz that LeBron James had lost to Kemba Walker for the first time in their 29 head-to-head appearances. However, did you ever consider who LeBron loses most often to? If you set a minimum of 10 head-to-heads (regular season and playoffs), that would be Patrick McCaw, who is 4-1 against James in the regular season and 8-1 against him in the playoffs.

This is now more easily searchable thanks to Basketball-Reference's new Teammates And Opponents tool, located in the Frivolities section of the site. This will produce a list of either every player your choice, in this case, LeBron, has played against, or played with. As an example of the teammates function, here's a link to every player Russell Westbrook has played with. You may be surprised that among players who've been in 50 games with Westbrook, Hasheem Thabeet has the highest winning percentage with him.

So try out our new Teammates and Opponents tool and see what interesting results you can find! If you have any questions or suggestions, feel free to contact us through our feedback form.

Posted in Announcement, Basketball-Reference.com, Features, History, Trivia | 7 Comments »

Headshots Added to College Basketball Reference

22nd January 2020

On our sites that cover the pro sports, we at Sports Reference make an effort to include portraits of the players being covered, as there is power to attaching a name and stats to a face. With that in mind, we're happy to announce both the addition of headshots for players currently participating in the 2019-20 college basketball season, as well as pictures for 500 major historical players.

If you're subscribed to our College Basketball Stathead daily recap newsletter, you may have noticed we recently added the headshots of the top 5 players for each night to give your inbox a splash of color. (And if you're not subscribed but are interested, sign up here!) Luka Garza, Markus Howard, Xavier Tillman, they've all got their faces on the site now.

As for the historical players, check out Bill Walton when he was playing college ball instead of commentating on it, or Tony Bennett's big smile back when he was shooting threes for Green Bay instead of coaching the Cavaliers. Charles Barkley when he had hair, or David Thompson rocking the 'fro at NC State are also choice additions.

We do plan to incorporate historical coach headshots in the near future as well. If you have any questions or suggestions, feel free to contact us through our feedback form. Thanks for following us!

Posted in Announcement, CBB at Sports Reference, Data, Features, History | 2 Comments »

Post-Shot xG for Goalkeepers Now on FBref

22nd January 2020

As part of our partnership with StatsBomb, FBref continues to incorporate more advanced statistics for players in the major international leagues. Additions have included expected goals, passing data and plus/minus, which you can read more about in this blog post. We have now added post-shot expected goals for goalkeepers, which uses expected goals on on-target shots to measure the shot-stopping ability of goalkeepers.

This information is available in competitions' advanced goalkeeping page. For example, here is a link to the advanced goalkeeping table for the 2019-20 English Premier League. You can read FBref's xG explainer as well as Statsbomb's PSxG explainer for more information on how these figures are calculated.

You can keep up with the latest additions of competitions coverage and new features here on the Sports Reference Blog, or by signing up for the This Week in Sports Reference mailing list. Feel free to send us any questions or suggestions through our feedback form or FBref's official Twitter account.

Posted in Advanced Stats, Announcement, Data, FBref, Features | 1 Comment »

Provisional 2019 Approximate Value Now on PFR

2nd January 2020

With the season concluded, we're pleased to report that we've added 2019 Approximate Value (AV) numbers to the site for all NFL players. Note that these numbers are just provisional right now; the final numbers will be released after the Pro Bowl rosters and All-Pro rosters are finalized. However, there's already some interesting preliminary information to take a look at.

As of now, Lamar Jackson is the clear AV leader at 26, tied for the all-time record with LaDainian Tomlinson's 2006 MVP season. Michael Thomas, Patrick Mahomes, Dont'a Hightower and Dak Prescott round out the top 5.

Not sure what AV is? To learn more about PFR's attempt to put a single number on each player-season since 1960 (for the purposes of comparing players across position and era), check out this link. Feel free to send us feedback via our site's form.

Posted in Announcement, Features, Pro-Football-Reference.com | 2 Comments »