Sports Reference Blog

Archive for the 'Data' Category

Full Coaching Histories Since 1990 Added to Pro-Football-Reference

16th July 2018

On Pro-Football-Reference, while the majority of our statistics are for players, we also have a good amount of coaching data, including team ranks during a coach's tenure as well as coaching history. One of our offseason projects at PFR was to fill out the coaching history of every head coach and coordinator since 1990, and we have pushed that data out for the public eye today.

Some examples:

Tom Cable is currently an assistant head coach and offensive line coach for the Seattle Seahawks, and he's been on the West Coast for the grand majority of his career, ever since he started off as a graduate assistant for Idaho and San Diego State in the late 1980s.

David Shula had an ignominious head coaching tenure for the Bengals of the 1990s, but now we've added in his 1991 year as a WR coach that got him promoted to head coach the ensuing season.

Lane Kiffin has been out of the NFL for a while, but we've added in his latter-day college experience as an offensive coordinator for Alabama and a head coach for Florida Atlantic. Never forget he got his head coaching gig with the Raiders from a college offensive coordinator position, so anything is possible.

To get at least one actually successful coach in here, Dick Vermeil now has his 1970s stints with UCLA as well as his 1969 season as the Los Angeles Rams' special teams coordinator included in his history.

If you have any questions or suggestions, feel free to contact us through our feedback form.

Posted in Announcement, Data, History, Pro-Football-Reference.com, Trivia | 2 Comments »

Data Coverage Map Added to College Basketball Reference

10th July 2018

We've made lots of additions to the player data on College Basketball Reference in recent years. In an effort to make it a little more clear what data we have and what data we don't have, we've made this coverage map back to 1947-48.

Our rosters are essentially complete back to 1992-93, and our coverage of box score stats is also nearly complete back to that year (aside from minutes played, turnovers, personal fouls and offensive/defensive rebounds).

For seasons before 1992-93, it's important to note that our rosters are not 100% complete. It was NCAA practice for many years to combine the stats for non-rotation players on team stat sheets under the label "Others." So for many teams we have stats for the eight, nine, ten, etc. players who received significant playing time, but not for the players with negligible playing time. Additionally, we're missing 621 school seasons completely (mostly in the 40s and 50s). So when we say we have points for 98.8% of players in 1947-48, that means we have points for 98.8% of the players in our database for that season (which excludes players lumped in as "Others" or players on teams we are completely missing).

The link to this coverage map can be found both in the Player Season Finder tool and in our About section.

If you happen to have any of the data we're missing, please let us know. Over time, we hope to fill in as many gaps as possible.

Posted in Announcement, CBB at Sports Reference, Data, History | No Comments »

WNBA Season Advanced Stats Now Available on Basketball-Reference

19th June 2018

As the 2018 WNBA season continues on through the summer, we have now added an Advanced Stats section to the season stats pages. We already included advanced stats on players' individual pages, so you already knew Brittney Griner had a 10.2 block percentage during her 2015 Defensive Player of the Year performance, but now you can see her performance with the context of the rest of the league. You could also take a look at the gap in win shares between Nneka Ogwumike's 2016 MVP season and the rest of her peers. Of course, we'll continue to update 2018 advanced stats along with the classics as this season unfolds.

Posted in Advanced Stats, Announcement, Basketball-Reference.com, Data | No Comments »

Basketball-Reference Adds Historical Playoff Series Outcome Data

18th June 2018

While the NBA is in offseason mode, teams are still making moves to improve themselves, and the same goes for Basketball-Reference as well. With that in mind, we've added a page that details historical data for seven-game playoff series based on various game results. Just to give an example, if we had just watched the Cavs go down 2-3 to the Celtics in the Eastern Conference Playoffs, you'd look for the Team is Down column, find the 2-3 section, and locate their predicament where they'd lost the first two away games, won their first two home games and then lost the third away game, displayed as two red A's, two green H's and one more red A. Then, you'd find that prior to the Cavs pulling off the series win, teams in that situation were 2-40, with only the '07 Utah Jazz and the '08 San Antonio Spurs able to pull off the same feat. We have overall winning records for both the modern 2-2-1-1-1 format, as well as the classic 2-3-2 format.

If you have any questions or suggestions, feel free to contact us through our feedback form.

Posted in Announcement, Basketball-Reference.com, Data, Features, History, Playoffs | No Comments »

Massive Update to College Basketball Player Class Data

1st June 2018

We have made a huge update to our player class data on College Basketball Reference. Previously, we had this data more or less complete back to 1992-93, and fairly spotty for previous seasons. However, we realized that for players for whom we have four seasons of data who played during years with freshman eligibility that we could infer what their freshman, sophomore, junior and senior seasons were. Likewise, we could do the same for players with three years of data during the years freshmen were ineligible to play varsity. So we have now identified freshman, sophomore, junior and senior seasons for 13,842 additional players in our database.

Read the rest of this entry

Posted in Announcement, CBB at Sports Reference, Data, History | 1 Comment »

Hockey-Reference Adds NHL Game Logs Back to 1917-18

26th April 2018

In a huge addition, we at Hockey-Reference are glad to announce that we now have all NHL regular season box scores available back to the beginning of the NHL, the 1917-18 season. With this, we can now fill out the gamelogs section of many great players' careers. For example, you can now go through Wayne Gretzky's incredible 1985-86 season where he recorded a point in all but 3 games. Or you could go find Bobby Orr's 1970-71 season when he finished with a career high 139 points.

All of these game logs are also searchable on our Play Index now, which is also a great new asset to have. Searching for most penalty minutes by one team in a game now displays both teams in the famous 1981 Bruins-North Stars brawl game; the North Stars held the record until the Flyers broke it in a 2004 game against the Senators. Looking for most goals by a player in a game will now lead you to Joe Malone's record day in 1920, when he scored seven for Quebec in what ended up being a 10-6 victory over the Toronto St. Patricks. We should put a reminder that there are some categories, such as saves and shots against, that were not recorded in the earlier days of the league.

So now there's a lot more history to dig through, and we hope you enjoy our presentation of the "new" box scores! Please note that, in some cases, older box scores are lacking times for penalties and/or goals. Also, as a result of this new data, some long-established season totals no longer add up. This is something we plan on addressing shortly. Please let us know if you have any questions or comments.

Posted in Announcement, Data, History, Hockey-Reference.com, Play Index | 10 Comments »

Save Percentage and Goalie Minutes Coverage Extended

24th April 2018

Hockey-Reference has now added goaltenders' Shots Against data back to the 1955-56 season; we previously only had that back to 1983-84. This addition enables us to calculate season and career save percentages for all goaltenders, including several Hall of Famers. For example, a contender for greatest goalie of all time, Jacques Plante, is now listed with a career .9196 SV%, putting him at eighth all time and fifth among retired players. Plante also now occupies two of the top five spots in the single-season Save Percentage leaderboard. Johnny Bower is the top addition to the career Save Percentage leaderboard, as his career .9219 SV% puts him at third all time and second among retired players, only trailing Dominik Hasek.

We also now have data available that gives us precise Time On Ice numbers for goalies down to the second, going all the way back to the beginning of the NHL. As a result, there have been some changes to the Minutes Played data, and subsequent re-calculations of everyone's Goals Against Average. This pertains, mainly, to goalies prior to 1999. We always had Time On Ice data for goalies from 1999 onward.

We pride ourselves on providing the most accurate information available and we hope fans of NHL history enjoy this addition. Please let us know if you have any questions or comments.

Posted in Announcement, Data, History, Hockey-Reference.com, Leaders, Statgeekery | 2 Comments »

Postponement and Cancellation Info Added To Baseball Team Schedules

24th April 2018

Thanks to the efforts of David Vincent (R.I.P.), Baseball Reference has added originally scheduled dates of games that were made up later in the season or unplayed due to other circumstances. We have now incorporated that information into our team schedules from 1877 up to 2016, with the 2017 and 2018 seasons coming later on.

This information, which is part of the Retrosheet database, also includes the reason for the postponement or cancellation of the game. So now it's easier to know the 2007 Indians played their first "home" series in Milwaukee due to snow in Cleveland. In addition to weather-related postponements, we also mark games that were pushed due to more tragic events, such as the 1968 Pirates and several other teams that season delaying their openers due to the funeral for Martin Luther King Jr.

As a final note, a reminder that the reason there was a second game that had to be forefited on Disco Demolition Night was because of a rainout for a game previously scheduled on May 2, 1979.

For more information on the source and other contributors to this project, please refer to Retrosheet.

Posted in Announcement, Baseball-Reference.com, Data, Trivia | 3 Comments »

2018 WAR Update

15th March 2018

As you browse Baseball Reference, you might notice some subtle changes to WAR figures on the site. There are four main reasons for this:

  1. Park factors for recent seasons have been re-computed to be three-year rolling averages. For instance, 2016 Park Factors now encompass 2015-2017. This is something that needs to be done when seasons end.
  2. We've incorporated restated and expanded fielding statistics from Sports Info Solutions. SIS's Defensive Runs Saved forms the basis for our Defensive WAR calculations since 2003. From 2011 on they recalibrated data using their timer measures to measure ball hang time. There was also some recalculation based on changing shift methodology. Though we're now publishing their catcher framing stat (Strike Zone Runs Saved), we have not incorporated it into WAR at this time.
  3. Pitchers who received time as position players (whether PH or in the field) are now being treated as part-time pitchers and part-time position players. Previously we treated them as full-time pitchers. Some pitchers like Red Ruffing, Bob Lemon and Jim Kaat appeared in many games as a position player, pinch hitter or pinch runner. We used to credit these PAs as pitchers, which overvalued their offensive contributions. To handle this, we compute a percentage of time as a non-pitcher and make an adjustment.
  4. Also, we have incorporated a good deal more Retrosheet data which has affected the years we can compute more advanced fielding and baserunning measures. We're now able to roll these measures back to 1953. Another important change is that with Retrosheet gamelogs back to 1908, we can now use their IP data back to that year to get starter/relief IP splits. Some pitcher WAR changes for 1908-12 are due to WAR now being calculated using gamelog IP rather than the "official" total listed on the player's stat line. The biggest difference here was the appropriately named Bugs Raymond. The "official" record credits him with 324.1 IP that season, but the gamelogs come out to 304.1, which significantly impacted his WAR calculation (see below). For further reading on discrepancies between "official" records and more recently produced gamelogs, please read this excellent explainer by Retrosheet's Dave Smith.

For further details on WAR and its calculation, please see this WAR explainer.

Read the rest of this entry

Posted in Advanced Stats, Announcement, Baseball-Reference.com, Data, Statgeekery, Uncategorized, WAR | 8 Comments »

Pitch Framing Measures Added to Baseball Reference

8th March 2018

Our friends at Sports Info Solutions (formerly known as Baseball Info Solutions) have provided us with a pitch framing measure back to 2011, which we have added to Baseball Reference. Before I explain any further, if you're unfamiliar with the concept of pitch framing please read Mike Fast's 2011 article on the topic and Ben Lindbergh's 2013 follow up.

The stat that we have added is called Strike Zone Runs Saved. It represents the runs saved by catcher framing. In our tables, it's labeled RszC and it's available from 2011 to the present. While this statistic is a potential component of Defensive Runs Saved (and therefore WAR), please note that we have elected to not integrate this number into DRS (or WAR) at this time. We may elect to do so in the future, but for now we agree with Bill James's stance that waiting for further research is a good idea.

Read the rest of this entry

Posted in Advanced Stats, Announcement, Baseball-Reference.com, Data, Features, Statgeekery, WAR | 3 Comments »