|
||||
| ||||
|
|
#1 |
|
All Star Starter
Join Date: Jun 2004
Posts: 1,644
|
Suggestion/Wondering about CatoBase
I'm sure its been asked, and its something jcato certainly has considered, but why does CatoBase not calculate winshares for past years? It should have all the necessary stats at its finertips, so why if I start a historical season in 1941, as I just did, can it not calculate Win Shares for Ted Williams in 1939 and 1940 season? It wouldnt even have to calculate them every time, just do it once for every year that has reall been played and add it to the CatoBase database. Once you know Wagner's win shares in 1911, you don't need to recalcuate it.
I'd love for this to happen, because its unfair to see someone else leading in career win shares only because 5 years of DiMaggio's career hasn't been counted. |
|
|
|
|
#2 |
|
Minors (Triple A)
Join Date: Jan 2004
Posts: 200
|
Unfortunately, this is not the case. There is no where near the stats available for past seasons that are needed. You can't even correctly calculate OBP with the stats found in the career CSV, let alone Win Shares.
|
|
|
|
|
#3 |
|
All Star Starter
Join Date: Jun 2004
Posts: 1,644
|
Oh, I understand OOTP may not make the stats avaiable, but there shouldn't be a need. You can get these stats anywhere. In fact, there must be a database already made of this stuff. Although it might be only extant on Bill James' computer. Let me put it another way through example:
If I start a season in 1941, The game and CatoBase includes Ted Williams stats for 1939 and 1940. It has all the Home Runs and Strikeouts he had those 2 years. Why don't we have win shares? Its just another stat. Add a column to the Lahman database. CatoBase would not need to calculate previous seasons, just show them and incorprate them into career stats and all-time highs. Like, we already know Wagner hold the record for Win Shares in a season with 59. You don't need to go back and recalculate it, just like you don't need to recalculate Wagner's batting average that year. Its a stat we already have and already incorporate. Why can't this be done with Win Shares? Does that make sense? |
|
|
|
|
#4 | |
|
All Star Starter
Join Date: Jan 2003
Location: Champaign, IL
Posts: 1,175
|
Quote:
If so, then they'd need to be imported into OOTP along with the rest of the stats, and then exported using CatoBase. Rather than being re-calculated, I guess they could just be read as a value. But only if they're included in the historical stats you've imported, I'm sure. If not, then CatoBase has to calculate from the stats that are included, and right now, I don't think that's possible because of incomplete information. jcato will correct me if I'm wrong, I'm sure.
__________________
Gordy Hulten Owner / General Manager Red Willow Roadrunners -- Kennel Series Champions: 1951, 1959, 1964, 1965, 1972, 1975, 1980, 1982, 1983 Dog Days Baseball - "The World's Best Online OOTP League" Creator inactive: Republican League - OOTP 2009 Dynasty inactive: Republican League Dynasty - Version 2.0 inactive: Republican League Dynasty |
|
|
|
|
|
#5 |
|
Hall Of Famer
Join Date: Jul 2002
Location: Kentucky
Posts: 2,601
|
When i try to run Catobase i get a "path not found", "league.csv" not found?
|
|
|
|
|
#6 | |
|
Hall Of Famer
Join Date: Dec 2001
Location: In front of some barbecue and a cold beer
Posts: 9,490
|
Quote:
http://www.baseballgraphs.com/details.html#sharecalc (I recommend that web site to you all) "How are Win Shares calculated? Okay, here's how Bill James calculates Win Shares. Ready? First, you divide responsibility for a team's wins between the offense (batting and baserunning) and defense (pitching and fielding). You do this by calculating the team run differential through a method James calls Marginal Runs. You first calculate the average number of runs scored per team in the league. You next adjust your team's runs scored and runs allowed for the ballpark in which they played half their games (i.e. home games). Then you add together two figures: all runs scored over 52% of the league average (credited to the offense), and all runs allowed less than 152% of the league average (credited to the defense). This is total marginal runs. Next, you take the percent of marginal runs contributed by the offense, multiply it by the number of wins times three. This is the total number of offensive Win Shares. You do the same thing for defensive Win Shares. Next, you attribute offensive Win Shares to individual players. This is done through two key metrics: Runs Created and Outs Made. Runs Created is a formula built by James and refined over the years. It starts with the basic equation of OBP times total bases and then adds player credit for other factors, including stolen bases, caught stealing, grounding into double plays, batting average and home runs with runners in scoring position and the kitchen sink. Runs Created is calculated for every single batter, including pitchers (if they're in the National League). Next, you subtract the league 'background' Runs Created (52% of the league average) from each player's Runs Created based on the number of Outs Made by that batter, adjust it for ballpark, and credit each player with the result; essentially individual marginal runs created. Add these up for all players and use each player's percentage of the whole to allocate offensive Win Shares to each. Note that any player whose Runs Created are less than 52% of the league average runs created per out is credited with no Win Shares. This doesn't happen very often (except for pitchers). That was the easy part. Now you've got to deal with the defense. The first step is to divide defensive Win Shares between pitching and fielding. This done through a complicated formula that accounts for FIP elements that can be attributed only to pitchers (home runs, walks and strikeouts) as well as a team's DER (Defensive Efficiency Ratio, adjusted for the ballpark) and other fielding statistics such as passed balls, errors and double plays. Typically, about 70% of defensive Win Shares are credited to pitching, and 30% to fielding. The Win Shares system is bound so that pitching never is credited with less than 60%, or more than 75%, of defensive Win Shares. Next, you allocate pitching Win Shares to individual pitchers. This is accomplished through an even more complicated formula that starts with each pitcher's marginal runs not allowed (same approach as team marginal runs not allowed), wins, losses and saves. Special consideration is given to relievers by estimating the number of high-leverage innings they pitched (ninth innings with one-run leads are more important than first innings with no score) and something called "Component ERA" which is essentially ERA re-calculated according to the actual underlying run elements. Finally, pitchers are deducted Win Shares if they are absolutely lousy hitters. Call this the "Dean Chance" factor. All these elements are then mixed together in a complicated formula to allocate pitching Win Shares to individual pitchers. As in offensive Win Shares, any pitcher who gives up more than 152% of league-average Runs Scored (adjusted for ballpark) does not receive any credit for pitching Win Shares. One note: responsibility for unearned runs is split 50/50 between pitching and fielding. Which leads us to the next, most complicated step: allocating fielding Win Shares to fielding positions, and then to individual fielders. The calculations differ for each position. Essentially, James has selected four defensive statistics to evaluate positions. Here they are by position, listed in order of importance: Catchers: Caught Stealing, Errors, Passed Balls and Sacrifice Hits Allowed First Basemen: Plays Made, Errors, Arm Rating and Errors by third basemen and shortstops Second Basemen: Double Plays, Assists, Errors and Putouts Shortstops: Assists, Double Plays, Errors and Putouts Third Basemen: Assists, Errors, Sacrifice Hits Allowed and Double Plays Outfielders: Putouts, Team DER, Arm Elements and Assists and Errors Lots of things to note about the fielding calculations. First, the statistics are adjusted based on the number of innings a lefthander pitches for the team, which has an impact on which side of the field batters hit the ball to. Second, these stats are calculated as a proportion of the team's total, divided by the league-average proportions of the total. In other words, if a shortstop has 50 assists and his team has 100 assists in total, he receives just as much credit as the shortstop who has 100 assists and plays on a team with 200 assists in total. This is important, because it adjusts the fielding stats for the fact that fielders may be playing behind pitchers with certain tendencies such as giving up more ground balls vs. fly balls. Third, double plays are only factored in as a proportion of potential double plays. If teams don't have a lot of runners on first, they have less of a chance to turn double plays, and Win Shares takes this into account. Fourth, team DER is used to credit outfielders with fielding Win Shares because it is James' observation that outfielders have a much larger impact on DER than infielders. James acknowledges that there is some 'circular logic' here. Fifth, there is a final element included in the formula to allocate fielding Win Shares to individual fielders. This element is called 'Range Bonus Play.' It particularly impacts outfielders in the following manner: if one outfielder handles more opportunities per inning played than the other outfielders on the team, he will be credited with more fielding Win Shares. This especially impacts centerfielders, who typically handle more chances per inning played than the corner outfielders." That's not a small undertaking and it requires a *lot* of data.
__________________
Senior member of the OOTP boards/grizzled veteran/mod maker/surly bastage If you're playing pre-1947 American baseball, then the All-American Mod (a namefiles/ethnicites/nation/cities file pack) is for you. |
|
|
|
| Bookmarks |
| Thread Tools | |
|
|