|
||||
| ||||
|
|||||||
| Earlier versions of OOTP: General Discussions General chat about the game... |
![]() |
|
|
Thread Tools |
|
|
#61 | |
|
Global Moderator
Join Date: Nov 2002
Location: Queens, NY
Posts: 9,848
|
Quote:
You know what would be interesting to see? The difference in regressions done for different eras. Basically, you run them for a deadball-era league, a league from the 1950s, and a modern league. You could use "reclculate for historical accuracy" or something to get league totals, and see what happens.
__________________
My music "When the trees blow back and forth, that's what makes the wind." - Steven Wright Fjord emena pancreas thorax fornicate marmalade morpheme proteolysis smaxa cabana offal srue vitriol grope hallelujah lentils |
|
|
|
|
|
|
#62 |
|
All Star Reserve
Join Date: Jun 2002
Location: Pittsburgh PA
Posts: 912
|
Elendil
Just curious as to what ratings you used for batters. It might be interesting to see if you set all batters talents to something around 50 or 60 (just something that's all the same). I think that may make a more controlled environment.
__________________
"And Shepherds we shall be, For thee, my Lord, for thee. Power hath descended forth from Thy hand, Our feet may swiftly carry out Thy commands. So we shall flow a river forth to Thee, And teeming with souls shall it ever be. In Nomeni Patri Et Fili Spiritus Sancti." |
|
|
|
|
|
#63 | |
|
Hall Of Famer
Join Date: Dec 2003
Location: the dynasty forum
Posts: 2,318
|
Quote:
__________________
Heaven is kicking back with a double Talisker and a churchwarden stuffed with latakia. Last edited by Elendil; 05-11-2005 at 02:47 PM. |
|
|
|
|
|
|
#64 |
|
Hall Of Famer
Join Date: Dec 2003
Location: the dynasty forum
Posts: 2,318
|
Just to satisfy my own curiosity, I altered the league stats to make the league totals similar to those for the 1986 AL (a DH is used for both leagues), and then simulated the season and did all these analyses again. As I expected, movement was now important and statistically significant. However, the results on none of the variables were extremely strong. The basic fact is that ERA is a good, but not great, measure of pitcher effectiveness. There's a lot of noise in ERA coming from the way it's counted (e.g., if an error was made in an inning, anything you give up with two outs doesn't count as ER's) and from luck (deviations in BABIP, streaky hitting by opponent, unusually good baserunning by opponent). But in general, stuff, control, and movement all behave as we expect. Which rating is most important depends on what kind of league you have. If homers are abnormally low, then movement doesn't mean much. If strikeouts are abnormally high, then stuff doesn't mean much. If walks are abnormally low, then control doesn't mean much.
__________________
Heaven is kicking back with a double Talisker and a churchwarden stuffed with latakia. |
|
|
|
|
|
#65 |
|
All Star Reserve
Join Date: Feb 2003
Location: Ottawa
Posts: 818
|
Too much time on your hands...
|
|
|
|
|
|
#66 | |
|
Hall Of Famer
Join Date: Jul 2004
Location: High and outside
Posts: 4,031
|
Quote:
__________________
![]() ![]()
|
|
|
|
|
|
|
#67 | |
|
Hall Of Famer
Join Date: Feb 2002
Location: watching: DArwin's missing link in action
Posts: 3,112
|
Quote:
__________________
Senior Senor Member of the OOTP Boards Pittsburgh Playmates- OTBL |
|
|
|
|
|
|
#68 |
|
Hall Of Famer
Join Date: Jan 2002
Location: Orlando, FL
Posts: 3,827
|
I remember the last time he offered something productive. Wait, no I don't.
Elindil, a league that I was a part of for an inaugural season (OTBA) used the same creation modifiers that you used to generate its pool of players. The league totals were also very low in walks and had less homers than expected. Has this shown up consistently using Skydog's modifiers?
__________________
"Read books, get brain." Last edited by Dagrims; 05-11-2005 at 05:19 PM. |
|
|
|
|
|
#69 | |
|
All Star Reserve
Join Date: Dec 2001
Posts: 887
|
Quote:
__________________
Realy good musition of many insterments, including the hyperbolic vitriol. |
|
|
|
|
|
|
#70 | |
|
Hall Of Famer
Join Date: Feb 2002
Location: watching: DArwin's missing link in action
Posts: 3,112
|
Quote:
__________________
Senior Senor Member of the OOTP Boards Pittsburgh Playmates- OTBL |
|
|
|
|
|
|
#71 | |
|
All Star Reserve
Join Date: Dec 2001
Posts: 887
|
Quote:
__________________
Realy good musition of many insterments, including the hyperbolic vitriol. |
|
|
|
|
|
|
#72 | |
|
Hall Of Famer
Join Date: Dec 2003
Location: the dynasty forum
Posts: 2,318
|
Quote:
__________________
Heaven is kicking back with a double Talisker and a churchwarden stuffed with latakia. |
|
|
|
|
|
|
#73 | |
|
Hall Of Famer
Join Date: Jul 2003
Location: Gassin' Kurds
Posts: 2,019
|
Quote:
|
|
|
|
|
|
|
#74 | ||
|
Hall Of Famer
Join Date: Feb 2003
Location: , "
Posts: 3,082
|
Quote:
Quote:
__________________
Brookline Maccabees. RIP |
||
|
|
|
|
|
#75 | |
|
Hall Of Famer
Join Date: Dec 2003
Location: the dynasty forum
Posts: 2,318
|
Quote:
http://www.ootpdevelopments.com/boar...=defense+study In looking at it again, it examines fielding pct. as well, but I think the criticisms of the methodology are sound.
__________________
Heaven is kicking back with a double Talisker and a churchwarden stuffed with latakia. |
|
|
|
|
|
|
#76 |
|
All Star Reserve
Join Date: May 2004
Posts: 721
|
My two cents:
A quick look at your regressions suggest a likely problem with multicollinearity. Your regressions have lots of interactive variables (movement * control, control * stuff) right? If so, then many of these independent variables might be highly correlated with each other. If this is the case, you can't trust the coefficients of your regression. You can get a reasonably good R2 in such regressions but the coefficient estimates and t-stats can be wacko. Possible solutions: (1) check for correlation among independent variables and, maybe, if 2 variables are highly correlated only include one of them in the regression: first try one and then the other. Include the one that gives you the best R2 (or best "theoretical justification"). (2) start with just a few basic variables and then add one-by-one other variables and only if these new variables bump up your adjusted R2 include them otherwise into the trashheap they go. You might also look for heterocedasticity by looking at your residuals. This might help you better identify where possible non-linearities/regression misspecifications exist. |
|
|
|
|
|
#77 |
|
Hall Of Famer
Join Date: Dec 2003
Location: the dynasty forum
Posts: 2,318
|
I think you're right about multicollinearity. That's the reason I also tried running the analyses with the independent variables pared down to just stuff, movement, control, & velocity. In those analyses I got stronger results on the individual coefficients, but significance levels were still in the 90-99% range, nothing above that. I think those are reasonable findings, both because ERA has noise in it and because ratings (rightly) don't completely determine performance over the course of a single season, even one with 190 games. Sometimes players will perform above or below their ratings.
__________________
Heaven is kicking back with a double Talisker and a churchwarden stuffed with latakia. |
|
|
|
|
|
#78 |
|
Bat Boy
Join Date: Oct 2004
Location: Illinois
Posts: 8
|
Thanks Elendil and others for your statistical analyses. I've been wanting to do the same thing for months now, but was waiting until my semester was over (I'm a college professor). I agree with the point that was made yesterday about multicollinearity, particularly given the use of both the ratings variables and squared ratings variables simultaneously in a regression model. There will obviously be a high degree of multicollinearity between those paired ratings variables, thus potentially confounding your results. I would suggest using only one or the other of each of the paired ratings variables (i.e. use either stuff or squared stuff, whichever correlates most strongly with ERA, in the model, but not both) to help in this regard. The other thing I would recommend in looking at your output is to increase the sample size considerably. The independent variable-to-sample size ratio being used in some of these models is far below most accepted standards. One way to alleviate this problem would be to run maybe ten simulated leagues and use all of the data together rather than just a single season of a single league. In this regard, I would recommend running single seasons of multiple separate leagues rather than running multiple seasons of the same league to avoid violating regression's independence of obversation assumption. Further, this might help eliminate an league-by-league anomolies that occur, such as your previous post that one league you ran seemed to have a tendency towards being a pitcher's league.
Thanks for your research into this. Of all the posts/threads we see on the message boards, this is one that I feel may contribute towards making a better game product in the future. |
|
|
|
![]() |
| Bookmarks |
|
|