|
||||
| ||||
|
|||||||
| Bug Reports Forum Have a bug to report? Please post here. |
![]() |
|
|
Thread Tools |
|
|
#1 |
|
Minors (Double A)
Join Date: Sep 2018
Posts: 110
|
Potential Issue with reversed RHB/LHB pefromance against RHP
When doing a bunch of statistical analysis looking at ratings and impact on the game engine, I noticed that something that is thought to be conventional wisdom is out of whack when it comes to the batter vs pitcher match-up and what side of the plate the batter is on. The conventional wisdom is that opposite side of the plate matchups benefit the hitter since they can see the ball come out of the pitcher's hand better. However, when looking at 22, that does not appear to be the case at the moment.
I fired up an MLB quickstart in both build 30 and 34. I retired every LHP in the world, and ran the AI on every team. Every team was set to play in the same ballpark, with all park factors set to 1.000. So there's only RHP in this entire world, and park factors are not an issue. I then ran a ton of simulation module stuff (every team versus every other team in the league for 2,430 games), with some of these players getting more than 70,000 games in this test. When I do some linear regression on wOBA to hitting stats, the side of the plate the batter is on seems reversed to what would be expected. Mainly LHB perform worse against RHB against RHP when all other things are controlled. This screenshot contains the coefficients for each variable in the model, with bats_1 being RHB and bats_2 being LHB. All things considered, LHB would have a wOBA eight points lower compared to a RHB against RHP. ![]() I thought I had messed up the test, and created a new MLB quickstart and tried it again, only to see the same results. I thought this might be a sim module thing, so I used that same world, and simed an entire season using the game engine (not sure if Sim Module uses the same engine), and the same results appeared. When looking at OPS, RHB would expect to have a 30 point bump to OPS compared to a LHB when all their ratings are the same. I haven't looked at LHPs and whether RHB would get a boost or suffer a penalty in that scenario. But I thought I would bring it up. |
|
|
|
|
|
#2 |
|
OOTP Developments
Join Date: Aug 2007
Location: Nice, Côte d'Azur, France
Posts: 21,473
|
Doing anything as unexpected as retiring all the pitchers of a specific handedness is very likely to throw the results you're getting off, as the engine isn't really designed to deal with that sort of situation.
So I'm not really sure if there's too much we can take from this. I'm sure Matt will see this and might take a look if he sees anything that triggers any alarms for him, but I suspect this is likely just a byproduct of the testing environment.
__________________
lukas@ootpdevelopments.com PreOrder Out of the Park Baseball 26! Need to upload files for us to check out? Instructions can be found here |
|
|
|
|
|
#3 | |
|
Minors (Double A)
Join Date: Sep 2018
Posts: 110
|
Quote:
This was not how things worked in 21, and as an additional measure I simply changed every pitcher that was left-handed to right-handed, and the same thing happened. |
|
|
|
|
|
|
#4 |
|
OOTP Developments
Join Date: Aug 2007
Location: Nice, Côte d'Azur, France
Posts: 21,473
|
Ok, thanks. Appreciate the clarification.
__________________
lukas@ootpdevelopments.com PreOrder Out of the Park Baseball 26! Need to upload files for us to check out? Instructions can be found here |
|
|
|
|
|
#5 |
|
Minors (Double A)
Join Date: Sep 2018
Posts: 110
|
Just to confirm I'm not crazy. Created an MLB quickstart. Changed that park factors to 1.000 and did nothing to what hand the pitchers throw. Disabled injuries, position player fatigue, and disabled suspensions. Ran computer AI at the start, and then disabled the AI from making any changes to rosters.
Simulated the entire season in the engine (not sim module). Pulled the splits versus RHP and compared them to ratings in the database and LHB are still expected to perform worse against RHP than RHB.
|
|
|
|
|
|
#6 | |
|
Minors (Double A)
Join Date: Sep 2018
Posts: 110
|
Quote:
Same test as this, except looking at LHP splits and the same thing holds true: opposite side of the plate hitters are expected to perform worse. Took the same league, same simulation. Looked at LHP splits and the "vsl" hitting ratings in the database. Ran some regressions against it, and LHB perform better against LHP than RHB do. bats_1 = RHB bats_2 = LHB
|
|
|
|
|
|
|
#7 |
|
Minors (Double A)
Join Date: Sep 2018
Posts: 110
|
I uploaded a jupyter notebook walking through the steps to reach the conclusion that I have posted here. Unless someone can show me otherwise, I am convinced that lefty/right matchups, and vice versa, are not working as expected in OOTP22.
|
|
|
|
|
|
#8 |
|
Minors (Rookie Ball)
Join Date: Jul 2018
Posts: 35
|
I think the difference in performance is dictated by the ratings, not the handedness. The handedness effects ratings. It looks like he didn't give everyone the same ratings, and the distribution of ratings with assumption there are no interaction terms (there should be), make the results not that useful. Multicolinearity wouldn't matter much here I think, but that should also be checked for.
Checking the interaction of pitcher and batter ratings would be important too, and I don't think they are all linear either. I want to see some residuals. The difference between 40 and 65 control will reduce walks a lot more than the difference between 65 and 90 will, for example. How does pitcher control and batter eye interact with walk rate? And consider that the pitcher control is probably not linear related to batter performance on its own, either. |
|
|
|
|
|
#9 | ||||
|
Minors (Double A)
Join Date: Sep 2018
Posts: 110
|
Quote:
Of course ratings will dictate the performance of a player, but they aren't the only thing. It's important to remember the pitcher also has ratings against both RHB and LHB. Just like batters will tend to have better splits against one hand than the other, pitchers do to. The side of the plate they bat from is important to judge what impact those pitcher ratings have on a hitter's expected outcomes. Quote:
Quote:
I went back and added residual plots and r2 scores for each model that was produced. The r2 scores for tests 1 and 4 are all higher than .90, where the scores for tests 2 and 3 are much lower, largely because it was a single season with significantly lower number of plate appearances for every player make the outcomes much more varied. Quote:
Last edited by thegoldengod; 04-05-2021 at 05:55 PM. |
||||
|
|
|
|
|
#10 |
|
Minors (Rookie Ball)
Join Date: Jul 2018
Posts: 35
|
> You're going to have to explain why it would be important to give everyone the same ratings when you're trying to test this. The whole point of linear regression is to extract the importance of each variable within the model.
> Yes, there are some events and ratings that are not linear If the distribution of batting ratings isn't the same for LHBs and RHBs (and I don't see any reason to assume they have the same distribution), then you need to consider nonlinearity as well as interaction terms to say if LHB or RHB is actually overperforming or not. |
|
|
|
|
|
#11 | |
|
Minors (Double A)
Join Date: Sep 2018
Posts: 110
|
Quote:
Again, individual events like walk rate, home run rate and strikeout rate are non-linear. However, for those events it's just better to create a low and high model based on whether the player in question has a low/high rating in whatever tool mainly governs that event. I am not attempting to predict any of those. Simple linear regression is more than adequate for what I am attempting to do. Also, if 22 is working as intended, why is 21 showing the exact opposite results? Last edited by thegoldengod; 04-05-2021 at 10:02 PM. |
|
|
|
|
|
|
#12 |
|
Minors (Double A)
Join Date: Sep 2018
Posts: 110
|
Updated the notebook to include t-tests and histograms for the two ratings that were shown to not have the same distribution.
|
|
|
|
|
|
#13 |
|
OOTP Developer
Join Date: Jun 2009
Location: Here and there
Posts: 15,956
|
One thing related to this to keep in mind is that the game tries for the right balance as a whole. Without any adjustments, then things get double-penalized, since most RHB are rated better vs L than vs R, and similarly for pitchers. We did make sure that the league balance came out closer to real balance.
So, depending on what you are looking at, yeah, you might see cases which may end up being backwards when you at look at it one way, but without that then the league devolves too much into platoons. |
|
|
|
|
|
#14 | |
|
Minors (Double A)
Join Date: Sep 2018
Posts: 110
|
Quote:
Matt, are you saying the dev team looked at the code and it's working as intended? |
|
|
|
|
|
|
#15 |
|
OOTP Developer
Join Date: Jun 2009
Location: Here and there
Posts: 15,956
|
|
|
|
|
|
|
#16 |
|
Minors (Double A)
Join Date: Sep 2018
Posts: 110
|
Appreciate the effort to look into it. I brought up MLB splits from 2020, and it appears that indeed LHB either perform a few points worse, or about even as RHB against RHP, and suffer heavily against LHP. So if the desire was to replicate last year, that seems to be the case then.
|
|
|
|
|
|
#17 | |
|
Minors (Rookie Ball)
Join Date: Jul 2018
Posts: 35
|
Since you are getting different results in ootp21 than ootp22, then there is something that changed and I imagine the findings are good enough without considering nonlinearity and interaction of terms
Quote:
|
|
|
|
|
|
|
#18 | |
|
OOTP Developer
Join Date: Jun 2009
Location: Here and there
Posts: 15,956
|
Quote:
|
|
|
|
|
|
|
#19 | |
|
Minors (Double A)
Join Date: Sep 2018
Posts: 110
|
Quote:
What I am saying is that a LHB has a .010 "hole" to dig out of simply because they're a LHB against RHP when you look at wOBA. That's the equivalent to about 30 points in BABIP, 18 points in Power, and and 30 points in Eye against RHP (1 - 200 scale). A LHB could have the exact same ratings as a RHB against RHP, and would be in that .010 hole against a RHB based on the data from the engine. If the desire was to reduce that double-penalty, what I've been able to gather suggests you've gone too far in the other direction. I posted the data from the first test directly to the github for anyone to look at and do with as they please. But if the desired result is being achieved, it is what it is. Last edited by thegoldengod; 04-06-2021 at 12:41 AM. |
|
|
|
|
![]() |
| Bookmarks |
|
|