View Single Post
Old 09-21-2017, 03:16 PM   #2
fhomess
Hall Of Famer
 
fhomess's Avatar
 
Join Date: Nov 2002
Posts: 3,483
Thanks: 171
Thanked 1,208x in 448 posts
Fun project. Don't make the mistake of thinking the OOTP data dump is a well thought out database. There's a lot of good information in it, but you're going to pull your hair out trying to make sense of it where no sense is to be made. You can often assume that if something doesn't make sense, like a game_id column in players_career stats, then it's just not reporting anything useful.

I took a quick glance through your blog and have the following couple of notes:
  • You say you're going to use the parks table to adjust your statistics using the park factors from that table. You will get bad results doing this. Those are in-game configuration settings, not calculated park factors based on actual results. They don't change from year to year. If you want to know how to adjust the stats, you have to calculate park factors based on home/road splits and save that off somewhere.
  • Players_career_fielding_stats:
    • rto = Runners Thrown Out (for catchers)
    • ipf = innings played fraction. Innings played = (3*IP + IPF)/3. The pitching stats table has this, too, but for whatever reason also includes an outs column that does the 3*IP+IPF calculation for you.
    • Detailed UZR data isn't anywhere in the data dump
__________________
StatsLab- PHP/MySQL based utilities for Online Leagues
Baseball Cards - Full list of known templates and documentation on card development.
fhomess is offline   Reply With Quote
2 thanks for this post:
cavebutter (09-21-2017), tarmer (08-22-2019)