|
||||
|
10-10-2014, 05:57 PM | #1 |
All Star Starter
Join Date: May 2013
Location: Philadelphia
Posts: 1,245
|
Interest in a complete U.S. world_default?
Looking through the world_default, there are many glaring issues, especially for the United States, of which the data is easily accessible.
There are longitudes and latitudes missing from many, cities which do not have a current population of 1000 or more, and many cities that do have a 1000+ population are missing as well. Thankfully the Census makes the data easily accessible, as I was able to download basic data about every town, city, CDP, etc. in about two minutes, and place it in Excel. From my count, there are 15,041 (discluding Puerto Rico) "cities" that have 1000+ populations, from Abbeville, AL, to Zwolle, LA. Would there be any interest in such a file? If so, would there be any willing helpers? EDIT: U.S. ONLY file is complete. Download HERE. Download the source files (not the actual XML), and miscellaneous data files HERE. Last edited by Ike348; 11-01-2014 at 11:27 AM. |
10-10-2014, 08:53 PM | #2 |
All Star Starter
Join Date: May 2013
Location: Philadelphia
Posts: 1,245
|
I'm just starting to fool around with this and it looks like the abbreviations are easy enough to build using a simple formula: =UPPER(LEFT(B2,3)) It doesn't work if you have a two-word city for which you would want to use the initials, but I found another formula to do that.
It will also be tough to replace the appendices to the place name. The Census data has "Birmingham city" instead of just Birmingham, which makes sense, but is still a pain. I could do a case sensitive find and replace for " CDP", " city", " town", etc., but that might work for every single one. Obviously the toughest part will be location data. I've been looking around the Census website for some location data for these places, and the best I've found are some KMLs that can overlay the place boundaries in Google Earth. I could drag the mouse over these places, but it would be better if there was location data for these places the same as population. Then there is the not-too-simple matter of IDs. I think the first release might be one with all of the world IDs first (how I will re-order them, I have no idea), and then the U.S. IDs. I don't know yet. This will be my official progress thread BTW. |
10-11-2014, 12:03 AM | #3 |
All Star Starter
Join Date: May 2013
Location: Philadelphia
Posts: 1,245
|
Well I found Gazetteer files that have long/lats for all of the places. Now it's just the IDs.
|
10-13-2014, 09:32 PM | #4 |
All Star Starter
Join Date: May 2013
Location: Philadelphia
Posts: 1,245
|
Tags needed progress
id: for U.S. ONLY file name: YES pop: YES lid: YES time_zone: YES observes_dst: YES lat: YES long: YES abbr: YES altitude: not necessary, so N/A, http://www.ootpdevelopments.com/boar...ml#post3295242 Last edited by Ike348; 10-25-2014 at 12:27 PM. |
10-20-2014, 09:11 PM | #5 |
All Star Starter
Join Date: May 2013
Location: Philadelphia
Posts: 1,245
|
Hit a little wrinkle in that all townships were not included in the original file. Exchanged a few e-mails with a guy at the Census bureau and he was able to sort it all out. Progress back on track now.
|
10-24-2014, 08:39 PM | #6 |
All Star Starter
Join Date: May 2013
Location: Philadelphia
Posts: 1,245
|
Almost done with the first release, which will be U.S. only. Unfortunately, there is no easy way to mass edit all of the IDs, so when I do the complete world, all of the US cities will be 91000 and up. I can edit them fine when I'm building the file, but if the file is already created and I don't have the source, the there is no easy way.
What I still have to do: - Add county descriptors for places with same name in a state. This is only for the 11 states I used towns and townships for. - Copy and paste. Features: - Coordinates for all places, rounded to 1/100th of a degree. - Proper DST for some cities in Arizona (Navajo Nation follows DST, so Arizonan NN cities follow it too) - Of course, updated populations for 2010 Census. - Proper ID alphabetization (for U.S. only) - Conditional abbreviations. If a place is more than one word, then the initials are used. (San Bernardino now SB, not SAN) All other places use first 3 letters. For hyphenated places, the first 3 letters are still used. However, when a county descriptor is included, the abbreviation is A(C, or something like it. If you happen to have a league with any of these towns, they can easily be changed in the league setup. - 18,077 places with a population of 1000+ people, from Abbeville, AL to Zwolle, LA Notes: - In order to get townships like my town Maplewood, NJ in, some sacrifices had to be made: - Many townships have the same name in those states with townships. - Some instances where "XXX township" and "XXX city" may exist, and since the two have different coordinates, populations, etc., they both will be in the file. Happens mostly in Michigan, but other cases as well. I'd rather have both in the file than make some arbitrary conditions about which to include. Full documentation will be included in a read_me. Last edited by Ike348; 10-25-2014 at 12:27 PM. |
10-24-2014, 10:26 PM | #7 |
All Star Starter
Join Date: May 2013
Location: Philadelphia
Posts: 1,245
|
Just awaiting one last e-mail and then I will copy and paste the final thing together. ETA for first release probably next weekend, or during the week if I have some free time (homework and family-shared computer).
ETA for full world release probably 2 weeks. |
10-29-2014, 07:27 PM | #8 |
All Star Starter
Join Date: May 2013
Location: Philadelphia
Posts: 1,245
|
Well dealing with MCDs is a pain (19 Washington townships in Pennsylvania!), so I'm just going to use places for these states. Yes, some towns won't be included, but it doesn't affect populations that much. For example, New York has 78% of its total population represented in just places. However, Vermont only has 33%, but you could chalk that up to not having many communities over 1000. New Hampshire and Maine also have less than 50% represented.
|
10-29-2014, 08:04 PM | #9 |
Hall Of Famer
Join Date: Jun 2014
Location: Juust a bit outside...
Posts: 5,607
|
This is great work. I can't wait to use it
__________________
"Cannonball Coming!" Go Bucs!! Founder and League Caretaker of the Professional Baseball Circuit, www.probaseballcircuit.com An Un-Official Guide to Minor League Management in OOTP 21 Ratings Scale Conversion Cross-Reference Cheat Sheet |
10-29-2014, 08:06 PM | #10 |
OOTP Developments
Join Date: Aug 2007
Location: Nice, Côte d'Azur, France
Posts: 19,757
|
This looks really cool. Looking forward to it!
|
10-29-2014, 08:08 PM | #11 |
OOTP Developments
Join Date: Aug 2007
Location: Nice, Côte d'Azur, France
Posts: 19,757
|
I live in Maine, and the deal in Northern NE is that there are just so many tiny towns. Villages really. So those numbers sound pretty accurate to me.
|
10-29-2014, 08:23 PM | #12 |
All Star Starter
Join Date: May 2013
Location: Philadelphia
Posts: 1,245
|
I'm almost done, I just have to do some more county addendums (x county). Then I have to do a fair amount of copy/paste. Done for today, there is a World Series game on... Have Friday night free though so I will try and put in mad work on it then.
Glad to see someone is looking forward to it! |
10-31-2014, 06:34 PM | #13 |
All Star Starter
Join Date: May 2013
Location: Philadelphia
Posts: 1,245
|
Done with the U.S. only file. Requires some testing before release though.
EDIT: Posted file in first post. Fixed a couple errors. Should work fine. Read through the documentation for a little information. Complete world will not be done tonight, but will most likely be finished by Tuesday. Then comes the world with U.S. states as nations. Last edited by Ike348; 10-31-2014 at 07:39 PM. |
Bookmarks |
Thread Tools | |
|
|