|
||||
|
01-04-2013, 02:53 PM | #61 |
Hall Of Famer
Join Date: Nov 2012
Location: Czech Republic
Posts: 2,077
|
I just finished selection of Czech first names - I have 472 of them. Although I may decide to remove some of them during the frequency assigning.
I keep there a few Polish, Germanic, Hungarian names and also some probably popular names that are not Czech (like the mentioned Nikolas, Sebastian or Leonard) and variants of Czech and foreign names (like Dominik - Dominic, Nikolas - Nicolas, Kvido - Quido etc.). If you have any advices or comments of the bit I just wrote please let me know. Otherwise please wait until I complete and upload it so you can check it out. PS: I found nice source for surnames and will do them as the next thing. |
01-04-2013, 05:29 PM | #62 |
Minors (Single A)
Join Date: Jun 2012
Posts: 87
|
Following on from this I was thinking about the English name sets. Given that a statistically noteworthy amount of the English population are ethnically Asian, I'm thinking specifically about the Pakistani Muslim community, should there be the most common names among that demographic?
Pro: It would be more realistic to have an English regen called Mohammed Hussain Cons: How do you guarantee the ethnic first and surnames match up so that you don't get super rare (in real life) cross ethnicity names such as Jason Mohammed? In the event that facegen appears in FHM would there be potential reality issues with having an obviously ethnic name being assigned to a Caucasian face? |
01-04-2013, 07:07 PM | #63 |
Hall Of Famer
Join Date: Nov 2012
Location: Czech Republic
Posts: 2,077
|
Czech first names done. I have 455 of them. The file is attached to this post so you can look at it. I am open to your suggestions.
The names come from Aktuální a podrobná statistika jmen (Czech) and their frequencies are based on the real data but tweaked according to JeffR's advices in this thread. Don't you have a piece of software for generating players names? I could use something like it to test it. I'd like to generate bunch of names and check it out to avoid the possibility that the file is compiled wrong way. In that case I could try and repair it. |
01-04-2013, 07:12 PM | #64 | |
Guest
Posts: n/a
|
Quote:
|
|
01-05-2013, 12:02 AM | #65 |
Hall Of Famer
Join Date: Nov 2012
Location: Czech Republic
Posts: 2,077
|
OK now I have finished the last names as well. Here they are (all 10.000 of them).
The file is based on this source: Četnost českých příjmení (Czech). There are some (not many) Vietnamese and foreign (e.g. Polish or German) surnames which I think is completely OK because such surnames are used in Czech republic. Vietnamese surnames are used only by Vietnamese people but their children will play hockey as well. I have modified frequencies in a few ways... - The most common names' priorities were lowered. - The Vietnamese and some other "less-Czech" surnames were lowered notably. - The surnames of famous hockey players from the real world were somewhat lowered too. - Other smaller changes. Now it would be nice to generate some names to test whether the set is done right. And of course you are welcome to go through the files and point out possible mistakes or ideas for improving the set. |
01-05-2013, 12:56 AM | #66 | ||
Moderator
Join Date: May 2012
Location: Canada
Posts: 367
|
Quote:
Quote:
I noticed a few sets are missing. Given what's already planned in there, IMO there should also be Bulgarian, Turkish, Greek, Mongolian and Armenian ones. There should also be something from South Africa. Belgian should also be split into Belgian Dutch and Belgian French, similar to Switzerland. I reckon that adds a fair amount of work for it all to be ready for release day, though... Perhaps as a bonus if time allows, and/or a post-release expansion? I have a small Bulgarian set ready that could do it (240 + 517) or be expanded as needed. The others wouldn't be very long and difficult to build, especially since I think they might not need to be as thorough as the bigger nations, at least for the time being. Other than that, I currently have (South) Korean in progress. It should be ready this weekend; chances are, I can clear a second one by Monday as well. - YZG |
||
01-05-2013, 02:43 AM | #67 | ||
FHM Producer
Join Date: Apr 2002
Location: Kelowna, BC
Posts: 16,625
|
Quote:
Quote:
|
||
01-05-2013, 02:51 AM | #68 |
FHM Producer
Join Date: Apr 2002
Location: Kelowna, BC
Posts: 16,625
|
No, this looks good - I can't test it yet in the game because it's going to require a bunch of changes to the world.xml file to switch over from the temporary OOTP files we're using, but I don't see any problems with it. And 450 names is a big improvement over the 130 or so the original file had, nice work.
|
01-05-2013, 03:06 AM | #69 | |
FHM Producer
Join Date: Apr 2002
Location: Kelowna, BC
Posts: 16,625
|
Quote:
Ladislav Rut Vladislav Havlíček Jindřich Hrubec Ladislav Nosek Petr Fojtík Vojtěch Heřmánek Antonín Štěpaník Aleš Rybka Petr Dočkal Patrik Potěšil Ladislav Řehoř Oldřich Vaca David Sitár René Petráš Karel Křížek Martin Novák Marián Schuster Miloš Hanek Antonín Nesvadba Karel Hofman |
|
01-05-2013, 05:30 AM | #70 |
Minors (Single A)
Join Date: Jun 2012
Posts: 87
|
Just out of interest how many names do we expect to find in a set? The English first names are currently at 250, which seems a bit low but covers pretty much all of the names that have a frequency of 0.25% or greater in real life.
I guess I can add a few more rare ones to go down to those with a frequency of 0.1% if that is in keeping with the other name sets? |
01-05-2013, 05:40 AM | #71 | |
All Star Starter
Join Date: Oct 2011
Location: UK
Posts: 1,209
|
Quote:
__________________
Webmaster of The Blue Line Eastside Hockey Manager & Franchise Hockey Manager community and resource |
|
01-05-2013, 07:15 AM | #72 |
Minors (Single A)
Join Date: Jun 2012
Posts: 87
|
|
01-05-2013, 07:40 AM | #73 | |
All Star Starter
Join Date: Oct 2011
Location: UK
Posts: 1,209
|
Quote:
If we're using Table 6, I'd suggest widening the net somewhat. For example, the name "Fred" ought to be included IMO - he's at #576 (0.02%). What do you think of going down to about #1158 where we have Roy (i.e. those with a count of at least 22)? Once the non-English/Welsh sounding names are taken out, I think we'd be left with just under 1000 names - which is the equivalent to just under two distinct names per English/Welsh player in the game DB. Off-topic a little, but I can't believe so few babies were called the likes of "Roy" in 2011. But then I guess that's a sign of the times and/or me getting old!
__________________
Webmaster of The Blue Line Eastside Hockey Manager & Franchise Hockey Manager community and resource |
|
01-05-2013, 08:49 AM | #74 | ||
Hall Of Famer
Join Date: Nov 2012
Location: Czech Republic
Posts: 2,077
|
Quote:
BTW Petr Fojtík is my actual friend Quote:
|
||
01-05-2013, 09:42 AM | #75 |
All Star Starter
Join Date: Oct 2011
Location: UK
Posts: 1,209
|
I think Sebastian posted somewhere here a while back that they would look into FaceGen, but it's not yet certain whether or not it will make it in to the game.
__________________
Webmaster of The Blue Line Eastside Hockey Manager & Franchise Hockey Manager community and resource |
01-05-2013, 09:44 AM | #76 |
Hall Of Famer
Join Date: Nov 2012
Location: Czech Republic
Posts: 2,077
|
|
01-05-2013, 11:00 AM | #77 |
Hall Of Famer
Join Date: Nov 2012
Location: Czech Republic
Posts: 2,077
|
Could a moderator edit this thread's title to reflect the fact that not only Czech names are the subject of discussion here? I would do it myself but haven't found out how and I'm not sure if I have the needed permissions.
|
01-05-2013, 05:30 PM | #78 | |
Minors (Single A)
Join Date: Jun 2012
Posts: 87
|
Quote:
Duh! Oh and as for expanding the database, my thoughts were to do all the English names with 50 or more occurances then add any others of note below that point, similar to what you suggested. Last edited by grindline; 01-05-2013 at 05:34 PM. |
|
01-08-2013, 02:40 PM | #79 |
All Star Starter
Join Date: Oct 2011
Location: UK
Posts: 1,209
|
I've completed the Irish and Scottish names and have sent them to Grindline for double-checking. Once he's okay'ed them, I'll put them into a text file using the correct format.
I have frequency figures from the 2011 census for first names for both countries. There aren't any comprehensive frequency figures for surnames, so they'll each need to be set to '1' like the English surnames.
__________________
Webmaster of The Blue Line Eastside Hockey Manager & Franchise Hockey Manager community and resource |
01-11-2013, 12:12 AM | #80 |
Moderator
Join Date: May 2012
Location: Canada
Posts: 367
|
Are Asian names in-game intended to use the Given name /Surname format or the Surname / Given name one?
I am joining two csv sets here: first one is a small Bulgarian set (239 + 552), if you're interested to add it. It was built from a couple of lists of people and names are weighted according to their occurence in said lists. The second is a Korean one (607 + 182). I checked to reuse the one that comes with OOTP, but found out that, like the Russian set, it includes both male and female names. Moreover, the names are transliterated according to several romanization systems. So, I rebuilt the set anew. It features 182 of the ±250 known Korean surnames and they are weighted according to real-life proportions. Given names have been compiled from various lists of people. A few have been made more common according to frequency of occurence in lists and according to data on the most common given names to babies for the past few decades. I also applied the Revised Romanization of Korean to all gathered names to ensure names are uniform. I see some names are slightly different in North Korea, but it might not be worth the hassle for now. I see the default OOTP Chinese names are a real mess as well. Seems like there's a mix of Chinese and Taiwanese names in there. I'll build a fresh new one. - YZG |
Bookmarks |
|
|