Home | Webstore
Latest News: OOTP 25 Available - FHM 10 Available - OOTP Go! Available

Out of the Park Baseball 25 Buy Now!

  

Go Back   OOTP Developments Forums > Prior Versions of Our Games > Franchise Hockey Manager > FHM - General Discussions

Reply
 
Thread Tools
Old 01-01-2013, 08:45 AM   #21
no way
Minors (Triple A)
 
no way's Avatar
 
Join Date: Dec 2012
Location: Plzeň, Czech Republic
Posts: 296
Quote:
Originally Posted by geckon View Post
I can have a look at them too. Have you made any changes already? I would like not to duplicate our work.
Here is what I did, if you want you can check it.
Attached Files
File Type: zip 26 czech.zip (6.1 KB, 48 views)
no way is offline   Reply With Quote
Old 01-01-2013, 09:05 AM   #22
Kebabking
Bat Boy
 
Join Date: Dec 2012
Posts: 5
no way: I'll PM you some surnames really quick, hope it helps

edit: message sent, hope it arrived - I don't have it in my mailbox for some reason.

Last edited by Kebabking; 01-01-2013 at 09:39 AM.
Kebabking is offline   Reply With Quote
Old 01-01-2013, 09:36 AM   #23
geckon
Hall Of Famer
 
geckon's Avatar
 
Join Date: Nov 2012
Location: Czech Republic
Posts: 2,077
Quote:
Originally Posted by no way View Post
I have first names done and last names about 40%. I was thinking that I would give it to you after I was done to check it because more eyes see more (prostě víc očí víc vidí )
Quote:
Originally Posted by no way View Post
Here is what I did, if you want you can check it.
Thanks, I'll look at it during today.
geckon is offline   Reply With Quote
Old 01-01-2013, 10:12 AM   #24
geckon
Hall Of Famer
 
geckon's Avatar
 
Join Date: Nov 2012
Location: Czech Republic
Posts: 2,077
Quote:
Originally Posted by JeffR View Post
The name files will be different from OOTP, the European ones in particular in OOTP aren't good enough for hockey. I've been working on them, but they've been taking a backseat to other priorities for a while. Czechs are done, sort of: I did them with an English keyboard, so the special characters are missing. If someone who speaks Czech would like to make themselves useful and edit those in (and correct any other spelling issues with the names), here are the files.
Jeff, I just wanna ask a few questions before I'll make any changes...

1) What encoding the file with the Czech special characters included should have? I suppose it's UTF-8 but I rather ask...

2) Where did you get those names? There are some (e.g. Jeremy, Georg, Nikolas etc.) I don't consider Czech although I admit there could be a Czech man using such name - but it would be _very_ rare. Is it possible to delete names from the file to make it more accurate?

3) Is it possible to add new names to the files? I missed some names after a brief scan of the files (like Miroslav or Radovan).

4) I suppose the first number after a name expresses how often the name should occur. Is that right? Did you get those numbers from some statistics? There are some which I feel they are not right (like Jaromír or Josef) although I don't have any real numbers in my hands now. I could probably find and use some statistics to correct the numbers. Is such effort desired?

5) What does the second number express?

I'd like to improve at least the first names file since I think the first names are more annoying if they are done wrong. There are quite enough surnames so they won't repeat so much but if there is a problem with first names' frequency and you see a lot of lets say Rostislavs (which is not so much common Czech name) you will be disappointed.
My plan is to add more names to the file, take some name frequency statistics and correct the freq numbers accordingly. I must admit though that I think I won't make it all done today. Would you like to see such work done or are you not interested in this?
Thanks.

Last edited by geckon; 01-01-2013 at 10:13 AM.
geckon is offline   Reply With Quote
Old 01-01-2013, 12:58 PM   #25
geckon
Hall Of Famer
 
geckon's Avatar
 
Join Date: Nov 2012
Location: Czech Republic
Posts: 2,077
Time for a little update and few more questions

I searched for Czech name databases and found some. I have three possible plans. Please tell me which one you find better for the game.

1) Use the database provided by Czech ministry for internal affairs. It contains all names used in Czech Republic and can be sorted according to frequency of newborns in any year since 1890. So I could pick one year and take lets say 200 most used names that year for newly born children.
Advantages:
- Old names used lets say fifty years ago won't be included (or less often) although people with such names still live.
Disadvantages:
- Statistical error. Let's say that in the picked year there were uncommonly few Jans born although normally this name is very common. The resulting file would be affected by such abnormality.
- I would have to sort female names out by hand since the database contains male and female names together.

2) Use another database which is probably based on the same or similar data but shows only overall numbers of living persons with given names.
Advantages:
- It doesn't have the statistical problem described above.
- I can filter out only male names automatically so less work for me
Disadvantages:
- It includes older names that are not used much nowadays.
- It includes some Slovak names which is I think mostly reflecting Slovaks who stayed in Czech Republic after Czechoslovakia split-up (so no the people born after 1993).

3) Just take the file given by JeffR and edited by no way and edit it more - make some corrections, add missing names, delete few names etc.
Advantages:
- Probably the fastest option. (Maybe the 2nd one could be faster, hard to say.)
Disadvantages:
- Certainly not as accurate as the others.

Questions:
- (Valid only for the first plan) What year to pick? Since I suppose the first newgens will be generated for the 2013/2014 season it should probably be something like 1990-2000. Do you agree? Which one would you pick and why?
- (For developers only) What range is used for the name frequency in the name generator? I need to know that to be able to normalize the real frequency numbers.
- How many names would you include into that file?

I personally prefer probably the second option for it should probably be both fast and precise enough.

Last edited by geckon; 01-01-2013 at 01:26 PM.
geckon is offline   Reply With Quote
Old 01-01-2013, 09:12 PM   #26
JeffR
FHM Producer
 
JeffR's Avatar
 
Join Date: Apr 2002
Location: Kelowna, BC
Posts: 16,623
Quote:
Originally Posted by geckon View Post
1) What encoding the file with the Czech special characters included should have? I suppose it's UTF-8 but I rather ask...
Yes, UTF-8.

Quote:
2) Where did you get those names? There are some (e.g. Jeremy, Georg, Nikolas etc.) I don't consider Czech although I admit there could be a Czech man using such name - but it would be _very_ rare. Is it possible to delete names from the file to make it more accurate?
I started with the names of all the Czech nationality players in the database and then some from things like older seasons, lists of Czech sportsmen, and so on. So "Jeremy" is from Jeremy Tichý (Plzen U-20 player), Nikolas is from Nikolas Tverdak (Chomutov U-20), etc. I'd like to keep the really rare names in, but if anything looks completely wrong, take it out.

Quote:
3) Is it possible to add new names to the files? I missed some names after a brief scan of the files (like Miroslav or Radovan).
Sure, go ahead.

Quote:
4) I suppose the first number after a name expresses how often the name should occur. Is that right? Did you get those numbers from some statistics? There are some which I feel they are not right (like Jaromír or Josef) although I don't have any real numbers in my hands now. I could probably find and use some statistics to correct the numbers. Is such effort desired?
Yes, all the frequency numbers are added together, then the chance of an individual name being used is X/Total.) Those files are based mainly on the number of players with that name appearing in the database, with a few modifications to make the more common names more frequent.

Quote:
5) What does the second number express?
It's the nationality identifier for the name, 26 is Czech.

Quote:
My plan is to add more names to the file, take some name frequency statistics and correct the freq numbers accordingly. I must admit though that I think I won't make it all done today. Would you like to see such work done or are you not interested in this?
Definitely, go ahead and do it. I don't need it right away, the other name files are a long way from being done - I just happened to have the Czech one done already.

Quote:
I searched for Czech name databases and found some. I have three possible plans. Please tell me which one you find better for the game.
I think 1) sounds like the best option. The main problem with 2) is the Slovak names - the Slovaks will have their own name list, and the way naming works, we can just specify that 2% (or whatever the exact number is) of Czech-nationality players get Slovak names - so I'd like to keep the name lists (European ones, at least) as single-nationality as possible.

Quote:
(Valid only for the first plan) What year to pick? Since I suppose the first newgens will be generated for the 2013/2014 season it should probably be something like 1990-2000. Do you agree? Which one would you pick and why?
Maybe 1993-present. Since the names are going to be for future players, I think current naming trends should be reflected. (Even though it's a little disturbing to see character names from Twilight, Harry Potter, and so on in the newest lists.)

Quote:
(For developers only) What range is used for the name frequency in the name generator? I need to know that to be able to normalize the real frequency numbers.
The frequency numbers are added together for each nationality and then the chance of an individual name appearing is (its frequency number/total number). So they're only relative to each other and there's no specific range to use. But don't try to make it represent the real life numbers perfectly - it's better to set a cutoff where all names below a certain percentage of the population get a 1, set the frequencies for the more common names in proportion to that - and then lower those numbers for the common names by at least 50% (for some reason, if you try to match real life perfectly, it just doesn't look right - there are far too many common names.)

Quote:
How many names would you include into that file?
As many as you want. The Canadian and American last name lists will be 8000-9000 names long, so if you want to have a few thousand, no problem.

Thanks for the help. Again, take your time, it doesn't take long for us to incorporate them into the names file once we have them.
JeffR is offline   Reply With Quote
Old 01-01-2013, 09:18 PM   #27
JeffR
FHM Producer
 
JeffR's Avatar
 
Join Date: Apr 2002
Location: Kelowna, BC
Posts: 16,623
Quote:
Originally Posted by Strelka View Post
If you would like me to look at Russian/Ukrainian/Belorussian I can do that.
Sure, I don't have them done for Ukraine/Belarus yet (I was planning on separate files for all 3), but here's Russia. Unfortunately, it's a mess - I made the mistake of adding in the OOTP Russian names, which are terrible - feminine names, lots of Central Asian names, and so on. I've tried to clean it up, but from what Alessandro tells me, I missed a lot. So if you can get rid of some of the obvious problems, that would be great.
Attached Files
File Type: zip 34 russian.zip (17.6 KB, 38 views)
JeffR is offline   Reply With Quote
Old 01-01-2013, 09:28 PM   #28
JeffR
FHM Producer
 
JeffR's Avatar
 
Join Date: Apr 2002
Location: Kelowna, BC
Posts: 16,623
And as long as I'm asking for help, I might as well post this: here's the (provisional) ethnicities.txt file for the game. The ones marked with asterisks are already done (and a few others will probably be lifted entirely from OOTP, when their names are good enough for our purposes, e.g. Latin American and Japanese names), but the others need work. If anyone wants to help with any of these, speak up. Some of them will need to be built from the ground up, but for most I can provide you with beginning files built from the player database.
Attached Files
File Type: txt ethnicities - Copy.txt (1.4 KB, 97 views)
JeffR is offline   Reply With Quote
Old 01-02-2013, 04:52 AM   #29
Strelka
Minors (Rookie Ball)
 
Strelka's Avatar
 
Join Date: Feb 2012
Posts: 36
Quote:
Originally Posted by JeffR View Post
Sure, I don't have them done for Ukraine/Belarus yet (I was planning on separate files for all 3), but here's Russia. Unfortunately, it's a mess - I made the mistake of adding in the OOTP Russian names, which are terrible - feminine names, lots of Central Asian names, and so on. I've tried to clean it up, but from what Alessandro tells me, I missed a lot. So if you can get rid of some of the obvious problems, that would be great.
I fixed the first names, it was easy. A few completely not Russian names, a few Central Asian names, and several of the same name transliterated in a different style - I deleted the repeats and kept the same style throughout.

Now the last names - wow. It is going to take me some time to go over 5000 names, I'd guess maybe 1000 are Central Asian. And while yes you have your Uzbek here and there that is a Russian who plays hockey, his name isn't going to be "Ivan." So with the way the engine selects names I think it's best to leave them out of the Russian name file.
Strelka is offline   Reply With Quote
Old 01-02-2013, 06:45 AM   #30
grindline
Minors (Single A)
 
Join Date: Jun 2012
Posts: 87
Quote:
Originally Posted by JeffR View Post
And as long as I'm asking for help, I might as well post this: here's the (provisional) ethnicities.txt file for the game. The ones marked with asterisks are already done (and a few others will probably be lifted entirely from OOTP, when their names are good enough for our purposes, e.g. Latin American and Japanese names), but the others need work. If anyone wants to help with any of these, speak up. Some of them will need to be built from the ground up, but for most I can provide you with beginning files built from the player database.
I have never done anything like this before but I would love to help in some way.

I suppose the obvious thing would be to do the English ones (given I am from England) but if any really random sets need building from the ground up and you can't find anyone with any expertise I don't mind having a go.
grindline is offline   Reply With Quote
Old 01-02-2013, 07:45 AM   #31
geckon
Hall Of Famer
 
geckon's Avatar
 
Join Date: Nov 2012
Location: Czech Republic
Posts: 2,077
Quote:
Originally Posted by JeffR View Post
I started with the names of all the Czech nationality players in the database and then some from things like older seasons, lists of Czech sportsmen, and so on. So "Jeremy" is from Jeremy Tichý (Plzen U-20 player), Nikolas is from Nikolas Tverdak (Chomutov U-20), etc. I'd like to keep the really rare names in, but if anything looks completely wrong, take it out.
Yes as I said - there can be person with such name but it is very very rare. And it's not like it is rare Czech name (like for example an old one) but it's an inspiration from other languages. I will see how many names I'll get from the database etc and then I'll decide what to do with such names. But when I'll be deleting Slovak names I think I should delete those as well.


Quote:
Originally Posted by JeffR View Post
I think 1) sounds like the best option. The main problem with 2) is the Slovak names - the Slovaks will have their own name list, and the way naming works, we can just specify that 2% (or whatever the exact number is) of Czech-nationality players get Slovak names - so I'd like to keep the name lists (European ones, at least) as single-nationality as possible.
I could remove the Slovak names if I'd use the 2nd option but if you think 1) is better anyway I can do that.
The possibility of Czech player getting Slovak name seems cool to me. Would it be possible (and usable) something like a very very little chance of getting any language's name for a newgen with any nationality? Name files could be purely national and you could still have Jeremy Tichý this way.


Quote:
Originally Posted by JeffR View Post
Maybe 1993-present. Since the names are going to be for future players, I think current naming trends should be reflected. (Even though it's a little disturbing to see character names from Twilight, Harry Potter, and so on in the newest lists.)
OK, I can remove female names and then sum numbers of newly born people with given names from years 1993-2012 and build the file based on that.
If there will be Harry Potter Novák (or similar) as a Czech name in the statistics I promise I will remove it ;-))
(Btw any possibility to have women hockey? Not that I would need it, it just crossed my mind :-))


Quote:
Originally Posted by JeffR View Post
The frequency numbers are added together for each nationality and then the chance of an individual name appearing is (its frequency number/total number). So they're only relative to each other and there's no specific range to use. But don't try to make it represent the real life numbers perfectly - it's better to set a cutoff where all names below a certain percentage of the population get a 1, set the frequencies for the more common names in proportion to that - and then lower those numbers for the common names by at least 50% (for some reason, if you try to match real life perfectly, it just doesn't look right - there are far too many common names.)
OK. Do you have any suggestion what cutoff to use? By common names you mean only the most common ones or any names above the cutoff?


Quote:
Originally Posted by JeffR View Post
Thanks for the help. Again, take your time, it doesn't take long for us to incorporate them into the names file once we have them.
OK, I'll try to do it the most properly when there is enough time. If you would need it for some reason just tell me and I will try to do it faster.
geckon is offline   Reply With Quote
Old 01-02-2013, 08:44 AM   #32
no way
Minors (Triple A)
 
no way's Avatar
 
Join Date: Dec 2012
Location: Plzeň, Czech Republic
Posts: 296
To geckon:
if you want some hlep, just contact me and I'll do whatever I can or you need
no way is offline   Reply With Quote
Old 01-02-2013, 08:54 AM   #33
geckon
Hall Of Famer
 
geckon's Avatar
 
Join Date: Nov 2012
Location: Czech Republic
Posts: 2,077
Quote:
Originally Posted by no way View Post
To geckon:
if you want some hlep, just contact me and I'll do whatever I can or you need
Thanks for the offer. This will probably be a one-man work but if I find something I could use help with I'll write you.
geckon is offline   Reply With Quote
Old 01-02-2013, 09:03 AM   #34
archibalduk
All Star Starter
 
archibalduk's Avatar
 
Join Date: Oct 2011
Location: UK
Posts: 1,209
Quote:
Originally Posted by grindline View Post
I have never done anything like this before but I would love to help in some way.

I suppose the obvious thing would be to do the English ones (given I am from England) but if any really random sets need building from the ground up and you can't find anyone with any expertise I don't mind having a go.
The North American names from OOTP would be a good starting point for the English names. I guess it'd be just a case of taking out the non-English names from the list.

In case it is of any help, I have attached a spreadsheet of British players broken down by nationality (i.e. England, Scotland, etc). This covers players in the EIHL, EPL and NIHL-1. Maybe this might be useful to anybody who looks at the British names (please note that I don't have any names from the Rep of Ire - hence that worksheet is blank).

Some other name resources:

SCOTLAND
- Common male Scottish forenames: Scottish First Names: Scots names -Women
- Common Scottish surnames: Appendix:Scottish surnames - Wiktionary
- Top 100 Scottish surnames (2001 Scottish census): http://www.gro-scotland.gov.uk/files...es_tablea1.pdf
- Top 300 Scottish surnames (1901 Scottish census): http://www.gro-scotland.gov.uk/files...es_tablea5.pdf

ENGLAND & WALES
- Top 100 English male baby names & top 100 Welsh baby names (2011 census - see Table 2 for England and Table 3 for Wales - Table 6 has a list of the top 6083 boys names for England and Wales and lists their frequency): http://www.ons.gov.uk/ons/rel/vsob1/...boys--2011.xls
- Common English & Welsh surnames: http://en.wiktionary.org/wiki/Append...land_and_Wales)

OTHER
- Common surnames for various other countries: Category:Surname appendices - Wiktionary
Attached Files
File Type: xls British & Irish Players.xls (74.0 KB, 54 views)
__________________
Webmaster of The Blue Line Eastside Hockey Manager & Franchise Hockey Manager community and resource

Last edited by archibalduk; 01-02-2013 at 09:04 AM.
archibalduk is offline   Reply With Quote
Old 01-02-2013, 11:41 AM   #35
grindline
Minors (Single A)
 
Join Date: Jun 2012
Posts: 87
Ok. I will try some of the British or Irish name files if the ootp American names file is available.
grindline is offline   Reply With Quote
Old 01-02-2013, 01:48 PM   #36
Whitebear
Minors (Single A)
 
Join Date: Dec 2012
Location: Gatineau (Ottawa Region)
Posts: 70
Quote:
Originally Posted by dave1927p View Post
Another annoying thing is getting french surnames and first names for people born in ontario or western canada. Sure there is the odd one but it's rare. In ootp it's way too common getting Jean-Pierre Lefebvre born in Calgary.
BTW it's not that rare 21.7% French and 20.6% other language in Canada

Mother tongue (percentage distribution), Canada, provinces and territories, 2011 Census

Richard, Lafleur, Lemieux, Roy, Beliveau, Plante, Bourque, Trottier, Bossy, Potvin, Dionne, Perreault, Brodeur, Fleury, Marleau, Bričre, Giroux, Bergeron etc..
Whitebear is offline   Reply With Quote
Old 01-02-2013, 02:18 PM   #37
tward13
All Star Starter
 
Join Date: Jan 2003
Posts: 1,957
Quote:
Originally Posted by archibalduk View Post
The North American names from OOTP would be a good starting point for the English names. I guess it'd be just a case of taking out the non-English names from the list.

In case it is of any help, I have attached a spreadsheet of British players broken down by nationality (i.e. England, Scotland, etc). This covers players in the EIHL, EPL and NIHL-1. Maybe this might be useful to anybody who looks at the British names (please note that I don't have any names from the Rep of Ire - hence that worksheet is blank).

Some other name resources:

SCOTLAND
- Common male Scottish forenames: Scottish First Names: Scots names -Women
- Common Scottish surnames: Appendix:Scottish surnames - Wiktionary
- Top 100 Scottish surnames (2001 Scottish census): http://www.gro-scotland.gov.uk/files...es_tablea1.pdf
- Top 300 Scottish surnames (1901 Scottish census): http://www.gro-scotland.gov.uk/files...es_tablea5.pdf

ENGLAND & WALES
- Top 100 English male baby names & top 100 Welsh baby names (2011 census - see Table 2 for England and Table 3 for Wales - Table 6 has a list of the top 6083 boys names for England and Wales and lists their frequency): http://www.ons.gov.uk/ons/rel/vsob1/...boys--2011.xls
- Common English & Welsh surnames: http://en.wiktionary.org/wiki/Append...land_and_Wales)

OTHER
- Common surnames for various other countries: Category:Surname appendices - Wiktionary
My Scottish ancestors must have all come over and taken their names with them. I see the Gardiners and Orrs in 1901 but not in the recent list! At least the Russells are still around.
tward13 is offline   Reply With Quote
Old 01-02-2013, 03:54 PM   #38
CeeBee
Minors (Double A)
 
CeeBee's Avatar
 
Join Date: Apr 2009
Location: Chase BC Canada
Posts: 105
Quote:
Originally Posted by Whitebear View Post
BTW it's not that rare 21.7% French and 20.6% other language in Canada

Mother tongue (percentage distribution), Canada, provinces and territories, 2011 Census

Richard, Lafleur, Lemieux, Roy, Beliveau, Plante, Bourque, Trottier, Bossy, Potvin, Dionne, Perreault, Brodeur, Fleury, Marleau, Bričre, Giroux, Bergeron etc..
How many of those were NOT born in Quebec though? 2? and those 2 have first names of Bryan and Theoran though Trottier's and Fleury's have been Quebecois as well.
CeeBee is offline   Reply With Quote
Old 01-02-2013, 04:28 PM   #39
Whitebear
Minors (Single A)
 
Join Date: Dec 2012
Location: Gatineau (Ottawa Region)
Posts: 70
Quote:
Originally Posted by CeeBee View Post
How many of those were NOT born in Quebec though? 2? and those 2 have first names of Bryan and Theoran though Trottier's and Fleury's have been Quebecois as well.
Bryan Trottier Val-Marie SK
Denis and Jean Potvin Ottawa, Ont
Patrick Marleau, Aneroid SK (Don't know if he speaks french?)
Claude Giroux Hearst, Ont
Alex Plante Brandon MB

Well five on 18 is about 27% out of Quebec !

I think it's fair to say that there's enough french name in North America even in USA but it doesn't mean they speak french.
Whitebear is offline   Reply With Quote
Old 01-02-2013, 04:53 PM   #40
Nino33
Major Leagues
 
Nino33's Avatar
 
Join Date: Oct 2012
Posts: 320
The post you replied to initially referred to players "born in Ontario or western Canada"

Your response indicating 21.7% French didn't refer to western Canada and Ontario but rather all of Canada

The data you linked to indicates 1.5%-4.1% French as mother tongue in western Canada and Ontario

Last edited by Nino33; 01-02-2013 at 04:54 PM.
Nino33 is offline   Reply With Quote
Reply

Bookmarks

Thread Tools

Posting Rules
You may not post new threads
You may not post replies
You may not post attachments
You may not edit your posts

BB code is On
Smilies are On
[IMG] code is On
HTML code is Off

Forum Jump


All times are GMT -4. The time now is 08:31 PM.

 

Major League and Minor League Baseball trademarks and copyrights are used with permission of Major League Baseball. Visit MLB.com and MiLB.com.

Officially Licensed Product – MLB Players, Inc.

Out of the Park Baseball is a registered trademark of Out of the Park Developments GmbH & Co. KG

Google Play is a trademark of Google Inc.

Apple, iPhone, iPod touch and iPad are trademarks of Apple Inc., registered in the U.S. and other countries.

COPYRIGHT © 2023 OUT OF THE PARK DEVELOPMENTS. ALL RIGHTS RESERVED.

 

Powered by vBulletin® Version 3.8.10
Copyright ©2000 - 2024, vBulletin Solutions, Inc.
Copyright © 2020 Out of the Park Developments