A quick update on this. Just by taking material from names files created by fans for OOTP 6 and earlier versions, we have at least a rough start for the following languages/nationalities:
English
Latin American / Spanish
Japanese
German
French
Italian
Finnish
Swedish
Russian
Czech
Polish
Scottish
Arabic
Irish
I'm also trying to figure out how to work these in:
Fantasy/Tolkein
Scandinavian/Viking
Waiting for Bobble's Hobbit files to show up in my in-box.
Some key nationalities I'm missing so far include:
Korean
Chinese/other Asian
Norwegian
Danish
Dutch
Portuguese
Brazilian
Greek
Anything in Africa
Please let me know if you have any leads on these.
Incidentally, my goal is to make distinct data by country, and not by language. In other words, I'd love to separate U.S. English, U.K. English, Australian, etc. Similarly, there should be different values for Latin American versus Spanish names, and so on. However, this may not end up being practical, for time or resource reasons. Still, I'll do my best! If it ends up being impossible, we'll just have one large data set for Spanish, English, Arabic, etc.
The data right now is not very clean, and I suspect a large number of duplicates. However, right now there are about 40,000 unique last names and 8,000 unique first names in the data set. This is about twice the number of names in the original OOTP.