I'm looking to generate some database test data, specifically table columns containing people's names. In order to get a good indication of how well indexing works with regard to name based searches I want to get as close as possible to real world names and their true frequency distribution, e.g. lots of different names with frequencies distributed over some power law distribution.
Ideally I'm looking for a freely available data file with names followed by a single frequency value (or equivalently a probability) per name.
Anglo-saxon based names would be fine, although names from other cultures would be useful also.