June 10, 2009 at 7:30 am
I would like to utilize the fuzzy search capabilities of SQL when querying my database. In particular, I would like to use it when searching for people by name. However, I need a master names list to run a comparison against. Where does one get such a list? I need one that not only covers American names, but European, Asian, Middle Eastern, Indian, etc... Is there a place you can download such lists? Or purchase them?
Thanks
June 10, 2009 at 7:37 am
this might help a little bit:
the Us Census Bureau put together the most common male first names, female first names, and most common last names:
http://www.census.gov/genealogy/names/names_files.html
90% of the names in the Us have those values...when i went over it though, i wish i had the other 10% to get a complete database.
that list might be a good start for you to use for some tables to compare against.
Lowell
June 10, 2009 at 7:43 am
You could also try sites like NameTrends.net, Parents.com, or even PregnancyWeekly.com. I think there's also a BabyNames.com. Not sure how international any of those are, but they might have some use.
- Gus "GSquared", RSVP, OODA, MAP, NMVP, FAQ, SAT, SQL, DNA, RNA, UOI, IOU, AM, PM, AD, BC, BCE, USA, UN, CF, ROFL, LOL, ETC
Property of The Thread
"Nobody knows the age of the human race, but everyone agrees it's old enough to know better." - Anon
June 10, 2009 at 11:58 am
Thanks for the responses. I downloaded the census data and it will do for now. Now I have a follow up question as it relates to these tables.
I have been work with Ira Warren Whiteside's Fuzzy Match function (see [/url]
What I would like to do is have a string provided by a user run against the Last Name table (refNamesList_LastName) and return everything with a ranking of .95 or higher. Is this possible? I've been trying to do it for the last couple of hours without success. Thanks
June 15, 2009 at 7:16 am
It is possible. Looks like what you'd need to do is take that function, and use it in a Cross Apply against the table and the input parameter (user-provided value).
Take a look at Cross Apply in Books Online, see if that helps.
- Gus "GSquared", RSVP, OODA, MAP, NMVP, FAQ, SAT, SQL, DNA, RNA, UOI, IOU, AM, PM, AD, BC, BCE, USA, UN, CF, ROFL, LOL, ETC
Property of The Thread
"Nobody knows the age of the human race, but everyone agrees it's old enough to know better." - Anon
June 15, 2009 at 7:17 am
After reading that article, it also looks to me like you might want to take a look at the SoundEx and Difference functions in BOL. They might simplify what you're trying to do.
- Gus "GSquared", RSVP, OODA, MAP, NMVP, FAQ, SAT, SQL, DNA, RNA, UOI, IOU, AM, PM, AD, BC, BCE, USA, UN, CF, ROFL, LOL, ETC
Property of The Thread
"Nobody knows the age of the human race, but everyone agrees it's old enough to know better." - Anon
Viewing 6 posts - 1 through 5 (of 5 total)
You must be logged in to reply to this topic. Login to reply