January 18, 2010 at 9:55 am
Is there an SSIS transform available that would help with this issue? I've reviewed fuzzy lookup and term extract, but don't think they will get to where I'm trying to go.
Scenario:
Table1 - raw data with several columns. The specific column that I'm working with is called Item_Keyword. The specific data entries for this column vary depending on data entry personnel and clinic.
For example: the Item_Keyword column may have entries like simplex, palacos osteobond, and cobalt. I know these to all be essentially the same as Bone Cement.
I want to update a new column in Table1 based on a lookup versus a Categorization_table which has 2 columns: KeyWord and Category.
For example: The Bone Cement category would have 4 rows in the Categorization_table.
Keyword Category
simplex Bone Cement
palacos Bone Cement
osteobond Bone Cement
cobalt Bone Cement
I have 23 distinct categories at this time.
Is there an SSIS transform available to do this work?
Thanks, BPH
BPH
January 18, 2010 at 10:50 am
You almost answered your own question. Create a lookup table in SQL Server and then use the Lookup transformation in your dataflow.
The absence of evidence is not evidence of absence
- Martin Rees
The absence of consumable DDL, sample data and desired results is, however, evidence of the absence of my response
- Phil Parkin
January 18, 2010 at 1:46 pm
That's the path I have taken today and I need to do some fine tuning on getting the matches to hit more frequently. Thanks for the reply.
BPH
January 18, 2010 at 3:05 pm
Another alternative, of course, is to do it all in one hit at the end, using a simple UPDATE query.
The absence of evidence is not evidence of absence
- Martin Rees
The absence of consumable DDL, sample data and desired results is, however, evidence of the absence of my response
- Phil Parkin
Viewing 4 posts - 1 through 3 (of 3 total)
You must be logged in to reply to this topic. Login to reply