Help me with using Table values CLR function in OLE DB command in data flow task

  • HI All, 

     I am struggling to use table value CLR function in OLE DB command in data flow task.

    I am trying to give the input from OLEDB source , each row input to a CLR table valued Function which in tern will give more rows like table ,which i wanted to insert in a table via OLE DB destination. 

    I am unable to do it in OLE DB command. 

    I tried like below in OLE DB Command.

    select col1, col2,col3
    from dbo.clrfunction(?,?,?,?)

    but col1, col2 , col3 are not coming in available output. 

    Have any one faced these type of scenario ? Please help me> thanks in advance

  • JoNTSQLSrv - Tuesday, November 27, 2018 12:15 AM

    HI All, 

     I am struggling to use table value CLR function in OLE DB command in data flow task.

    I am trying to give the input from OLEDB source , each row input to a CLR table valued Function which in tern will give more rows like table ,which i wanted to insert in a table via OLE DB destination. 

    I am unable to do it in OLE DB command. 

    I tried like below in OLE DB Command.

    select col1, col2,col3
    from dbo.clrfunction(?,?,?,?)

    but col1, col2 , col3 are not coming in available output. 

    Have any one faced these type of scenario ? Please help me> thanks in advance

    Why are you using OLEDB Command here, and not just OLEDB source and OLEDB destination?

    The absence of evidence is not evidence of absence.
    Martin Rees

    You can lead a horse to water, but a pencil must be lead.
    Stan Laurel

  • Phil Parkin - Tuesday, November 27, 2018 5:23 AM

    JoNTSQLSrv - Tuesday, November 27, 2018 12:15 AM

    HI All, 

     I am struggling to use table value CLR function in OLE DB command in data flow task.

    I am trying to give the input from OLEDB source , each row input to a CLR table valued Function which in tern will give more rows like table ,which i wanted to insert in a table via OLE DB destination. 

    I am unable to do it in OLE DB command. 

    I tried like below in OLE DB Command.

    select col1, col2,col3
    from dbo.clrfunction(?,?,?,?)

    but col1, col2 , col3 are not coming in available output. 

    Have any one faced these type of scenario ? Please help me> thanks in advance

    Why are you using OLEDB Command here, and not just OLEDB source and OLEDB destination?

    Good question.
    Thanks for your reply.

    I am trying to use Balance data distributor, i.e., from OLEDB source to BDD to split  million of records into 4 parts  for parallel processing. 

    so after BDD i need to send the input to CLR function which inturn will give me more records for 1 row.

     So i am searching a SQL processing task in Data flow but  only OLEDB can handle SQL so i am trying to do with that. 

    Can you please tell me if i am doing right ? or is there any way ? 

    I am trying with script component but i feel like it  will be performance killer.

    Please help me out

  • JoNTSQLSrv - Tuesday, November 27, 2018 6:19 AM

    Phil Parkin - Tuesday, November 27, 2018 5:23 AM

    JoNTSQLSrv - Tuesday, November 27, 2018 12:15 AM

    HI All, 

     I am struggling to use table value CLR function in OLE DB command in data flow task.

    I am trying to give the input from OLEDB source , each row input to a CLR table valued Function which in tern will give more rows like table ,which i wanted to insert in a table via OLE DB destination. 

    I am unable to do it in OLE DB command. 

    I tried like below in OLE DB Command.

    select col1, col2,col3
    from dbo.clrfunction(?,?,?,?)

    but col1, col2 , col3 are not coming in available output. 

    Have any one faced these type of scenario ? Please help me> thanks in advance

    Why are you using OLEDB Command here, and not just OLEDB source and OLEDB destination?

    Good question.
    Thanks for your reply.

    I am trying to use Balance data distributor, i.e., from OLEDB source to BDD to split  million of records into 4 parts  for parallel processing. 

    so after BDD i need to send the input to CLR function which inturn will give me more records for 1 row.

     So i am searching a SQL processing task in Data flow but  only OLEDB can handle SQL so i am trying to do with that. 

    Can you please tell me if i am doing right ? or is there any way ? 

    I am trying with script component but i feel like it  will be performance killer.

    Please help me out

    The architecture of your solution is inherently slow, unfortunately – if I am understanding things right.
    You have millions of source rows, which you'd like to split and process in parallel ... good start.
    But, for each of those rows, you wish to call a CLR which itself generates multiple rows of data per input row. This is your pain point. Can this part be re-architected to be set-based, rather than RBAR?

    The absence of evidence is not evidence of absence.
    Martin Rees

    You can lead a horse to water, but a pencil must be lead.
    Stan Laurel

  • Phil Parkin - Tuesday, November 27, 2018 6:36 AM

    JoNTSQLSrv - Tuesday, November 27, 2018 6:19 AM

    Phil Parkin - Tuesday, November 27, 2018 5:23 AM

    JoNTSQLSrv - Tuesday, November 27, 2018 12:15 AM

    HI All, 

     I am struggling to use table value CLR function in OLE DB command in data flow task.

    I am trying to give the input from OLEDB source , each row input to a CLR table valued Function which in tern will give more rows like table ,which i wanted to insert in a table via OLE DB destination. 

    I am unable to do it in OLE DB command. 

    I tried like below in OLE DB Command.

    select col1, col2,col3
    from dbo.clrfunction(?,?,?,?)

    but col1, col2 , col3 are not coming in available output. 

    Have any one faced these type of scenario ? Please help me> thanks in advance

    Why are you using OLEDB Command here, and not just OLEDB source and OLEDB destination?

    Good question.
    Thanks for your reply.

    I am trying to use Balance data distributor, i.e., from OLEDB source to BDD to split  million of records into 4 parts  for parallel processing. 

    so after BDD i need to send the input to CLR function which inturn will give me more records for 1 row.

     So i am searching a SQL processing task in Data flow but  only OLEDB can handle SQL so i am trying to do with that. 

    Can you please tell me if i am doing right ? or is there any way ? 

    I am trying with script component but i feel like it  will be performance killer.

    Please help me out

    The architecture of your solution is inherently slow, unfortunately – if I am understanding things right.
    You have millions of source rows, which you'd like to split and process in parallel ... good start.
    But, for each of those rows, you wish to call a CLR which itself generates multiple rows of data per input row. This is your pain point. Can this part be re-architected to be set-based, rather than RBAR?

    Thats amazing. Thanks for your set-based idea. Even i was thinking the same. I know RBAR will be performance killer , any how i just done for testing. 

    Can you please suggest me or give idea on how to do set based using CLR function(because Table values function itself is RBAR)  in data flow task  ? 
    I know we can try from control flow by using execute sql task. Is this what you meant?

  • JoNTSQLSrv - Tuesday, November 27, 2018 10:47 AM

    Phil Parkin - Tuesday, November 27, 2018 6:36 AM

    The architecture of your solution is inherently slow, unfortunately – if I am understanding things right.
    You have millions of source rows, which you'd like to split and process in parallel ... good start.
    But, for each of those rows, you wish to call a CLR which itself generates multiple rows of data per input row. This is your pain point. Can this part be re-architected to be set-based, rather than RBAR?

    Thats amazing. Thanks for your set-based idea. Even i was thinking the same. I know RBAR will be performance killer , any how i just done for testing. 

    Can you please suggest me or give idea on how to do set based using CLR function(because Table values function itself is RBAR)  in data flow task  ? 
    I know we can try from control flow by using execute sql task. Is this what you meant?

    Can you provide an overview of what the CLR function does? I think it would be useful background.

    Can the CLR be changed to provide output for a range of input rows, instead of only one? (Eg, WHERE STATUS = 'A', or , WHERE DATE >='20180101' AND DATE < '20190101')

    If the CLR could be changed (or a new CLR created) in this manner, you should be able to use it in your OLEDB source.

    The absence of evidence is not evidence of absence.
    Martin Rees

    You can lead a horse to water, but a pencil must be lead.
    Stan Laurel

  • Phil Parkin - Tuesday, November 27, 2018 11:06 AM

    JoNTSQLSrv - Tuesday, November 27, 2018 10:47 AM

    Phil Parkin - Tuesday, November 27, 2018 6:36 AM

    The architecture of your solution is inherently slow, unfortunately – if I am understanding things right.
    You have millions of source rows, which you'd like to split and process in parallel ... good start.
    But, for each of those rows, you wish to call a CLR which itself generates multiple rows of data per input row. This is your pain point. Can this part be re-architected to be set-based, rather than RBAR?

    Thats amazing. Thanks for your set-based idea. Even i was thinking the same. I know RBAR will be performance killer , any how i just done for testing. 

    Can you please suggest me or give idea on how to do set based using CLR function(because Table values function itself is RBAR)  in data flow task  ? 
    I know we can try from control flow by using execute sql task. Is this what you meant?

    Can you provide an overview of what the CLR function does? I think it would be useful background.

    Can the CLR be changed to provide output for a range of input rows, instead of only one? (Eg, WHERE STATUS = 'A', or , WHERE DATE >='20180101' AND DATE < '20190101')

    If the CLR could be changed (or a new CLR created) in this manner, you should be able to use it in your OLEDB source.

    Thanks  Phil Parkin 

    CLR will take 1 row as input and calculate some business logic(using c#) and give us 6 rows or sometimes more depending upon category. 

    In Sp we usually call CLR like below,


    Insert into output_table(col1,col2,col3)
    Select b.col1,b.col2,b.col3
    from Input_table a 
    cross apply dbo.fn_clr(a.col1,a.col2,a.col3) b
    where a.col1  >='20180101' AND DATE < '20190101'

    so I am trying to do the above  in SSIS so when i try this inside data flow task i ended up BDD and  ole db command so i was struggling. 

    So my understanding on your idea is , splitting the million of input rows into 3 batch(3 oledb source) and inside each source changing where clause and applying CLR directly in OLEDB source itself right ? 

    like below 

    1'st oledb source

    Select b.col1,b.col2,b.col3
    from Input_table a 
    cross apply dbo.fn_clr(a.col1,a.col2,a.col3) b
    where a.col1  >='20180101' AND DATE < '20180501'

    2nd oledb source

    Select b.col1,b.col2,b.col3
    from Input_table a 
    cross apply dbo.fn_clr(a.col1,a.col2,a.col3) b
    where a.col1  >='20180601' AND DATE < '20180901'

    3rd oledb source

    Select b.col1,b.col2,b.col3
    from Input_table a 
    cross apply dbo.fn_clr(a.col1,a.col2,a.col3) b
    where a.col1  >='20181001' AND DATE < '20190101'

    so finally , into oledb destination using union all task right ? 

    Please correct me if i am wrong

  • JoNTSQLSrv - Tuesday, November 27, 2018 11:13 PM

    Phil Parkin - Tuesday, November 27, 2018 11:06 AM

    JoNTSQLSrv - Tuesday, November 27, 2018 10:47 AM

    Phil Parkin - Tuesday, November 27, 2018 6:36 AM

    The architecture of your solution is inherently slow, unfortunately – if I am understanding things right.
    You have millions of source rows, which you'd like to split and process in parallel ... good start.
    But, for each of those rows, you wish to call a CLR which itself generates multiple rows of data per input row. This is your pain point. Can this part be re-architected to be set-based, rather than RBAR?

    Thats amazing. Thanks for your set-based idea. Even i was thinking the same. I know RBAR will be performance killer , any how i just done for testing. 

    Can you please suggest me or give idea on how to do set based using CLR function(because Table values function itself is RBAR)  in data flow task  ? 
    I know we can try from control flow by using execute sql task. Is this what you meant?

    Can you provide an overview of what the CLR function does? I think it would be useful background.

    Can the CLR be changed to provide output for a range of input rows, instead of only one? (Eg, WHERE STATUS = 'A', or , WHERE DATE >='20180101' AND DATE < '20190101')

    If the CLR could be changed (or a new CLR created) in this manner, you should be able to use it in your OLEDB source.

    Thanks  Phil Parkin 

    CLR will take 1 row as input and calculate some business logic(using c#) and give us 6 rows or sometimes more depending upon category. 

    In Sp we usually call CLR like below,


    Insert into output_table(col1,col2,col3)
    Select b.col1,b.col2,b.col3
    from Input_table a 
    cross apply dbo.fn_clr(a.col1,a.col2,a.col3) b
    where a.col1  >='20180101' AND DATE < '20190101'

    so I am trying to do the above  in SSIS so when i try this inside data flow task i ended up BDD and  ole db command so i was struggling. 

    So my understanding on your idea is , splitting the million of input rows into 3 batch(3 oledb source) and inside each source changing where clause and applying CLR directly in OLEDB source itself right ? 

    like below 

    1'st oledb source

    Select b.col1,b.col2,b.col3
    from Input_table a 
    cross apply dbo.fn_clr(a.col1,a.col2,a.col3) b
    where a.col1  >='20180101' AND DATE < '20180501'

    2nd oledb source

    Select b.col1,b.col2,b.col3
    from Input_table a 
    cross apply dbo.fn_clr(a.col1,a.col2,a.col3) b
    where a.col1  >='20180601' AND DATE < '20180901'

    3rd oledb source

    Select b.col1,b.col2,b.col3
    from Input_table a 
    cross apply dbo.fn_clr(a.col1,a.col2,a.col3) b
    where a.col1  >='20181001' AND DATE < '20190101'

    so finally , into oledb destination using union all task right ? 

    Please correct me if i am wrong

    This sounds good to me
    Ensure that an appropriate index is in place on your source table to keep the initial batched SELECT running well.

    The absence of evidence is not evidence of absence.
    Martin Rees

    You can lead a horse to water, but a pencil must be lead.
    Stan Laurel

  • Phil Parkin - Wednesday, November 28, 2018 5:26 AM

    JoNTSQLSrv - Tuesday, November 27, 2018 11:13 PM

    Phil Parkin - Tuesday, November 27, 2018 11:06 AM

    JoNTSQLSrv - Tuesday, November 27, 2018 10:47 AM

    Phil Parkin - Tuesday, November 27, 2018 6:36 AM

    The architecture of your solution is inherently slow, unfortunately – if I am understanding things right.
    You have millions of source rows, which you'd like to split and process in parallel ... good start.
    But, for each of those rows, you wish to call a CLR which itself generates multiple rows of data per input row. This is your pain point. Can this part be re-architected to be set-based, rather than RBAR?

    Thats amazing. Thanks for your set-based idea. Even i was thinking the same. I know RBAR will be performance killer , any how i just done for testing. 

    Can you please suggest me or give idea on how to do set based using CLR function(because Table values function itself is RBAR)  in data flow task  ? 
    I know we can try from control flow by using execute sql task. Is this what you meant?

    Can you provide an overview of what the CLR function does? I think it would be useful background.

    Can the CLR be changed to provide output for a range of input rows, instead of only one? (Eg, WHERE STATUS = 'A', or , WHERE DATE >='20180101' AND DATE < '20190101')

    If the CLR could be changed (or a new CLR created) in this manner, you should be able to use it in your OLEDB source.

    Thanks  Phil Parkin 

    CLR will take 1 row as input and calculate some business logic(using c#) and give us 6 rows or sometimes more depending upon category. 

    In Sp we usually call CLR like below,


    Insert into output_table(col1,col2,col3)
    Select b.col1,b.col2,b.col3
    from Input_table a 
    cross apply dbo.fn_clr(a.col1,a.col2,a.col3) b
    where a.col1  >='20180101' AND DATE < '20190101'

    so I am trying to do the above  in SSIS so when i try this inside data flow task i ended up BDD and  ole db command so i was struggling. 

    So my understanding on your idea is , splitting the million of input rows into 3 batch(3 oledb source) and inside each source changing where clause and applying CLR directly in OLEDB source itself right ? 

    like below 

    1'st oledb source

    Select b.col1,b.col2,b.col3
    from Input_table a 
    cross apply dbo.fn_clr(a.col1,a.col2,a.col3) b
    where a.col1  >='20180101' AND DATE < '20180501'

    2nd oledb source

    Select b.col1,b.col2,b.col3
    from Input_table a 
    cross apply dbo.fn_clr(a.col1,a.col2,a.col3) b
    where a.col1  >='20180601' AND DATE < '20180901'

    3rd oledb source

    Select b.col1,b.col2,b.col3
    from Input_table a 
    cross apply dbo.fn_clr(a.col1,a.col2,a.col3) b
    where a.col1  >='20181001' AND DATE < '20190101'

    so finally , into oledb destination using union all task right ? 

    Please correct me if i am wrong

    This sounds good to me
    Ensure that an appropriate index is in place on your source table to keep the initial batched SELECT running well.

    Thanks a lot Phil Parkin 🙂 yes i will check index as well

Viewing 9 posts - 1 through 8 (of 8 total)

You must be logged in to reply to this topic. Login to reply