Complex XML Source Import

  • Hi All,

    I've an interesting challenge, I have just received a new set of XML files which we need to shred and import into our data warehouse. The files are anywhere between 100mb and 2GB and have been set up to use 3x XSD files. The files can have 20-30 node levels and there are multiple namespaces referred to in each.

    My intention is to try and flatten the data in a number of the nodes and import them into their relevant downstream tables, however I'm running into issues trying to get any data out of the files. The issue appears to relate to the multiple name spaces and having multiple xsd files. Using the XML source I can only refer to one XSD, I understand needs to be the top level XSD file.

    My questions are as follows:

    1- How can I use SSIS to Import an XML file with 3x XSD file references?

    2- If SSIS can't do this, what other options are there?

    I appreciate the help, while I've imported many xml files in the past, I haven't encountered this issue before

    Regards,

    Steve

  • Quick thought, use XQuery in SQL Server and wildcard name spaces, ie. *:element

    😎

  • I was thinking of that, however I'd be concerned with performance as I'd have to process in 2GB files, would that not kill the server?

  • Not if done correctly, done many times larger files this way.

    😎

Viewing 4 posts - 1 through 3 (of 3 total)

You must be logged in to reply to this topic. Login to reply