Possible RAM issue ? Or ???

  • I have an SSIS package that is trying to process a 5.4 GB flat file, composed of variable record length records, each of which has a record type that is used for a conditional split. I currently have it set up to do a sort on two of the record type so that I can merge join them during the import process and save a step later in query processing, as well as reduce stored data volume. On smaller files, this has worked fine, but in processing this behemoth, it created almost 10 GB of temp files (causing a number of disk space issues, which I've been able to get past), and once it finally had enough disk space for the 147 files it created, it failed as follows:

    Error: 0xC0047022 at Data Flow Task, SSIS.Pipeline: SSIS Error Code DTS_E_PROCESSINPUTFAILED. The ProcessInput method on component "Sort TYPE 75 by JOURNAL ID" (384) failed with error code 0x8007000E while processing input "Sort Input" (385). The identified component returned an error from the ProcessInput method. The error is specific to the component, but the error is fatal and will cause the Data Flow task to stop running. There may be error messages posted before this with more information about the failure.

    Error: 0xC0047022 at Data Flow Task, SSIS.Pipeline: SSIS Error Code DTS_E_PROCESSINPUTFAILED. The ProcessInput method on component "Sort TYPE 70 by JOURNAL ID" (437) failed with error code 0xC0047020 while processing input "Sort Input" (438). The identified component returned an error from the ProcessInput method. The error is specific to the component, but the error is fatal and will cause the Data Flow task to stop running. There may be error messages posted before this with more information about the failure.

    The two Sort components referred to are Merge Joined, but the failure happens before it starts to work on that step. The two sort compnents are green, however, and I don't know why. The total number of rows to each of these sorts is just over 1.2 million, based on running this package manually.

    I haven't been able to find anything useful on the web, but perhaps I just haven't searched for the right thing yet. The server I'm running the package on only has 12 GB of RAM, so that might be an issue. I did direct the BufferTempStorage to the data drive (SAN - based - 4 GB/sec SAN Fabric). There are just over 4.5 million rows in the flat file.

    Does anyone have any guidance on this one ?

    P.S. The final commit for a number of other splits from the conditional occurred before this failure, but yet, no component turned red...

    Steve (aka sgmunson) 🙂 🙂 🙂
    Rent Servers for Income (picks and shovels strategy)

  • If you can find any way that you can re-engineer this process so that you do not do any sorting or grouping within SSIS and instead use the SQL Server engine, your problems will likely go away.

    The absence of evidence is not evidence of absence
    - Martin Rees
    The absence of consumable DDL, sample data and desired results is, however, evidence of the absence of my response
    - Phil Parkin

  • I can certainly re-engineer this, but I was hoping for some clear indication as to exactly what this error really means before I go off and do something different that will likely cause other kinds of performance issues.

    Steve (aka sgmunson) 🙂 🙂 🙂
    Rent Servers for Income (picks and shovels strategy)

Viewing 3 posts - 1 through 2 (of 2 total)

You must be logged in to reply to this topic. Login to reply