How can I compare two HL7-like text files?

  • Where I work, we receive HL7-like text files each weekday from an external source. One of my users believes that the data may be duplicated, so he's asked me to check the last few days HL7-like text files, looking for duplicates.

    What techniques or apps can I use to check for duplicates between two or more HL7-like files?


    The reason why I call these text files "HL7-like" is because they leave off a couple of standard sections from the beginning of each record. I don't know why, they don't tell me anything when I ask, so I'm left to speculate. My guess is some people at my work and the agency that supplies us with the data via secure FTP, decided that they could shorten the length of each record by cutting all those segments which would always be the same. Of course, I could be wrong. That's just my best guess.

    Kindest Regards, Rod Connect with me on LinkedIn.

  • Hi Rod, cheers.  Not sure about pro tools but if you're up for rolling your own you could try .NET and System.IO

    https://docs.microsoft.com/en-us/troubleshoot/dotnet/csharp/create-file-compare

    https://stackoverflow.com/questions/1358510/how-to-compare-2-files-fast-using-net

     

    Aus dem Paradies, das Cantor uns geschaffen, soll uns niemand vertreiben können

  • Take a look at Beyond Compare from Scooter Software.  With that said - comparing HL7 messages isn't going to be easy.  You could easily have duplicates but in different formats - or duplicate messages in different positions.

    Jeffrey Williams
    “We are all faced with a series of great opportunities brilliantly disguised as impossible situations.”

    ― Charles R. Swindoll

    How to post questions to get better answers faster
    Managing Transaction Logs

Viewing 3 posts - 1 through 2 (of 2 total)

You must be logged in to reply to this topic. Login to reply