How to convert PDF data to database table in SQL server

  • If I just use a PowerQuery inside my data flow, I can just read the table and tell PowerQuery to ignore the rest. Then it's stupid easy. No need to use Excel at all.

  • Emperor100 wrote:

    First convert the PDF to excel format. Remove any unwanted lines, rows or any characters. Verify the data is correctly formatted, then save it as .csv file.  Use import/export wizard to import the data to SQL Server table.

    I'll try that and see what happens.  The challenge will be getting the 776 pages done in my lifetime!

    Rick
    Disaster Recovery = Backup ( Backup ( Your Backup ) )

  • If your data is in tables in the PDF file, you can tell PowerQuery to return just the table. Is this a publicly available PDF file? (likely not)... I wanted to see if I could get PowerQuery to read it and then maybe dump that into a table.

    (I have sooo wished for a PowerQuery transform in SSIS since forever.. so I could read from a PDF or whatever, and then send the data anywhere I want... Oh right, PYTHON.

Viewing 3 posts - 16 through 17 (of 17 total)

You must be logged in to reply to this topic. Login to reply