January 23, 2007 at 9:24 am
My client would like to fulltext search a bunch of PDF's that are stored in the database as images. When using SQL Server's fulltext indexing, I suppose I cannot query that column for words since it is binary. Is my assumption correct, if so, how would I go about doing this?
January 23, 2007 at 3:28 pm
Take a look the following blog, it may help you...
Full Text Search Adobe PDF files stored in SQL Server...
http://jtkane.spaces.live.com/Blog/cns!1pWDBCiDX1uvH5ATJmNCVLPQ!537.entry
MohammedU
Microsoft SQL Server MVP
January 24, 2007 at 12:11 pm
Wonderful article. Thank you.
January 29, 2007 at 5:50 am
Normally to search different docs not included is to obtain an iFilter from the vendor. Adobe does have an iFilter that allows the full-text searching to crawl these files.
I just searched Adobe's site by putting in PDF iFilter and this seems to be the most recent version of their filter.
http://www.adobe.com/support/downloads/detail.jsp?ftpID=2611
There are other documents and instructions on their site. The MS indexer that is part of the OS also benefits from this iFilter.
Good Luck, it is pretty easy to implement.
Viewing 4 posts - 1 through 3 (of 3 total)
You must be logged in to reply to this topic. Login to reply