PDA

View Full Version : 【Question】 SQL Server 2005 - Indexing PDF documents inside ZIP archives


dev_vb
January 27th, 2009, 02:33 AM
I'm trying to indenxing in SQL server 2005 , some HTML documents and Pdf documents. Each File is compressed in ZIP format and stored in a Image Field of the SQL.
I've used Citeknet ZIP IFilter 2.1 to index the Zip files.
The result is that , the HTML document (in ZIP) are correctly indexed and searched, while pdf document (in ZIP) are not indexed.

Test1.HTML -> Test1.Zip -> Stored in Image Field -> Index / Search OK
Test2.PDF -> Test2.Zip -> Stored in Image Field -> Index / Search NO

Foxit PDF IFilter works fine when the Image Field (in SQL) contains PDF file

Test2.PDF -> Stored in Image Field -> Index and Search OK

I need to search PDF documents compressed in Zip Files and stored in SQL Server 2005 , how can i do?

emily
February 3rd, 2009, 12:10 AM
Hello,
Thank you for your feedback.
As this issue, it caused by Citeknet Zip ifiler did not call Foxit PDF IFilter to extract the content from the PDF files during indexing. Please contact Citeknet company.
“Test2.PDF -> Stored in Image Field -> Index and Search OK “shows Foxit PDF IFilter is working well.
Thank you for your understanding.
Emily