Indexing PDFs – Can Someone Make it Easier Please?
Posted by workerthread on October 14, 2008
The two posts on this blog with the greatest number of hits are the one on configuring a PDF iFilter with WSS 3.0, and using Adobe Reader 9 with SharePoint. Almost every SharePoint implementation I’ve been involved in has required setting up a PDF iFilter and I would say that after standard Office documents (mostly Word but some PowerPoint and Excel), PDFs are the file type most commonly uploaded to SharePoint document libraries.
So please, could someone somewhere make it easier for SharePoint Admins to set up their servers for crawling and indexing PDF documents! I really would like to see the day when I don’t have to mess with registry settings and XML files to get this to work!
Sadalit Van Buren has a post on her wishlist for the next version of SharePoint. As she says, “forget the relationship with Adobe already, so that the Acrobat Filter is out of the box!”. I also spotted a post on the Res Cogitans blog with a speculative SharePoint v14 Feature List which also mentions PDF support as a “Probably” – I really, really hope so.
If this does happen, I would really like to get the metadata captured as well, in the same way as Office documents. PDF document properties generally look like this:
So of course I would like to get them automatically mapped to document library columns on upload. Bamboo Solutions are moving towards a solution with their pre-release PDF Document Parser, so maybe as this progresses at least one of my wishes will come true…
6 Responses to “Indexing PDFs – Can Someone Make it Easier Please?”
Sorry, the comment form is closed at this time.