|
|
Searching inside PDFs
belongs to nGenera Product Management ![]() by Alan Savoy on Aug 15, 2008 - 12:41 PM read 181 times |
Hey Vaughn,
I found your learning connection call enlightening.
At one point it sounded like your team is rewriting your previous research work as wikis so that they are searchable, since they are in PDF and other formats. While I can understand converting research to wiki so that it can be kept current, I wanted to let you know that there is technology out there that can allow search engines to see inside documents, and even the properties of the document.
I am only very familiar with SharePoint, and that uses the iFilter system to parse the inside of documents for its search engine. I know there are a number of 3rd party search engines out there that probably have this capability as well; I do not know if the Apps team is considering this.
It may be a good idea to bring in this search technology to help bridge the gap for your team and save some time. It would also be a great addition to the hub functionality.
-
re: Searching inside PDFs
a reply to Searching inside PDFs
by Vaughan Merlyn on Aug 16, 2008 - 05:32 AM read 78 timesThanks for your interest and for this information. I said we are planning to move some of the libary of research reports to a Wiki to make them searchable. As you point out, if the only purpose is to make them searchable, there are other (better!) ways to do that.
However, we have found that it is not just the searchability that makes Wiki's attractive - it is the whole concept of hyperlinking and truly collaborating in the use of a document (for example) - make it one's own, adapting and evolving it, even embedding things like videos and animations. We think that's an important part of how groups such as IT organizations will leverage Web 2.0 and our intellectual capital. The act of creating the Wiki's from Word or PDF files is relative mechanical - the value once they are there can be significantly enhanced over their flat, static source documents.




