Members: Join   Log In
Conv Alan Savoy
Rank_member
Searching inside PDFs Rt-50-16
belongs to nGenera Product Management  
by Alan Savoy on Aug 15, 2008 - 12:41 PM read 181 times
 

Hey Vaughn, 

I found your learning connection call enlightening. 

At one point it sounded like your team is rewriting your previous research work as wikis so that they are searchable, since they are in PDF and other formats.  While I can understand converting research to wiki so that it can be kept current, I wanted to let you know that there is technology out there that can allow search engines to see inside documents, and even the properties of the document.

I am only very familiar with SharePoint, and that uses the iFilter system to parse the inside of documents for its search engine.  I know there are a number of 3rd party search engines out there that probably have this capability as well; I do not know if the Apps team is considering this.

It may be a good idea to bring in this search technology to help bridge the gap for your team and save some time.  It would also be a great addition to the hub functionality.

  • Conv Vaughan Merlyn
    Rank_docent
    re: Searching inside PDFs Rt-50-16
    Icon-thread a reply to Searching inside PDFs
    by Vaughan Merlyn on Aug 16, 2008 - 05:32 AM read 78 times
     

    Thanks for your interest and for this information.  I said we are planning to move some of the libary of research reports to a Wiki to make them searchable.  As you point out, if the only purpose is to make them searchable, there are other (better!) ways to do that.

    However, we have found that it is not just the searchability that makes Wiki's attractive - it is the whole concept of hyperlinking and truly collaborating in the use of a document (for example) - make it one's own, adapting and evolving it, even embedding things like videos and animations.  We think that's an important part of how groups such as IT organizations will leverage Web 2.0 and our intellectual capital.  The act of creating the Wiki's from Word or PDF files is relative mechanical - the value once they are there can be significantly enhanced over their flat, static source documents.

Featured

nGen Collaboration v7.1
Project CBS
Project LIM
Project ITM
Wiki Archive
Concours Archive

Author Profile

Alan Savoy

Member Rank_member

Subscribe

Feed for nGenera Community:
Feed_small Public Secure_feed_16 Secure

Why subscribe? What is RSS?