I want to be able to add/delete documents, (mainly PDF’s) allow my users to add documents in a members based service. What is the best way to index these documents as they are uploaded/donated/or already exist in certain folders? Basically I am looking for a program of some sort that allows me to index them into a SQL DB or XML file where I can then write code to allow searching of the indexed information and pull up the correct documents. The basic need is what is the best tool to use to index the content so I have a db to pull from in search queries based on keywords so I can show the most relevant documents to their search text. Is there a program or plug in of somekind that already indexes PDF’s etc and places them into a db or xml format so I can use this info to create my own custom search script/results layout so they can open these docs if they find what they need?
Indexing = file name, file extension, and keywords within the document. I know MS can index the document names which is fine, but if I have a PDF called meridian.pdf, but it is actually for particular part numbers or equipment names or something, I need to know that as well. So I was looking for something that also indexed content within the documents not just the file names.