Here is my problem. My office has a large selection of files, and they want to have a nice interface where they can search keywords through the files. The simple solution to this would be to find a program that indexes the files and searches through them. But there is a problem with doing that. Some of the files are saved as PDFs and cannot be searched. So we ordered a program that converts the PDFs to word files. I've made the conversions, but the DOC files are not even close to carbon copies of the PDFs, but the words for the most part are there.
My idea, which leads to the question of whether or not this can be done, is to possibly put all of the files into an access database as OLE objects. This way, when I get results, with some simple programming I could display a link to the PDF file instead of the DOC file where the words were found. But I'm not sure if searching through objects in the database is possible. And, even if it is, would it be slow? And also if that is the case, can it be indexed to make it faster, seeing that I'm estimating it would be about 100 to 200 MB of documents?
My gut feeling is that you cannot search through the objects in the database, but I figured I'd give it a try. But if it can be done, can anyone offer some suggestions on exactly how?
Thanks for any help,
Tim
My idea, which leads to the question of whether or not this can be done, is to possibly put all of the files into an access database as OLE objects. This way, when I get results, with some simple programming I could display a link to the PDF file instead of the DOC file where the words were found. But I'm not sure if searching through objects in the database is possible. And, even if it is, would it be slow? And also if that is the case, can it be indexed to make it faster, seeing that I'm estimating it would be about 100 to 200 MB of documents?
My gut feeling is that you cannot search through the objects in the database, but I figured I'd give it a try. But if it can be done, can anyone offer some suggestions on exactly how?
Thanks for any help,
Tim