frustrating
Registered User.
- Local time
- Today, 09:02
- Joined
- Oct 18, 2012
- Messages
- 68
Greetings:
I'm trying to create a small program that makes one particular job my company does significantly easier. The software our client uses is obnoxious which involves a lot of clicking through their web interface to view documents to audit. It can be up to 1000 documents to audit, and each document can take 10 clicks just to view the image! I think it's possible to retrieve all of the documents and look at them without clicking, but to do that requires the extraction and insertion of variables that will ultimate get inserted into a URL string to download the pdf files we want to audit.
I'm trying to run a script that does several things:
1. Goes to a URL and downloads its source code and saves it into a txt file.
2. Then takes the txt file and retrieve the variable I need (my client's system calls it transportID, but what I really need is the 9 digit number that's after each one) from the file. There could be anywhere from 10-20 'transportIDs" per batch that need to get downloaded. They then need to be put into an access table called TransportID.
3. The transportID gets inserted into a link which goes to the page I can download my PDF.
4. Download all the PDFs into a folder.
From there I can create a form to view them as necessary.
I've been able to automate my source code download into txt just fine, but where I am really getting stuck is step #2. What would be the best way to tell the system to basically "Open the text file, search the whole file for anything that says 'transportID' then retrieve the 10 characters after 'transportID' and then put them into a table?
It's a pretty obnoxious post, I know. Any ideas?
I'm trying to create a small program that makes one particular job my company does significantly easier. The software our client uses is obnoxious which involves a lot of clicking through their web interface to view documents to audit. It can be up to 1000 documents to audit, and each document can take 10 clicks just to view the image! I think it's possible to retrieve all of the documents and look at them without clicking, but to do that requires the extraction and insertion of variables that will ultimate get inserted into a URL string to download the pdf files we want to audit.
I'm trying to run a script that does several things:
1. Goes to a URL and downloads its source code and saves it into a txt file.
2. Then takes the txt file and retrieve the variable I need (my client's system calls it transportID, but what I really need is the 9 digit number that's after each one) from the file. There could be anywhere from 10-20 'transportIDs" per batch that need to get downloaded. They then need to be put into an access table called TransportID.
3. The transportID gets inserted into a link which goes to the page I can download my PDF.
4. Download all the PDFs into a folder.
From there I can create a form to view them as necessary.
I've been able to automate my source code download into txt just fine, but where I am really getting stuck is step #2. What would be the best way to tell the system to basically "Open the text file, search the whole file for anything that says 'transportID' then retrieve the 10 characters after 'transportID' and then put them into a table?
It's a pretty obnoxious post, I know. Any ideas?