Although I work in a functional department (not IS), I've been instructed to develop an archive for our portion of our legacy system. (Other departments aren't interested in the historical data, for reasons that I can't fathom) That system was developed in DEC VAX and is a cobbled-together mish-mash of relational and flat structures.
I have identified at least seventeen files (tables) that are essential to mine the historical data in the legacy system. Two files have more than 300 fields; one of those files has more than 130K records. Some files that have fewer fields (on the order of 25) have well over a half-million records.
The system has a facility to export comma-separated, text-delimited files, but each field has to be listed by name. The system uses fixed-length records (and fields) that can be overlayed in multiple ways to define subfields. (For example, consider the consecutive characters 12212010 -- today's date. This could be referred to as a single 8-character date field, and positions 1 and 2 could be defined as a month field, and positions 1 through 4 could be defined as a month-day field, all simultaneously.) Because of the substantial overhead of the delimiters in a 340-field file (nearly than 1K per record, more than 100MB for the table), I am considering overlaying the record with a single field for export.
In converting the legacy data to our new system, I wrote code that translated the textfile output of data-definition reports into a database of fields and their properties. Since Access is a recursive tool (table definitions are themselves data in tables), I am hoping to manipulate the data-definition database into linked-table specifications.
(As for future use of the data: I would link the data in a FE-BE configuration, even though the back-end data would be static)
How do I achieve this? Can I push data to MSysIMEXSpecs and -Columns? Must I use Schema.ini? (If so, what is the best resource for learning about it?)
Thank you for any wisdom you can share! Please let me know if I need to clarify anything about this project. I think I'm in over my head, but that doesn't diminish the expectations of my supervisors.
I have identified at least seventeen files (tables) that are essential to mine the historical data in the legacy system. Two files have more than 300 fields; one of those files has more than 130K records. Some files that have fewer fields (on the order of 25) have well over a half-million records.
The system has a facility to export comma-separated, text-delimited files, but each field has to be listed by name. The system uses fixed-length records (and fields) that can be overlayed in multiple ways to define subfields. (For example, consider the consecutive characters 12212010 -- today's date. This could be referred to as a single 8-character date field, and positions 1 and 2 could be defined as a month field, and positions 1 through 4 could be defined as a month-day field, all simultaneously.) Because of the substantial overhead of the delimiters in a 340-field file (nearly than 1K per record, more than 100MB for the table), I am considering overlaying the record with a single field for export.
In converting the legacy data to our new system, I wrote code that translated the textfile output of data-definition reports into a database of fields and their properties. Since Access is a recursive tool (table definitions are themselves data in tables), I am hoping to manipulate the data-definition database into linked-table specifications.
(As for future use of the data: I would link the data in a FE-BE configuration, even though the back-end data would be static)
How do I achieve this? Can I push data to MSysIMEXSpecs and -Columns? Must I use Schema.ini? (If so, what is the best resource for learning about it?)
Thank you for any wisdom you can share! Please let me know if I need to clarify anything about this project. I think I'm in over my head, but that doesn't diminish the expectations of my supervisors.