As I said in my introduction to this site ([FONT="]Wandering in the dark)[/FONT], I have not used Access since the mid 1990’s. I understand the principles of a relational data bases design and the importance of data normalization.
Here is my dilemma:
I have a large number of excel spreadsheets ( appx. 125) that have been used to record data from clients that have been treated in a diabetes prevention program. These spreadsheets have been copied, renamed and used by others to add their data. There are some spreadsheets that have 128 columns. This is the result of adding new data to an original spreadsheet creating an enormous flat file. To compound the problem the multiple copies of the original spreadsheets do not all contain the same data.
What is the best way to sort all this information into clean non duplicated data that can be used in an Access database? Should I try to clean up the data in the spreadsheets first then import it into Access or would it be easier to import the raw spreadsheets and manipulate the data in Access?
I have very little experience with Visual Basic. I have seen on this site that it is often recommended as a solution for manipulating the data. What is the learning curve for me to be able to use that programming language?
What is the best way to sort all this information into clean non duplicated data that can be used in an Access database? Should I try to clean up the data in the spreadsheets first then import it into Access or would it be easier to import the raw spreadsheets and manipulate the data in Access?
I have very little experience with Visual Basic. I have seen on this site that it is often recommended as a solution for manipulating the data. What is the learning curve for me to be able to use that programming language?
Prior to dealing with the raw data I started to design my Access database. I have attached a copy of my first pass on a normalized database that will meet our needs. I am not sure if I have carried the normalization too fare, making my model too complicated. You will notice that all personal information about a client is separated from the metrics by a junction table which allows the sharing of metrics without revealing any personal information. This is a federal law (HIPAA) that requires that medical record privacy be guaranteed.
I have attached a copy of my Access design with only the empty tables and no forms or queries.
I have attached a copy of my Access design with only the empty tables and no forms or queries.
Any help or advice would be appreciated as I am not sure how to proceed.