tripptofer
Registered User.
- Local time
- Today, 09:41
- Joined
- Aug 12, 2009
- Messages
- 20
Hiya-
Apologies in advance, but I seem to have gotten stuck and could use a swift nudge in the right direction.
I had been bludgeoning excel to act as a quasi-database (b/c I was on a mac and it was the only thing I had) and have subsequently acquired a pc and access and so I am trying to slowly transform what I did before into a proper db.
I first mapped everything out on paper so that fields could be normalized properly from the get go, but then realized it was less typing to simply import the spreadsheet as a populated table.
What I would like to do is start with a massive amount of grants data from the NIH and then tease out their unique codes for recipient institutions, investigators and granting agencies. Then, using referential integrity I was hoping to backfill in as many of the holes as possible and correct any typos (deduping).
So the main data table would have fields including GrantID, RecipientID, RecipientOrgName, RecipientOrgCity, state, country; InvestigatorID, IvestigatorName, etc... Initially I can use all of the NIH unique IDs as primary keys, however eventually all will require their own internal PK as foreign data will be added and I can assume that not everyone in the world has been funded by the NIH.
I guess at this stage, I am mostly concerned with structuring everything as best as possible from the get go and keeping my brownian motion efforts to a minimum...
I had imported the main data table and built the accessory lookup tables which will be populated by the main data table however access is balking at me making anything other than non-referential relationships.
Do the supplemental tables need to be first populated? I thought it would simply be a left (?) join and having banged around for a while now on my own I am quietly convinced that a little knowledge is dangerous and I've missed something fundamental.
Any guidance would be appreciated...
Kind regards,
tripptofer
Apologies in advance, but I seem to have gotten stuck and could use a swift nudge in the right direction.
I had been bludgeoning excel to act as a quasi-database (b/c I was on a mac and it was the only thing I had) and have subsequently acquired a pc and access and so I am trying to slowly transform what I did before into a proper db.
I first mapped everything out on paper so that fields could be normalized properly from the get go, but then realized it was less typing to simply import the spreadsheet as a populated table.
What I would like to do is start with a massive amount of grants data from the NIH and then tease out their unique codes for recipient institutions, investigators and granting agencies. Then, using referential integrity I was hoping to backfill in as many of the holes as possible and correct any typos (deduping).
So the main data table would have fields including GrantID, RecipientID, RecipientOrgName, RecipientOrgCity, state, country; InvestigatorID, IvestigatorName, etc... Initially I can use all of the NIH unique IDs as primary keys, however eventually all will require their own internal PK as foreign data will be added and I can assume that not everyone in the world has been funded by the NIH.
I guess at this stage, I am mostly concerned with structuring everything as best as possible from the get go and keeping my brownian motion efforts to a minimum...
I had imported the main data table and built the accessory lookup tables which will be populated by the main data table however access is balking at me making anything other than non-referential relationships.
Do the supplemental tables need to be first populated? I thought it would simply be a left (?) join and having banged around for a while now on my own I am quietly convinced that a little knowledge is dangerous and I've missed something fundamental.
Any guidance would be appreciated...
Kind regards,
tripptofer