It's complicated, but.
1) run a scan of all the relevant raw files using a recursive function. Every specimen in the collection has been photographed. Each raw file is about 70mb.
2) record the family name of each box of images.
3) record the Accession number of each image in a box, along with...