Meeting Minutes from January 15th, 2006 The meeting in which we attempt to describe their structure, and our proposal, and further divide the labors. Their Current Structure: Two folders under a main directory Archives Archive97 Archive98 Archive99? These three contain the articles and images as well. The other pages were at top level in one of two forms monthYear.htm indexYear.htm ^ These files only reference pages in the articles Articles Every article they've ever produced (since 99) resides in this folder. This folder contains both the images and the web pages, which are named in no particular manner. They are for our purposes, completely random. 3-5 articles a month 12 montths a year 8 years 288 – 480 Articles Our Structure: Year Month timestamp#takedown timestamp#number#.ext Year is the the top most storage folder, this goes down into the twelve months of the year. From here, each folder contains articles and their images, named by a time stamp, and an incrementing number in case of multiple files with the same extension being submitted at the same time. Converter approaches: We can shift things by time stamp, and hope this works without breaking an unacceptably large number of the links. Create a program which mines the html files to find the linked images and preserves those links. Allow some poor work study student to convert the whole thing by hand. Dray is going to set up a development server and a repository for us to work from.