Progress on Migration of GrainGenes from ACEDB to RDBMS

Author: David Hummel (updated 23Sep2004 by Nancy Lui)



Current Status

Goals

Schema Design

The data contained in ACEDB is highly object-oriented in nature, existing in tree-like nested structures which are not directly translatable to a relational format. To maintain the complexity and breadth of data contained in ACEDB, several relational tables are required to express the data in a single class. These tables, linked together via foreign keys, allow for the one-to-many, many-to-one, and many-to-many relationships that exist in the tree-like ACEDB models. The following relational strategy was used:

Given the high degree of model complexity for certain ACEDB classes (i.e. Sequence), these classes require many SQL tables to express the data. To minimize the number of tables required, it was helpful to reduce the complexity of the class models, wherever possible, before deriving the SQL tables. The following steps were taken to simplify the models:

Considerations

Links

Miscellaneous Documents