MAGE-DB2
The MicroArray Gene Expression (MAGE) object model (OM)
is an emerging standard for the representation of microarray
data (Spellman
et al.). MAGE-OM provides the necessary data types
and relationships to support storage of very complete
descriptions of microarray experiments and the resulting
data including: a) description of RNA isolation, hybridization,
and scanning protocols; b) descriptions of chip architecture;
c) annotation data relating to chip probes; d) raw and
processed data at the lowest feature level of the arrays.
A MAGE-ML toolkit is available for exporting MAGE data
to XML-format for purposes of data distribution . We have
implemented a database called MAGE-DB2 that is a full relational mapping of the MAGE-OM optimized
to run on IBM’s scalable, parallel database DB2.
The database is supplemented with: a) data loaders for
importing chip description files for both cDNA, long-oligonucleotide
(Agilent) and Affymetrix arrays, annotation data, and
primary microarray data; b) a web-interface that enables
users to design, save and re-use descriptions of all protocols;
c) a natural language query builder that enables non-expert
users to query the database; d) data export tools that
output data stored in MAGE-DB2 to MAGE-ML format.
Protein-DB2
Protein-DB2 is a relational
database for primary proteomics experimental data. Its
schema also is an extension of the Proteomics Experiment Data
Repository (PEDRo) object model (Taylor
et al), now referred to as the Protein Standards Initiative
(PSI) object model. Protein-DB2 is designed to store complete descriptions of 2-D gel
experimental analyses output by the analysis tool Progenesis
(Nonlinear Dynamics Inc.), as well as results of subsequent
MS and MS/MS analysis. A suite of data translators which
convert binary files output by multiple mass spectrometry
instrumentation to either PSI-ML or mzXML formats
has been developed. Software for loading PSI-ML or mzXML
files into the database and associating these data with
particular spots has been developed. Software for linking
MS and MS/MS data from spot analysis to gels stored within
the database has been developed. Software for loading
the results of Mascot and Sequest MS data analysis and
linking this with MS and spot data has been developed. Versions
1 and 2 database source code is available at the CCBM website
(www.ccbm.jhu.edu).
The
Cardiac Anatomic Database System (CADS)
CADS is an object model and
relational database designed to store finite-element models
of imaged hearts along with MR imaging data measured at
each image voxel. It is currently used to store diffusion
tensor MR data describing fiber and laminar sheet organization
of imaged hearts. CADS is unique
in that it incorporates a visually driven query system
so that users may visualize imaged, segmented and rendered
hearts, graphically select voxel data within sub-regions
for analysis and specify arbitrary SQL queries to be executed
on the data within the selected sub-region. It is a powerful
tool for voxel by voxel comparative analysis of cardiac
imaging data. CADS is available
for download at www.ccbm.jhu.edu and is deployed and operational.
The Center for Cardiovascular Bioinformatics & Modeling - Model Source Code