about icmicm peopleicm researchicm portalsicm seminarsicm publicationsicm educationicm downloads


Contact Us Explore Our Institute News Organization Jobs Facilities Philanthropy

Facilities

 

Databases

MAGE-DB2
The MicroArray Gene Expression (MAGE) object model (OM) is an emerging standard for the representation of microarray data (Spellman et al.). MAGE-OM provides the necessary data types and relationships to support storage of very complete descriptions of microarray experiments and the resulting data including: a) description of RNA isolation, hybridization, and scanning protocols; b) descriptions of chip architecture; c) annotation data relating to chip probes; d) raw and processed data at the lowest feature level of the arrays. A MAGE-ML toolkit is available for exporting MAGE data to XML-format for purposes of data distribution . We have implemented a database called MAGE-DB2 that is a full relational mapping of the MAGE-OM optimized to run on IBM’s scalable, parallel database DB2. The database is supplemented with: a) data loaders for importing chip description files for both cDNA, long-oligonucleotide (Agilent) and Affymetrix arrays, annotation data, and primary microarray data; b) a web-interface that enables users to design, save and re-use descriptions of all protocols; c) a natural language query builder that enables non-expert users to query the database; d) data export tools that output data stored in MAGE-DB2 to MAGE-ML format.

Protein-DB2
Protein-DB2 is a relational database for primary proteomics experimental data. Its schema also is an extension of the Proteomics Experiment Data Repository (PEDRo) object model (Taylor et al), now referred to as the Protein Standards Initiative (PSI) object model. Protein-DB2 is designed to store complete descriptions of 2-D gel experimental analyses output by the analysis tool Progenesis (Nonlinear Dynamics Inc.), as well as results of subsequent MS and MS/MS analysis. A suite of data translators which convert binary files output by multiple mass spectrometry instrumentation to either PSI-ML or mzXML formats has been developed. Software for loading PSI-ML or mzXML files into the database and associating these data with particular spots has been developed. Software for linking MS and MS/MS data from spot analysis to gels stored within the database has been developed. Software for loading the results of Mascot and Sequest MS data analysis and linking this with MS and spot data has been developed. Versions 1 and 2 database source code is available at the CCBM website (www.ccbm.jhu.edu).


The Cardiac Anatomic Database System (CADS)
CADS is an object model and relational database designed to store finite-element models of imaged hearts along with MR imaging data measured at each image voxel. It is currently used to store diffusion tensor MR data describing fiber and laminar sheet organization of imaged hearts. CADS is unique in that it incorporates a visually driven query system so that users may visualize imaged, segmented and rendered hearts, graphically select voxel data within sub-regions for analysis and specify arbitrary SQL queries to be executed on the data within the selected sub-region. It is a powerful tool for voxel by voxel comparative analysis of cardiac imaging data. CADS is available for download at www.ccbm.jhu.edu and is deployed and operational.

 

Software


The Center for Cardiovascular Bioinformatics & Modeling - Model Source Code