XML-Based Access to Bioinformatics Data

A. Domain-Specific Markup Languages/Schema (DTD/XML Schema)

Language Full Name Developer URL Schema Format Schema Description
BSML Bioinformatic Sequence Markup Language LabBook www.bsml.org DTD www.labbook.com/dtd/bsml2_2_annot.dtd The Bioinformatic Sequence Markup Language is an open-standard protocol for the encoding and display of graphic genomic displays of DNA, RNA, and protein sequence information.
BIOML BIOpolymer Markup Language ProteoMetrics www.bioml.com DTD www.bioml.com/BIOML/bioml.dtd The BIOpolymer Markup Language (BIOML) is an XML language that is used to describe experimental information about proteins, genes, and other biopolymers.
PSDML Protein Sequence Database Markup Language National Biomedical Research Foundation pir.georgetown.edu/pirwww DTD pir.georgetown.edu/pirwww/xml/psdml.dtd The Protein Sequence Database Markup Language is an open-standard markup language used to store protein information in the Protein Information Resource (PIR) database.
AGAVE Architecture for Genomic Annotation, Visualization and Exchange DoubleTwist www.agavexml.org XML Schema www.agavexml.org/schema/v3_0/agave.xsd An Architecture for Genomic Annotation, Visualization and Exchange from the AGAVE Community lead by DuobleTwist.
MSAML Multiple Sequence Alignment Markup Language Paul Gordon maggie.cbr.nrc.ca/~gordonp/xml/MSAML DTD maggie.cbr.nrc.ca/~gordonp/xml/MSAML/dtd/msaml.dtd The Multiple Sequence Alignments Markup Language was developed to make manipulation and extraction of multiple sequence alignment information easier by logically defining the parts of an alignment for use in an XML-based application.
GAME Genome Annotation Markup Elements BIOXML www.bioxml.org/Projects/game DTD www.bioxml.org/dtds/game2/game.dtd Genome Annotation Markup Elements is a markup language used in molecular biology for annotation of a biosequence.
MGED-ML Microarray Gene Expression Database Markup Language MGED Group/OMG www.mged.org DTD ftp.omg.org/pub/docs/lifesci/01-11-02.dtd The Microarray Gene Expression Database (MGED) group is a grass-root movement to promote the adoption of standards in microarray experiments and data. More specific goals are to facilitate the establishment of gene expression databases, comparability of microarray data from different sources, interoperability of different functional genomics databases and data analysis software. Subsumes GeneXML, GEML and MAML.

 

B. Ontologies (RDF/RDFS/DAML+OIL)

Language Full Name Developer URL Schema Format Schema Description
GO Gene Ontology Gene Ontology Consortium www.geneontology.org DTD/RDF ftp.geneontology.org/go/xml/dtd/go.dtd The objective of GO is to provide controlled vocabularies for the description of the molecular function, biological process and cellular component of gene products. These terms are to be used as attributes of gene products by collaborating databases, facilitating uniform queries across them. The controlled vocabularies of terms are structured to allow both attribution and querying to be at different levels of granularity.
MGED Ontology Microarray Gene Expression Database Ontology MGED Group www.cbil.upenn.edu/Ontology/MGED_ontology.html RDFS/DAML www.cbil.upenn.edu/Ontology/biomaterial12.rdfs The MGED project is charged with developing an ontology for describing samples used in microarray experiments.

 

C. Web Services (WSDL/SOAP/DAML-S/WFSL/XLANG)

I3C