Difference between revisions of "DAS/2.1/Spec"

From BioDAS
Jump to: navigation, search
m (DAS/2/Spec moved to DAS/2.1/Spec: To separate this evolving version of the DAS spec from the frozen DAS/2 version.)
(Wrote a lead to describe what this 2.1 stuff is all about. Link to stable 2.0 version.)
Line 1: Line 1:
 +
This '''DAS/2.1''' version is currently under development. Contributions to this new version are invited from both DAS/1 and DAS/2 developers with the goal of merging ideas from both camps to create a unified version of the spec suitable for all DAS-based applications.
 +
 +
Here's the [http://biodas.org/documents/das2/das2_protocol.html stable '''DAS/2.0''' specification].
 +
 
==Overview==
 
==Overview==
  
The DAS 2.0 specification addresses some shortcomings of the DAS 1.x protocol, including:
+
The DAS 2.x specification addresses some shortcomings of the DAS 1.x protocol, including:
  
 
* Better support for hierachical structures (e.g. gene -> transcripts -> exons)
 
* Better support for hierachical structures (e.g. gene -> transcripts -> exons)
Line 10: Line 14:
  
 
DAS is a protocol for sharing biological data.  This version of the
 
DAS is a protocol for sharing biological data.  This version of the
specification, DAS 2.0, describes features located on the genomic
+
specification, DAS 2.x, describes features located on the genomic
 
sequence.  Future versions will add support for sharing annotations of
 
sequence.  Future versions will add support for sharing annotations of
 
protein sequences, expression data, 3D structures and ontologies.  The
 
protein sequences, expression data, 3D structures and ontologies.  The
Line 16: Line 20:
 
large core shared with the protein sequence DAS.
 
large core shared with the protein sequence DAS.
  
A DAS 2.0 annotation server provides feature information about one or
+
A DAS 2.x annotation server provides feature information about one or
 
more genome sources.  Each source may have one or more versions.
 
more genome sources.  Each source may have one or more versions.
 
Different versions are usually based on different assemblies.  As an
 
Different versions are usually based on different assemblies.  As an
Line 34: Line 38:
 
associated with each annotation.  These are called feature filters.
 
associated with each annotation.  These are called feature filters.
  
DAS 2.0 is implemented using a
+
DAS 2.x is implemented using a
 
[http://www.ics.uci.edu/~fielding/pubs/dissertation/top.htm ReST architecture].
 
[http://www.ics.uci.edu/~fielding/pubs/dissertation/top.htm ReST architecture].
 
Each document (also called an entity or object) has
 
Each document (also called an entity or object) has
Line 46: Line 50:
 
==Retrieval of genomic sequence and annotation==
 
==Retrieval of genomic sequence and annotation==
  
[[DAS/2/Spec/Get-Genomic]]
+
[[DAS/2.1/Spec/Get-Genomic]] for read-only access to genomic sequence and annotation data sources.
  
 
==Stylesheet==
 
==Stylesheet==

Revision as of 21:16, 30 October 2008

This DAS/2.1 version is currently under development. Contributions to this new version are invited from both DAS/1 and DAS/2 developers with the goal of merging ideas from both camps to create a unified version of the spec suitable for all DAS-based applications.

Here's the stable DAS/2.0 specification.

Overview

The DAS 2.x specification addresses some shortcomings of the DAS 1.x protocol, including:

  • Better support for hierachical structures (e.g. gene -> transcripts -> exons)
  • Ontology-based feature annotations
  • Allow multiple formats, including formats only appropriate for some feature types
  • A lock-based editing protocol for curational clients
  • An extensible namespacing system that allows annotations in non-genomic coordinates (e.g. uniprot protein coordinates or PDB structure coordinates)

DAS is a protocol for sharing biological data. This version of the specification, DAS 2.x, describes features located on the genomic sequence. Future versions will add support for sharing annotations of protein sequences, expression data, 3D structures and ontologies. The genomic DAS interface is deliberately designed so there will be a large core shared with the protein sequence DAS.

A DAS 2.x annotation server provides feature information about one or more genome sources. Each source may have one or more versions. Different versions are usually based on different assemblies. As an implementation detail an assembly and corresponding sequence data may be distributed via a different machine, which is called the reference server.

Annotations are located on the genomic sequence with a start and end position. The range may be specified multiple times if there are alternate coordinate systems. An annotation may contain multiple non-continguous parts, making it the parent of those parts. Some parts may have more than one parent. Annotations have a type based on terms in SOFA (Sequence Ontology for Feature Annotation). Stylesheets contain a set of properties used to depict a given type.

Annotations can be searched by range, type, and a properties table associated with each annotation. These are called feature filters.

DAS 2.x is implemented using a ReST architecture. Each document (also called an entity or object) has a name, which is a URL. Fetching the URL gets information about the document. The DAS-specific documents are all in XML. Other data types have existing widely used formats, and sometimes more than one for the same data. A DAS server may provide a distinct document for each of these formats, along with information about which formats are available.

Retrieval of genomic sequence and annotation

DAS/2.1/Spec/Get-Genomic for read-only access to genomic sequence and annotation data sources.

Stylesheet

Writeback

Region locking

Assay Retrievals

Ontology Retrievals