Cancer Data Standards Repository Content Characterization

Frank Olken (LBNL)
2005-03-14

II. Core Content Characterization

Attributes marked with an asterisk (*) are considered mandatory.

Core Content Characterization
Attribute Value
Title* Cancer Data Standards Registry
Acronym

caDSR

**One of the following (Web page(s), Identifier, or Contact Information) is mandatory:
Web page(s)** http://ncicb.nci.nih.gov/core/caDSR/
Identifier** http://ncicb.nci.nih.gov/core/caDSR/
Contact Information**

Denise Warzel, Assistant Director, caDSR, NCI Center for Bioinformatics, NIH, 6116 Executive Boulevard, Suite 408, Rockville, MD 20852 email: warzeld@mail.nih.gov, Help desk: email: ncicb@pop.nci.nih.gov

Inclusion Rationale*

This is a production ISO/IEC 11179 metadata registry.

Subject* biological, biomedical, medical or healthcare
Kind of Metadata*
  • data element characterization
    • definitions - natural language, logic-based
    • types
    • dimensionality / measurement units
  • UML object models: classes and attributes
Size statistics (estimated)*

12K data elements in registry (in various states of development)

Initial Submitter* Frank Olken (LBNL)
Date of Initial Survey* 2005-03-14

III. Supplementary Content Characterization

Attributes marked with an asterisk (*) are considered mandatory.

Supplementary Content Characterization
Attribute Value
Date* March 31, 2005 version 3.0
Creator Applications developed by NCI. Content primarily developed by NCI, and NCI funded cancer centers.
Publisher* National Cancer Institute Center for Bioinformatics (NCICB)
Description*

One of the problems confronting the biomedical data management community is the panoply of ways that similar or identical concepts are described.  Such inconsistency in data descriptors (metadata) makes it nearly impossible to aggregate and manage even modest-sized data sets in order to be able to ask basic questions.  The NCI, together with partners in the research community, develops common data elements (CDEs) that are used as metadata descriptors for NCI-sponsored research and for the caCORE applications.  The caCORE objects are represented by UML Models.  The UML Model is used to facilitate a semi-automated load from caCORE UML into ISO/IEC 11179 Administered Components.  This is discussed in more detail in the Application Developers section.  The caDSR is a database and tool set that the NCI and its partners use to create, edit and deploy the CDEs.

Language(s)* English only.
Graph-theoretic Classification* general simple directed graph (e.g., UMLS) - may contain cycles
Format / Schemas(s)* XML, NCI's own schema (based on ISO 11179)
Media / Download* http://cdebrowser.nci.nih.gov
Constraint Specifications Referential integrity constraints from ISO 11179. Inclusion dependencies by NCI Enterprise Vocabularies Services.
Protocol(s) HTTP, Java and SOAP API's.
Licensing Issues* None, open source, public domain
Export restrictions None
Subsets There is a query facility to select subsets.
Versions, Updates

2005-03-14 current version 2.11, 2005-03-28 version 3.0 will be released. Constantly update, individual items are versioned.

Documentation Documentation is available from the home page, a technical guide, users guide, online documentation, release notes, API guide. Documentation URL is http://ncicb.nci.nih.gov/core/caDSR/#SoftwareDocumentation
Character Set Encoding* UTF-8
Measurement units Not applicable.
Dataset / Standards Dependencies ISO/IEC 11179, NCI Thesaurus, MGED, UMLS, GO, see http://nciterms.nci.nih.gov and also http://ncimeta.nci.nih.gov
Related Datasets None
Software tools CDE Browser (see URL above)
Audience(s) Cancer researchers, clinical trials adminstrators
Citation No
Surveyor* Frank Olken (LBNL)
Date of Survey* 2005-03-14

IV. Content Characterization by XMDR Staff

The following data elements are to be supplied by XMDR project staff/collaborators.

Attributes marked with an asterisk (*) are considered mandatory.

Content Characterization by XMDR Staff
Attribute Value
MDR Participant Expertise Sherri De Coronado, NCI
MDR Evaluator* Frank Olken (LBNL)
Inclusion Priority* 1

Maintained by Frank Olken (LBNL) at Lawrence Berkeley National Laboratory. olken@lbl.gov Last updated: 2004-03-15, Tuesday, 4:55 PM PST

Valid XHTML 1.0!