NASA SWEET Content Characterization

Gail Hodge (IIA)
2005-03-15



II. Core Content Characterization

The following attributes of the content characterization will be supplied by XMDR staff.

Attributes marked with an asterisk (*) are considered mandatory.

Title*

The title (name) of the metadata resource. Taken from Dublin Core Metadata Element Set, Version 1.1: Reference Description.



NASA Semantic Web for Earth and Environmental Terminology

Acronym

The acronym (if any) of the title (name) of the metadata resource.



NASA SWEET

One of the following (Web page(s), Identifier, or Contact Information) is mandatory:

Web page(s)

URL or URI for a web page which describes and/or provides access to the metadata resource.



http://sweet.jpl.nasa.gov/ontology/

Identifier

Unique identifier for the metadata resource, e.g., URN, URI, URL, ISBN, ISSN, DOI, etc.

Contact Information

Name of contact person, mailing address, email address, phone number, etc. for this metadata resource.

Rob Raskin

Jet Propulsion Lab, 300-320

Pasadena, CA 91109

(818) 354-4228

raskin@jpl.nasa.gov





Inclusion Rationale*

Why should we include this metadata content in the XMDR prototype? Which sponsoring organizations of XMDR are likely to be interested?



This provides access to several environmentally related ontologies. May also develop an interest in XMDR on the part of some staff at NASA. Mappings from other terminologies, including the Global Change Master Directory, are available which may add content applicable to one or more of the use cases. SWEET relies heavily on numerical values and problems with Protégé in this regard are noted. This may be another area in which this can be a valuable content testbed for the XMDR tools.

Subject*

Indicate application domain(s) of the metadata content metadata resource. See below. Taken from Dublin Core Metadata Element Set, Version 1.1: Reference Description.

Kind of Metadata*

Indicate type(s) of metadata included in the data set: Cf. "type" attribute from the Dublin Core Metadata Element Set, Version 1.1: Reference Description.

Size statistics(estimated)*

How big is the data set: number of terms or concepts (nodes), number of relationships (edges), number of constraints, size in bytes (in various formats/compressions), size in bytes of internal representations?

Initial Submitter*

Name of person who filled out this survey for this metadata resource in the initial phase. Also include email address.



Gail Hodge

gailhodge@aol.com

Date of Initial Survey*

Date that the initial phase of the survey was completed or updated. Used ISO 8601 date format, i.e., yyyy-mm-dd.



2005-03-10

III. Supplementary Content Characterization

The following attributes of the content characterization will be completed by XMDR staff. These attributes will be collected for those content collections which are considered to be high priority for inclusion in the prototype.

Attributes marked with an asterisk (*) are considered mandatory.

Date*

Publication date of most recent version of the metadata resource in ISO 8601 Date Format: YYYY-MM-DD. Taken from Dublin Core Metadata Element Set, Version 1.1: Reference Description.

Ontologies revised and validated October 10, 2004



Creator

Organization or person(s) responsible for the creation (authoring) of the metadata resource. Taken from Dublin Core Metadata Element Set, Version 1.1: Reference Description.

Team led by Rob Raskin (raskin@seastar.jpl.nasa.gov)



Publisher*

Organization or person(s) responsible for publishing / distributing the metadata resource. Note that we do not differentiate here between publishers and distributors. Taken from Dublin Core Metadata Element Set, Version 1.1: Reference Description.



NASA Jet Propulsion Laboratory

Description*

Additional textual description of the metadata resource.



This project provides a common semantic framework for various Earth science initiatives. SWEET developed these capabilities in the context of finding and using Earth science data and information.


The ontologies within the Semantic Web for Earth and Environmental Terminology (SWEET) provide an upper-level ontology for Earth system science. The SWEET ontologies include several thousand terms, spanning a broad extent of Earth system science and related concepts (such as data characteristics) using the OWL language. The ontologies can be downloaded from http://sweet.jpl.nasa.gov/sweet. To support such a large collection and adhere to the guiding principles, the concepts are divided, where possible, into orthogonal dimensions or facets in support of reductionism. The primary ontologies are shown in Figure and explained below. Each box represents a separate ontology, and a connecting line indicates where major properties are used to define concepts across ontology spaces.





You can probably grab some words from the documentation

Language(s)*

Language(s) of the content of the metadata resource. Is the metadata resource multi-lingual (e.g., thesauri)? Taken from the Dublin Core Metadata Element Set, Version 1.1: Reference Description.



English

Graph-theoretic Classification*

Indicate type of metadata according to the following graph-theoretic classification scheme:



My guess would be a faceted classification,

Format / Schemas(s)*

What file formats (ASN.1, XML, RDF, KIF, HDF5, netCDF, ... ) can the metadata resource be had in? What schemas are used (e.g., for XML, RDF, ...)



OWL

Media / Download*

Is the metadata resource available for download? What are the principal / mirror download sites ? What media types (CDROM, DVD, ...) are available (if any)?



Available for download from the web site

Constraint Specifications

Does the metadata resource include constraints? What kinds of constraints (keys, foreign keys, ....)? How are constraints specified (SQL, logic, RuleML, SWRL, Object Constraint Language, ...) ?

Protocol(s)

What protocols does the metadata publisher support for download, other access, e.g., FTP, HTTP, REST, SOAP, UDDI, LDAP, etc.



HTTP

Licensing Issues*

Open source, public domain, academic use, proprietary license, .... License agreement required ? Cost of license? Can content be redistributed or posted to the web?



None

Export restrictions

Restrictions on export / distribution ?



None

Subsets

Is there some subset of the metadata resource which would be of interest? Can we request / extract / query this subset for the originating site or will we have to obtain the complete metadata resource content and then perform the subset extraction query processing on our system?



There are several subsets available.

Earth Realms

Physical Phenomena (any transient feature)

Physical Processes

Physical Properties

Physical Substances

Sun Realms

Biosphere

Data

Human Activities

Material Things

Numerics

Space

Time

Units



Versions, Updates

What is the current version number of the metadata resource? How often is the metadata resource updated? How are updates named / distributed / propagated ?

Documentation

What documentation is available? Where / how to obtain documentation ? Format of documentation ?



Guide to SWEET Ontologies by Rob Raskin is available at http://sweet.jpl.nasa.gov/index.html

Character Set Encoding*

Is the data set encoded in ASCII, Unicode (and if so what character encoding UTF-8, UTF-16, ...), or other?



UTF-8

Measurement units

What system of measurement units (e.g., SI, cgs, US customary, ... ) is used (if any)? How are they encoded ?



Separate ontology for Units. Defined using Unidata’s UDUnits. Separate section in the documentation about Units.

Dataset / Standards Dependencies

Indicate any dependencies of this metadata resource on other metadata resources or standards, e.g., country codes, terminologies, chemical or biological nomenclature standards, etc.

Related Datasets

Other similar or related metadata resources.



Global Change Master Directory

A good list of metadata vocabularies is available from:

http://marinemetadata.org/vocabularies/refs



including another RDF/OWL implementation of the GCMD keywords

Software tools

What software tools are available to parse, load, convert, browse, edit, .... this metadata resource (type)?



Protégé’?

Audience(s)

Who is the primary intended audience for this data set? Expert researchers, DBAs, scientific users, agency staffers, librarians, statisticians, teachers, general public, college students, high school students, ...



The primary audience might have been scientific users, but could be educators and public.



Citation

How should the metadata resource be cited in publications, etc. ? Note that the preferred bibliographic citation is often a publication rather than the web site for a resource.



Probably use the Guide to SWEET Ontologies?

I am not sure if anything was published in the peer-reviewed literature.

Surveyor*

Person who filled out this section of the survey for this metadata resource. Contact info also.

Date of Survey*

Date this survey was completed / updated for this metadata resource.

IV. Content Characterization by XMDR Staff

The following data elements are to be supplied by XMDR project staff/collaborators.

Attributes marked with an asterisk (*) are considered mandatory.

XMDR Participant Expertise

Names of persons (if any) on XMDR project (and contact info) who are familiar with this data set. XMDR participant organizations who have copies of this metadata resource.



Gail Hodge has contact with GCMD and those at JPL who have worked on this ontology

XMDR Evaluator*

Names of persons on XMDR project (and contact info) who evaluated this metadata resource for inclusion (e.g., if the content survey was completed by someone at the content repository).



Gail Hodge

Inclusion Priority*

Priority suggested for acquisition and ingestion of this metadata resource.



Should be included since it constitutes an environmental rather than a health ontology. One or more can be selected. Suggest using Earth Realms and Biosphere. Time and Units may also be interesting.





Maintained by Frank Olken at Lawrence Berkeley National Laboratory. olken@lbl.gov Last updated: 2005-03-15, Tuesday