XMDR Content Survey - IETF RFC 3066 (Language Tags)

II. Core Content Characterization

Title*

IETF RFC 3066 - Tags for the Identification of Languages

Acronym

IETF3066

Web page(s)

http://rfc.net/rfc3066.html

Identifier

URI:ISO:2.16.840.1.113883.6.121

Contact Information

Harald Tveit Alvestrand
UNINETT
Pb. 6883 Elgeseter
N-7002 TRONDHEIM
NORWAY

EMail: Harald.T.Alvestrand@uninett.no
Phone: +47 73 59 70 94

Inclusion Rationale*

This standard describes how language tags are to be encoded and used throughout the WWW

Note: This specification obsoletes IETF 1766

Subject*

Kind of Metadata*

Size statistics(estimated)*

IETF 3066 doesn't specify language tags directly. It instead describes a mechanism for extending ISO 639 -1 Two character codes for the representation of names of languages and ISO 639-2 - Three character codes for the representation of languages

ISO 639-1 contains ~135 language codes

ISO 639-2 contains ~500 language codes

This is a flat structure so there are no edges

Initial Submitter*

Harold Solbrig solbrig@mayo.edu

Date of Initial Survey*

2005-03-10

III. Supplementary Content Characterization

The following attributes of the content characterization will be completed by XMDR staff. These attributes will be collected for those content collections which are considered to be high priority for inclusion in the prototype.

Attributes marked with an asterisk (*) are considered mandatory.

Date*

2005-01

Creator

Internet Engineering Task Force

Publisher*

Internet Assigned Numbers Authority

Description*

RFC 3066 specifies an identifier mechanism, a registration function for values to be used with that identifier mechanism, and a construct for matching against those values.

IANA maintains a registry of assigned language tags

Language(s)*

English

Graph-theoretic Classification*

Indicate type of metadata according to the following graph-theoretic classification scheme:

Format / Schemas(s)*

none

Custom Set of Text Tables

Media / Download*

None

Constraint Specifications

?

Protocol(s)

?

Licensing Issues*

ISO was recently convinced to make the ISO 639-1 and 639-2 publicly available.

Export restrictions

None

Subsets

None

Versions, Updates

Changes occur needed

Documentation

See: RFC3066

Character Set Encoding*

UTF-8

Measurement units

NA

Dataset / Standards Dependencies

ISO 639-1

ISO 639-2

ISO 3166

Related Datasets

Other similar or related metadata resources.

Software tools

Other tools as well

Audience(s)

Anyone

Citation

ISO 3066:2001 - Tags for the Identification of Languages. H. Alvestrand

Surveyor*

Harold Solbrig

Date of Survey*

2005-03-11

IV. Content Characterization by XMDR Staff

The following data elements are to be supplied by XMDR project staff/collaborators.

Attributes marked with an asterisk (*) are considered mandatory.

XMDR Participant Expertise

Names of persons (if any) on XMDR project (and contact info) who are familiar with this data set. XMDR participant organizations who have copies of this metadata resource.

XMDR Evaluator*

Names of persons on XMDR project (and contact info) who evaluated this metadata resource for inclusion (e.g., if the content survey was completed by someone at the content repository).

Inclusion Priority*

Priority suggested for acquisition and ingestion of this metadata resource.


Maintained by Frank Olken at Lawrence Berkeley National Laboratory. olken@lbl.gov Last updated: 2004-08-31, Tuesday

Valid XHTML 1.0!