Identifiers: Difference between revisions

Revision as of 13:30, 9 January 2006

Digital Object Identifier - a digital identifier for any object of intellectual property (from DOI FAQ, mEDRA and The Biology Wiki).

The DOI is a Handle System implementation.

The Handle System is a comprehensive system for assigning, managing, and resolving persistent identifiers, known as "handles," for digital objects and other resources on the Internet.

If you give each object a name (a handle), and associate that name with the object's location using the Handle System, you'd only have to update the handle record with the new location, not notify everyone who might want to find the object.

Proxy servers (DOI resolvers)

You can resolve a DOI by typing on your browser address bar the proxy server name followed by your DOI. For example, http://dx.medra.org/10.1000/182. To speed resolution, the proxy servers cache handle values, with the TTL set to 24 hours.

DOI lookup
OpenURL resolver can be used to retrieve DOI metadata in XML

Description of OpenURL standard OpenURL is a syntax for embedding parameters such as identifiers and metadata in links.

On Making and Identifying a Copy

To obtain a DOI Prefix, you need to work either with a DOI Registration Agency or, for experimental or prototype purposes, with the International DOI Foundation. To obtain a DOI prefix for experimental use, write to the IDF at contact@doi.org, giving clear indication why it is required. Prefixes issued directly by the IDF will be at a cost of US$1,000 per prefix. These prefixes will be issued purely at the discretion of the IDF. List of agencies.

DOI Numbering

The DOI consists of a unique alpha-numeric character string divided in two parts: a prefix and a suffix. For example:

10.1000/abc

where:

10.1000 is the prefix
10 identifies the string as a DOI (distinguishes a DOI from any other implementation of the Handle System).
1000 identifies the publisher
abc is the suffix (identifying the digital object)

The suffix can integrate other standard identifiers such as ISBN or ISSN. As a consequence, the DOI allows to mantain the standard identifiers already in use. The suffix is assigned by the publisher (registrant). The DOI suffix can be any alphanumeric string (any printable characters from the Universal Character Set (UCS-2), of ISO/IEC 10646, which is the character set defined by Unicode v2.0). The DOI is an "opaque string" or "dumb number" - nothing at all can or should be inferred from the number in respect of its use in the DOI System.

Handle syntax imposes two constraints on the prefix -- both slash and dot are "reserved characters".

Publishers use many different schemes which all form DOIs that can then be used together: e.g.:

Publisher A uses PII: S1384107697000225
Publisher B uses SICI: 0361-9230(1997)42:<OaEoSR>2.0.TX;2-B
Publisher C uses "C-numbers": JoesPaper56

These three schemes are not at all interoperable, but become so in the DOI system as:

DOI:10.2345/S1384107697000225
DOI:10.4567/0361-9230(1997)42:<OaEoSR>2.0.TX;2-B
DOI:10.6789/JoesPaper56

Each publisher can retain his own scheme and does not need to switch to a new one, though all publishers need to agree on a common metadata set for their DOIs.

Each DOI has associated with it some minimum set of metadata (the Kernel); and may have associated with it some additional metadata.

DOIs are case insensitive. All DOIs are converted to upper case upon registration.

DOI Guidelines - sample DOIs, etc

Suffix nodes may be used to reflect hierarchical information or levels of granularity. For instance, the first node might be a multiple-letter code for the journal title, while successive nodes encode year of article acceptance and order of article acceptance. This is the scheme used by Academic Press, with resulting DOIs like doi:10.1006/jmbi.1998.2354.

Digital Object Identifiers (DOI) - An Embarrassment of Riches Part I and Part II

Multiple Resolution

operates on the premise that content, not its location, is identified.
enables content owners and distributors to identify their intellectual property with bound collections of related resources at a hyperlink's point of departure, instead of requiring a user to leave the page to go to a new location for further information.

PURL

PURL is not very useful because it's inherently dependent on DNS (from PURL evalution)

LSID

Life Sciences Identifiers Specification
LSID project
LSID authority and web resolver
LSID best practices - A guide to deploying Life Science Identifiers - IBM
Build an LSID Resolution Service using the Java language - IBM tutorial
Build a life sciences collaboration network with LSID - IBM

Specification

"a standardized naming schema for biological entities in the Life Sciences domains"
An LSID consists of three scoping mechanisms: an authority, a namespace, and an identifier. It can also optionally contain a version, specified by a revision identifier.

urn:lsid:authority:namespace:identifier:revision

"URN"
"LSID"
authority identification (usually an Internet domain name)
namespace identification
object identification
optionally: revision identification. If revision field is omitted then the trailing colon is also omitted.

Examples:

Notes:

While an LSID is defined to be semantically opaque, the author of an LSID resolution service must interpret the encoding to resolve and return the correct data.
Since LSID resolution uses SRV records, your TLD does not have to point to the IP of your LSID server.

@@ Line 80: / Line 80: @@
 *[http://www.ibm.com/developerworks/opensource/library/os-lsid2/ Build a life sciences collaboration network with LSID] - IBM
 ===Specification===
-The LSID declaration consists of the following parts, separated by double colons:
+"a standardized naming schema for biological entities in the Life Sciences
+domains"<br/>
+An LSID consists of three scoping mechanisms: an authority, a namespace, and an identifier. It can also optionally contain a version, specified by a revision identifier.
+ urn:lsid:authority:namespace:identifier:revision
 *"URN"
 *"LSID"
@@ Line 91: / Line 94: @@
 *URN:LSID:rcsb.org:PDB:1D4X:22
 *URN:LSID:ncbi.nlm.nih.gov:GenBank.accession:NT_001063:2
+Notes:
+*While an LSID is defined to be semantically opaque, the author of an LSID resolution service must interpret the encoding to resolve and return the correct data.
+*Since LSID resolution uses [[Wikipedia:SRV_record|SRV records]], your TLD does not have to point to the IP of your LSID server.

Identifiers: Difference between revisions

Revision as of 13:30, 9 January 2006

PURL

LSID

Specification

Navigation menu

Page actions

Page actions

Personal tools

Navigation

Search

research

Tools