Identifiers

Miscellaneous

 * Uniform Resource Identifiers (URI): Generic Syntax RFC 2396
 * URI vs. URL
 * InChI: the IUPAC International Chemical Identifier

DOI
Digital Object Identifier - a digital identifier for any object of intellectual property (from DOI FAQ, mEDRA and The Biology Wiki).

The DOI is a Handle System implementation.

The Handle System is a comprehensive system for assigning, managing, and resolving persistent identifiers, known as "handles," for digital objects and other resources on the Internet.

If you give each object a name (a handle), and associate that name with the object's location using the Handle System, you'd only have to update the handle record with the new location, not notify everyone who might want to find the object.

Description of OpenURL standard OpenURL is a syntax for embedding parameters such as identifiers and metadata in links.

On Making and Identifying a Copy

To obtain a DOI Prefix, you need to work either with a DOI Registration Agency or, for experimental or prototype purposes, with the International DOI Foundation. To obtain a DOI prefix for experimental use, write to the IDF at contact@doi.org, giving clear indication why it is required. Prefixes issued directly by the IDF will be at a cost of US$1,000 per prefix. These prefixes will be issued purely at the discretion of the IDF. List of agencies.

DOI Numbering

The DOI consists of a unique alpha-numeric character string divided in two parts: a prefix and a suffix. For example: where:
 * 10.1000/abc
 * 10.1000 is the prefix
 * 10 identifies the string as a DOI (distinguishes a DOI from any other implementation of the Handle System).
 * 1000 identifies the publisher
 * abc is the suffix (identifying the digital object)

The suffix can integrate other standard identifiers such as ISBN or ISSN. As a consequence, the DOI allows to mantain the standard identifiers already in use. The suffix is assigned by the publisher (registrant). The DOI suffix can be any alphanumeric string (any printable characters from the Universal Character Set (UCS-2), of ISO/IEC 10646, which is the character set defined by Unicode v2.0). The DOI is an "opaque string" or "dumb number" - nothing at all can or should be inferred from the number in respect of its use in the DOI System.

Handle syntax imposes two constraints on the prefix -- both slash and dot are "reserved characters".

Publishers use many different schemes which all form DOIs that can then be used together: e.g.:

Publisher A uses PII: S1384107697000225 Publisher B uses SICI: 0361-9230(1997)42:2.0.TX;2-B Publisher C uses "C-numbers": JoesPaper56

These three schemes are not at all interoperable, but become so in the DOI system as:

DOI:10.2345/S1384107697000225 DOI:10.4567/0361-9230(1997)42:2.0.TX;2-B DOI:10.6789/JoesPaper56

Each publisher can retain his own scheme and does not need to switch to a new one, though all publishers need to agree on a common metadata set for their DOIs.

Each DOI has associated with it some minimum set of metadata (the Kernel); and may have associated with it some additional metadata.

DOIs are case insensitive. All DOIs are converted to upper case upon registration.

DOI Guidelines - sample DOIs, etc

Suffix nodes may be used to reflect hierarchical information or levels of granularity. For instance, the first node might be a multiple-letter code for the journal title, while successive nodes encode year of article acceptance and order of article acceptance. This is the scheme used by Academic Press, with resulting DOIs like doi:10.1006/jmbi.1998.2354.

Digital Object Identifiers (DOI) - An Embarrassment of Riches Part I and Part II

Multiple Resolution
 * operates on the premise that content, not its location, is identified.
 * enables content owners and distributors to identify their intellectual property with bound collections of related resources at a hyperlink's point of departure, instead of requiring a user to leave the page to go to a new location for further information.

Fees

 * mEDRA
 * Annual fee of $400 per 100 DOIs, $600 per 200 DOIs, etc
 * crossref.org
 * Annual fees: $250 and up
 * Deposit fees: ~$1 per item

Software
Proxy servers (DOI resolvers) You can resolve a DOI by typing on your browser address bar the proxy server name followed by your DOI. For example, http://dx.medra.org/10.1000/182. To speed resolution, the proxy servers cache handle values, with the TTL set to 24 hours.
 * http://hdl.handle.net/
 * http://dx.doi.org/
 * http://dx.medra.org/
 * DOI lookup
 * OpenURL resolver can be used to retrieve DOI metadata in XML
 * HDL/DOI Protocol Handler for Mozilla

LSID

 * Life Sciences Identifiers Specification
 * LSID resolution project (old site?)
 * LSID Browser for Firefox
 * LSID Perl Toolkit
 * LSID authorities
 * BioPathways and their web resolver
 * University of Wisconsin CFL
 * LSID best practices - A guide to deploying Life Science Identifiers - IBM
 * Build an LSID Resolution Service using the Java language - IBM tutorial
 * Build a life sciences collaboration network with LSID - IBM
 * Firefox extensions
 * LSID: An Informatics Lifesaver - BioITWorld article
 * Metacat LSID support - implementation example
 * LSID Pros & Cons

Specification
"a standardized naming schema for biological entities in the Life Sciences domains" An LSID consists of three scoping mechanisms: an authority, a namespace, and an identifier. It can also optionally contain a version, specified by a revision identifier. urn:lsid:authority:namespace:object:revision
 * "URN"
 * "LSID"
 * authority identification (usually an Internet domain name)
 * namespace identification
 * object identification
 * optionally: revision identification. If revision field is omitted then the trailing colon is also omitted.

Examples

 * URN:LSID:ebi.ac.uk:SWISS-PROT.accession:P34355:3
 * URN:LSID:rcsb.org:PDB:1D4X:22
 * URN:LSID:ncbi.nlm.nih.gov:GenBank.accession:NT_001063:2
 * URN:LSID:parts.mit.edu:BBa:B0030

PURL
PURL is not very useful because it's inherently dependent on DNS (from PURL evalution)