LSID

Life Science Identifiers^[1]^[2] are a way to name and locate pieces of information on the web. Essentially, an LSID is a unique identifier for some data, and the LSID protocol specifies a standard way to locate the data (as well as a standard way of describing that data). They are a little like DOIs used by many publishers.

An LSID is represented as a uniform resource name (URN) with the following format:

urn:lsid:⟨Authority⟩:⟨Namespace⟩:⟨ObjectID⟩[:⟨Version⟩]

The lsid: namespace, however, is not registered with the Internet Assigned Numbers Authority (IANA), and so these are not strictly URNs or URIs.^[3]

LSIDs may be resolved in URLs, e.g. http://zoobank.org/urn:lsid:zoobank.org:pub:CDC8D258-8F57-41DC-B560-247E17D3DC8C

Controversy over the use of LSIDs edit

There has been a lot of interest in LSIDs in both the bioinformatics and the biodiversity communities, with the latter continuing to use them as a way of identifying species in global catalogues.^[4] However, more recently, as understanding has increased of how HTTP URIs can perform a similar naming task,^[5]^[6] the use of LSIDs as identifiers has been criticized^[7] as violating the Web Architecture good practice of reusing existing URI schemes.^[8] Nevertheless, the explicit separation of data from metadata; specification of a method for discovering multiple locations for data-retrieval; and the ability to discover multiple independent sources of metadata for any identified thing were crucial parts of the LSID and its resolution specification that have not successfully been mimicked by an HTTP-only approach.

The World Wide Web provides a globally distributed communication framework that is essential for almost all scientific collaboration, including bioinformatics. However, several limits and inadequacies were thought to exist, one of which was the inability to programmatically identify locally named objects that may be widely distributed over the network. This perceived shortcoming would have limited our ability to integrate multiple knowledgebases, each of which gives partial information of a shared domain, as is commonly seen in bioinformatics. The Life Science Identifier (LSID) and LSID Resolution System (LSRS) were designed to provide simple and elegant solutions to this problem, consistent with next-generation Semantic Web and semantic grid, based on the extension of existing internet technologies. However, it has more recently been pointed out that some of these perceived shortcomings are not intrinsic to HTTP URIs, and much (though not all) of the functionality that LSIDs provide can be obtained using properly crafted HTTP URIs.^[5]

Alternative identifiers for organisms edit

Alternative identifiers have been proposed for organisms, e.g. the DOI system. NamesforLife (N4L), a private company, set up a system to apply DOIs to organisms. For example, doi:10.1601/nm.3093 is the DOI for Escherichia coli, and doi:10.1601/tx.3093 is the corresponding taxon.^[9]

Notes edit

^ Clark T., Martin S., Liefeld T. Briefings in Bioinformatics 5.1:59-70, March 1, 2004.
^ Cover, Robin. "OMG Life Sciences Identifiers Specification (LSID)". xml.coverpages.org. Retrieved 2022-12-10.
^ "GitHub - tdwg/wiki-archive: Archive of the TDWG wiki". 2018-11-15.
^ Jones, Andrew C.; White, Richard J.; Orme, Ewen R. (2011-10-17). "Identifying and relating biological concepts in the Catalogue of Life". Journal of Biomedical Semantics. 2 (1): 7. doi:10.1186/2041-1480-2-7. ISSN 2041-1480. PMC 3245425. PMID 22004596.
^ ^a ^b "Converting New URI Schemes or URN Sub-Schemes to HTTP URIs". dbooth.org. Retrieved 2022-12-10.
^ "A precedent suggesting a compromise for the SWHCLS IG Best Practices from Henry S. Thompson on 2006-07-26 (public-semweb-lifesci@w3.org from July 2006)". lists.w3.org. Retrieved 2022-12-10.
^ "My conversation with Sean Martin about LSIDs from noah_mendelsohn@us.ibm.com on 2006-07-25 (www-tag@w3.org from July 2006)". lists.w3.org. Retrieved 2022-12-10.
^ "Architecture of the World Wide Web, Volume One". www.w3.org. Retrieved 2022-12-10.
^ Garrity, George M.; Lyons, Catherine (2003-01-01). "Future-Proofing Biological Nomenclature". OMICS: A Journal of Integrative Biology. 7 (1): 31–33. doi:10.1089/153623103322006562. PMID 12831553.

External links edit

LSID Resolution Project
LSID Assigning and Resolution Authority from The University of Texas at Austin
A Position on LSIDs - Reflections from someone involved in implementation and roll out of LSIDs

[1] Clark T., Martin S., Liefeld T. Briefings in Bioinformatics 5.1:59-70, March 1, 2004.

[2] Cover, Robin. "OMG Life Sciences Identifiers Specification (LSID)". xml.coverpages.org. Retrieved 2022-12-10.

[3] "GitHub - tdwg/wiki-archive: Archive of the TDWG wiki". 2018-11-15.

[4] Jones, Andrew C.; White, Richard J.; Orme, Ewen R. (2011-10-17). "Identifying and relating biological concepts in the Catalogue of Life". Journal of Biomedical Semantics. 2 (1): 7. doi:10.1186/2041-1480-2-7. ISSN 2041-1480. PMC 3245425. PMID 22004596.

[converting-5] "Converting New URI Schemes or URN Sub-Schemes to HTTP URIs". dbooth.org. Retrieved 2022-12-10.

[6] "A precedent suggesting a compromise for the SWHCLS IG Best Practices from Henry S. Thompson on 2006-07-26 (public-semweb-lifesci@w3.org from July 2006)". lists.w3.org. Retrieved 2022-12-10.

[7] "My conversation with Sean Martin about LSIDs from noah_mendelsohn@us.ibm.com on 2006-07-25 (www-tag@w3.org from July 2006)". lists.w3.org. Retrieved 2022-12-10.

[8] "Architecture of the World Wide Web, Volume One". www.w3.org. Retrieved 2022-12-10.

[9] Garrity, George M.; Lyons, Catherine (2003-01-01). "Future-Proofing Biological Nomenclature". OMICS: A Journal of Integrative Biology. 7 (1): 31–33. doi:10.1089/153623103322006562. PMID 12831553.

[1]

[2]

[3]

[4]

[5]

[6]

[7]

[8]

[9]

LSID

Contents

Controversy over the use of LSIDs edit

Alternative identifiers for organisms edit

See also edit

Notes edit

External links edit