Linked data
|
|
This article's use of external links may not follow Wikipedia's policies or guidelines. Please improve this article by removing excessive or inappropriate external links, and converting useful links where appropriate into footnote references. (March 2012) |
In computing, linked data describes a method of publishing structured data so that it can be interlinked and become more useful. It builds upon standard Web technologies such as HTTP and URIs, but rather than using them to serve web pages for human readers, it extends them to share information in a way that can be read automatically by computers. This enables data from different sources to be connected and queried.[1]
Tim Berners-Lee, director of the World Wide Web Consortium, coined the term in a design note discussing issues around the Semantic Web project.[2] However, the idea is very old and is closely related to concepts including database network models, citations between scholarly articles, and controlled headings in library catalogs.[citation needed]
Principles
Tim Berners-Lee outlined four principles of linked data in his Design Issues: Linked Data note, paraphrased along the following lines:
- Use URIs to identify things.
- Use HTTP URIs so that these things can be referred to and looked up ("dereferenced") by people and user agents.
- Provide useful information about the thing when its URI is dereferenced, using standard formats such as RDF/XML.
- Include links to other, related URIs in the exposed data to improve discovery of other related information on the Web.
Tim Berners-Lee gave a presentation on linked data[3] at the TED 2009 conference. In it, he restated the linked data principles as three "extremely simple" rules:
- All kinds of conceptual things, they have names now that start with HTTP.
- I get important information back. I will get back some data in a standard format which is kind of useful data that somebody might like to know about that thing, about that event.
- I get back that information it's not just got somebody's height and weight and when they were born, it's got relationships. And when it has relationships, whenever it expresses a relationship then the other thing that it's related to is given one of those names that starts with HTTP.
Note that although the second rule mentions "standard formats", it does not require any specific standard, such as RDF/XML.
Components
- URIs (specifically, of the dereferenceable variety)
- HTTP
- Resource Description Framework (RDF)
- Serialization formats (RDFa, RDF/XML, N3, Turtle, and others)
Linking open-data community project
The goal of the W3C Semantic Web Education and Outreach group's Linking Open Data community project is to extend the Web with a data commons by publishing various open datasets as RDF on the Web and by setting RDF links between data items from different data sources. In October 2007, datasets consisted of over two billion RDF triples, which were interlinked by over two million RDF links. By September 2011 this had grown to 31 billion RDF triples, interlinked by around 504 million RDF links. There is also an interactive visualization of the linked data sets to browse through the cloud.
Dataset instance and class relationships
Clickable diagrams that show the individual datasets and their relationships within the DBpedia-spawned LOD cloud, as shown by the figures to the right, are:
See also
- Entity-attribute-value model
- Open data
- Record linkage
- Identity resolution
- Data deduplication
- Linked data page
External links
|
|
This article's use of external links may not follow Wikipedia's policies or guidelines. Please improve this article by removing excessive or inappropriate external links, and converting useful links where appropriate into footnote references. (April 2012) |
Datasets
- CKAN – registry of open data and content packages provided by the Open Knowledge Foundation
- DBpedia – a dataset containing extracted data from Wikipedia; it contains about 3.4 million concepts described by 1 billion triples, including abstracts in 11 different languages
- DBLP Bibliography – provides bibliographic information about scientific papers; it contains about 800,000 articles, 400,000 authors, and approx. 15 million triples
- GeoNames provides RDF descriptions of more than 7,500,000 geographical features worldwide.
- Revyu – a Review service consumes and publishes linked data, primarily from DBpedia.
- riese[dead link] – serving statistical data about 500 million Europeans (the first linked dataset deployed with XHTML+RDFa)
- UMBEL – a lightweight reference structure of 20,000 subject concept classes and their relationships derived from OpenCyc, which can act as binding classes to external data; also has links to 1.5 million named entities from DBpedia and YAGO
- Sensorpedia – A scientific initiative at Oak Ridge National Laboratory using a RESTful web architecture to link to sensor data and related sensing systems.
- FOAF – a dataset describing persons, their properties and relationships
- OpenPSI for the OpenPSI project a community effort to create UK government linked data service that supports research
- VIAF (Virtual International Authority File) – an aggregation of authority files (author names) from national libraries from around the world.
Projects
- Linked open data around the clock (LATC) – EU project
- PlanetData – EU project
- Linking Open Data 2 (LOD2) – EU project[4][5]
- TaxonConcept - Semantic Species Information (Biodiversity Informatics)
Use case demos
|
|
This article uses bare URLs for citations. Please consider adding full citations so that the article remains verifiable. Several templates and the Reflinks tool are available to assist in formatting. (Reflinks documentation) (April 2012) |
Further reading
- Linked data web architecture note by Tim Berners-Lee
- Linked Data: Evolving the Web into a Global Data Space (2011) by Tom Heath and Christian Bizer, Synthesis Lectures on the Semantic Web: Theory and Technology, Morgan & Claypool
- The Web Turns 20: Linked Data Gives People Power, part 1 of 4, by Mark Fischetti, Scientific American 2010 October 23
- Linked Data Is Merely More Data – Prateek Jain, Pascal Hitzler, Peter Z. Yeh, Kunal Verma, and Amit P. Sheth. In: Dan Brickley, Vinay K. Chaudhri, Harry Halpin, and Deborah McGuinness: Linked Data Meets Artificial Intelligence. Technical Report SS-10-07, AAAI Press, Menlo Park, California, 2010, pp. 82–86.
- LinkedData at the W3C Wiki
- LinkedData.org
- OpenLink Software white papers
- Linked Data on the Web – Chris Bizer, Tom Heath, Kingsley Uyi Idehen, Tim Berners-Lee. In Proceedings WWW2008, Beijing, China
- Interlinking Open Data on the Web – Chris Bizer, Tom Heath, Danny Ayers, Yves Raimond. In Proceedings Poster Track, ESWC2007, Innsbruck, Austria
- Ontology Alignment for Linked Open Data – Prateek Jain, Pascal Hitzler, Amit Sheth, Kunal Verma, Peter Z. Yeh. In proceedings of the 9th International Semantic Web Conference, ISWC 2010, Shanghai, China
- Interview with Sören Auer, head of the LOD2 project about the continuation of LOD2 in 2011, June 2011
- Linked Open Data: The Essentials - Florian Bauer and Martin Kaltenböck (January 2012)
- The Flap of a Butterfly Wing - semanticweb.com Richard Wallis (February 2012)
Browsers
- Explorator – a browser for exploring Sparql endpoints.
- Sig.ma – Browser for Linked data and embeddable mashup generator.
- The Tabulator – Generic data browser and editor.
- OpenLink Data Explorer (ODE)
- Zitgist DataViewer – linked data viewer
- Disco – Hyperdata Browser – A simple browser for navigating the semantic web.
- LENA – a Fresnel LEns based RDF/Linked data navigator with SPARQL selector support.
- RelFinder – Visual relationship discovery and exploration.
- VisiNav – Visual Data Navigation.
- Wandora – information mashup creator with numerous information extractors.
- Ontology Browser – An online OWL ontology and LOD browser.
- Falcons Explorer – Tabular and relational end-user programming for the web of data.
- MyView – Querying linked data by navigation.
Presentations
- Tim Berners Lee : The next Web of open, linked data at TED
- Linked Data Tutorial – Michael Hausenblas
- Sir Tim Berners Lee: Linked Open Data at LinkedData Planet
- Linked Data: Principles and State of the Art – Chris Bizer, Tom Heath, Tim Berners-Lee at WWW2008
- The Linking Open Data Project – Bootstrapping the Web of Data – Tom Heath
- Creating, Deploying and Exploiting Linked Data – Keynote by Kingsley Uyi Idehen at Linked Data Planet, 2008
- Deploying Linked Data using OpenLink Virtuoso
- Native to a Web of Data – Tom Coates
- How To Make Linked Data More than Data – Prateek Jain, Pascal Hitzler, Amit Sheth, Peter Yeh, Kunal Verma – Presented by Amit Sheth at Semantic Technology Conference 2010
- Biodiversity Informatics on the Semantic Web - Peter DeVries (December 2010)
Events
- Workshop on Consuming Linked Data (COLD2011) at ISWC 2011
- Workshop on Linked Data on the Web (LDOW2011) at WWW2011
- Workshop on Consuming Linked Data (COLD2010) at ISWC 2010
- Workshop on Linked Data on the Web (LDOW2010) at WWW2010
- Workshop on Linked Data on the Web (LDOW2009) at WWW2009
- Workshop on Linked Data on the Web (LDOW2008) at WWW2008
- LinkedData Planet 2008 conference in New York City
References
- ^ Bizer, Christian; Heath, Tom; Berners-Lee, Tim (2009). "Linked Data—The Story So Far". International Journal on Semantic Web and Information Systems 5 (3): 1-22. doi:10.4018/jswis.2009081901. ISSN 15526283. http://tomheath.com/papers/bizer-heath-berners-lee-ijswis-linked-data.pdf. Retrieved 2010-12-18.
- ^ Tim Berners-Lee (2006-07-27). "Linked Data—Design Issues". W3C. http://www.w3.org/DesignIssues/LinkedData.html. Retrieved 2010-12-18.
- ^ "Tim Berners-Lee on the next Web". http://www.ted.com/talks/tim_berners_lee_on_the_next_web.html.
- ^ "CORDIS FP7 ICT Projects – LOD2". European Commission. 2010-04-20. http://cordis.europa.eu/fetch?CALLER=PROJ_ICT&ACTION=D&CAT=PROJ&RCN=95562.
- ^ "LOD2 Project Fact Sheet – Project Summary". 2010-09-01. http://static.lod2.eu/Deliverables/LOD2_D12.5.1_Project_Fact_Sheet_Version.pdf. Retrieved 2010-12-18.
|
||||||||||||||||||||||||||||