Linked Data: Present & Future
In this text, I give a short overview of Linked Data technologies, describing their main characteristics as well as their adoption. I also risk making a few predictions on the future of Linked Data.
What is Linked Data?
Linked Data can be seen as a simplified (and pragmatic) implementation of the Semantic Web vision. Sir Tim Berners-Lee, the inventor of the Web, coined the term Linked Data in 2006 to prescribe a simple method of publishing data using web standards. The method can be summarized in three points:
- All data items should have names that start with http
- When looked up online, the http names should return some data in a standard format to describe the items
- The description of the items should also contain relationships to other pieces of data.
In technical terms, this means that data items are identified by URIs, so that they can be dereferenced through HTTP, and can refer to other items using their HTTP URI-based identifiers. The language used to express such data is often called the Resource Description Framework (RDF).
Linked Data & Me
I’ve been a close observer of the emergence of Linked Data. Publicly, I was involved in a number of forums and meetings dealing with Linked Data. I have co-organized the ISWC, the main research venue for Linked Data, a number of times since 2007 (I was, for instance, PC Chair of ISWC 2012 in Boston and will be In-Use Chair this year in Vienna). Privately, I regularly leverage Linked Data in my own research, either to better grasp content (e.g. to understand text better) or to serialize output data (e.g. to publish datasets).
Linked Data Today
The adoption of Linked Data has been phenomenal. Linked Data is used in two main ways today: i) to create webs of data that can be accessed and queried by anyone, and ii) to add metadata to Web pages.
The most prominent web of data created through Linked Data is called the Linked Open Data (LOD) cloud (see Figure 1). It is conceptually similar to the World Wide Web, but contains interlinked data instead of interlinked documents. The LOD cloud includes thousands of different datasets from a wide range of domains: from governmental data to geographic, life-science or bibliographic data. Each of these datasets contains a myriad of data items and links, is fully open, and can be queried using a standard query language (SPARQL). Other important webs of data exist besides the LOD Cloud, such as Wikidata or Google’s Knowledge Graph.
In addition, Linked Data is also used to add metadata to Web pages. The main format used in that sense is called schema.org, which is supported by a number of prominent companies including Google, Microsoft, Yahoo and Yandex. This format allows all sorts of data to be added to a Web page, to describe for example people, products, events, or reviews that are contained in that Web page. Those data can then be used to summarize, describe, or manipulate the Web page (e.g. to create rich snippets on a search engine). Today, millions of websites use this format to describe their pages.1
Linked Data Tomorrow
Linked Data is widely available today, in the LOD cloud and on Web pages. However, the development of applications using Linked Data has been hampered by a series of technical issues, from data quality to complex standards. In the following paragraphs, I give my own vision of the evolution of Linked Data.
- Agile standards: RDF and its applications are governed by a monolithic and complex set of standards that are revamped every few years. In that context, agile and incremental efforts like schema.org will be increasingly popular and important as they correct, update or try out features on a continuous basis, akin to methodologies used for agile software.
- Smart clients: using Linked Data productively is often more complex than it seems, as one typically has to spend considerable time selecting, aligning and cleaning up data (which is a common issue in Big Data and Data Science). Increasingly, Machine Learning methods will be able to automate such processes to create smart Linked Data clients capable of ingesting, aligning and cleaning up raw Linked Data using sophisticated supervised models.
- Unification: Linked Data is available today from several distinct and heterogeneous platforms (the LOD cloud, Wikidata, HTML pages, etc.) In the future, bridges will be built to integrate those platforms and create more extensive webs of data. Fribourg’s VoldemortKG is a first effort in that direction, as it interlinks schema.org data to the LOD cloud.
This could be the right blog for everyone who is desires to be familiar with this topic. You already know much its practically not easy to argue along (not that I just would want…HaHa). You certainly put the latest spin with a topic thats been discussing for decades. Excellent stuff, just great!
I bet old school Twitter experts would really like your write up. I truly appreciate this write up. Make your list and boot the article. You have a great sense of humor.
Hello! I just want to supply a enormous thumbs up for the excellent information you’ve here on this post. I am coming back to your blog for much more soon.
It is like you read my thoughts! Wow, that is a really cool way of thinking about it!
Have you already setup a fan page on Facebook ?-’~`:
Is there anything else I could read to learn more about this? Thank you. I truly appreciate your efforts and I am waiting for your next post. You should be thanked more often. I could not resist commenting.
I love articles like this one but I find myself spending hours simply browsing and reading. In my view, if all webmasters and bloggers made just right content material as you did, the web might be a lot more helpful than ever before. Take a look at my web site as well and let me know what you think. This information is good. I am reading your blog while chillin at my coffee shop.
it is like you wrote the book on it or something. Great tips and very easy to understand. I enjoy the stuff you provide here. My professor said they really like your websites blog. I really love your writing style and how well you express your ideas.
You have the best ideas. You have made my day!
Now I feel stupid. Very nice page. You have made my day! Thx again.
I would like to be a teacher in this topic. Thank you for sharing your info.
I enjoy the stuff you provide here and can not wait to take a look when I get home. It is rare to see a nice write up like this one these days. I am trying to discover more about this field. I enjoyed reading what you had to say.
When I started my browser this website was already running. Somehow you make time stop and fly at the same time. You have a lot of knowledge on this subject. I truly appreciate your efforts and I am waiting for your next post.
Your creative potential seems limitless. I was looking everywhere and this popped up like nothing! Somehow you make time stop and fly at the same time. Keep it up!
Great tips and very easy to understand. I enjoyed your article. Wow, that is a really amazing way of thinking about it! Got sucked into your post for the last hour.
I conceive this website holds very wonderful composed written content content.
The next time I read a blog, I am hoping that it doesnt disappoint me about that one. I mean, It was my method to read, but I personally thought youd have some thing intriguing to convey. All I hear can be a bunch of whining about something that you could fix in the event you werent too busy looking for attention.
I admit, I individual not been on this webpage in a lengthened time? however it was another pleasance to see It is such an indispensable message and neglected by so numerous, equal professionals. I thank you to aid making group many knowledgeable of mathematical issueExcellent nonsensicality as veritable.