Sunday, August 09, 2009

The Link in Linked Web

Kingsley Idehen posted a thoughtful article on URI, URL, and linked data this weekend. In the style of Q&A, the post concisely answers some of the most confusing questions about linked data. It explains the subtle distinction between URI and URL when dealing with the linked data. Moreover, the post implies that "a new level of Link Abstraction on the Web" is likely needed for us in order to efficiently consume the linked data Web.

After I left a comment for the post, however, I feel the issue deserve a second thinking. Before approaching forward, I pasted the main section of my original comment to Kingsley's post in the following.

Another thought I have, however, is that we may have three, in contrast to two, fundamental definitions on describing the Web. The two well-known ones are data and service; or in RDF we define Class and Property respectively. Until now, we assert the third one---link---to be nothing but a special form of data. The reality is, however, that this special form is so special that we may consider to give it a little bit honor so that it becomes the third member of the fundamental building block of the Web. That is, a link is not a data, and nor is it a service, but a link. Or with respect to your post, a URI is not a data, but a form of link, PERIOD.

I believe that this distinction, once it is made, could be important as well as valuable. A trick thing here is that, following this distinction we can start to think of other forms of links that is beyond URI (which is just a binary model). By contrast, we may start to invent the links in higher order, such as the link of links (metalink) or the thread of links (group link). Be honest, if the Web is moving towards a web of linked data (and I believe so since the Web data is more and more interconnected), we must breakthrough this traditional thinking of the link model. The key is, however, from today we start to think link to be link but not a data.

World Wide Web: from the dualistic view to the ternaristic view

The thought that Web link is a fundamental element of the Web that is independent to data and service was originated when I wrote a model of Web evolution. (Actually, it could be traced back to January 2007 when I first started to think of how the Web evolves.) By observing the evolution of the Web, more and more I felt that data, service, and Web link are three equivalently fundamental elements of the Web. This interpretation of the Web is different from the classic dualistic view of the Web in which the Web is said to be built upon two fundamental first-class entities: data (which expresses the static description) and service (which expresses the dynamic action). In this classic dualistic model, Web link is a special second-class member that is partially static description and partially implied by dynamic action.

RDF is a typical design according to this dualism philosophy of the Web. In RDF, relation (RDF:Property) is a first-class entity along with the normal object entity (RDF:Class). While class expresses the static fact of the Web, relation expresses how the static facts are interacted to each other. Both the elements are equally fundamental. Two models with the identical classes may not necessarily be equivalent to each other since the properties that are among the classes could be different.

Now we need to start discussing a few subtle implication of this philosophical view of the Web.

By the dualism philosophy, a relation is first of all a service and secondary a link. For example, suppose there are two statements: (1) Mary is a teacher of John, and (2) Mary is a friend of John. Mary and John are two objects. "teacher-of" and "friend-of" are two relations. Philosophically, however, the primary meaning of each of the relations is a typical service defined in between the two objects. In the first relation, Mary provides a teaching service for John, by which a teacher-of relation is established. In the second relation, Mary provides a friendship service for John, by which a friend-of relation is established. Be note each of the links is a consequence of the respective service (and there could be other consequences as well) in contrast to a prerequisite of the service. The dualism philosophy tells that service implies link and every link must be a consequence of a service. Moreover, no link actually makes sense if no services imply the link. Every link has a reason, which is a known service, conceptually (means that the service is unnecessarily implemented, however).

There is also another side of Web link according to this dualism philosophy. Once a link is implied by a service, it becomes a data. Unlike service that always leads to an action or a production, link describes certain static fact, which by the dualism philosophy is a data.

Therefore, link, which inherits the features from both of the first-class entities, is a special secondary entity in the dualistic view of the Web.

The ternarism (3 fundamental elements to model the world) philosophy to which I prefer may describe the same Web but in a different picture. By this philosophy, a link is not the consequence of a service and neither is it an unique type of data. A link is a link, which in the ternaristic view of the Web exists without the need of being implied by a service or being stored in the form of data.

In the dualism world, wherever there is a link, it must exist a data that represents the link and a service (implemented or not) that implies the link.

In the ternaristic world, however, when there is a link, it may or may not exist a data that represents the link, and it may or may not exist a service (let it alone implemented) that implies the link. A link is nothing but a pure connection among (could be more than between) the things.

In the dualism world, that one thing is linked to another thing is always due to some reason. In the ternaristic world, however, link is a matter of natural connection that does not require a reason to be existed. A link is as fundamental as a data or a service.

As we know, the Web is a world of information. Following this ternarism philosophy, the Web we understand becomes different world from what we normally think by the dualistic view. It tells that in the world of information, data reveals the encapsulation of information, service reveals the action and production of information, and link reveals the transportation (in contrast to connection) of information. Under this view, any Thing in the information Web is composed by three fundamental elements---data, service, and link. The data elements contains the information, the service element enables the production of the information as well as the interaction of the information to the other things, and the link element determines whether or not the information being able to be passed to another Thing.

By the ternaristic view of the Web, when we say there is a link from Thing A to Thing B, it means that the information carried by Thing A can be directly transported to Thing B without the help of any other information carrier.

By the ternaristic view of the Web, when there are no links between Thing A and Thing B, it means that unless there are additional information carrier participated in the transaction, the information carried by A cannot be passed to B. Once properly the additional information carriers joins the protocol (possibly in both sides), a link in higher order can be established between A and B.

By the ternaristic view of the Web, there is always a link (i.e. a direct link in the classic mean) between any two things though the link is often in higher order, i.e., it is often not a binary link that involves only the two designated things.

The regular Thinking Space readers may have found that this ternaristic view of the Web is also influenced by the quantum theory. Unlike that in the dualistic presentation of the Web we often need to perform an expensive computation to discover a link (concatenated by several direct binary links) between two objects in the Web, in the ternaristic presentation of the Web any two objects are directly linked, but possibly linked in varied orders. Moreover, I realize that we may directly apply many classic quantum theories to the Web if we start to think of the Web in the ternaristic view, which I will share later in the other posts.

Does the ternaristic view actually reveal the more intrinsic fact of the Web? I do not know. But there is one thing I feel certain. Link is not a simple issue. By better understanding the nature of link in the information world, we may eventually release the tremendous power of computation that we might not even imagine now. For the companies that aim to monetize linked data (such as Kingsley's OpenLink Software), it would be even more valuable for them to rethink the nature of the links that they are working against every day.

8 comments:

Kingsley Idehen said...

Nice post (as per usual)!

At OpenLink we see Links as the new medium of value exchange re. the World Wide Web.

Basically, Links are Information conductors (re. URLs) and Data Conductors (re. Generic HTTP URIs).

For instance, we are trying to get Media companies to comprehend the concept of "medium of value exchange shrinkage" where the Link (HTTP URI) becomes a powerful addition to their matrix of value exchange mediums. Our effort with the BBC [1] is a nice example.

Example: Newspapers have used large rectangular pulp to exchange value, and now struggle with the fact that their audience seeks a new digital medium of exchange, one that is more granular than an opaque URL i.e. a Link into their vast high quality databases on a data item by data item level.

What I describe above re. Newspapers also plays well with the Data is Wine and Code is Fish analogy. Basically, Newspapers by their very existence are high quality database curators, their data is more valuable over time, but they have chosen the equivalent of a paper cup (rectangular pulp) as the sole medium of value exchange :-(

Even when Newspapers have gone online, they opted to create a digital variant of the paper cup instead of powerful Linked Data Spaces where HTTP URIs deliver powerful branding and gateways to more powerful business models etc..

Links:

1. http://bbc.openlinksw.com

gregory said...

links ARE the information in the next stage of consideration ..

the interconnectivity is/will be the only thing that matters ..

this is the path from data to information to wisdom ..

Yihong Ding said...

@ Kingsley,

There is still a long way to go for the regular audience understand what linked data is and how they may benefit from it. It is probably equivalently hard for the linked data researchers and producers (such as you and your company) to invent so that the regular audience may be able to eventually get the point. The BBC example you show is surely a great step towards the goal.

I agree to you that Link is both of information conductor and data conductor. An addition I want to make is that the distinction between an information conductor and a data conductor could be more than the difference between URL and URI. In short, information is data in context, while a context itself could be another data. Therefore, theoretically an information conductor is (and probably should be) a higher-order data conductor. Or let's put it in another way: an information conductor can be computed by several data conductors and a data conductor can be derived from information conductors.

Anyway, things could be more and more interesting. But the right understanding of Link is certainly the center of all the issues.

best,

yihong

Anonymous said...

//The key is, however, from today we start to think link to be link but not a data.

So the difference between link and data is?? That needs to be defined... I guess data would have to qualify the usual standards of science and all that doesn't is link?? Then what is a service?? Is it a program that transmits,displays,explains(??) the data..This still doesn't define link per se.. just what it is not??


//The regular Thinking Space readers may have found that this ternaristic view of the Web is also influenced by the quantum theory.

Are you talking about the link (on the web) as the equivalent of opening the box and looking at the schroedinger's cat??

Yihong Ding said...

@ aangtce,

Data, service, and link defines the three essential aspect of a thing. In short, data defines the thing's static nature, service defines the thing's active nature, the link defines the thing's linkage nature. The three aspects are independent ot each other. For example, a man has his body which is part of its data nature. A man can perform certain action, such as read, which is its service nature. A man has relation to another man, which is its linkage nature. You cannot say that it is which aspect that causes another aspect. They are not in the relation of reason vs. consequence. They mutually decide each other and it is the spirit of natural evolution.

About your second question, I would recommend you read my sharing on the quantum theory and the information world first. A link is not a quantum. Please do not compare the two in the straightforward way. Moreover, the quantum theory is based on the philosophic view that quantum itself might be the single most essential component in the world. While the ternaristic view of the Web suggest THREE but not one single most essential component in its respective world. You must not omit this significant difference between the two. And I am only trying to find a way to express World Wide Web but not the real world.

yihong

Anonymous said...

1. hmm... ok that was a good example.. i get the idea, though would like to still get to a definition, guess it will take more time..
2. yep i have read that post.. i have been following ur blog for a year now..

//A link is not a quantum.

Nope definitely not.. i didn't mean it that way either. I was just proposing that the link part does not exist without an observer, who observes it.. thereby equating the creation of a link to the collapse of the probability wave function.

//Moreover, the quantum theory is based on the philosophic view that quantum itself might be the single most essential component in the world.

as far as i understand, quantum theory, there are different types of quarks(viz .leptons) though all of them can be reduced to probability waves..

//And I am only trying to find a way to express World Wide Web but not the real world.

True, but i was think the distinction is a lot more blurry enough to warrant such a comparison as i did..:P

Anonymous said...

hi this is aangtce.
well just a pseudo comment to get mailed , when u reply back..
can't do that with a openid :(

Kenneth Udut (free.naplesplus.us - Naples News, Info, Jobs) said...

What is relevent? Machines are notoriously awful at figuring that out.

I think linked data has a great promise to do so.

The Link is what makes things "hyper" and it is like having millions of interconnected books, but instead of linking one book to book (website to website) or a page to a page in another book (webpages), or word-to-word (search engines), linked data has a level of granularity that is more HUMAN: concept to concept, idea to idea. Memes.

It's hard to implement at this point in time, but it's getting there: an interconnected living BRAIN that we all contribue to and tap into.

Kenneth Udut, Naples, FL
http://free.naplesplus.us