ACM Computing Surveys
31(4), December 1999,
http://www.acm.org/surveys/Formatting.html. Copyright ©
1999 by the Association for Computing Machinery, Inc. See the permissions statement below.
Unlinking the LinkFlinders University of South Australia School of Information Science and Technology, Email: janet@ist.flinders.edu.au
In memorium AbstractThe aim of this review paper is to provide a "big picture snapshot" of the multiple facets of hypermedia linking. In providing this snapshot, the paper overviews key issues in both categorizing these facets and exploiting them to design effective implementations of links.The presentation is structured around the process of untying the perception of the link and its capabilities from the limitations of defining it in terms of the currently most pervasive implementation of the link, namely links in HTML. In the process, the paper draws on the work of Paul Thistlewaite on the linking issues for large volatile hyperbases. IntroductionNot unnaturally, the common current perception of the nature and limitations of hypermedia linking is defined by the properties of the <A> tag in HTML. The pervasiveness of this implementation has led to the HTML link becoming the de facto standard definition. The aim of this paper is to review how the link is potentially much more powerful (and consequently complex) than what this particular implementation indicates. It is also suggested that XML may provide the means to unlocking this potential for the Web.The following section briefly considers what constitutes the essence of linking. This will highlight the two key aspects of a link namely its specification (how) and its semantics (why). The first section after this will then outline the limitations of the specification of the HTML link and review potential alternatives. The next section will consider the semantics of the HTML link which given the limitations of its specification are surprisingly generic but also difficult to enforce. In the conclusion, the potential of XML to provide a more powerful (and complex) link will be briefly discussed.
A naive, but inherently true, definition of a link drawn from usage of the Web would be
"something that you select in one context that takes you to a related context" where
"context" typically refers to the contents of a graphical interface window displayed on a
screen. This definition captures the basic essence of a link, namely, that
Despite the impression created by the dominance of the HTML link, there are alternatives to the above that could apply. (Various aspects of specific alternatives have been considered by others particularly in the literature on "non-Web-based" hypermedia; see for instance [Ashman 1997], [Davis 1992], and, [Kappe 1993].) Overviewing these alternatives highlights how much more powerful and complex the link could be if unlinked from the limitations of this particular specification:
Because both source existence and destination identification can each be specified either by enumeration or by computation [Ashman 1997a], there are several different ways of implementing links. Links can be explicitly enumerated by hand or they can be automatically created by computation and this can be done at varying points in time: The crafted (or hand-made) links are nearly always created ahead of need although this is not necessary. Indeed in the case of reader annotation, it is conceivable that an annotation link could be created on the spot when first desired. The time of computation (or re-computation) can vary from the creation of the source document through to immediately prior to the navigation of the selected link. The later are often referred to as dynamic links [Ashman 1997]. The most appropriate time of computation depends on various factors including the semantics of the link (as discussed below), and, the volatility of the source document's content and of the range of destination documents. For the HTML link, its source existence is frequently pre-crafted although potentially pre-computed when the document is created. Its destination identification is similarly pre-determined although through such technology as CGI scripts, the contents of that destination may actually be computed when the link is followed. One of the great weaknesses of the HTML link is the absolute specification of the destination document. Given the limited average life-time of URLs, this leads to links regularly breaking. Although the HTML link makes it appear so, advertising the existence of a link source endpoint is not mandatory. Indeed to handle the existence of multiple link sources at the one endpoint, it could be more appropriate not to explicitly advertise at all and instead let the user enquire about link availability perhaps prompted by implicit indications in the context. The response could be presented as a selection list from which the user might choose a link (or not). Such a chooser is present in other hypermedia systems [Verbyla 1994]. The HTML link's approach to the source existence query of the navigating-from questions is one of its most significant limitations. In this approach, the source identification includes the destination identification with both embedded in-line in the document. Web will only really achieve fairly general hypermedia functionality when the physical representation of links is separated from the physical location of the anchors that they connect [DeRose 2000]. (This would be more consistent with HTML's (and SGML's) general principle of separating the meaning specification from the embedded tags.) This externalisation of the link specification is common to many other hypermedia models and systems [Davis 1992], [Kappe 1993], [Wiil 1997], [Verbyla 1994]. Such externalisation means that potentially:
The navigation-to queries are only relevant to bi-directional links. The necessity for which has not been clearly established; some hypermedia models assume such links but the majority does not [Parunak 1989]. As in object-oriented design, there needs to be a differentiation between an association, a relationship, which is inherently bi-directional and the need to provide navigability (permitted direction of movement) which is uni-directional. Specification only has to accommodate the necessary and sufficient semantics not the universality of semantics. For instance, linking references to a staff member to that member's personal page may be the desired navigability although the referential association between the two is bi-directional. The apparent bi-directionality of HTML links provided by the browser's "forward" and "backward" buttons is actually only a localised implementation of an undo or go-back navigation for the user's chosen path. Rarely in the destination context is there a representation of the reverse relationship (i.e. a link to go back). Before the navigation questions can be considered, and, before source existence and destination identification can be specified, it is necessary to resolve the issues relating to addressing, namely the "pointing" semantics and syntax. HTML links are limited because their semantics only allow whole documents or pre-embedded "#fragment identifiers" to be addressed. (To appreciate this limitation, compare it with the versatility of cross-referencing possible on paper.) Also, their syntax involves identification by location that has proven to be extremely fragile. The Web Consortium's XML Linking working group is developing specifications of Xpath, and Xpointer which deal with such issues as an extensive variety of fine-grained, fragment identifiers along with user-selected fragments [DeRose 2000]. Others have tackled the fragility via such efforts as URNs (Universal Resource Number) [Moats 1997] and DOI (Document Object Identifier) [Lynch 1997]. [DeRose 2000] further discusses these issues. SemanticsThere have been several link taxonomies proposed. Those based on intended functionality are to assist the user to conceptualise the result of following the link [Parunak 1991], [Trigg 1983]. Allen [Allen 1996] applies a similar purpose-oriented classification having first categorised links on the basis of how they could be detected i.e. pattern matching, information retrieval algorithms, and, natural language techniques. Others constrain link types in order to make them comprehensible to authors and users [Thistlewaite 1997]. This basically involves clustering the more specific purpose-oriented types together on the basis of the nature of the association depicted. This is illustrated below in an example that uses Thistlewaite's classification [Thistlewaite 1997].Despite the severe limitations of its specification mentioned above, the HTML link arguably has quite generic semantics. The reason for this is that the link source text (and possibly other media such as an image) that is placed between the and tags is fairly unrestricted in content. It is this link source representation which can capture the association between the link's source and destination i.e. tell the user why the link exists. To illustrate this consider the following examples of the four link types identified by Thistlewaite [Thistlewaite 1997]:
The validity of the link could arguably be better ensured if the link itself was determined
by computation that "captured" the semantics of the link. A simple example of this
(which can be done using Javascript) is where sections are in pages called
sec
Even such computation cannot ensure the validity of the link if after the link's
computation, the nature (contents) of the destination or the source document changes. For
instance consider a semantic link that determines its destination on the basis of an
aboutness measure applied to the contents of the two documents. Unless the computation
of the measure is carried out at the time of the navigation of the link, there is potential for
the content of either to change such that the link is actually no longer valid.
Thistlewaite [Thistlewaite 1997] highlights another instance of this "link (semantic) decay" that
actually arises from computation at the time of navigation. In this case references to the
specific position titles such as "The Prime Minister" were automatically linked to the
biographical details of the current incumbent. However as the referent page ages, the
validity of such a link is at some stage going to disappear if the link is dynamically
computed since the referent will no longer be the incumbent.
The HTML link has been employed as a baseline measure throughout the paper since it is
a de facto standard due to its popularity and pervasiveness. Although its syntax and
semantics are limited, the HTML link clearly has immense utility. It also has significant
disadvantages and limitations as in the automatic crafting and management of such links
on a large-scale, and, the coarse-grained nature and fragility of its addressing. The XPath,
XPointer and XLink initiatives associated with the XML link [DeRose 2000] may yet harness
for the Web the power of the unlinked link.
[Ashman 1997]
Helen Ashman, Alejandra Garrido, and Harri Oinas-Kukkonen. "Hand-made and Computed Links, Precomputed and Dynamic Links" in Proceedings of Multimedia '97 (HIM '97), Germany, 191-208, 1997.
[Ashman 1997a]
Helen Ashman. Theory and Practice of Large-Scale Hypermedia Systems, Ph.D. Thesis, Royal Melbourne Institute of Technology, Australia, 1997.
[Davis 1992]
Hugh Davis, Wendy Hall, Ian Heath, Gary J. Hill, and Rob J. Wilkins. "Towards an Integrated Information Environment with Open Hypermedia Systems" in Proceedings of the ACM Conference on Hypertext (ECHT '92), Milano, Italy, 181-190, [Online: http://acm.org/pubs/citations/proceedings/hypertext/168466/p181-davis/], December 1992.
[Davis 1995]
Hugh C. Davis. "To Embed or Not to Embed..." in Communications of the ACM (CACM), 38(8), 108-109, August 1995.
[DeRose 2000]
Steven J. DeRose. "XML Linking" in ACM Computing Surveys, Symposium on Hypertext and Hypermedia, 2000.
[Kappe 1993]
Frank Kappe and Hermann A. Maurer. "Hyper-G: A Large Universal Hypermedia System and Some Spin-offs" in Computer Graphics, special online issue, [Online: zgdv.igd.fhg.de/ papers/multimedia/Kappe.Maurer.PS.Z], 1993.
[Lynch 1997]
Clifford Lynch. "Identifiers and Their Roles In Networked Information Applications" in ARL: A Bimonthly Newsletter of Research Library Issues and Actions, 194,Washington, DC: Association of Research Libraries, [Online: http://www.arl.org/newsltr/194/identifier.html], October 1997.
[Moats 1997]
Ryan Moats. URN Syntax, IETF RFC 2141, [Online: ftp://ftp.isi.edu/in-notes/rfc2141.txt and http://www.ietf.org/html.charters/urn-charter.html], May 1997.
[Parunak 1989]
H. van Dyke Parunak. "Hypermedia Topologies and User Navigation" in Proceedings of ACM Hypertext '89, Pittsburgh, PA, 43-50, November 1989.
[Parunak 1991]
H. van Dyke Parunak. " Don't Link Me In: Set-based Hypermedia for Taxonomic Reasoning" in Proceedings of ACM Hypertext '91, San Antonio, TX, 233-242, [Online: http://acm.org/pubs/citations/proceedings/hypertext/122974/p233-van_dyke_parunak/], December 1991.
[R–scheisen 1995]
Martin R–scheisen, Christian Mogensen, and Terry Winograd. "Beyond Browsing: Shared Comments, SOAPS, Trails and On-line Communities" in Proceedings of the Third International World Wide Web Conference, Darmstadt, Germany, [Online: http://www.igd.fhg.de/www/www95/proceedings/papers/88/TR/WWW95.html], 1995.
[Thistlewaite 1997]
Paul Thistlewaite. "Automatic construction and management of large open webs" in Information Processing and Management, 33(2), 161-173, 1997.
[Trigg 1983]
Randall H. Trigg. A Networked Approach to Text Handling for the Online Scientific Community, Ph. D. thesis, University of Maryland, [Online: chapter 4 of the thesis - http://www.parc.xerox.com/spl/members/trigg/thesis/thesis-chap4.html], 1983.
[Verbyla 1994]
Janet Verbyla and Helen Ashman. "A User-Configurable Hypermedia-based Interface via the Functional Model of the Link" in Hypermedia 6(3), 193-208, 1994.
[Wiil 1997]
Uffe K. Wiil and John J. Leggett. "Workspaces: The HyperDisco Approach to Internet Distribution" in Proceedings of ACM Hypertext 97, Southampton, UK, 13-23, [Online: http://acm.org/pubs/citations/proceedings/hypertext/267437/p13-wiil/], April 1997.
|