Tagging, Metadata, Extended File Attributes
Trails you can add marks to (including your own) are marked with the pencil icon.

A trail of 11 pages, marked with comments, by rowanrook
About this trail:
This wiki trail will focus on tagging: technical discussion, applications, and implications. Tagging is a powerful information management tool, and a key component of Web 2.0 services such as the del.icio.us social bookmarking service. While tagging has become widespread in the internet, it has yet to be widely implemented in the personal computer desktop environment. I am particularly interested in having a synthesis of both local and internet tagging. I invite you to add relevant material to this trail.
11 marks in this trail
1
This wiki trail will focus on tagging: technical discussion, applications, and implications. Tagging is a powerful information management tool, and a key component of Web 2.0 services such as the del.icio.us social bookmarking service. While tagging has become widespread in the internet, it has yet to be widely implemented in the personal computer desktop environment. I am particularly interested in having a synthesis of both local and internet tagging. I invite you to add relevant material to this trail.
3

Extended file attributes is a file system feature that enables users to associate computer files with metadata not interpreted by the filesystem, whereas regular attributes have a purpose strictly defined by the filesystem (such as permissions or records of creation and modification times). Unlike forks, which can usually be as large as the maximum file size, extended attributes are usually limited in size to a value significantly smaller than the maximum file size. Typical uses can be storing the author of a document, the character encoding of a plain-text document, or a checksum.

Contents

[hide]
4
Multidimensional tagging, a key component in social sharing sites, can potentially help enterprises manage large stores of information. In this article, I'll examine the ways that multidimensional tagging will be implemented using Open Source tools [...]

Tools Exist, File System Hooks Don't
The tools for corporate tagging capabilities already exist in the Open Source community. Most of it is encapsulated in the tools used by social bookmarking sites, which are often based on the LAMP stack. They're typically written in common scripting languages, such as Perl or Python, or Java. One such Open Source tool is unalog. Ostensibly a social bookmarking system, it's written in Python and the source is readily available on SourceForge. While the core tools exist, the hooks into the file system are still mostly missing.

A somewhat different but innovative approach is evident with Flickrfs or the Flickr File System. Based on FUSE, it creates a virtual file system with tagging for the Flickr digital photo management service. A fusion of file system and service, Flickrfs lets Linux users access the Flickr service as if it were any other mounted Linux file system. Photos can be accessed through the same tags available on Flickr using standard Linux commands such as cp. Flickrfs represents another way that tagging may come to information management - as a specific application or service but integrated into the normal file system [...]
Multidimensional tagging provides an opportunity to let users manage information more in line with their natural way of thinking. By sharing tags across the enterprise, users will spend less time looking for information and more time making use of it. Unlike other collaborative systems, users do all the work without legions of editors making decisions that users find mystifying. The social sites on the Internet have shown this to be a viable information management model. It's a matter of how and when, not if, these features become available to the corporate enterprise.

References

5
microformat (sometimes abbreviated μF or uF) is a web-based[1] data formatting approach that seeks to re-use existing content as metadata, using only XHTML and HTML classes[2] and attributes.[3] This approach is intended to allow information intended for end-users (such as contact information, geographic coordinates, calendar events, and the like) to also be automatically processed by software.

Although the content of web pages is technically already capable of "automated processing," and has been since the inception of the web, there are certain limitations. This is because the traditional markup tags used to display information on the web do not describe what the information means.[4] Microformats are intended to bridge this gap by attaching semantics, and thereby obviate other, more complicated methods of automated processing, such as natural language processing or screen scraping. The use, adoption and processing of microformats enables data items to be indexed, searched for, saved or cross-referenced, so that information can be reused or combined.[4]

Current microformats allow the encoding and extraction of events, contact information, social relationships and so on. More are being developed. Version 3 of the Firefox browser,[5] as well as version 8 of Internet Explorer[6] are expected to include native support for microformats.

Contents

[hide]
6
The Dublin Core metadata element set is a standard for cross-domain information resource description. It provides a simple and standardised set of conventions for describing things online in ways that make them easier to find. Dublin Core is widely used to describe digital materials such as video, sound, image, text, and composite media like web pages. Implementations of Dublin Core typically make use of XML and are Resource Description Framework based. Dublin Core is defined by NISO Standard Z39.85-2007.

Contents

[hide]
8

Resource Description Framework (RDF) is a family of World Wide Web Consortium (W3C) specifications originally designed as a metadata model but which has come to be used as a general method of modeling information, through a variety of syntax formats.

The RDF metadata model is based upon the idea of making statements about resources in the form of subject-predicate-object expressions, called triples in RDF terminology. The subject denotes the resource, and the predicate denotes traits or aspects of the resource and expresses a relationship between the subject and the object. For example, one way to represent the notion "The sky has the color blue" in RDF is as a triple of specially formatted strings: a subject denoting "the sky", a predicate denoting "has the color", and an object denoting "blue".

This mechanism for describing resources is a major component in what is proposed by the W3C's Semantic Web activity: an evolutionary stage of the World Wide Web in which automated software can store, exchange, and use machine-readable information distributed throughout the web, in turn enabling users to deal with the information with greater efficiency and certainty. RDF's simple data model and ability to model disparate, abstract concepts has also led to its increasing use in knowledge management applications unrelated to Semantic Web activity.

Contents

[hide]
10
Desktop:Tagging, Metadata, Extended File Attributes. This WIKI trail will focus more specifically on software which enables tagging of desktop or local files rather than just Web2.0. Please feel free to add relevant material to this trail.
11
Jiglu is a super-smart engine that pieces your site together, intelligently tagging and linking your web content. Jiglu plugs into your site to automatically create intelligent tags and links for your web content. This means that people can dive straight in and read what they want - your most popular blog entries or even stuff buried deep within your site.

Add your comment: