Graphic Recording

I didn't know it, but all my life I've engaged in "graphic recording" when it came to exploring new ideas or learning. I never went as far as the artists who made a series of recordings for the sustainable agriculture and food conference, but my subjects were technical, and I was a technical kid growing up, so my "confections," as Tufte calls them, were more mathematical, graphical and textual in nature. I used them to illustrate things to myself, like working out visually how cycles represent waveforms in musical instruments. Now, I see them as graphic recordings. I was a bit ashamed of them, since I thought it meant that I wasn't a good learner and tried to suppress or limit them. That was a mistake.

The drawings are simply wonderful and I got put onto them by Brenda Dawson who tweeted about the graphical recordings made for the March 29 2009 conference
Inaugural National Symposium on Food Systems and Sustainability at the University of California, Davis. How much better a "presentation" these graphic recordings make than a PowerPoint presentation!

These drawings are a lot like my vision for an information system, called Strands, which would be as thick and filled with complexity as the Talmud and as visually expressive as these graphic recordings. If only the web could be like this. When I think of Twitter and Tabloo, if they could be combined, I think we'd be close. Tabloo enables users to create visual narratives (through the structure and relationship, size and aspects of images) and Twitter enables users to create conversations out of small fragments of thought flowing continually.

Labels: , , , , , , , , , , , , , , , , ,

Why Tag Clouds are Beating a Dead Horse

Tag clouds are dead. I don't want to mince words. I've been waiting for a long time for someone to say so, to let everyone see the elephant in the living room. What interests me is why tag clouds are dead.

About ten years ago I was working on a prototype web application. It never saw the light of day. But it was called Strands and consisted of a wiki-like content management system that allowed anyone (it was based on SoftSecurity) to create pages, to post and edit content. Any author could include single keywords in the text. These would be automatically scooped up and entered into an index. You could display the posts associated with (containing) any keyword listed on a page like search results. The idea was that content could be navigated in any number of ways according to keywords added by users. It's wasn't social. It didn't know the user who contributed the keyword. The idea was to destroy hierarchy and create a user centered order to information, something close to the folksonomy (but not quite because it didn't care about who submitted a keyword). One version did not allow linking between pages, no "wikiword" links, the idea being that all navigation was by keyword links, either in content or on the "strand" pages listing all content belonging to a keyword.

One of the other ways of navigating considered was by popularity of keyword. The system could generate a list of keywords based on how many posts contained or were associated with them. You may start to find the elements of this system familiar. "Strands" are posts listed by tag. Keywords are tags. Navigating by popular keywords is a tag cloud. The ideas for this system partly developed out of work I'd seen on the web where posts were ordered by single keyword. The other reason was I have a terrible time categorizing anything, I can't decide which category something could go in. I am incredibly bad at and hate categorizing anything, so I decided the wiki element would let visitors to my site categorize my junk for me.

If this were not a blog, I'd spare you all this personal history, but it does show you why I am interested in the question of why tag clouds suck.

When I visit a website with a tag cloud, I tend to pay close attention to it. I noticed that I never bothered clicking on them, never used them. When I thought about why, one of the things I noticed was that nearly every tag cloud consisted of a number of large tags I could count on my hand, and the rest were undifferntiated in size. One of the solutions that came to mind was displaying tags by popularity on a logarithmic scale, which could help increase the difference between the less popular tags. I'm not that great at math, so I would need to leave it to someone else to work this out. But the idea is to create greater differentiation visually among the less differentiated tags.

The other problem with this is there are only so many font sizes that are easily usable on the web. This worsens the differentiation problem.

The other concern I had devevloping the keyword based application was that chaos would ensue. People tend to prefer order. Would it help or hurt for people to be navigating by tag? Tags don't always apply to the subject. Their strength is freedom, freedom from controlled vocabularies and rigid meanings, but without those restrictions tag-chaos can reign. Wikis always had a kind of randomness to them and so do tag structured and navigated content.

I almost never click on tags in Wordpress blogs for this reason. It usually produces a result that widens not narrows my search. Nielson observed that clicking on a link has a penalty, and the trouble with tags is they have an uncertantity penalty.

The closest I've ever seen to a realization of the keyword based navigation idea is a photo gallery developed by Alex Wilson some years ago. You can see it still in operation here. It's a great idea and an excellent implementation, I don't know why I didn't go ahead with my own version instead of abandoning it (doubly, since the eventual goal was for organizing photographs). It makes the homepage a tag cloud and each detail page with a photograph displays a vertical row of thumbnails to photographs linked by tags, which is very similar to the way the Strands pages listed posts according to tag (like Flickr pages with the tags next to the image). Alex recently switched to a standard gallery system for this exact reason, that visitors and customers apparently found the tag-navigated album confusing.

I love tags. I use them like I feel they were supposed to be in this blog, I just write any significant word that comes into my head about the subject. I don't care that they create long lists of tags, since I only use them as a memory aid. They are terrible for people navigating the site and categories would probably be better. Tags aid memory, they aid discovery and exploration, but I'm uncertain that they are good finding aids.

I'm sure others have observed this before, but I've kept quiet about it, so I may be late the party, but still, it's a useful discussion, to dissect why tags ultimately fail to live up to the (strange to me) hype they received. Every new web technology seems to be annoucned like the second coming.

So, yes, tag clouds are beating a dead horse. Even the little sets of tags next to blog posts don't really do much for me, not even on my own site, or they don't seem to do much for visitors in my view.

The other thing that tortured me developing the keyword based navigation was whether to allow spaces in keywords, which would prevent combining keywords like chicken+soup and create confusion (sepearte keyword threads of navigation) between "farmers market" and "farmers_market." I worried a bit about misspellings, but not too much since I didn't like controlled vocabularies.

References: Tag Clouds_Rip and ZigTag supposed to solve these problems.

Labels: , , , , , , ,

Capturing and Refining User Expertise

One of my longtime interests has been how to create a system that captures the knowledge of experts and refines it into a single resource. I was attracted to wikis early on by their communal authorship, but found the lack of structure unsuitable for my needs. What I wanted, for two of my early efforts, one a site intended to help family photography historians answer questions about old photographs and the other a site for programmers to find help with coding questions, was a way to let users engage in a Q and A and then somehow capture and distill the expertise into a more traditional article format (like a wiki page), which could be maintained by everyone. I wanted to capture the expertise emerging from the group discussion through some mechanism.

I ended up developing a content management system for the coding site, which had the ability to "fold" a comment thread attached to an article back into the article for editing. I also developed a tool, which could take a forum thread and turn it into an article text for editing. These solutions required a lot of manual effort to whip the unruly comments into a coherent article.

All along I wanted to introduce the communal editing feature of a wiki to this process, but I faced the obstacle of how to overcome the distinction between communal content and content owned by the user posting it. I racked my brains to design the system to somehow enable a transition from personal content to communal content, so that question and answer sessions centered around a code example or problem, could be "folded" into a more communal source of information, refined and with conclusions. But never found a solution.

Originally, I had wanted to develop my coding help site as a Q and A site like Experts Exchange. This explains why I needed some way of converting the knowledge captured by the Q and A session, if there were a solution, into an article form. A QandA session usually results in exposing a lot of valuable knowledge from experts. I wanted a way to capture and refine this so people could learn to code better from it.

Stackoverflow.com a Q and A site for coders. It is simply excellent in design and execution. What fascinates me most is their concept of a "Community Post." When a post is edited by more than four users, it it promoted to a Community Post, which is editable by every user and no longer belongs to the original owner. Apparently, they use a wiki-like versioning system for their posts, so the original post is owned by the original posting user, subsequent versions I suppose are owned by their editors (the user who revised it), and after four unique edits becomes the property of the community.

This mechanism provides a smooth transition from traditional _authorship_ to the communal writing style of the wiki where the community is the author and authorship is anonymous. I wish I had thought of it, since the original idea for my site was a "code wiki" that would not just provide solutions to programming questions but help coders learn from the results and improve their skills. I don't want to rehash my failures with phphelp.com, but to highlight an innovative way of providing a smooth transition between individually owned and communal content.

One of the questions raised by this is authorship. People like attribution because it builds their reputation. So in a wiki environment, they lose their attribution. A user's post becomes a community post. So what happens to a user's credit? One solution is to create an indirect proxy for credit in a communal authorship environment, so that good authors get "badges" or "reputations" that they wear independently. Instead of a "byline" for your post, you get a badge representing the amount and effectiveness of your contributions.

Which is better? Everyone owning their own content or communal content? It really depends on the audience and goals of the site. Some people prefer to own their own content and share it. This is how most social media sharing sites work. You own your content and your friends own their content and the site provides a way of sharing it. Social bookmarking sites also enable users to keep their own content separate from others and then the content is mixed and matched through tag navigation. A wiki-style system generally views content as communal. Stackoverflow solved this problem with a novel mechanism for transitioning content from individual to communal status.

It occurred to me this mechanism might be valuable in a so-called bliki system, which is a blog and a wiki combined. In a bliki, users create quick, timely posts like blog entries connected to dates, but they can also edit the content of posts to create and reference wiki pages. This enables users to make quick sketchy entries like a blog, but then later, reflect on those entries with longer posts. This is called "quick-slow" in bliki terms. What if this process could be facilitated by automatically transitioning the "quick" blog post into a "slow" wiki page? Instead of making a blog post then creating a wiki page linked to it with extra information, the blog post would at some point transform itself into communal content, from blog post to wiki page. Authorship would still be retained because each post would still exist in the wiki history. Anyone could go back to the original blog post to see who posted it and what it was about.

Labels: , , , ,

The Wiki and the Farm

There has been a flurry recently inspired by Michael Pollan writing about a vision of people becoming producers and consumers in society. He argues that industrialization created a division in society between producer and consumer, with the consumer essentially at the mercy of producers.

I've thought about how once there were many producers of food, but as agriculture industrialized, we began to call the things we did "industries," a hog farm became the hog industry, wheat farming became the grain industry, raising beef became the cattle industry. All the little myriad farms producing our food were replaced by large commodity farms based on economy of scale through centralization and industrialization (the use of petrochemical fertilizer, mechanized harvesting).

Some of the changes introduced by industry have been useful, for example frozen peas are by and large much better quality than fresh or canned peas. Peas must be picked at the very peak of sweetness, which only lasts for a day or two. This requires a massive and quick harvesting effort. The peas must be quickly frozen to preserve their sweetness and quality. If the peas were sent to a market, if they were picked over a number of weeks, the quality would suffer. The frozen pea is picked at the peak of sweetness and frozen in one quick, mechanized operation.

On the other hand, we by creating industries out of the various aspects of farming, we have lost something in the translation. We lost the intertwined way plants, animals and the soil interact with each other on the farm. This interaction was replaced by massive inputs of petrochemical fertilizer and pesticides to feed and protect the weak, sickly hybrids raised in huge monoculture beds required by economies of scale. But we lost another thing, which Pollan touches upon, the intermingling of producer and consumer that existed before industrialization. It is easy to see the small farmer as a producer, but it takes a little more digging to see the web of producers and consumers. The farmer produced food that the blacksmith ate, but the blacksmith shod the horses the farmer pulled his plows and harvesters with. At every level, people were producers and consumers.

The blacksmith is a recurring figure in my thoughts. Without a blacksmith the community would grind to a halt. The blacksmith also represents the complex web of production and consumption in the community before everything became an "industry," demonstrates the interdependencies in the community. The blacksmith must eat. The farmer must shoe his horses. No one can escape the individual and direct relationships that sustain them by shifting responsibility to some distant industry. The blacksmith also represents the connection between culture and nature, through the implements he fabricates for the farmer to work the fields and reinforces the true meaning of cultivation, which means to cultivate the land and to cultivate the person through culture.

After industrialization there were only classes of producers and consumers. There is always an imbalance, whether in farming or the music industry between producer and consumer, with a small number of producers creating things and a large number of consumers consuming things. The producers dictate what is produced, how it is produced and the consumers are passive or only through large numbers do they influence what is produced.

What does this have to do with wiki? The moment the first wiki was born, it made everyone and anyone who came along into an author or a reader, a producer or consumer. The wiki by definition commingles production and consumption, producers and consumers. The wiki was way ahead of its time. The contribution of this idea may be more important and lasting than the wiki as a way to manage content. The wiki's greatest contribution was to awaken people to a new reality, that in a networked world of digital information, post-industrialization is possible, that people can become producers and consumers again.

Pollan argues we should start gardens to lessen the division in our society between producers and consumers. By gardening, we can become producers as well as consumers of food. It is worth noting that maintaining a wiki can be likened to gardening, so perhaps a wiki is a garden, where like the real garden, is a place of reconciliation.

Labels: , , , , , , ,

Citizendium: Multiple Truths Welcome?

I've been reading the Citizendium's approach to governing what goes into their content. This new attempt at a wiki encyclopedia favors an approach with less rules, greater oversight and tries to accommodate a multiplicity of views on truth.

http://en.citizendium.org/wiki/CZ:Neutrality_Policy

I wish they would not call it a neutrality policy. I strongly dislike the oxymoronic "neutral point of view."

I don't know how far the Citizendium will go, but I do believe multiple truths should be represented transparently. There is no reason not to accommodate multiple truths and no reason not to build information systems capable of accommodating multiple truths.

Contrary to popular belief, there does exist more than more than one truth. In genealogy, for example, the idea of multiple truths is necessary, since the same individual is frequently claimed by more than one family. Information about the past is sketchy and subject to interpretation. When the first online genealogies were being discussed on the GenWeb mailing list, it was ultimately concluded that there should not be a single unified global genealogy, since the "facts" could never be reconciled perfectly. There would by necessity be a need for representing multiple truths, based upon facts weighted by how much confidence we have in the sources (familiar to anyone who sources their genealogy).

Moreover, the truths we hold in our minds are imperfect, and emerge from our folk knowledge, through narrative and are based upon assumptions, which generally are made not on evidence (and probably can never be made upon anything else), but on the folk knowledge we absorb from our surroundings.

An absolute truth may exist and be determined by the physical universe, but there are many questions that arise about the human mind, society and the constructions of the human mind, which society is an example, which have no physical existence at all and it may never be able to determine what is true. There is also the nature of our knowledge existing only as sense perceptions, which makes science a kind of honorable delusion, as accurate as we can determine to agree on shared descriptions of phenomena. We have a reasonable idea of what we know is true or not through careful scientific inquiry, assisted by not guaranteed by reason, but in the end the only thing we know is: No one truly knows anything.

Labels: , , , , ,

Quick-slow: A way to give meaning to media?

I develop the platform for the folkstreams.net website, which is a non-profit archive for rare folklore documentary films. We transfer the films to video and then stream them to the web so they are not lost, molding in some archive never to be seen. Many of our films have not been seen for twenty years or more, one was rescued from a barn. As such, we are strong advocates of open access to archives (and I am happy to learn many other institutions in the folklore world also understand how important access is to a sustainable archive and are using the web in wonderful ways).

To the point. It has always been important for our films to be presented in context. I have always believed that media without context is meaningless, whether that is a family photograph or a documentary film. A photographic image is merely an interesting composition without the information necessary to understand it, to interpret it. All images must be read...oddly enough, since they are the seeming opposite of text, which everyone acknowledges must be read. We teach literacy, but we don't teach the equivalent for images. (What would that be imagacy, photoacy, videocy? That last one sounds too much like idiocy to be comfortable.). A photographic image may affect us as a work of art, or it may present an attractive composition, but beyond that it requires context. The same is true for moving pictures. This is why each film on our website is nestled in a set of contextual materials. I sometimes doubt that many people read them, or read the transcript, but they are there to give meaning to the films, to place them in context so that people may better understand the subjects and ideas presented in the film.

I've wanted for some to build a small content management system where

* Media is on an equal footing with text, but also where media is the centerpiece.
* Media is as easy to work with and place in context as working with text in a wiki.

There is a form, a mashup you might call it, between a Blog and a Wiki, called a Bliki. I had not paid too much attention to this development until I read on one of the advocate's websites that the idea of a Bliki involved a "quick-slow" process. The blog enables a user to quickly write a blog entry, something quick, potentially ephemeral and tied to time; At that moment or any time later, the user may also create a wiki page connected to the blog entry, for slower moving, more thoughtful and persistent content.

I think this would apply perfectly to such a media application, which would be useful in personal publishing and could help small archives (local history and genealogy societies, libraries and archives) manage and create access to their image collections. As volunteers scan images, they could upload them with descriptions as blog entries, then they or others could provide context through wiki pages associated with the image meta data in the blog entries. It would be upload, give a title and description, later come back, drop a wikiname in the description and then create a page. It could encourage local people to contribute memories to photographs, for example.

Today, I actually came across the first example in the wild of someone doing this, someone using the Bliki "quick-slow" philosophy to give meaning to images. "I added Wikipedia links to my Flickr photos..." (http://instones.org/archives/61 2007) This is exactly the kind of behavior I would like a web application to enable and encourage, which would facilitate quick image upload and meta data, but would also enable and encourage placing media in context, as well as supporting tagging for folksonomy. It's more a philosophy than a technology.

Labels: , , , , , , ,