Text Clouds: A New Form of Tag Cloud?

During 2006, tag clouds moved beyond their well-known role as navigation mechanisms and indicators of activity within social media experiences, emerging as a standard visualization technique for texts and textual data in general.
This use of tag clouds does not commonly involve tags, social networks, emergent architectures, folksonomies, or metadata.
"Text cloud" might be a more accurate label for these visualizations than tag cloud. In addition to recognizing fundamental differences - text clouds differ from tag clouds in composition (no tags at all) and purpose (predominantly comprehension, rather than access or navigation) - distinguishing the two types of clouds will make it much easier to assess their abilities to support user experience needs and business goals.
The emergence of this new form of text cloud looks like a good example of speciation in action (though it's too early to tell whether the end result will be cladogenesis or anagenesis).
Major and minor publications feature(d) text clouds as visualizations in 2006, both permanently and temporarily:

The Economist's Text cloud

In 2006, several free and public tools for generating text clouds locally on the desktop or via a service available through the Web were released. The increase in the number and variety of specific text cloud tools reflects embrace and enthusiasm for text clouds in communities of interest for information visualization, language processing, and semantics.
Some of the better known examples of text cloud tools include:

The Many Eyes Cloud

The text clouds created with these tools range across a wide spectrum of speeches and writing:

Text clouds are meant to facilitate rapid understanding and comprehension of a body of words, links, phrases, etc. Any block of information composed of text is open to analysis as a text cloud, as these screen captures of text clouds for restaurant menus, ingredients, wikipedia, magazine covers, and even poems demonstrate.
Tim O'Reilly uses text clouds for a number of purposes:

We used them a bunch to analyze the topics, companies and people at the last FOO Camp, and they were the most useful of the visualizations we did. They helped us see where we were under- and over-represented in terms of companies and particular technologies we were wanting to explore. ...So they have many uses beyond just showing what we normally think of as tags.

Non-linear Access
The emergence of text clouds shows continuing exploration and refinement of cloud style displays as a new form of user interface, adapted to specific contexts. Continued refinement of text clouds in this direction may indicate an expanding role for commonly available and sophisticated text visualization tools to support specialized goals for information display and understanding.
Remember that Google is busy right now scanning thousands of books per day from several of the world's major academic libraries, as part of it's self-appointed labor of organizing the world's information. That's a lot of new text. How will people work with effectively with such an overwhelming amount of text, of so many different kinds, from so many different sources?
Consider the following, from Ulysses' Without Guilt by Stacy Schiff (in the New York Times):
Recently Cathleen Black, president of Hearst Magazines, urged a group of publishing executives to think of their audience as consumers rather than readers. She's onto something: arguably the very definition of reading has changed. So Google asserts in defending its right to scan copyrighted materials. The process of digitizing books transforms them, the company contends, into something else; our engagement with a text is different when we call it up online. We are no longer reading. We're searching - a function that conveniently did not exist when the concept of copyright was established.
On a larger scale, the growing use of text clouds hints at a (potential) deeper cultural shift in the way we go about reading and comprehension: a shift from linear modes based on reading words and sentences, to nonlinear modes based on viewing summaries of content in aggregate as a way of discovering concepts and patterns. (Finally, a legitimate use for Twitter...) Experimenting with text clouds for non-linear reading and comprehension (now that's a sexy term...) is a natural evolution of the role cloud style displays play as an alternative / compliment / supplement to the list based navigation now dominant in user experiences.
A Text Cloud of Twitter Posts (A TwitterCloud?)

created at TagCrowd.com


I'm not predicting the end of reading as we know it, nor the end of navigation as we know it: both will be with us for a long, long time. But I do believe that text clouds might constitute an emerging method for augmenting comprehension and display of text, with broad potential uses.
Enterprising Clouds
What about someone lacking time to fully read a Shakespeare play, or a faddish business book, but who needs to understand something about that book's meaning and substance? A text cloud creation tool could extract the most commonly mentioned terms, and otherwise profile the words that make up the text. It would be risky to rely on a shallow text cloud (and Tim O'Reilly mentions this specifically) for deep comprehension, but it would be enough to understand the concepts that appear, and allow polite conversation at a networking event, or lunch with that certain manager who recommended the book.
If I were entrepreneurial, I'd source a set of free electronic versions of classic texts, process them with one of the free text cloud tools, apply some XSLT and other transformations to generate consistent readable formatting, and sell the results as a line of ebooks called "Cloud Notes". Of course, someone's beaten me to it already...
What's in store for the future?
In this fashion, text clouds may become a generally applied tool for managing growing information overload by using automated synthesis and summarization. In the information saturated future (or the information saturated present), text clouds are the common executive summary on steroids and acid simultaneously; assembled with muscular syntactical and semantic processing, and fed to reading-fatigued post-literates as swirling blobs of giant words in wild colors, it consists of signifiers for reified concepts that tweak the eye-brain-language conduit directly.

Share and Enjoy:
  • Print
  • Digg
  • del.icio.us
  • Facebook
  • Google Bookmarks
  • BlinkList
  • connotea
  • FriendFeed
  • LinkedIn
  • Netvibes
  • Posterous
  • Reddit
  • StumbleUpon
  • Suggest to Techmeme via Twitter
  • Technorati
  • Tumblr
  • Twitter
  • Twitthis
  • Yahoo! Buzz
  • Hyves
  • Live
  • MisterWong
  • Mixx
  • MySpace
  • NewsVine
  • Propeller
  • Simpy
  • Slashdot
  • Wikio
  • Current
  • email
  • Ping.fm

Related posts:

  1. Watching Ideas Bloom: Text Clouds of the Republican Debate At Democrats.org
  2. Text Clouds and Advertising: Microsoft's Community Buzz Project
  3. Text Clouds of the Democratic Debate
  4. 10 Best Practices For Displaying Tag Clouds
  5. Tag Clouds Evolve: Understanding Tag Clouds

Category: Tag Clouds
Tags: , , , , , 12 comments »

12 Responses to “Text Clouds: A New Form of Tag Cloud?”

  1. Stewart McKie

    Joe
    Another great post. Visu­al­iza­tion is clearly an impor­tant ben­e­fit of a textcloud but also con­sider the ana­lyt­i­cal value of hav­ing a lot of text clouds about a spe­cific topic. Tag pop­u­lar­ity is web 1.0. Applied text cloud ana­lyt­ics is web 2.0.
    Best
    Stewart

  2. Greg Weller

    Joe — very inter­est­ing post. You might be inter­ested in some of the tag clouds I’ve done on Many-Eyes, includ­ing 6 dif­fer­ent edi­tions of Whitman’s Leaves of Grass.(my stuff is located at the above URl) I’m work­ing on the idea now of a lit­er­ary mashup of tag clouds: take two or more works with sim­i­lar themes (Say Spenser’s The Faerie Queen and Shakespeare’s Mid­sum­mer Nights Dream, com­bine the two texts and tag cloud (or text cloud) them. It’s some­thing like William S. Bur­roughs’ cut up tech­nique meets Web 2.0.

  3. Jonah Keegan

    Joe, I won’t repeat the praise but great sur­vey of the progress this com­mu­ni­ca­tion has made. Also wanted to point you to my own small con­tri­bu­tion to tag/text/word cloud­ing. Not the deep­est exam­ple of the meme, but not the shal­low­est either I think. :o )
    http://snapshirts.com/

  4. joe lamantia

    Jonah: I love all the new busi­nesses pop­ping up around clouds these days — it’s encour­ag­ing to see so much entre­pre­neurism (?) in a new space. Good luck with snap­shirt!
    Greg: Nice work — I learned about ManyEyes at IDEA2006, and am glad to see peo­ple putting it to inter­est­ing and artisitic uses right away
    Stew­art: Could you share some of the more inter­est­ing exam­ples of clouds from Script­cloud that you’ve seen?

  5. Warren Apel

    I like the idea of using text clouds to dis­play non-tag data. In this ManyEyes graph (URL below) I used font size to dis­play the rel­a­tive sizes of uni­ver­sity endow­ments. I’m work­ing on some sim­i­lar data dis­plays that show world pop­u­la­tion, etc. I think it’s a great way to put lots of infor­ma­tion in an easy-to-read but com­pact space. Much more data dense than a bar graph!
    http://services.alphaworks.ibm.com/manyeyes/view/Sh3S9FsOtha6Qj-yrrjGF2–

  6. Arnab Ganguly

    The ques­tion arises is the usabil­ity of Tag Clouds. Do they really help vis­i­tors. What does it mean to a nor­mal vis­i­tor, how often they are clicked through.

  7. andrew

    i don’t really believe this kind of clouds are use­ful from a SEO point of view…too many unre­lated links on a page
    just my opin­ion though..

  8. Tsu Dho Nimh

    Joe -
    Text clouds would be good for show­ing novice writ­ers that they are not focused on what they think they are focused on, but with­out the mechan­i­cal “key­word den­sity” cal­cu­la­tions.
    For exam­ple, if you are writ­ing a prod­uct review, the prod­uct type, the name of the man­u­fac­turer, the product’s name, and some verbs and nouns relat­ing to that the prod­uct is used for should show up promi­nently in the cloud.
    You need to use a gen­er­a­tor that has a stop list, of course. I’m using the gen­er­a­tor at tagcrowd.com and it’s really nice.

  9. sasha

    Check out http://www.cloudtuner.com

  10. Figment Engine

    I’ve cre­ated a tag cloud that is ani­mated and shows the rela­tion­ships between tags based on co-occurance… check it out at
    Ani­mated Tag Cloud

  11. Eamon

    Fas­ci­nat­ing read­ing. Do text clouds have any­thing in com­mon with Grounded The­ory http://en.wikipedia.org/wiki/Grounded_theory — a form of research where text is analysed so that impor­tant words iden­ti­fied and cod­ing of key­words takes place. I don’t know a lot about grounded the­ory but it sounds like some­thing that could ben­e­fit from text clouds or vice versa?

  12. joe lamantia

    Hi Eamon. I didn’t have Grounded The­ory specif­i­cally in mind while I was writ­ing about text clouds, but that doesn’t mean there’s no con­nec­tion. There are some text cloud analy­sis tools avail­able, but I’ve not heard any of the authors men­tion grounded the­ory in rela­tion.
    Maybe you’ve found a good thread to fol­low up on :)
    Think­ing broadly, almost all of the research / dis­cov­ery / insight meth­ods cur­rent in man­age­ment con­sult­ing and user expe­ri­ence are bor­rowed from dis­ci­plines like social sci­ences and cog­ni­tive sci­ences. If you look under the hood at many of these meth­ods I think it’s appar­ent that GT is an impor­tant com­po­nent and / or sig­nif­i­cant influ­encer. For exam­ple, I worked in an IT strat­egy and man­age­ment con­sult­ing group that relied on a method directly derived from GT (thought I’m sure this escaped most of the peo­ple involved in defin­ing it, iron­i­cally…)
    And on a more tech­ni­cal side, some of the con­cepts that GT relies on (cod­ing, cat­e­gories, etc.) are present in search, index­ing, and seman­tic tools.
    What sort of con­nec­tions do you see?

Back to top