Off the Top: Content Management Entries

Showing posts: 16-30 of 52 total posts


November 1, 2003

iPIM and Chandler have a chair at the Personal Info Cloud

There are two articles that are direct hits on managing information for the individual and allowing the individual to use the information when they needed it and share it as needed. Yes, this is in line with the Personal Information Cloud.

The first article, The inter-personal information manager (iPim) by Mark Sigal about the problem with users finding information and how the can or should be able to then manage that information. There are many problems with applications (as well as the information format itself) that inhibit users reuse of information. In the comments of the article there is a link to products that are moving forward with information clients, which also fit into the Personal Information Cloud or iPIM concept. (The Personal Information Cloud tools should be easily portable or mobile device enabled or have the ability to be retrieved from anywhere sent to any device.

The second article is from the MIT Technology Review (registration required) titled Trash Your Desktop about Mitch Kapor (of founding Lotus Development fame) and his Open Source project to build Chandler. Chandler is not only a personal information manager (PIM), but the tool is a general information manager that is contextually aware. The article not only focusses on Mitch and the product (due late 2004), but the open and honest development practices of those that are building Chandler at the Open Source Application Foundation for Windows, Mac, Linux, etc. distribution.



Why page numbers fail us

I keep running into a deep information habit that has never worked well for its intended purpose, the page number has been an information curse. Printed documents use page numbers, which are intended as a reference point (not bragging rights often referenced in Harry Potter and Neal Stephenson books - I am on page 674 and you are on page 233). All of us are familiar with this problem from high school and college if you happened to have a different printed copy of a classic text. Page 75 of Hemmingway's Old Man and the Sea was not the same in everybody's copy.

Even modern books fail when trying to reference pages, just look at the mass market edition of Crypnomicon with 1168 pages and the hardcopy version of Crypnomicon with 928 pages of the same text. Trying to use a page number as a reference does absolutely no good.

Now we try and reference information on the Web, which should not be chunked up by page count, but by logical information breaks. These breaks are often done by chapter or headings and rightly so as it most often helps the reader with context. Documents that are placed on the Internet, many times for two purposes - the ability to print and to keep the page numbers. Having information that is broken logically for a print presentation makes some sense if it is going to be printed and read in that manner, but more and more electronic information is being read on electronic devices and not printed. The Adobe reader does not easily flow from page to page, which is a complaint I often hear when readers are trying to read page delimited PDF files.

So if page numbers fail us in the printed world and are even more abysmal in the realm of the electronic medium, what do we use? One option is to use natural information breaks, which are chapters, headers, and paragraphs. These breaks in the information occur in every medium and would cause problems for readers and the information's structure if they are missing.

If we use remove page numbers, essentially going native as books and documents did not havepage numbers originally (Gutenberg's Bible did not rely on page numbers, actually page numbers in any Bible are almost never used Biblical reference), then we can easily place small paragraph numbers in the margins to the left and right. In books, journals, and periodicals with tables of contents the page or article jumps the page numbers can remain as the documents self-reference. The external reference could have a solid means of reference that actually worked.

Electronic media do not necessarily needs the page numbers for self-references within the document as the medium uses hyper-linking to perform the same task appropriately. To reference externally from a document one would use the chapter, header, and paragraph to point the reader to the exact location of text or microcontent. In (X)HTML each paragraph tag could use an incremented "id" attribute. This could be scripted to display in the presentation as well as be used as hyperlink directly to the content using the "id" as an anchor.

I guess the next question is what to do about "blockquote" and "table" tags, etc., which are block level elements? One option is to not use an id attributes in these tags as they are not paragraphs and may be placed in different locations in various presentation mediums the document is published in. The other option is to include the id tag, but then the ease of creating the reference information for each document type is eliminated.

We need references in our documents that are not failures from the beginning.

Other ideas?



October 19, 2003

RSS on PDAs and information reuse

Three times the past week I have run across folks mentioning Hand/RSS for Palm. This seems to fill the hole that AvantGo does not completely fill. Many of the information resources I find to be helpful/insightful have RSS feeds, but do not have a "mobile" version (more importantly the content is not made with standard (X)HTML validating markup with a malleable page layout that will work for desktop/laptop web browsers and smaller mobile screens).

I currently pull to scan then read content from 125 RSS feeds. Having these some of these feeds pulled and stored in my PDA would be a great help.

Another idea I have been playing with is to pull and convert RSS feeds for mobile browser access and use. This can be readily done with PHP. It seems that MobileRSS already does something like this.

Content, make that information in general, stored and presented in a format that is only usable in one device type or application is very short sighted. Information should be reusable to be more useful. Users copy and paste information into documents, todo lists, calendars, PDAs, e-mail, weblogs, text searchable data stores (databases, XML respositories, etc.), etc. Digital information from the early creation was about reusing the information. Putting text only in a graphic is foolish (AIGA websites need to learn this lesson) as is locking the information in a proprietary application or proprietary format.

The whole of the Personal Information Cloud, the rough cloud of information that the user has chosen to follow them so that it is available when they need that information is only usable if information is in an open format.



September 9, 2003

Getting Site maps and Site indexes right

Chiara Fox provides and excellent overview site maps and site indexes in her Sitemaps and Site Indexes: what they are and why you should have them. This overview and is very insightful. Many experienced users find well developed site maps very helpful.

The odd thing is that for the great assistance site maps and site indexes provide, new users and even general users rarely turn to these assistive tools. In the past five years I have only seen one or two users click on the site map or index in user testing sessions. When questioned why the user often states they do not find the tools helpful (read Chiara's article to build better tools) or they did not know to look for the links.



May 21, 2003

CMS goes over 1,001

This is post 1,001 in my homebuilt weblog CMS. Yes there have been other odometer parties on this site, but turning over 1,000 was one that really has pleased me. My tool is missing some elements that others now have, but I still have fun tinkering with mine and extending it. There are some plans for the summer to add functionality by embracing the categories and using them in conjuction with the links page as well as tying the links and entries together, sort of like a Web strawberry banana shake. I have a book review repository planned too, but it has not moved out of the ERD phase yet (uh, yes I plan, chart, and document my apps and changes for the site, it eases making changes in the future -- I just need to get all the docs on one machine and in one directory).

Lift you glass to 1,001 entries and no digi-barfing



April 26, 2003

Tantek on handrolling weblogs and hand built CMS

Tantek discusses Jeffrey Zeldman handrolling his own RSS feeds (as well as his own site). Tantek also discusses those who still handroll their own weblogs as well as those that have built their own CMS to run their blogs. This was good to see that there are many other that still build their own and handroll (I stopped handrolling October 2001 when I implimented my own CMS that took advantage of a travel CMS I had built for myself).



Great news for Anil as he joins Type Pad and Movable Type company

There was great news this week from Anil who has recently become a member of Six Apart, which was recently funded (yes, great products do still get funded and money is still around for great products). Six Apart are the makers of Movable Type, and just introduced, Type Pad. This was the best string of news I have heard in a long time.



March 27, 2003

Powells Books Booty

Okay, here is the list of booty from Powells Books... Metarepresentations: a multidisciplinary prespective edited by Dan Sperber, a description of this Cog Sci overview book help understand it better. Kunstler's The City in Mind. Feynman's Six Easy Pieces, which I started this morning on the train and really enjoy. William Gibson's Pattern Recognition that I started reading on the plane and has really pulled me in. A string of tech books, MySQL Cookbook, Perl and XML, and Java and XML, and based on Peter's recommendation Managing Enterprise Content: A Unified Content Strategy. This and yesterday's mentioned Hofstadler book should about cover it. I really wish there were a Powells Books where I lived, but my wallet does not wish the same. It is great to be able to see the books and evaluate how helpful the book will actually be to you before buying.

I also added Information Architecture: An Emerging 21st Century Profession by Earl Morrogh while at the IA Summit. It seems to be a very good overview on first pass and it comes very highly recommended. I met Earl at the Summit and he is purely delightful and very much a part of the IA community.



February 9, 2003

Content management interview with Bob Bioko

Elearningpost interviews CMS wiz Bob Bioko. The article has an e-learning bent to it, but it is a great interview discussing content management. I really enjoy Bob's approach as it separates information from technology and stresses the importance of understanding the information and its needs prior to digging into the technology. This is brought out in this quote from the interview:

Firstly, technology changes a lot and focusing on technology is not the right thing to do. What's really important to me is figuring out what exactly you want from your learning system or information system. From my point of view, I have certain information resources that I would like to deliver to certain people in a certain way. That's not a technology question. That's not about what system I have. Rather its about what information do I have, who wants it and how do I deliver that information in the best possible way. Now obviously I would need a system to do that, but the infrastructure follows from the need I have, not the other way around.

When someone makes the decision the other way around, focusing on the platform, the software, the features etc., they limit and constrain themselves to what the system allows them to do. This is not to say that technology wont be a determining factor in what you actually have to, its just that technology should be a response to the problem not a definer of the problem.

Two other bon mots in this article include discussions of "context management" and the perenial question of "build versus buy".



January 5, 2003

More future proofing information

Speaking of future proofing your information, Mark discusses CMS and information reuse. One quote that brings this to light is:

This ties you to your content management system. The further removed your raw data is from your published form, the harder it will be to migrate away from the tools that convert one to the other.

Mark also discusses how using HTML he then created PDF files of his Dive into Accessibility essays. HTML has much of the semantic tools needed and the structure to provide a reusable information repository.



December 14, 2002

Text to HTML tool

Dean is building a text to HTML tool to add structural and typographical features easily. This would provide proper tagging and encoding of typographical characters without the user having to know what these elements are, just what they do.



December 10, 2002

Model of Attraction Outline - Version 1

The Model of Attraction ouline version 1 is now posted. The outline has been structured to set up a structure for filling in the blanks and providing a better strucutre for understanding the MoA. Outlines are my foundations for writing more serious works. Outlines help me find holes and provide a structure to rest content upon. This verion is largely attributed a train ride to Philly that allowed me time and untethered space to think, order, and write.

Please comment if you are so inclined. Find holes are areas that do not seem fully fleshed out enough. Thank you in advance.



December 5, 2002

Content Management, what is in it for me

Content Management, what is in it for me? is answered and explained in this Intranet Journal article. This article asks and, in tern, answers many of the questions relating to content management (CM) and CM systems (CMS). The term Content Management has come to include document management and other elements that used to be loosely related. CM used to focus just on Website content management, but as organizations grew their electronic information repositories many of the piles of digital bits became tied to or extensions of the content management system (CMS). I have had similar discussions with the electronic document management folks I work with and the lines are very blurry these days.


Snowday and lessons on not caching and using static pages

It is a personal snowday as the U.S. Federal Government is open (so far), but on unscheduled leave. Oddly enough the OPM (Office of Personnel Management) status site does not have a no-cache setting in the header. This means you browsers, even your modern browser, will cache the page and require you the user to manually refresh the page. These pages may even be cached locally by ISPs and connectivity providers, meaning the information is not up to date. The page is rather light, with few graphics, but it is not a static page, meaning each page is generated dynamically, which requires more horsepower to serve the page. One days like today every Washington, DC area federal employee and contractor is hitting this page, as well as many DC area private sector employees (most DC companies are open if the Federal Government is open).

I thought most IT folks learned their lesson with the need for "dynamic" pages a year or two ago, but I guess not. Many previously dynamic pages are not dynamic on the back end with content management systems doing the same work they always have done, but building static pages. If the content on the pages is not changing often (this is a subjective term and can roughly defined as content changing ever few hours with heavy site loads are usually candidates for static pages) or are not serving parsable datapoints (tables of data that can have subsets of data selected for viewing). Much of the "dynamic page" hype was generated by marketing folks to non-technical types who made decisions on cool or manly mindsets. Many folks started actually thinking about the need for dynamic pages a couple years ago. Those that decided they did not need dynamic pages for all or even most of their site began to realize they could save on the heavier hardware needed to churn out everypage. These static output folks also found they could withstand much higher hit rates with ease. I have been in meetings with folks that were asking about stress testing site that were static running on boxes that were formerly running dynamic sites. Those of us that understood the processor savings knew this was a foreign concept and they did not understand the server would be able to handle twice to eight times the load previously handled.



November 23, 2002

Get your RSS feed

Yes, I finally got up to speed with the rest of the world and added an RSS feed and have added a new page that will track available vanderwal.net XML documents and RSS feeds. I may make a couple category specific RSS feeds as there is interest. Use the (now working again) comments or the contact to let me know what you would like.

I have only put out the first RSS feed in 0.91 at the moment. I may upgrade it in the near future as I now have it relatively easy to build from my end. I have been getting a decent amount of pestering and bother from folks asking for the feed. You see I still build my own CMS for the site and it takes time and priority to get around to some of these things.

Why not move to Movable Type or Drupal (the only two I would currently consider)? I enjoy building my own system, but it does require that I build my own APIs or even my own applications to mirror functionality. I like building CMS and this one is one of six that I have designed and/or fully built since 1997. It is like a hobby for me as well as a job.



This work is licensed under the Creative Commons Attribution-NonCommercial-ShareAlike License.