Off the Top: XML Entries

Showing posts: 31-45 of 68 total posts


January 27, 2003

Apple Word Replacement Rumor and Information Structure Dreams

Rumor has it Apple is working on MS Word replacement. This would be a great thing if it would read native Word files seemlessly, but even better would be turning out valid HTML/XHTML. MS Word has always made a huge mess of our information with its conversion to something it "calls" HTML, it is not even passable HTML. One could not get a job using what Microsoft outputs as HTML as a work sample, heck, it would not even pass the laugh test and it may get somebody fired.

One of the downsides of MS Office products is that they are created for styling of information not marking up information with structure, to which style can hang. MS Word allows people (if the turn on or keep the options turned on) to create information sculptures with structure and formatting of the information. What Word outputs to non-Word formats is an information blob that has lost nearly all of its structure and functionality in any other format. It does not really have the format the Word document to begin with. What Web developers do is put the structure back into the information blob to recreate an information sculpture again.

You ask why is structure important? Structure provides the insight to know what is a header and sub-header. Structure provides the ability to discern bulleted lists and outlines. Structure makes it script-kiddie easy to create a table of contents. Structure makes micro-content accessible and easier to find with search. Structure provides better context. Structure provides the ability to know what is a quote from an external document and point to it easily. Structure provides ease of information portability and mobile access easier. These just name a few uses of structure.

Does MS Word have this structure capability? Yes, do people use it? No really. If people use it does MS Word keep the structure? Rarely, as it usually turns the structure into style. This is much like a somebody who spent months in the gym to build a well defined physique only to have the muscles removed to stuff their own shirt with tissue paper to give it the look of being in shape. Does the person with the tissue paper muscles have the ability to perform the same as the person who is really in shape? Not even close.

Structure is important not only for the attributes listed above, but also for those people that have disabilities and depend on the information being structured to get the same understanding as a person with out disabilities. You say MS Word is an accessible application, you are mostly correct. Does it create accessible information documents? Barely at best. The best format for information structure lay in HTML/XHTML/XML not in styles.

One current place that structure is greatly valuable is Internet search. Google is the top search engine on the Internet. Google uses the text in hyperlinks, the information in title tags, and information in the heading tags to improve the findability of a Web page. What are these tagged elements? Structure.

One of the nice things about a valid HTML/XHTML Web document is I can see it aqnd use it on my cell phone or other mobile devices. You can navigate without buttons and read the page in chunks. Some systems preparse the pages and offer the ability to jump between headings to more quickly get to the information desired.

These are just a few reasons I am intrigued with the Apple rumor. There is hope for well structured documents that can output information in a structured form that can validate to the W3C standards, which browsers now use to properly render the information on the page. I have very little hope in the stories that MS is working toward an XML storage capability for Office documents, because we have heard this same story with the last few Office releases and all were functional lies.



January 23, 2003

Facets made usable with XFML

Introduction to XFML by Peter Van Dijck is a great first step to understanding the eXtreme Football Markup Language eXchangeable Faceted Metadata Language. This article lays out facets, topics, metadata, taxonomy, and provides a framework based on these understandings to build a usable structure to describe objects.



Parse RSS even if it is wrong

Parse RSS at All Cost by Mark Pilgrim what is required to parse RSS properly. More importantly Mark points out that as more RSS feeds are created the feeds are being poorly created. Then Mark instructs how to build a parser that will be a little more forgiving of poor markup.



December 11, 2002

RSS feeds are very Clue Train friendly it seems

Not long after I posted my RSS disconnecting the creator and the user comments it started sinking in that it really does not matter. We it does to some part, but from a user's perspective the RSS allows a quicker more efficient method of scanning for information they have an interest in and easily see from one interface when new content has been written. I use other's blogs and digests to find information to post for my own reflection and to use as jumping boards to new ideas.

Yes, the interaction between creator and user is important, but it is not as important as getting informtion out. I began thinking that the whining about the lack of interaction on my part was rather selfish and very contrary to the focus I have for most information, which is having the abiltiy to access, digest, add to, or reformulate the information into another medium or presentation that will offer possibly better understanding.

I was self-taught in the values of the Clue Train so when I heard about it for the first time I was supprised so some large degree that the manifesto had resonance and turned on a light for many people, for myself and some others, I guess we drank the cool-aide early, as we thought this was the way things were or should be from the beginning of electronic information and a truely open community where information flows freely. Yes, the RSS/RDF/XML feed is a freer flow of information and puts the choice of the information consuption in the user's hands.



December 9, 2002

RSS and interconnections

Since I added the vanderwal.net RSS feed I have been picking up other RSS and RDF feeds. I have been using Ranchero's NetNewsWire Lite to pull many feeds of sites I read on a regular basis. I have become a convert to RSS/RDF extracts. They are a time saver for seeing only updated sites. I have read feeds of many of the news sites from MacReporter for quite sometime, but having personal content and blogs pulled in is quite a timesaver and allows me to get through more information.

I do see a downside of the XML feeds, in the disconnection of the creator from the users. The Web has given us the ability to have digital ghosts that we know come to our sites and possibly read content. This is much like Plato's cave shadow people, in that we do not see the actual people that come to the sites, but we surmise what these visitors are like and what they come to read. Occasionally we receive comments on the site, e-mails from visitors, or best meet folks in person that read/experience your work. It is very much a disconnected work that is built from guesses, for those that try and care (some just build for themselves resources to be used remotely and all others are welcome "free riders", like here). The XML feeds seem to take away another level of the "interaction" between the creator and the users. This relationship is important in communication as the feedback helps shape the message as well as offer paths for both parties to learn and grow.

The XML feeds offer the consumers of the information easier and more efficient means of getting, filtering, and digesting information, but the return path to the creator is diminished. The feeds are a consumer oriented communication channel and not so much an interactive communiction channel. The down side is a lack of true interactive communication, which becomes more of a consuming produced products, much like frozen dinners that get popped in the microwave. The interaction provides the creator with an understanding of how the user consumes the information and what the consumer of the information is finding usable and how the consumer is being drawn to the information. When one cooks their own meals or is being cooked for the meal can be spiced and seasoned appropriately for consumption. The presentation of the food can be modified to enhance pleasure. The live cooking process allows for feedback and modification. Much like the interaction of information in a communication scenario the creator and the consumer have a relationship, as the creator finds the structure and the preferred means of consuming the information the presentation and structure of the information can be altered appropriately.

In a sense the XML feed could be seen as one type of information structure of presentation. There are other options available that can be used to bring back the interaction between the creator and consumer. Relationships and connections are built over this expansive medium of the Web through information and experience. These connections should be respected and provided a place to survive.



December 4, 2002

Mark dives into XFML

Mark Pilgim dives into XFML and provides an excellent set of references to learn more (as always).


November 28, 2002

W3C RDF Primer

The W3C RDF Primer is something to come back to soon. The Resource Description Framework is a solid foundation to sharing information and is getting used more. It is a grown-up's version of RSS (the weblogger's resource sharing XML tool). This information relies on well structured information and helps keep the information structured for reuse.


November 23, 2002

Get your RSS feed

Yes, I finally got up to speed with the rest of the world and added an RSS feed and have added a new page that will track available vanderwal.net XML documents and RSS feeds. I may make a couple category specific RSS feeds as there is interest. Use the (now working again) comments or the contact to let me know what you would like.

I have only put out the first RSS feed in 0.91 at the moment. I may upgrade it in the near future as I now have it relatively easy to build from my end. I have been getting a decent amount of pestering and bother from folks asking for the feed. You see I still build my own CMS for the site and it takes time and priority to get around to some of these things.

Why not move to Movable Type or Drupal (the only two I would currently consider)? I enjoy building my own system, but it does require that I build my own APIs or even my own applications to mirror functionality. I like building CMS and this one is one of six that I have designed and/or fully built since 1997. It is like a hobby for me as well as a job.



November 14, 2002

RDF is more than three letters

You need to buy a vowel to make a word with the letters RDF, but RDF can be used to share informaiton and communicate. Kendall Grant Clark explains the uses for the Resource Description Framework.


October 28, 2002

RSS in the near future

The RSS feed is in the works here and should be ready by the anniversary of the tool, which is October 31st. There have been many upgrades to the application that runs this site in the past year, but one that I really wanted was RSS. I have it running on my laptop and it needs only one tweak and then to get dropped into my main posting page. Did I mention I am loving having my a my development tools and full webserver with all the application elements running on my TiBook laptop again?


October 21, 2002

Microcontent browser

Anil's Dashes Magazine discusses microcontent browsers, which is a great idea that is on the cusp of being here. Anil outlines what is needed and where it will all end up, hopefully.


September 12, 2002


August 21, 2002

Business Maps

Over at O'Reilly Net Marc De Graauw puts forth Business Maps: Topic Maps Go B2B, which seems to be an IAs dream (that would be an enterprise IA or a Macro IA, which are the ones that perform extra enterprise IA). The article also points to an overview of Topic Maps. Good stuff.


May 30, 2002

Standard Data Vocabularies Unquestionably Harmful

Must come back to this when my mind is fresh, Standard Data Vocabularies Unquestionably Harmful over at O'Reilly Net. This seems right up the alley for an IA.


May 16, 2002

Emerging Tech Conference coverage

There is a lot of amazing things that have happened at the Emerging Tech Conference 2002. Great minds discussing great things, what could be better.


This work is licensed under the Creative Commons Attribution-NonCommercial-ShareAlike License.