![]() |
status privacy about contact |
|
Welcome to Orcmid's Lair, the playground for family connections, pastimes, and scholarly vocation -- the collected professional and recreational work of Dennis E. Hamilton
Archives
![]() Atom Feed Associated Blogs ![]() Recent Items |
2007-02-16Blogger: Sudden-Death Updating
Previously, I have expressed my dismay for the risks that live updating of web-based applications expose to users. While I’ve appreciated the support efforts, I have not passed uninjured through the gauntlet of Blogger glitches and deployed defects. It taught me to be thankful that I have Blogger FTP my page updates to my own site, for which I learned to introduce incidence response and recovery procedures. I’ve known there’s been a “new” Blogger for some time. Today, I find that I am being forced to update. The template adjustment that I just made will be the last time I can enter my account without updating to the new Blogger. I don’t even know whether this post will be accepted. If I hadn’t logged on the old-fashioned way in order to update a template, I wouldn’t have known what was happening. We’ll see how BlogJet manages to connect. Fortunately, I had backed up Orcmid’s Lair just yesterday while performing some of my continuing repaving of web sites. I think I’ll back up the others right now, and then we’ll see what Blogger has in store for me in the latest and newest. update 2007-02-17T19:17Z Well, it went so smoothly that I thought I was home free. That is, until I wanted to update this post as part of a logo-image adjustment that I'll talk about separately. Even though I have given BlogJet the new User Id and Password that "New Blogger" requires, it is unable to retrieve previous posts through the Blogger API. I see that this page where I am editing the update is at a different web address than I have seen for Blogger before. Hmm, I'll look into that after I see if the logo adjustments to the page template are working properly. update 2007–02–18T02:05Z I updated to BlogJet 2.0.0.9 but that didn’t cure my problem. I fiddled around in an effort to change the BlogJet settings for this Blogger account. After considerable fooling around, I discovered that the developer documentation on Blogger hasn’t been updated. The correct update 2007–02–18T02:13Z I also learned that the new-Blogger labels “feature” doesn’t work with my blog, probably because Orcmid’s Lair it is hosted on my own site and not BlogSpot. But BlogJet supports tags and I will have to see what happens when I set them. update 2007–02–18T02:28 OK, OK, I was a little careless with the actual settings that I figured out. It helps to have someone else double-check, huh? Comments: Post a Comment 2007-02-15OOX-ODF: Artillery Duels and Document Sleuthing
I read my RSS feeds in reverse chronological order, just like reading through someone’s blog. That’s how I organize them. That’s how Brent Phillips’ post was the first to lead me to the Microsoft open letter on Interoperability, Choice and Open XML . Today brought many other references to this public letter from General Managers Tom Robertson and Jean Paoli, and I won’t be rehashing those. {tags: orcmid OOX ODF ISO ECMA OASIS open document standards} The press coverage I’ve seen simply summarizes the points of the letter and attempts to provide context on the OOX-ODF contest. As I expected, there are also those who are not amused and even embarrassed. For my part, I’m not thrilled about the continuous speculations about motives. I’m also disappointed in the adversarial approach of some prominent ODF boosters, when the energy might be better spent making ODF complete and robust, having multiple implementations that have demonstrably-reliable interchange in interoperability settings. My greater concern is the credulity with which claims of ill-intent are accepted and repeated. I’m disappointed by the lack of much effort to fact check and provide objective analysis. The Microsoft open letter is something to be taken quite seriously. It is a public, official statement. My approach to this letter is to ignore the claims about IBM (whether they have any foundation or not) and extract from it Microsoft’s affirmative commitment to interoperability and acceptance of heterogeneity. I want to see continuing demonstration of that, and I will take this as part of the pledge that is being molded into action at Microsoft. Adding sauce to the fire fight, I find the IBM Open Client offering and this assertion to be quite perplexing: “Alternative to Vendor Lock-In Includes Lotus Software Running With Linux or Windows and Macintosh” Since “standards-based” Lotus Software is a sole-source deal, exactly which vendor’s lock-in should we be paying attention to here? Looking at the official press release, I can see how Forbes writer Daniel Lyons managed to have so much fun with it. This is a discouraging use of “Open” in light of the high-minded “open standards” criteria articulated by IBM’s Bob Sutor. In other news: A document purported to be the Australian response to the ISO JTC1 30–day “contradiction” review of ECMA-376 was leaked onto the Internet for a short time on February 14. The PDF file, created on February 6, appears to replicate a two-page faxed letter from Alistair Tegart, Standards Development Program Manager of Standards Australia. The letter is addressed to Lisa Rajchel, ISO JTC1 Secretary at ANSI in New York City (though not to the official address). The file was deep-linked in an exposed web-site subdirectory of an Open Document Format advocacy organization in Queensland, Australia. The file was directly linked from a U.S. blog post entitled “Australia’s comments on OOXML during the Contradiction Period” along with a suggestion of OOX’s negative reception based on the concerns expressed in the response. I found the linked document to be quite even-handed and temperate, although the uproar about contradictions was acknowledged to be a factor in the recommendation being made. It was startling to encounter this, since the Microsoft open letter was fresh in my mind, along with Martin LaMonica’s observation that Microsoft’s Tom Robertson “declined to offer more details on IBM's activities because the ISO standardization process is closed.” The blog post had disappeared from the U.S. site by late afternoon, gmt-0800. The document PDF disappeared from the Australian site some time thereafter. My RSS feed outlived the first or I would have missed witnessing the delayed triumph of prudence over righteous indignation. In researching this incident (mostly to find out who Lisa Rajchel is and who occupies the 13th floor of the former Salmon Tower across 42d Street from the New York City Public Library), I learned a lot about the “ISO/IEC JTC 1 N8455 30–day review ECMA-376|ISO/IEC DIS 29500 Office Open XML File Formats.” I learned that ODF advocacy organizations seem to have all of the JTC1 communications concerning ECMA-376, such as those here. I learned that ECMA-376 is ISO/IEC DIS 29500 and you can track its progress at ISO with the 29500 identifier. I learned that there was indeed a single-file ECMA-376.pdf that was available for a time from the ECMA site. (I learned that search engine cache pages are a wonderful thing). I also learned that some ODF advocates knew there was also a five-part set that was easier to handle. I am relearning that as long as your opponent is held to be evil incarnate, any excess is justifiable. I had forgotten that I have been in a standards war or three, and the last time people who supported one specification activity insisted that the other one should not exist, and vice versa. I couldn’t see the conflict, myself, seeing the efforts as complementary and both welcome. I worked hard to accomplish one of them (DMA), now I’m engaged in preserving the other (ODMA). Comments: "I’m disappointed by the lack of much effort to fact check and provide objective analysis." Really. Not only I have fact-checked Brian Jones and Doug Mahugh the last few weeks, I have also fact-checked your ass on the many inconsistencies you have dared post (in hope that it would help Microsoft cause; those little guys need you I guess). You just cannot even accept basic facts as truth. You might be interested in learning that I know this area very well (I am the vendor behind diffopc and xlsgen), and should you visit sites like openxmldeveloper.org, it might come to you that of all you are the one participating in adding FUD, while I am an authority in this area. Man, it took me three blog comments to get you to to password-protect an OOXML document and make you realize you are backing liars (claiming OOXML is 100% ZIP). Keep up the good work... 2007-02-11OOX-ODF and the Mystery Document Flavors
I just posted an update to the previous post where I puzzled over the republication of the 5 separate parts of the ECMA-376 specification on the ECMA site. I was particularly baffled about the agitation about them because those parts are closely related to the original October Final Draft parts that were submitted for ECMA approval. There appear to be no changes of substance or even of observable form, other than making them into the official December 2006 ECMA-376 documents. {tags: orcmid OOX ODF ISO ECMA OASIS open document standards} What I didn’t realize is that a different form of this material appears to have been used in the submission to ISO. Here’s what I found, for the benefit of those who won’t see any RSS feed for the updated earlier post: update 2007-02-12T01:50Z: I got it! update 2007-02-12T01:59Z tweaking the prose a little more; and 2007-02-12T07:32T for smoothing between this and the previous post: First, there is no question in my mind that the current downloads constitute the official December 2006 ECMA-376 documents, with all of the October 2006 Final Draft title pages corrected and an official, overall cover (two sheets) attached to the front of Part 1. I am satisfied that this is precisely the same content that was approved by ECMA and also used to form the ISO submission.
Since I had been downloading the drafts as they were announced as available from ECMA TC45, I was not particularly concerned about obtaining replacements for my copies of the final drafts. I lucked out: I had no idea they were ever in any final form but the five parts, each in two formats (.docx and .pdf). When I submitted comments to ECMA, I was able to use those far-more-manageable drafts from October and earlier. Although Stephen Johnson has provided a similar collation of the official December 2006 downloads, I don’t recommend using it. The five separate parts are superior in every respect. They are far more suitable for review, analysis, and exchange of comments. And if you identify the part you are referring to, using section numbers or the printed page numbers, it won’t matter whether you are using the mother-of-all-documents or its five handier constituents. I have no idea what procedural requirements led to the submission of a single PDF file. I hope that people were advised of the availability of the separated parts in their easier-to-review form. I can certainly understand how relying on the PDF-provided page-count positions would be confusing if used instead of the printed page numbers. Comments: Post a Comment How Publicly-Owned Document Standards Matter
Thanks to Bob Sutor’s morning link post, I was led to Shane Schick’s valuable Globe and Mail column, “Will today’s text files be tomorrow’s stone tablets?” This is a very handy 30,000-foot account of current activities with regard to obtaining some sort of ISO imprimatur for document-format specifications. Some details are a little off (e.g., companies don’t submit specifications directly to ISO), but I don’t see anything that seriously detracts from this worthy overview. My title is a little off too, since having document-format specifications promulgated by various standards-ratifying bodies is about as directly public as the U.S. Electoral College system. But it’s what we have (that is, ISO and the various nation-representing bodies). It is always good to ask “who owns my documents” and “who owns my ability to access and preserve my documents,” and also “who owns the format I am counting on for stability and continued usability,” whether they are in the custody of Google or of some application (let’s say Mind Manager for variety) on your desktop computer. Comments: Post a Comment OOX-ODF: The Danger of Finding Only What You're Looking For
One problem that scientific investigators must always be cautious about is finding what we are looking for because that is all we are able to see. This has tripped up many an “objective” person. I confess to getting that particular pie in my face often enough to be very careful. I’ll not be surprised when it happens again. I do promise to correct whatever is necessary. This human condition is amply demonstrated when we are making abstract attributions and speculations about the actions of others. It is easy to see all conduct as evidence that justifies the attitude we already bring to the party. Our spectacles are already smeared with the stain of our own prejudgments. It is easy to see how this arises in the way everything Microsoft does or doesn’t do is constantly explained by the corporation’s malevolent intentions. This applies to the simplest acts and some silly and sometimes completely stupid moves. But the indignant proclamations of further evidence of evil intent continues. They are not even “convicted” of being a monopoly — that’s not illegal, but that is often said in justifying every suspicion. Similarly, to consider that the behavior of IBM as a corporation and of a few visible IBM employees in opposing the promulgation of Office Open XML (OOX) to be a considered, calculated business maneuver is, to my mind, giving IBM far too much credit. There is considerable risk of potential embarrassment (and worse) were such machinations substantiated by non-repudiatable facts. There are too many other explanations that fit the observed behavior, especially for all of the years the IBM internal echo chamber has had to establish the received wisdom that Microsoft did them wrong (out IBM-ing IBM, more-or-less, for those of us with long memories). I’m willing to believe that it is really personal. I am not prepared to go farther than that. There are lots of agendas here. I doubt that one can isolate and confirm a single one to fit all of the conduct that we see. The same goes for Microsoft when one employee or another lashes out in some particular way. The so-called evidence for contradiction in OOX that has been compiled at Groklaw is another example of the lengths we can go when we are too happy with our findings and are careless with over-reaching interpretations of facts that are open for anyone to inspect more carefully. There may be a pony in there, but the readiness to accept blatant nonsense tends to smear the pony with manure. I’ll pick the first example that came to my attention, because it is so clear-cut. (It is not my purpose to engage in an extensive analysis of complaints about OOX, I’m looking at where attitude leads to credulity.) After that I’ll turn to a more-recent sequence of extrapolations that are even sillier, especially considering how easy it is to check. {tags: orcmid OOX ODF ISO ECMA OASIS open document standards} You Say Bright Green, I say ChartreuseOn January 28, Sam Hiser blogged about a problem with the way colors are handled in OOX. The summary statement in his blog feed was pretty direct:
On the blog page there’s a little table with two different sets of color mappings for a single set of color names (dark blue, dark cyan, etc.). I first thought the difference might have to do with the choice of standard web color codes, but the OOX choices don’t completely line up with those either. After looking at Yoon Kit Hasan Saidin’s article, I concluded two things (in a long-winded comment), and Yoon Kit has replied. What seems to have gotten lost in the discussion, and I am certainly at fault for not writing more crisply, is this:
Well, where did those widely different color mappings in the blog-post table come from? They arose by taking the codes for oranges and lining them up with codes for apples (sorry). The codes shown for OOX mappings have nothing to do with the SVG names nor the use of colors in DrawingML. The OOX ones being compared with the identified SVG colors are the OOX colors for highlighting over text. OOX has a limited set of highlighting colors for use in Wordprocessing ML and only in WordprocessingML. They are also named by fixed, rigidly-defined English-language text strings in attribute values. The red-green-blue levels that those attribute values correspond to are also rigorously and clearly defined. Some happen to be different mappings than ones used for similarly-spelled (but different-)attribute values in DrawingML. The highlight-color attributes are different attributes with different use, and there is no apparent intention to correlate them with the same names when used for the SVG and DrawingML colors. These never show up in DrawingML. Similarly, implementing them correctly is trivial when following the specification. And, in fact, not even Microsoft Office Word 2007 uses all of the same names in its English (US) interface for selection of highlight colors. The attribute values are for technical coding of colors. They are not about what users see or what the colors might be called by different users. OOX doesn’t prescribe any of that. You might have done this differently, and I might have also, given a blank sheet of paper to start with. It doesn’t matter. This is not rocket science. Yet the declaration of Yoon Kit, parroted by Sam is “MSOOXML contradicts W3C SVG Colour definitions” and the comparison in the table between SVG colors and the highlight colors in OOX WordProcessingML is simply bogus. What I say: No harm, no foul. Interesting way to learn more about how OOX is specified. The Mysterious Document Updates[updated 2007-02-12T07:15Z to smooth out some bumps and account for a third flavor, the single 5-parts-in-one PDF that was apparently submitted to ISO.] On February 8, Rob Weir posted about “Here Today, Gone Tomorrow.” Here;s the gist of it:
Of course there are hilariously speculative comments to go with the full post. And Microsoft’s Brian Jones is now archly pestered by some messenger of joy leaving innuendos on his blog and challenging him to explain what happened. Let’s take the section number issue first. It is true that the ECMA specification is in 5 I have a different beef with the section numbering in the OOX specification. The tables of content Now, about the repagination. Rob is a smart guy, and he has gone through the big I’m going out on a limb here, because I have no idea what documents were physically delivered to ISO JTC1 as the ECMA submission. What I have in my possession are the files for the December 2006 ECMA-376 as they were on February 9, 2007 when I downloaded them. Of these, the Part 4 DOCX (in a Zip file with other material) version is corrupted (a problem I sometimes have with some documents from some sites), so I couldn’t use Word 2007 to compare it with any earlier edition. I also have the final TC45 Drafts that were created in October for comparison. Here is the result of my explorations.
Differences in the Part 4, Markup Language Reference PDF. This is the big honker that you dive into when you want to see the precise details of all of the attributes and formats of OOX. Here’s where those who are looking for it find buried treasure and smoking guns. So what’s the difference? Well, the PDF that was created on February 1 has one more physical page in it than the PDF that was created on October 6, 2006. That is the blank page after the front title page. The page is counted in the page numbering, it just wasn’t physically present in the October 6 version. I can’t do a line by line or word for word comparison, but I can tell you that the numberings on the pages are identical and the table of contents (and the pages referenced in the table of contents) are identical. When I obtain the DOCX version of the latest downloads, I will have Word 2007 compare them and show me the changes. Meanwhile, if page references are to the numbers on the pages (and not the sequential page-count positions displayed by PDF), there seems to be no problem. And even if the PDF page-count numbers were used, the new document’s PDF page-count numbers for the main section are simply greater by 1 for the same numbered page. [updated 2007-02-11T11:33Z in a poor attempt to distinguish between the numberings PDF shows by counting the pages that are there, in sequence, and the numbers that are printed on the pages.] Differences in Part 1, Fundamentals. I know that older versions of Word can compare two documents and synthesize what are the additions and deletions between one and the other. Versions since at least Office 2003 also provide sidebar annotations that explain changes, making it easy to scroll through all of them. There are other ways to navigate from difference to difference as well. I’d seen a post that suggested that Office 2007 does this even better. So what a great test: doing some document forensics on the OOX specification itself using the current closest implementation. I started with Part 1 for no good reason other than it was first. Using the .docx files now, the final committee draft is dated October 9 and it has 173 pages in the file. These are pages i (un-numbered cover) through viii (end of the Introduction), followed by 166 pages of the main document text. The latest download (dated January 25, 2007 in the file itself) has 178 pages in the file. These are page i (un-numbered cover) through xii (end of Introduction), followed by 166 pages of the main document content. The tables of contents are the same, and the numbered pages of the main text are the same. I mean the same. There is no material difference in content. The pagination on the pages themselves has not changed at all. (These are all formatted for 8.5” by 11” paper, as are the PDFs, by the way.) The difference in the front matter is the addition of two new cover sheets (with blank backs) and modification of the original part 1 title to simply introduce the part and remove text that applied only in the TC45 draft. There are other differences. They are immaterial. They seem to be entirely involved with styles in the tables of content, fields that produce titles over the pages, and formatting of bulleted lists. The result seems to be indistinguishable, but Word 2007 says there was a change. I saw nothing different outside of the 4 new pages of front matter and the edits to the original title page. This is not an exhaustive comparison. Anyone who thinks they might find something significant is welcome to do more. Maybe ECMA could use stronger document controls and account for re-issues better. But there appears to be no difference and the material is on the same-numbered pages that it has been since October. I suspect ECMA might feel they are insulted, though. And rightly so. My provisional assessment: No harm. No foul. Waste of time. On further reflection: some lessons about how to cross-reference documents and also on how to provide some document engineering controls so people know what’s what. Comments: Post a Comment |
![]() |
You are navigating Orcmid's Lair. |
template
created 2002-10-28-07:25 -0800 (pst)
by orcmid |