Bonum Certa Men Certa

ECMA Open XML Approved Despite IBM Objection

As was expected, the ECMA approved Microsoft's Office Open XML file format today, despite the lone objection of IBM.

Approval of the specification, however, was not unanimous. IBM said it voted against Open XML, saying OpenDocument, which is the default format in OpenOffice, is a "vastly superior" format, and another standard was unnecessary.

"It (OpenDocument) is an example of a real open standard versus a vendor-dictated spec that documents proprietary products via XML," Bob Sutor, vice president for open source and standards for IBM, said in his blog. "ODF is about the future, Open XML is about the past. We voted for the future."

IBM, according to Microsoft, was the only Ecma member to give the thumbs down. Microsoft's public relations firm also circulated via e-mail a statement from the Initiative for Software Choice praising the approval.

In a blog entry by Andrew Shebanow, he also raises concerns about whether Open XML is a "One Way Standard", echoing earlier concerns whether Open XML is even able to be fully implemented by anyone other than Microsoft, or if only as a subset providing limited interoperability. On his Shebanation blog, Shebanow notes the gargantuan effort that Mac MS Office team must undertake to implement their own standard:

Today, though, a couple of interesting things happened that made me want to write about this. The first is that ECMA approved the Office XML standard over IBM’s objections. That got me thinking about Bob’s piece again. The other is that Rick Schaut of Microsoft’s Mac BU wrote an article explaining very eloquently why the Mac version of Office won’t support the Open XML file format until sometime next year. What struck me when I read the latter piece is that Rick absolutely, positively proves Bob Sutor’s point when he explains what it would take to create a file converter from scratch for Mac Word:

[…] a team of 5 developers will implement 25 handlers a week, which means that we’d have all the XML handlers written in 44 weeks. […] Nevertheless, we’ve taken a little less than a year to get the converters reading the new file format. We still aren’t writing the new file format, we have the RTF side of things to worry about, which is actually more complex than the XML side, and I’ve completely left out all of the design and coding for the intermediate representation of the file. The intermediate representation, itself, is at least 6 to 8 months worth of work.

Got that? It would take 5 developers a year to do a quarter of the work. That means the whole job is roughly 20 man-years of development time. That doesn’t include testing, documentation, or localization. That would probably double the number of man-years, at least. But it gets worse...

Much worse, since these figures are just for Word. Taking into account the other products in the suite, by Shebanow's calculations, it would take Microsoft 120 man years to implement it themselves. In fact, Microsoft is instead porting the Windows version of the converter to Mac, since it will take less time (this explains why Mac Office users must wait for Windows Office to be done, so they can port it.) Shebanow's estimate in man-hours for a competing personal productivity application to fully implement ECMA Open XML: 150 Man Years!

Apparently, Open XML is purposefully overreaching, with the ability of those who implement the format to provide varying functionality and levels of interoperability seen as a strength by the ECMA (emphasis mine):

At this point, maintenance of the Ecma Open XML standard moves from Microsoft to Technical Committee 45 of Ecma International (no longer all-caps). While supporting vendors remain free to innovate their own functionality, changes to the standard itself must now be approved by TC45.

[...]

"Thanks to the depth of the technical resources the TC45 created, the Open XML standard covers the full set of features used in the existing corpus of billions of documents," reads an Ecma statement this afternoon. "Developers have the flexibility to decide whether they want to take advantage of subsets or the full feature set of the Office Open XML formats. In addition, the format enables organizations to integrate productivity applications with information systems that manage business processes by enabling the use of custom schemas within Open XML documents."

As was pointed out by IBM's Bob Sutor some time ago, Open XML is Microsoft's marketing tactic, a pseudo-standard in name only designed to keep their Office products at the center of the IT universe by limiting interoperability with competing products.

Fully and correctly implementing Open XML will require the cloning of a large portion of Microsoft’s product. Best of luck doing that, especially since they have over a decade head start. Also, since they have avoided using industry standards like SVG and MathML, you’ll have to reimplement Microsoft’s flavor of many things. You had better start now. So therefore I conclude that while Microsoft may end up supporting most of Open XML (and we’ll have to see the final products to see how much and how correctly), other products will likely only end up supporting a subset.

That means that other products and software, in practice, will NOT be able to understand arbitrary Open XML that might be thrown at them. There is just too much. Therefore they will only create a bit that they need and send that off. Send it off to whom? The only software that might understand it, namely Microsoft Office.

So this is how I see this playing out: Open XML will be nearly fully read and written by Microsoft products, but only written in subset form by other software. This means that data in Open XML form will be largely sucked into the Microsoft ecosystem but very little will escape for full and practical use elsewhere.

All "standards" are not equal.

Recent Techrights' Posts

Techrights' Statement on Code of Censorship (CoC) and Kent Overstreet: This Was the Real Purpose of Censorship Agreements All Along
Bombing people is OK (if you sponsor the key organisations), opposing bombings is not (a CoC in a nutshell)
 
GNU/Linux Reaches All-Time High in Europe (at 6%)
many in Europe chose to explore something else, something freedom-respecting
Patents Against Energy Sources That Reduce Pollution
this EV space (not just charging) is a patent mine field and it has long been that way
DARPA’s Information Innovation Office, Howard Shrobe, Values Compartmentalisation But Loses the Opportunity to Promote GNU/Linux and BSDs
All in all, he misses an opportunity
Wayland is an Alternative to X
the alternative to X (as in Twitter) isn't social control media but something like IRC
BetaNews, Desperate for Clicks, is Pushing Donald Trump Spam Created by LLMs (Slop)
Big clap to Brian Fagioli for stuffing a "tech" site with Trump spam (not the first time he uses LLMs to do this)
[Meme] Social Control Media Bliss
"My tree is bigger than yours"
Links 24/11/2024: More IMF Bailouts and Net Client Freedom
Links for the day
Gemini Links 24/11/2024: Being a Student and Digital Downsizing
Links for the day
[Meme] The Most Liberal Company
"Insurrection? What insurrection?"
apple.com Traffic Down Over 7%, Says One Spyware Firm; Apple's Liabilities Increased Over 6% to $308,030,000,000
Apple is also about 120 billion dollars in debt
Over at Tux Machines...
GNU/Linux news for the past day
IRC Proceedings: Saturday, November 23, 2024
IRC logs for Saturday, November 23, 2024
[Meme] GAFAMfox
Mozilla Firefox in a state of extreme distress
Google Can Kill Mozilla Any Time It Wants
That gives Google far too much power over its rival... There are already many sites that refuse to work with Firefox or explicitly say Firefox isn't supported
Free (as in Freedom) Software Helps Tackle the Software Liability Issue, It Lets Users Exercise Greater Control Over Programs
Microsofters have been trying to ban or exclude Free software
In the US, Patent Laws Are Up for Sale
This problem is a lot bigger than just patents
ESET Finds Rootkits, Does Not Explain How They Get Installed, Media Says It Means "Previously Unknown Linux Backdoors" (Useful Distraction From CALEA and CALEA2)
FUD watch
Techdirt Loses Its Objectivity in Pursuit of Money
The more concerning aspects are coverage of GAFAM and Microsoft in particular
Links 23/11/2024: Press Sold to Vultures, New LLM Blunders
Links for the day
Links 23/11/2024: "Relationship with Oneself" and Yretek.com is Back
Links for the day
Links 23/11/2024: "Real World" Cracked and UK Online Safety Act is Law
Links for the day
Links 23/11/2024: Celebrating Proprietary Bluesky (False Choice, Same Issues) and Software Patents Squashed
Links for the day
Over at Tux Machines...
GNU/Linux news for the past day
IRC Proceedings: Friday, November 22, 2024
IRC logs for Friday, November 22, 2024
Gemini Links 23/11/2024: 150 Day Streak in Duolingo and ICBMs
Links for the day
Links 22/11/2024: Dynamic Pricing Practice and Monopoly Abuses
Links for the day
Topics We Lacked Time to Cover
Due to a Microsoft event (an annual malware fest for lobbying and marketing purposes) there was also a lot of Microsoft propaganda
Microsofters Try to Defund the Free Software Foundation (by Attacking Its Founder This Week) and They Tell People to Instead Give Money to Microsoft Front Groups
Microsoft people try to outspend their critics and harass them
[Meme] EPO for the Kids' Future (or Lack of It)
Patents can last two decades and grow with (or catch up with) the kids
EPO Education: Workers Resort to Legal Actions (Many Cases) Against the Administration
At the moment the casualties of EPO corruption include the EPO's own staff
Gemini Links 22/11/2024: ChromeOS, Search Engines, Regular Expressions
Links for the day
This Month is the 11th Month of This Year With Mass Layoffs at Microsoft (So Far It's Happening Every Month This Year, More Announced Hours Ago)
Now they even admit it
Links 22/11/2024: Software Patents Squashed, Russia Starts Using ICBMs
Links for the day
Over at Tux Machines...
GNU/Linux news for the past day
IRC Proceedings: Thursday, November 21, 2024
IRC logs for Thursday, November 21, 2024