Binary vs. ordinary XML

January 11th, 2011

There have been plenty of attempts to set a specification for a binary format of XML. Proponents of such a standard are drawn in by the less-wordy nature and easy parsing of binary XML. The format also supports features lacking in standard XML documentsfor instance, random access and indexing. The first makes it possible to access an arbitrary entry in a sequence in an equal amount of time versus having to access all the data in sequencethe difference between fast-forwarding through songs on a cassette and selecting a specific song on a record. Similarly, indexing defines an array of data (here, XML documents) that can be easily looked up.However, the lack of a standard for binary XML means that it’s difficult to find an editor that will properly view the documentsimple text editors as well as more advanced third-party tools often run into problems when trying to read a binary XML document. A developer may use binary XML when the standard format is unavailable, such as when memory is limited. In these cases, there must be an easy way to translate the document to a format which can be easily edited.Some features of binary XML can be reproduced in the standard format. The gzip software application for file compression can help make XML workable on even a lower-performance machine, while the Abstract Syntax Notation One (ASN.1) standard can be used to represent, encode, decode, and transmit data more efficiently. In fact, ASN.1 is the basis for the proposed Fast Infoset binary XML standard. Fast Infoset is being developed by both the Telecommunication Standardization Sector of the International Telecommunication Union (ITU-T) as well as the International Organization for Standardization (ISO). ITU-T published the standard in 2005, and ISO published it in 2007. Currently, Fast Infoset is popular in mobile device XML applications (which have a lower bandwidth connection) and when dealing with a large volume of persisting data. However, Fast Infoset is not the only proposed binary XML standard.The ISO’s MPEG working group developed the Binary MPEG format for XML (BiM), while W3C selected AgileDelta’s Efficient XML as the basis for its own Standard for Binary XML (EXI). And that’s just the tip of the iceberg. The lack of consensus in this area has prevented binary XML’s widespread adoption, but features of it remain highly useful, and help spur its advocates toward creating a universal definition.

Comments are closed.