An adaptive, fast, and safe XML parser based on byte sequences memorization | Zendy

Toshiro Takase | Zendy; Hisashi Miyashita | Zendy; Toyotaro Suzumura | Zendy; Michiaki Tatsubori | Zendy

AI Assistant Blog Pricing

Home ZAIA Blog

Open Access

An adaptive, fast, and safe XML parser based on byte sequences memorization

Author(s) -

Toshiro Takase,

Hisashi Miyashita,

Toyotaro Suzumura,

Michiaki Tatsubori

Publication year - 2005

Publication title -

citeseer x (the pennsylvania state university)

Language(s) - English

Resource type - Conference proceedings

ISBN - 1-59593-046-9

DOI - 10.1145/1060745.1060845

Subject(s) - computer science , byte , parsing , simple api for xml , xml , programming language , natural language processing , memorization , artificial intelligence , information retrieval , efficient xml interchange , world wide web , xml signature , linguistics , philosophy

XML (Extensible Markup Language) processing can incur significant runtime overhead in XML-based infrastructural middleware such as Web service application servers. This paper proposes a novel mechanism for efficiently processing similar XML documents. Given a new XML document as a byte sequence, the XML parser proposed in this paper normally avoids syntactic analysis but simply matches the document with previously processed ones, reusing those results. Our parser is adaptive since it partially parses and then remembers XML document fragments that it has not met before. Moreover, it processes safely since its partial parsing correctly checks the well-formedness of documents. Our implementation of the proposed parser complies with the JSR 63 standard of the Java API for XML Processing (JAXP) 1.1 specification. We evaluated Deltarser performance with messages using Google Web services. Comparing to Piccolo (and Apache Xerces), it effectively parses 35% (106%) faster in a server-side use-case scenario, and 73% (126%) faster in a client-side use-case scenario.

The content you want is available to Zendy users.

Already have an account? Click here to sign in.

Having issues? You can contact us here

Accelerating Research