A Parsing Cache for Highly Efficient XML Data Processing

H. Homayounfar and F. Wang (Canada)

Keywords

XML, Parsing, Querying, Databases, Operating Systems, Kernel, Filesystems, Caching.

Abstract

XML (eXtensible Markup Language) documents are rel atively large. Therefore processing such documents can be very costly. Meanwhile traditional filesystem buffer ing and caching mechanisms are not designed particularly to support XML parsing. Hence, redundant parsing prob lem causes the current XML parsers and query processors not to be efficient when processing large documents. Also, due to the data structure and algorithms used in the current XML tools, parsing is not fast enough or even providing required resource (i.e. memory) may not be feasible. Fur thermore, XML tools are not designed to provide a sharable parse tree for multiple users, and users have to limit them selves to specific tools to access the parse tree. To over come these limitations, we introduce a “Parsing Cache”, which uses a well-structured parse tree, an efficient parsing algorithm and a user-cache interface. It provides a resi dent parsing storage specially for XML documents. Due to small memory usage, parse trees can be resident for fur ther processing. The user-cache interface enables multiple users to share the parse tree. The parsing and query time are significantly improved compared with the current XML parsers and query managers.

Important Links:



Go Back