A Parsing Cache for Highly Efficient XML Data Processing

H. Homayounfar; F. Wang

A Parsing Cache for Highly Efficient XML Data Processing

H. Homayounfar and F. Wang (Canada)

Keywords

XML, Parsing, Querying, Databases, Operating Systems, Kernel, Filesystems, Caching.

Abstract

XML (eXtensible Markup Language) documents are rel atively large. Therefore processing such documents can be very costly. Meanwhile traditional ﬁlesystem buffer ing and caching mechanisms are not designed particularly to support XML parsing. Hence, redundant parsing prob lem causes the current XML parsers and query processors not to be efﬁcient when processing large documents. Also, due to the data structure and algorithms used in the current XML tools, parsing is not fast enough or even providing required resource (i.e. memory) may not be feasible. Fur thermore, XML tools are not designed to provide a sharable parse tree for multiple users, and users have to limit them selves to speciﬁc tools to access the parse tree. To over come these limitations, we introduce a “Parsing Cache”, which uses a well-structured parse tree, an efﬁcient parsing algorithm and a user-cache interface. It provides a resi dent parsing storage specially for XML documents. Due to small memory usage, parse trees can be resident for fur ther processing. The user-cache interface enables multiple users to share the parse tree. The parsing and query time are signiﬁcantly improved compared with the current XML parsers and query managers.

Important Links:

DOI:
From Proceeding (524) Web Technologies, Applications, and Services - 2006

Go Back