TY - JOUR
T1 - A new approach to modeling personal office documents
AU - Mhlanga, Fortune S.
AU - Zhu, Zhijian
AU - Wang, Jason T.L.
AU - Ng, Peter A.
N1 - Funding Information:
Keywords: Data modeling; Document databases; Office information systems; Query algebras; Relational and object-oriented technology * This work was supported in part by the National Science Foundation under Grant No. IRI-9224602, by the New Jersey Institute of Technology under Grant No. 421280 and by a grant from the AT&T Foundation. * Corresponding author.
PY - 1995/11
Y1 - 1995/11
N2 - This paper proposes a new approach to modeling documents in a personal office environment. The proposed document model (called the D_model) combines the relational and object-oriented paradigms and adopts a very natural view for describing the office documents. Documents are grouped into classes. Each class is characterized by a frame template, which describes the type for the class of documents. A frame template is instantiated by providing it with values to form a frame instance, representing a synopsis of a particular document associated with the template. Based on the nature of their contents, different frame instances can be grouped into a folder. Thus, a folder is a set of frame instances which may or may not be associated with the same template. The D_model describes documents using dual hierarchies: a document type hierarchy, depicting the structural organization of the documents, and a folder organization, representing the user's logical file structure. The document type hierarchy exploits structural commonalities between frame templates. Such a hierarchy helps to classify various documents. The folder organization mimics the user's real-world document filing system and provides the user with an intuitively clear view of his/her file structure. Such a view facilitates document retrieval and filing activities. We also present an algebra (called the D_algebra) for manipulating frame instances contained in folders. In contrast to existing algebraic languages, the D_algebra provides operators for manipulating heterogeneous sets (i.e. sets with elements of different types). The proposed document model and algebraic language have been implemented as part of TEXPROS, a personal document processing system currently running in our laboratory.
AB - This paper proposes a new approach to modeling documents in a personal office environment. The proposed document model (called the D_model) combines the relational and object-oriented paradigms and adopts a very natural view for describing the office documents. Documents are grouped into classes. Each class is characterized by a frame template, which describes the type for the class of documents. A frame template is instantiated by providing it with values to form a frame instance, representing a synopsis of a particular document associated with the template. Based on the nature of their contents, different frame instances can be grouped into a folder. Thus, a folder is a set of frame instances which may or may not be associated with the same template. The D_model describes documents using dual hierarchies: a document type hierarchy, depicting the structural organization of the documents, and a folder organization, representing the user's logical file structure. The document type hierarchy exploits structural commonalities between frame templates. Such a hierarchy helps to classify various documents. The folder organization mimics the user's real-world document filing system and provides the user with an intuitively clear view of his/her file structure. Such a view facilitates document retrieval and filing activities. We also present an algebra (called the D_algebra) for manipulating frame instances contained in folders. In contrast to existing algebraic languages, the D_algebra provides operators for manipulating heterogeneous sets (i.e. sets with elements of different types). The proposed document model and algebraic language have been implemented as part of TEXPROS, a personal document processing system currently running in our laboratory.
KW - Data modeling
KW - Document databases
KW - Office information systems
KW - Query algebras
KW - Relational and object-oriented technology
UR - http://www.scopus.com/inward/record.url?scp=0029408279&partnerID=8YFLogxK
UR - http://www.scopus.com/inward/citedby.url?scp=0029408279&partnerID=8YFLogxK
U2 - 10.1016/0169-023X(95)00019-O
DO - 10.1016/0169-023X(95)00019-O
M3 - Article
AN - SCOPUS:0029408279
SN - 0169-023X
VL - 17
SP - 127
EP - 158
JO - Data and Knowledge Engineering
JF - Data and Knowledge Engineering
IS - 2
ER -