Department of Software Technology
Vienna University of Technology
Creating an Order in Distributed Digital Libraries by
Integrating Independent Self-Organizing Maps
Abstract:
Digital document libraries are an almost perfect application arena for
unsupervised neural networks. This because many of the operations
computers have to perform on text documents are classification tasks
based on ``noisy'' input patterns. The ``noise'' arises because of
the known inaccuracy of mapping natural language to an indexing
vocabulary representing the contents of the documents.
A growing number of papers is dedicated to the usageof self-organizing
maps to organize the contents of such digital libraries. These papers
assume the central availability of the data; an assumption that is
questionable given the massive amount of available information. In this
paper we describe an approach for organizing distributed digital libraries
based on a system of independent self-organizing maps each of which
representing just a portion of the complete digital library.
Furthermore, we argue in favor of integrating these independent maps
in a hierarchical fashion, again by means of self-organizing maps.
The integration is based on the trained low level maps.
Up
Comments: rauber@ifs.tuwien.ac.at