Creating a SOMLib Digital Library

Department of Software Technology
Vienna University of Technology

En Route to Data Mining in Legal Text Corpora: Clustering, Neural Computation, and International Treaties The huge amount of data in legal information systems requires a new generation of techniques and tools to assist lawyers in analyzing data and finding critical nuggets of useful knowledge. A promising approach for data mining in legal text corpora is classification. What we are looking for are powerful methods for the exploration of such libraries whereby the detection of similarities between documents is the overall goal. These methods may be used to gain insight in the inherent structure of the various items contained in a text archive. In this paper we present the results from a case study in legal document classification based on an experimental document archive comprising important treaties in public international law. The essentials of our approach are the usage of a vector space document representation and the utilization of an unsupervised artificial neural network for document classification.

Up

Comments: rauber@ifs.tuwien.ac.at