In order to simulate a distributed digital library, the documents
of the CIA Worldfactbook have been split randomly into five parts
consisting of 50 documents each.
Note that five documents are thus assigned to two parts at a time.
Each part of the library was then used for training of a separate
self-organizing map.
Each of these maps represents a topologically ordered portion of its
subset of the document library.
Due to space restrictions we can only show one of these maps in this
paper, see Figure 2.
The other four maps, however, are rather similar in spirit.
In a second step, we integrated the five independent maps
into one single
self-organizing map which now represents the
complete document library.
The map that results from this integration is shown in
Figure 3.
Note that the main clusters are clearly visible from the map representation,
with the information from the low level maps being arranged according
to the organizational principles of the high level map.
As examples consider the area containing African countries in the left
upper part of the map, the area of oil producing countries,
the area of countries from Latin America, or
the area of countries usually referred to as the
first world in the right upper part.
With the latter area, please note the explicit distinction into countries
belonging to the Western hemisphere and those belonging to the
Eastern hemisphere1.
Finally, it is worth noting that the countries that are contained twice
in the library are pairwise assigned to the same unit in the
integrated map.
These countries are Austria, Comoros,
Iceland, Japan, and Mozambique.
In a nutshell, the higher level map forms an orderly mapping based on the information contained in several low level maps, each of which representing a portion of the library. The higher level map thus is a convenient starting point for a user trying to find her orientation in a distributed digital library.