Seminar Kosice - Vienna _____

(link to homepage of first seminar: WDA 2000)

Overview

Kurzbeschreibung

Overview


General Information

Following the great success of the first turn of this international seminar on data mining and clustering algorithms in Kosice in 2000 (WDA 2000), we will again offer this seminar this year.

The seminar will be organized as a Student Workshop with participants from the Vienna University of Technology, Austria, and the Technical University of Kosice, Slovakia as a cooperation between the Department of Software Technology (IfS) at VUT Vienna and the Department of Artificial Intelligence at TUKE, Kosice. The main goal of this seminar is to bring together students who are interested in the field of data mining, to discuss and exchange ideas and experiences.
We will analyze and compare a set of data analysis techniques based on some reference data set. We'll then make a two-day trip to Budapest, Hungary, where the individual results of the various approaches will be presented and discussed in an inspiring atmosphere. Thus, every participant will gain a good knowledge and overview of the strengths, weaknesses and applicabilities of the various approaches.

Apart from that, we will defintely also have time for some 'social program' apart from the seminar itself, as one of the central ideas of this seminar is to get people together and have fun while doing some reasonable and interesting work :)

For details on last year's seminar as well as for some pictures, see the WDA 2000 Homepage.

Goal of the Seminar

The goal is to analyze and compare a set of text mining techniques based on some reference data set. The individual results of the various approaches will be presented at this seminar, followed by a comparison of these results. Thus, every pasrticipant will gain a good knowledge and overview of the strengths, weaknesses and applicabilities of the various approaches.

Methods Used

A set of different methods will be used for analysis, namely

Experiments Data

We will use 3 different data sets for our experiments, each of which has different characteristics. Thus, we should be able to analyze the strengths and weaknesses of the various approaches with respect to different types of data to be analyzed. The 3 datasets are as follows:

The Paper

Each participant shall write a paper to be presented at our Workshop meeting in May/June. Basically, the paper shall comprise the following: The length of the paper shall be between 5 to 12 pages in the ACM style. Style files for MS Word and LaTeX and other word processors can be downloaded from http://www.acm.org/pubs/submitting_accepted_articles/au_dl.htm

R E S U L T S

Preliminary Schedule

Comments / Questions

In case you have questions, please contact:


BACK