next up previous contents
Next: Lessons Learned Up: Automatic Retrieval of Interactive Previous: Further Improvements   Contents


Table 5.2: probability values assigned to a category depending on where a catchword occurs
value occurrence
0.7 name
0.7 value
0.5 text before a field, has a colon after catchword
0.3 text before a field
0.3 text after a field
0.1 text after a field, has a colon after catchword



Table 5.3: example categories and catchwords that point to the very category
category catchwords
password pwd pass
URL url domain
e-mail e-?mail
date datum
time zeit time
street strasse add?ress
land land bundesl staat
city [^s,sh,w]ort stadt gemeinde city
number zahl nummer
keyword key stichw suche wor[d,t] search query
company firma
first-name vorname
name name



keyword-search samples
Figure 5.1: samples on keyword-search variation


summarised results
Figure 5.2: summarised results


Table 5.4: the test-examples used
index URL meaning
1 http://www.lzk.ac.at/cgi-bin/sides-such-boku.pl keyword-search
2 http://www.nextra.at/german/Suche/default.asp keyword-search
3 http://www.liberale.at/index.php3 newsletter
4 http://search.atomz.com/search/ keyword-search
5 http://go4it.servus.at/cgi-bin/texis/webinator/servussearch/ keyword-search
6 http://www.eva.ac.at/pop/result.htm keyword-search
7 http://mywhois.domainsave.at/whois.pl domain-search
8 http://novsrv3.ub.tuwien.ac.at/cgi-bin/search.pl advanced-keyword-search
9 http://www.kv.avalon.at/cgi-bin/subscribe.pl newsletter
10 http://www.eva.ac.at/pop/dank.rxml newsletter
11 http://order.reddothost.com/whois.pl domain-search
12 http://www.adis.at/cgi-bin/htsearch keyword-search
13 http://www.tuwien.ac.at/pr/cgi-bin/forum.pl posting
14 http://www.viennaairport.com/scripts/samples/search/vie_suche_d.idq keyword-search
15 http://energytech.at/(de)/kontakt.html contact
16 http://www.spar.co.at/cgi-bin/htsearch (advanced-)keyword-search
17 http://www.sil.at/search-scripts/htsearch (advanced-)keyword-search
18 http://www.kosmos.at/_vti_bin/shtml.exe/informationen.htm request
19 http://www.oeffentlicherdienst.at/cgi-bin/mail.cgi gewinnspiel
20 http://www.oebv.com/cgi-bin/mail.cgi question/contact



next up previous contents
Next: Lessons Learned Up: Automatic Retrieval of Interactive Previous: Value Selection   Contents
Andreas Aschenbrenner