Information source database based searching system on the Internet


Nagy Tamás <bigtom@avalon.aut.bme.hu>

BME, Automatizálási és Alk. Informatikai Tanszék



Indexed database based searching systems used recently have a number of problems. They are imprecise, not up to date, and they can’t keep up with the growing of Internet. The most important reason for these problems is that the application doing the searching part of the job (the web robot) hasn’t got connection with the user asking for information, thus the robot wants to catalogue the entire web instead of searching for a specific (a precisely defined) information.


The central part of the system, I am working on now, is a database, which contains the addresses of information sources. The information source is a web page, which helps to reach a specific information, because via this page, the web robot can find the information in fewer steps than via another page. It means that if you want to find a piece of information you should start browsing from that page, thus the web robot is starting searching of the web from that web page, too. It also means that the indexed database based search-pages are good information sources in many subjects, but we believe, that for a specific information we can find better information sources (for example because search pages uses these web pages also). The aim is to create a database where the web robot can find these information sources for a given information type, and the robot can use them to find pages, which are more relevant for a given query.