• Join over 1.2 million students every month
  • Accelerate your learning by 29%
  • Unlimited access from just £6.99 per month
  1. 1
  2. 2
  3. 3
  4. 4
  5. 5
  6. 6
  7. 7
  8. 8
  9. 9
  10. 10
  11. 11
  12. 12
  13. 13
  14. 14

A Review On Web Search Engines Retrieval Methods

Extracts from this document...


A Review On Web Search Engines' Retrieval Methods -Best match as principal and Boolean searching as auxiliary 21st December 2001 JIN, JIEJUN MSc in Information Management INF 6060 Information Storage and Retrieval Abstract This essay is aimed to analyse why most World Wide Web search engines provide best match searching as their principle retrieval method with Boolean searching playing an auxiliary role. The World Wide Web has revolutionised the way in which people access information, and search engines are are widely used by people to find useful information on the Web. There are pros and cons lying in both best match retrieval system and Boolean retrieval system. Comparisons of these two retrieval methods show that the performance of best match searching is generally stronger than Boolean searching in an online environment for general uses. Nevertheless, as long as in some circumstances Boolean searching provides more effective and accurate performance, a replacement is not seen. Main Contents The Importance of Searching The World Wide Web Seeking information is an activity fundamental to all human beings. Throughout history, one of man's primary concerns has been to satisfy his information needs. With the development of new technologies, people's behaviour with regard to accessing information has been greatly changed. Since the Internet was invented in 1969, it has been growing rapidly and now has extensions in every corner of the globe (Poutler (1997)). The World Wide Web (WWW) has revolutionised the way in which people access information, and has opened up new possibilities in areas such as digital libraries, and the dissemination and retrieval of scientific information. The Internet is proving to have important implications in areas as diverse as education, commerce, entertainment, and medicine and health care. ...read more.


say that users in all types of information retrieval systems face the central difficulty of effective, interactive formulation (and reformulation) of the queries that represent their information problems. Web search engines are no exception. Belkin and Croft (1987) state that from experimental studies it has been known for some time that in terms of recall and precision performance measures, best-match, ranked output retrieval techniques are in general superior in non-interactive settings to exact-match systems, such as commercial Boolean information retrieval systems. The Boolean retrieval system has been used for several decades, but it has always been subject to certain criticisms. Cooper (1983) says that Boolean systems have some serious drawbacks. The first drawback is that the Boolean language is confusing to the novice. This may be true. Since a Boolean search system only accepts Boolean queries as input, a user is required to construct the query formulations by Boolean logic operators (AND, OR, and NOT). Constructing a query formulation in Boolean form poses evident difficulties for those inexperienced users or those who have not been trained to operate Boolean logic. Salton et al. (1983) write, In operational information retrieval, Boolean query formulations are used to express the customers' information needs. The standard rules of Boolean logic may not, however, provide an ideal environment for the formulation of effective search requests... Unfortunately there exists much evidence to show that ordinary users are unable to master the complications of Boolean logic, and even professional indexers and searchers find it difficult to construct consistently effective index representations and search statement. Similarly, Willett (1988) argues that although the great majority of current retrieval systems are based upon Boolean searching, there are severe problems associated with the use of such a retrieval model. ...read more.


(Pritchard-Schoch (1993)) Conclusion Market forces heavily predominate the design and evolution of the major commercial search engines. It is unsurprising that about 85% of Internet users surveyed claim to use search engines and search services to find specific information. Commercial considerations play an important role in determining the popular employment of best match searching as the default search mode. Since the majority of users of a large, general-use search engine will be untrained, casual users, it makes sense that the default setting offers the type of search that best matches the needs of these users. Here, best match searches display clear advantages over Boolean systems - most importantly the simplicity and user-friendliness of the user interface (especially when "natural language" searches are offered) and the ranking of results according to relevance. However, as long as there are remain circumstances in which a Boolean search is more effective and a customer base that is prepared to use it, it makes sense for large search engines to make this option available as an auxiliary search method. Furthermore, the same survey which showed the impressive 85% preference for search engines as the method of data retrieval, also indicated that users are not satisfied with the performance of the current generation of search engines; the slow retrieval speed, communication delays, and poor quality of retrieved results (e.g. noise and broken links) are commonly cited problems. In such a setting, the current trend towards increasing computer literacy as well as increasing reliance on the internet for many types of serious research (as opposed to casual domestic "web-surfing") might even mean an increase in the number of people who are willing to learn how to use Boolean searching in order to make such research more effective. ...read more.

The above preview is unformatted text

This student written piece of work is one of many that can be found in our AS and A Level ICT in Business section.

Found what you're looking for?

  • Start learning 29% faster today
  • 150,000+ documents available
  • Just £6.99 a month

Not the one? Search for your essay title...
  • Join over 1.2 million students every month
  • Accelerate your learning by 29%
  • Unlimited access from just £6.99 per month

See related essaysSee related essays

Related AS and A Level ICT in Business essays

  1. Marked by a teacher

    business online

    4 star(s)

    B) In this question I will be explaining why customers would be attracted to viewing and purchasing goods and services from businesses that are online based. I will identify the advantages to customers from the two chosen companies I have evaluated in Task 1, which were Amazon and Tesco.

  2. Sainsbury's communication methods.

    Also because of the Internet many people feel they are bombarded with too many items of information E.g. students requesting information via e-mail Teething problems: There may be problems with the systems, computers, networks etc that could prevent Sainsbury from doing their day-to-day tasks e.g.

  1. The purpose of this document is to define the Context of Cain Motors Information ...

    Checkpoint Report A progress report of the information gathered at a checkpoint meeting, which is given by a team to the Project Manager and provides reporting data as defined in the Work Package. Communication Plan Part of the Project Initiation Document describing how the project's stakeholders and interested parties will be kept informed during the project.

  2. Business Online

    the employees aren't to be needed in that department, they are given other targets, like to do more work in its shops and warehouses. Using the spare time they have, should be used to improve their connection with the suppliers AND also being responsive to industry demands and trends.

  1. Report on six original documents

    The information is formal. There are not many technical terms used in the content. The sentences are fairly small but effective. There are few images, in fact there are only two images on the whole application form and lots of text.

  2. The Use of Geographical Information Systems (GIS) And International Technology Transfer by Non-Governmental ...

    can contribute significantly to environmental protection as well as people's welfare... This increases the appeal to some organisations interested in implementing GIS (as "it is perceived by many donors, the failure of countries in Africa and elsewhere to develop was partially due to corruption and the use of funds to enrich individuals rather than communities and nations.")

  1. Access to computing

    SATA or 1.6 TB SAS Rack height 1U 1u Price 679.00 329.00 Above are two different servers, however the HP model is more expensive but gives the customer some great technology which can be upgraded and customised to their own preference.

  2. E-commerce case study - Online Music Bussiness.

    Jazz music will adopt the first scenario. As companies regularly have to update catalogues, to reflect the changes to their inventory and price changes; these may be expensive to print and delivered to customers and distributors. The website will enable this easy and cheaper for jazz music because the update will done electronically.

  • Over 160,000 pieces
    of student written work
  • Annotated by
    experienced teachers
  • Ideas and feedback to
    improve your own work