St. Mary's University
Blume Library
Searching for Information by Subject on the World Wide
Web
Table of Contents
Steps:
1. Define what you are seeking.
- What are some keywords you might use?
- What are some synonyms or related terms you might use?
2. Learn about and choose an appropriate database(s)
or search engine.
3. Learn about tips in searching the chosen database.
- Read the search tips and documentation provided by the database!
4. Examine the results and determine relevancy.
- If there are abstracts, examine them. Do the terms look relevant?
- Do many of the results link to different files at the same Web site?
- Is there another way you can refine or expand this search?
- Are there other terms you might try?
- Is there another database or search engine you might try?
5. Once you find relevant information, save the information and where
you located it. See Documenting Electronic Information.
Note: Some addresses or sites may not be accessible
at all times due to factors such as Internet traffic, mechanical breakdowns,
or relocation or removal of information.
Learning about appropriate databases (search
engines)
If you are trying to find information about a specific topic on the
World Wide Web (WWW), search engines are the tools that can help
you find that information. Search engines are databases that search World
Wide Web sites by various access points--some search words in titles, in
headings, in documents and some will rank results by relevancy according
various criteria, such as how frequently and how prominently the term(s)
searched appear in WWW documents.
With most of the search engines, you type your search term(s) in the
box or on the line provided, highlight "search" or "submit
search," and the engine will search for all occurrences of the
search term(s) and retrieve a list of sites that contain the term(s). Some
of the services will include an extract or annotation of that portion of
the Web site that contains the search term(s). The use of upper and lower
case letters and certain punctuation marks can influence the results you
retrieve with many of the search engines.
Search Tips
Everyone has their favorite search engine(s). I like to start with a
multiple or meta-search engine, because it searches several
search engines simultaneously. I also prefer engines that provide relevancy
ranking, because the items that should be about the topic searched,
should appear near the top of the list.
I frequently start with Metacrawler
(http://www.go2net.com/search.html/) because it simultaneously searches popular
search engines: Altavista, Excite, InfoSeek, OpenText, Yahoo!, WebCrawler.
It also provides relevancy ranking, so that the items that pertain most
closely to what I'm seeking often appear near the top of the list.
With many of the search engines, capitalization and punctuation are
important.
An
excellent site that describes the search process on the Web, and various
feature and tips about searching the Web is the Finding
Information tutorial (http://www.lib.berkeley.edu/TeachingLib/Guides/Internet/FindInfo.html)
from the University of California, Berkeley.
Selected WWW Search Engine sites
- Alta Vista (http://altavista.digital.com/)
- developed in Fall 1995 at Digital Research Laboratories. Indexes full-text
of over 30 million Web pages. Provides word count. Good response time.
Very useful if looking for a very obscure or specific topic.
- Excite (http://www.excite.com/)
- searches 1.5 million of the most popular Web sites, also searches Usenet
newsgroups; includes browsable subject tree. Displays results by ranked
relevancy.
- InfoSeek (http://infoseek.go.com/)
- allows free and fee-based searches; results displayed by ranked relevancy;
displays term(s) in context; 12 browsable subject trees. Useful if looking
for broader topics or terms likely to produce many hits.
- Yahoo! (http://www.yahoo.com/)
(stands for Yet Another Hierarchical Officious
Oracle) - more of a searchable catalog than a search engine; good
response time; searchable by field; includes links to other search engines;
also browsable subject tree of 14 broad categories.
- Lycos (http://www.lycos.com/)
- very good documentation; displays term(s) in context; sometimes difficult
to access.
- WebCrawler (http://www.webcrawler.com/)
- indexes HTML full text; no annotations; displays results by ranked relevancy.
- Harvest
(http://www.town.hall.org/brokers/www-home-pages/query.html) - small database;
very good documentation; displays term(s) in context; results displayed
by relevancy
- EINet Galaxy
(http://galaxy.einet.net/galaxy.html) - includes browsable, hierarchical
subject list
Multiple Search Engines
The following databases search multiple search engines. The advantage
of using meta-search engines such as these is that one can search a number
of databases at once. The disadvantage is that these sites may be slower
in retrieving results. Also not all databases searched may use the same
search protocols, and therefore one may not retrieve as the same hits as
if one had searched each database separately.
- Dogpile (http://www.dogpile.com/)
- searches the following Web databases: Yahoo!, Lycos' A2Z, Excite Guide,
World Wide Web Worm, WWW Yellow Pages , PlanetSearch, What U Seek, Magellan,
Lycos, WebCrawler, InfoSeek, AltaVista, Excite and HotBot, as well as Usenet
databases, ftp searchable databases, and news wire databases.
- Internet Sleuth (http://www.isleuth.com/)
- links to hundreds of searchable databases including search engines, Usenet
searchers, and categorizes topics. Choose from over 2,000 databases. One
can also specify the length of desired search time.
- Metacrawler (http://www.go2net.com/search.html/)
- Seach.Com (http://search.com/)
- All-In-One
Search Page (http://www.albany.net/~wcross/all1srch.html) - provides
access to over 120 searchable indexes; sometimes not accessible; sometimes
slow response time
More information about search engines
and search tips
- Search Engine Watch
(http://www.searchenginewatch.com) - updates and statistics about new and
tried search engines.
- Vaughan-Nichols, Steven J. "Find It Faster." Internet
World 8.6 (June 1997): 64-66.
Comparison studies of search engines
The following articles compare features such as the database design,
precision or relevancy of items retrieved, order of items retrieved, retrieval
time, and interface design of various search engines:
- Courtois, Martin P., William M. Baer, and Marcella Stark. "Cool
Tools for Searching the Web: A Performance Evaluation." Online
19.6 (Nov./Dec. 1995): 15-32. Compares CUI W3 Catalog, Harvest,
Lycos, OpenText, WebCrawler, WWW Worm, and Yahoo based on database documentation
and sample searches, database construction, database size, fields indexed,
search capabilities, manner in which results are displayed, update frequency,
and accessibility and response time.
- Feldman, Susan. "Web Search Services in 1998: Trends and Challenges."
Searcher 6.6 (June 1998). Accessed: 9 June 1998 <http://www.infotoday.com/searcher/jun/story2.htm>.
Compares and examines updated features in AltaVista, Ask
Jeeves, Excite, HotBot, Infind and Savvy Search, InfoSeek, Lycos, and Northern
Light. Also offers general tips in using search engines.
- Randall, Neil. "Hide and Go Seek." PC Computing Online.
Ziff-Davis Publishing Company. 1995. Accessed: 29 July 1997 <http://www.zdnet.com/~pccomp/features/internet/search/sub1.html>.
Compares Infoseek, Lycos, Webcrawler, WWW Worm, All-in-One
Search Page, Internet Exploration Page, Opentext, Yahoo, AliWeb, CUSI,
EINet Galaxy, JumpStation II, SavvySearch, and CUI W3 Catalog. Describes
desirable features in a search engine.
- Venditto, Gus. "Search Engine Showdown: IW Labs Tests Seven Internet
Search Tools." Internet World 7.6 (May 1996): 79-86. Accessed:
9 June 1998 <http://www.internetworld.com/print/monthly/1996/05/showdown.html>.
Compared Alta Vista, Excite, InfoSeek, Lycos, Open Text,
Web Crawler, and WWW Worm. InfoSeek produced the most relevant results,
while Alta Vista produced the most comprehensive results.
updated June 10, 1998 by Diane M. Duesterhoeft (dduesterhoeft@stmarytx.edu).
The URL for this page is http://library.stmarytx.edu/acadlib/subject/misc/searengi.htm