Search Tools
Overview Search Engines
Directory Browsing Metasearch Engines
More metasearch engines can be found, with reviews, at the Search Engine Watch website. That website is also useful for finding various specialty search engines. Copernic Agent Specialized search engines search for databases by topic and help eliminate the noise associated with general search engines. In the "Case Studies" section of this presentation "Case Study #2" is an example showing how to use one of these specialty search engines. In the "Data Mining" section of this presentation many other "Specialized Search Engines" are listed that assist in finding websites with databases. Recall there are over 200,000 databases on the Web. This specialized search engines are a big help in finding databases of interest to your research. Deep Web Search Tools If you do nothing else with the deep Web, learn how to use the three websites described below. CompletePlanetTM uses a query based engine to index 70,000+ deep Web databases and surface Web sites. Appendix A lists 60 of the largest deep Web databases which contain 10% of the information in the deep Web, or 40 times the content of the entire surface Web. These 60 databases are included in CompletePlanets indexes. CompletePlanet is sponsored by BrightPlanet® Corporation, a leader in deep Web searches. The interface is intuitive and easy to use. You can do a keyword search on all 70,000+ databases to find which databases to use for your search. You can also browse by category, and then search databases of interest. ProFusion is a combination of query based engine and a deep Web directory portal. The directory structure is accessed by clicking on Specialized Searches. With an account, you can setup custom My Search Groups to search customized lists of websites and/or databases of your choice. For example, you could create a group called Technology and add all the databases and websites of interest to you. This group is saved to your profile. You could then, at any future time, search this group on a research topic with keywords. This is a great time saver. Their query based engine is called SmartDiscovery®. SurfWax also uses a site's existing search capability as part of the meta-search process to tap the deep Web. They use proprietary algorithms to interpret the site's search criteria (Boolean, etc). With an account, you can also setup custom SearchSets to search customized lists of websites and/or databases of your choice. Surfwax also has a news accumulator feature with over 50,000 news topics in 84 categories. This news accumulator feature is a godsend providing high quality results. These are some useful news accumulator categories: all topics, networking, technology, telecommunication, and web services. In addition this site has WikiWax which takes the online encyclopedia Wikipedia to the next level. WikiWax does advanced look-aheads on Wikipedia searches to speed your keyword choices. Finding Deep Web Resources Pre-emptive search: to find deep Web databases, use a search engine or search a site containing both surface and deep Web content. For example, to find a database containing information on viruses use this search term (exact syntax may vary among search engines): On Google or InfoMine search for: virus (database OR repository OR archive) Hock (2004) has this additional method specific for the Teoma search engine: On Teoma search for: virus (resources OR meta site OR portal OR pathfinder) Reverse-Link Searching: Find out which pages link to a database you already find useful and see if those sites have further recommendations. To do this, use the link operator in the search engine. For example, Google uses link:yourURL. If you want to find out what sites link to NTIS, type this in the Google search bar: link:http://www.ntis.gov Find Experts: When you do a search with Teoma, experts and enthusiasts for your keywords are listed to the right of the results column. Go to these sites and see what resources are recommended to help you mine for deep Web resources. Search by document type: Search engines are now indexing heretofore deep files, like PDF files. In Google, by preceding your search terms with "filetype:ext" (where ext is the 3 character file extension), only those files will appear in the results. These are some examples of searches done in the Google search bar:
More about Google: When you do a search, the results are not only in the window you are viewing, but also simultaneously in the associated windows under the topics listed at the top of the search page, namely, Web, Images, Groups, News, Froogle, Local, etc. For example, if you search for the word virus, under Web are the websites found for virus, under Images are the graphics found for virus, under Groups are the discussion groups on virus, etc. - all of this is available without you doing anything extra on your part other than click each topic link in succession. Calishain (2005) gives these tips on Boolean modifiers using Google:
Go to the Google help section, for many more features. EndNote RSS Feeds Numerous other RSS readers are available. Wikipedia lists the websites of 130 readers and Earthweb provides the % market share of each of the most popular readers. If you want to search for feeds rather than getting them from your favorite known websites or from the "Find Feed" button in Pluck, try these. Chordata allows you to drill-down a hierarchical directory structure to find quality-rated feeds. Feedster is a search engine for locating feeds by keywords. LISTSERV Newsgroups 1) Open control panel Internet Options >Programs tab Primary Research and Reference Librarians Reference librarians at your local library or college can be of tremendous help. They are skilled in accessing information from a wide variety of resources including the deep Web. |