Call now: 252-767-6166  
Oracle Training Oracle Support Development Oracle Apps

 
 Home
 E-mail Us
 Oracle Articles
New Oracle Articles


 Oracle Training
 Oracle Tips

 Oracle Forum
 Class Catalog


 Remote DBA
 Oracle Tuning
 Emergency 911
 RAC Support
 Apps Support
 Analysis
 Design
 Implementation
 Oracle Support


 SQL Tuning
 Security

 Oracle UNIX
 Oracle Linux
 Monitoring
 Remote s
upport
 Remote plans
 Remote
services
 Application Server

 Applications
 Oracle Forms
 Oracle Portal
 App Upgrades
 SQL Server
 Oracle Concepts
 Software Support

 Remote S
upport  
 Development  

 Implementation


 Consulting Staff
 Consulting Prices
 Help Wanted!

 


 Oracle Posters
 Oracle Books

 Oracle Scripts
 Ion
 Excel-DB  

Don Burleson Blog 


 

 

 


 

 

 

 

 

Search Engine Precision and Recall

Oracle Database Tips by Donald Burleson

Google leaped to the front of the search engine products for several reasons, and many researchers are attempting to quantify those success factors with metrics defined as "precision "and "recall".  Yeah, OK . . .

If it were really simple (and measurable), then there would not be such a discrepancy between the traffic (and the resulting billions of dollars in revenue) of each search engine.  In a world where every search engine company invests millions of dollars in their technology, why does Google command more usage than almost of the competitors combined.  Of course, a savvy web user is far like likely to use Google than Grandma who uses whatever appears on her toolbar.

See my notes on Search Engine word stemming and synonym expansion to learn more about the need to expand queries to include stems and synonyms.

Watch out for what's "relevant"

Let's explore precision and recall and examine if the definition of "relevant" might negate this research method and indicate a need to look at the search engine relevancy from a different point of view.

Precision

Precision is defined as a metrics to ensure that the query returns ALL matching pages (i.e. no lost results).  In other words, precision is percentage of the statistical universe of matching results.  An example of precision of search engines is this scholarly study titled "Precision and Recall of Five Search Engines for Retrieval of Scholarly Information in the Field of Biotechnology ", shows interesting academic research on the relative precision and recall of several internet search engines.   It notes:

"Precision is the fraction of a search output that is relevant for a particular query. Its calculation, hence, requires knowledge of the relevant and non-relevant hits in the evaluated set of documents (Clarke & Willet, 1997)."

"In the context of the present study precision is defined as:

Precision=    Sum of the scores of scholarly documents retrieved by a search engine
                      --------------------------------------------------------------------------------
                            Total no. of results evaluated  "

Recall

Recall is defined fare more loosely and it as it uses the highly-variable word "relevant", a loosely-defined term, and at the heart of the success of any search engine.  Notes from Precision and Recall of Five Search Engines for Retrieval of Scholarly Information in the Field of Biotechnology, define precision as a metric that is impossible to accurately measure:

"Thus it [Precision] requires knowledge not just of the relevant and retrieved but also those not retrieved (Clarke & Willet, 1997). There is no proper method of calculating absolute recall of search engines as it is impossible to know the total number of relevant in huge databases."

Notes from Precision and Recall, show their results, with Google not coming-up #1 on either precision and recall (as defined by "relevance"):

Table 1. Mean Precision and Relative Recall of search engines during 2004

 

AltaVista

Google

HotBot

Scirus

Bioweb

Precision

0.27

0.29

0.28

0.57

0.14

Recall

0.18

0.20

0.29

0.32

0.05

Ancient Precision and Recall References

Old reference research from back in the early days when the area of search engine metrics was in its infancy provide a must-read foundation to the problem of defining the "best" web search mechanism.  Notes from this paper also contains a nice list of other scholarly studies on search engine precision and recall, from back in the early days when people thought that search engine ranking analysis would never morph into a multi-billion dollar a year question:

If you like Oracle tuning, you may enjoy my new book "Oracle Tuning: The Definitive Reference", over 900 pages of my favorite tuning tips & scripts. 

You can buy it direct from the publisher for 30%-off and get instant access to the code depot of Oracle tuning scripts.


 

 

��  
 
 
Oracle Training at Sea
 
 
 
 
oracle dba poster
 

 
Follow us on Twitter 
 
Oracle performance tuning software 
 
Oracle Linux poster
 
 
 

 

Burleson is the American Team

Note: This Oracle documentation was created as a support and Oracle training reference for use by our DBA performance tuning consulting professionals.  Feel free to ask questions on our Oracle forum.

Verify experience! Anyone considering using the services of an Oracle support expert should independently investigate their credentials and experience, and not rely on advertisements and self-proclaimed expertise. All legitimate Oracle experts publish their Oracle qualifications.

Errata?  Oracle technology is changing and we strive to update our BC Oracle support information.  If you find an error or have a suggestion for improving our content, we would appreciate your feedback.  Just  e-mail:  

and include the URL for the page.


                    









Burleson Consulting

The Oracle of Database Support

Oracle Performance Tuning

Remote DBA Services


 

Copyright © 1996 -  2020

All rights reserved by Burleson

Oracle ® is the registered trademark of Oracle Corporation.