Call now: 252-767-6166  
Oracle Training Oracle Support Development Oracle Apps

 
 Home
 E-mail Us
 Oracle Articles
New Oracle Articles


 Oracle Training
 Oracle Tips

 Oracle Forum
 Class Catalog


 Remote DBA
 Oracle Tuning
 Emergency 911
 RAC Support
 Apps Support
 Analysis
 Design
 Implementation
 Oracle Support


 SQL Tuning
 Security

 Oracle UNIX
 Oracle Linux
 Monitoring
 Remote s
upport
 Remote plans
 Remote
services
 Application Server

 Applications
 Oracle Forms
 Oracle Portal
 App Upgrades
 SQL Server
 Oracle Concepts
 Software Support

 Remote S
upport  
 Development  

 Implementation


 Consulting Staff
 Consulting Prices
 Help Wanted!

 


 Oracle Posters
 Oracle Books

 Oracle Scripts
 Ion
 Excel-DB  

Don Burleson Blog 


 

 

 


 

 

 

 

 

Quartile Binning

Data warehouse tips by Burleson Consulting

This is an excerpt from Dr. Ham's premier book "Oracle Data Mining: Mining Gold from your Warehouse".

We see that ODMr recognized that the two numerical data attributes AGE and YRS_RESIDENCE should be binned, and discretized the data so that these fields were categorized into 3 different bins: 1, 2, and 3.

When we click on the Options button in the Discretize section, we will find that our options are Quantile Binning, Equal Width Binning, and None. 

We can illustrate the difference between the quantile and equal width binning by using the discretize wizard.  Below is the histogram for the attribute AGE in the MINING_DATA_BUILD_V_US case dataset using the equal width binning strategy. 

Each group in the histogram view is composed of age values in increasing increments of 7.3 years.  As age increases, the number of customers in the bins decreases, from a maximum of 19.49% in group 3 to 0.22% in group 9.  This type of distribution that ?tails off? is not a good choice for data mining analysis.  You want a more uniform distribution of ages across all groups, as in the quantile binning shown below. 

Using the Discretize Transform Wizard

You can view the histograms of data using different binning methods in the discretize wizard.  We will illustrate the use of the discretize wizard by customizing the age ranges in a new binning strategy. 

In Step 3 of the Wizard, note that there are 3 attributes identified as numerical mining types:  AGE, CUST_ID, and YRS_RESIDENCE.  In this step you can change the mining type if necessary by clicking ?numerical? or ?categorical? and choosing the appropriate type. 

Numerical data can be binned.  Click next to continue the wizard. 

In step 4 of the Wizard, the numerical data types are listed with the average, maximum and minimum values.  The Defaults option brings up a menu listing a choice of mining algorithms.  Choosing any of these will optimize a binning strategy specific for the algorithm. 

For example if you pick Adaptive Bayes Networkfor AGE, the binning wizard bins the data into 5 groups:  < 26, 26-34, 34-42, 42-51, and >51.  The Na?e Bayes defaults to 3 groups:  <32, 32-44, and >44. 

 

For more tips and tricks for Oracle data warehouse analysis, see Dr. Ham's premier book "Oracle Data Mining: Mining Gold from your Warehouse"

You can buy it direct from the publisher for 30%-off:

http://www.rampant-books.com/book_2006_1_oracle_data_mining.htm


 

 
��  
 
 
Oracle Training at Sea
 
 
 
 
oracle dba poster
 

 
Follow us on Twitter 
 
Oracle performance tuning software 
 
Oracle Linux poster
 
 
 

 

Burleson is the American Team

Note: This Oracle documentation was created as a support and Oracle training reference for use by our DBA performance tuning consulting professionals.  Feel free to ask questions on our Oracle forum.

Verify experience! Anyone considering using the services of an Oracle support expert should independently investigate their credentials and experience, and not rely on advertisements and self-proclaimed expertise. All legitimate Oracle experts publish their Oracle qualifications.

Errata?  Oracle technology is changing and we strive to update our BC Oracle support information.  If you find an error or have a suggestion for improving our content, we would appreciate your feedback.  Just  e-mail:  

and include the URL for the page.


                    









Burleson Consulting

The Oracle of Database Support

Oracle Performance Tuning

Remote DBA Services


 

Copyright © 1996 -  2017

All rights reserved by Burleson

Oracle ® is the registered trademark of Oracle Corporation.

Remote Emergency Support provided by Conversational