Home
E-mail Us
Oracle Articles
New Oracle Articles

Oracle Training
Oracle Tips
Oracle Forum
Class Catalog

Remote DBA
Oracle Tuning
Emergency 911
RAC Support
Apps Support
Analysis
Design
Implementation
Oracle Support

SQL Tuning
Security
Oracle UNIX
Oracle Linux
Monitoring
Remote support
Remote plans
Remote services
Application Server
Applications
Oracle Forms
Oracle Portal
App Upgrades
SQL Server
Oracle Concepts
Software Support
Remote Support
Development
Implementation

Consulting Staff
Consulting Prices
Help Wanted!

Oracle Posters
Oracle Books
Oracle Scripts
Ion
Excel-DB

Don Burleson Blog

Introduction to using DBI and Oracle

Oracle Tips by Mladen Gogala

Mladen Gogala is the author of "Easy Oracle PHP".

See the whole article: Perl DBI and Oracle

Introduction

DBI is a Perl module, well documented in both online sources and the "Programming the Perl DBI" book by Alligator Descartes, Alistair Carty, Tim Bunce and Linda Mui (O'Reilly, ISBN:1565926994) but still, I can see questions about using DBI with Oracle on Oracle USENET groups. That motivated me to write this article.

Oracle has many features and data types, while using DBI with each particular feature is not always entirely trivial. The purpose of this article is to cover many of those uses from Perl practitioner's point of view. Here is what you can find in this article:

The sole purpose of this article is to serve as an introductory reading for those who have never used the two together. Therefore, some knowledge of both Oracle and Perl is assumed, and although I will not try to show off my obfuscation abilities, this article still assumes that the reader has read "Learning Perl" and has some experience with Oracle RDBMS.

This article will not cover basic features of neither Perl nor Oracle. It will also not cover the installation of Perl, Oracle, DBI or DBD::Oracle. Furthermore, it makes no claims of completeness. This is simply an introduction to using Oracle with Perl DBI.

This article was created on Red Hat Fedora Core 4, with Oracle 10.2.0.2 and Perl 5.8.6. The decision not to cover the installation was made because the installation is different for each operating system, while I'd like to concentrate on the common features, the features that can be used across the whole range of supported systems. Describing installation on Unix or Linux and omitting Windows or VMS would open me for the accusations of being OS biased. As I really am biased toward one type of OS, I wanted to hide that fact and cover just the common parts.

In writing this article I was relying mostly on the experience, scripts and sins of the past and online documentation. The most accurate and up to date online DBI documentation can always be found on CPAN (http://search.cpan.org). In particular, the syntax of all DBI methods can be found on the DBI page. I will occasionally quote DBI documentation, where appropriate and will always warn the reader when I'm doing so. Quotes from the online documentation will also be marked by the different font (smaller and italicized). The online documentation for the DBI and DBD::Oracle can be found at:

http://search.cpan.org/~timb/DBI-1.50/DBI.pm

http://search.cpan.org/~timb/DBD-Oracle-1.16/Oracle.pm

This is, of course, the documentation for the versions used in this article. This documentation is likely change with versions. If your DBI and DBD::Oracle versions are different from what was used to produce this article and the example scripts within it, you should check the documentation for your favorite version.

So, let's start with the article. In order to execute the examples from the article the reader will have to have access to an Oracle database and also access to Perl interpreter with DBI and DBD::Oracle installed. Finding Perl isn't a problem. Verifying that all necessary modules are installed is also very easy. So, how do we tell that DBI and DBD::Oracle are installed and how do we tell which versions are installed? The answer is really simple:

$ perl -e 'use DBI; print $DBI::VERSION,"\n";'

1.50

$ perl -e 'use DBD::Oracle; print $DBD::Oracle::VERSION,"\n";'

1.17

Therefore, we have DBI 1.50 and and DBD::Oracle 1.17. At the moment, these are the latest and the greatest versions. Here is the Oracle version:

$ sqlplus scott/tiger

SQL*Plus: Release 10.2.0.2.0 - Production on Sun Apr 16 23:09:35 2006

Connected to:

Oracle Database 10g Enterprise Edition Release 10.2.0.2.0 - Production

With the Partitioning, OLAP and Data Mining options

SQL>

This username and password (SCOTT/TIGER) will be used throughout this article, as well as the accompanying EMP and DEPT tables which will be used for almost all of the examples. EMP and DEPT tables do not have LOB columns, so I will have to create my own infrastructure for dealing with LOB data types.

Connecting to Oracle with DBI

DBI uses the following syntax to connect to an Oracle instance:

my $db=DBI->connect("dbi:Oracle:local", "scott", "tiger");

The string on the beginning contains the following elements, separated by colons: the string "dbi", driver type, in this case "Oracle" and the database name, also known as TNS descriptor. In the line above, the database name was "local". The first argument, therefore, defines the database to connect to. The 2nd and 3rd arguments are, of course, username and password. Of course, everyone who has ever worked with databases knows that connection errors happen from time to time, so it is prudent to check for errors:

my $dbh = DBI->connect( "dbi:Oracle:$db", $username, $passwd ) ||

die( $DBI::errstr . "\n" );

In case of connection error, database handle $dbh is not created and the error string "errstr" comes from the DBI class itself. If, on the other hand, the connection attempt is successful and the database handle is created, there are several properties that can be very helpful with further programming:

AutoCommit: when set to 1, DBI issues a commit after each successful SQL statement. This is very dangerous and on by default. Setting it to 0 is a good idea. This handle property is not Oracle specific, it is available for any database.
RaiseError: When turned on (it is off by default) it sends an exception to your script and terminates it. It is a good idea to kill the script if a non-handled Oracle exception happens. This handle property is not Oracle specific, it is available for any database.
ora_check_sql: Oracle has a performance enhancing trick called "deferred parse". When used, it decreases the number of the needed database calls, by bundling "parse" and "execute" phases of the SQL execution. Unfortunately, this feature is not turned on by default. To turn it on, you have to set ora_check_sql to 0. This is Oracle specific.
RowCacheSize: this instructs the driver to create a local pre-fetch cache and defines its size.
Both DBI database and statement handles always have errstr and err member variables which contain error message and code respectively. In case of a failed connection attempt, when database handle is not created, errstr and err can be used in class context, as $DBI::errstr and $DBI::err. That is used in the examples throughout the article.
This list is by no means complete, I mentioned only the properties most frequently used in the scripts. For the complete list, please consult the online DBI documentation. In further text, I'll introduce two more handle properties, used to deal with LOB data types.

These properties are keys of an associative array, and here is how they're set:

$dbh->{AutoCommit} = 0;

$dbh->{RaiseError} = 1;

$dbh->{ora_check_sql} = 0;

$dbh->{RowCacheSize} = 16;

So, this turns off auto-commit, instructs DBI to terminate the script if SQL error is incurred, instructs Oracle driver to use deferred parse and wait with parsing the SQL until an execution is attempted for the first time. It also creates a local pre-fetch cache with place for 16 rows.

Unfortunately, DBD::Oracle does not support array interface, so the effects of this local cache are not as great as one would expect. As a matter of fact, this lack of support for Oracle array interface is the feature that severely limits the use of Perl with Oracle and makes it unsuitable for large data loads or massive transactions. Features that are also not supported are TAF (Transparent Application Failover) and direct loads. In other words, Perl with DBI and DBD::Oracle is not an industry strength tool for a production environment.

Executing SQL with DBI

Now that we have a database handle properly created and configured, we can use it to execute SQL commands. Oracle SQL executes in several phases:

Parse phase: SQL statement is checked for syntactical validity and all objects are checked to see whether they exist or not and whether the user has appropriate access privileges or not. It is during this phase that the optimizer is invoked, statistics examined and the optimal access path determined. This parsing operation can be excruciatingly expensive and is best avoided, if at all possible.

Bind phase: placeholders in SQL statements are "connected" to the program variables. During this process the address of program variables is "made known" to oracle, so that it can read or write values from it. The same SQL command can be executed over and over again, without the need for re-parsing for various values of the program variable.

Execute phase: In this phase, the SQL statement in question is executed and the appropriate program areas within the program are updated to reflect that. Oracle can postpone parsing until the statement is actually executed, therefore decreasing the number of calls to oracle and the number of round trips over the network, necessary to reach the database.

Define phase: define phase exists only for queries. During this phase we define variables to receive output. This phase is not really necessary with Perl DBI as DBI calls also create variables to receive data.

Fetch phase: during the fetch phase, data is retrieved from an Oracle cursor and stored into program variables. Fetch is not bi-directional, it can be read sequentially and closed when no longer needed.

These phases of execution correspond to DBI calls. In particular, "parse" phase corresponds to the prepare DBI call. Now is the right time to demonstrate the fabled "deferred parse:

Example 1.

#!/usr/bin/perl -w

use strict;

use DBI;

my $db = DBI->connect( "dbi:Oracle:Local", "scott", "tiger" )

|| die( $DBI::errstr . "\n" );

$db->{AutoCommit} = 0;

$db->{RaiseError} = 1;

$db->{ora_check_sql} = 0;

$db->{RowCacheSize} = 16;

my $SEL = "invalid SQL statement";

my $sth = $db->prepare($SEL);

print "If you see this, parse phase succeeded without a problem.\n";

$sth->execute();

print "If you see this, execute phase succeeded without a problem.\n";

END {

$db->disconnect if defined($db);

}

The result of this script is the following:

bash-3.00$ /tmp/ttt

If you see this, parse phase succeeded without a problem.

DBD::Oracle::st execute failed: ORA-00900: invalid SQL statement (DBD ERROR: OCIStmtExecute) [for Statement "invalid SQL statement"] at /tmp/ttt line 13.

As you can see, parse succeeded without a problem, although the SQL statement in the variable $SEL was clearly invalid. There is one more thing in this script that needs to be explained, namely the END block. The END block gets executed when the script exits, regardless of the way it exits. So , let's comment it out and see what happens:

bash-3.00$ /tmp/ttt

If you see this, parse phase succeeded without a problem.

DBD::Oracle::st execute failed: ORA-00900: invalid SQL statement (DBD ERROR: OCIStmtExecute) [for Statement "invalid SQL statement"] at /tmp/ttt line 13.

Issuing rollback() for database handle being DESTROY'd without explicit disconnect().

bash-3.00$

The last message was issued by the DBI, because of exiting without disconnect. To avoid that, it is a good practice to include an END block like in the example 1. End block can also be used to issue rollback after a failed transaction. To summarize the example 1, a successful "connect" call established a database handle $db. Database handle has a prepare method, used to create a statement handle, $sth. Statement handle has many methods. Execute, bind and fetch are all methods of a statement handle.

Now, let's replace the invalid SQL with a valid one, namely "select * from emp"; Our script will need more then cosmetic changes. The script will now look like this:

Example 2.

#!/usr/bin/perl -w

use strict;

use DBI;

my $db = DBI->connect( "dbi:Oracle:Local", "scott", "tiger" )

|| die( $DBI::errstr . "\n" );

$db->{AutoCommit} = 0;

$db->{RaiseError} = 1;

$db->{ora_check_sql} = 0;

$db->{RowCacheSize} = 16;

my $SEL = "SELECT * FROM EMP";

my $sth = $db->prepare($SEL);

$sth->execute();

while ( my @row = $sth->fetchrow_array() ) {

foreach (@row) {

$_ = "\t" if !defined($_);

print "$_\t";

}

print "\n";

}

END {

$db->disconnect if defined($db);

}

The output looks exactly as expected:

bash-3.00$ ./ttt

7369 SMITH CLERK 7902 17-DEC-80 800 20

7499 ALLEN SALESMAN 7698 20-FEB-81 1600 300 30

7521 WARD SALESMAN 7698 22-FEB-81 1250 500 30

7566 JONES MANAGER 7839 02-APR-81 2975 20

7654 MARTIN SALESMAN 7698 28-SEP-81 1250 1400 30

7698 BLAKE MANAGER 7839 01-MAY-81 2850 30

7782 CLARK MANAGER 7839 09-JUN-81 2450 10

7788 SCOTT ANALYST 7566 09-DEC-82 3000 20

7839 KING PRESIDENT 17-NOV-81 5000 10

7844 TURNER SALESMAN 7698 08-SEP-81 1500 0 30

7876 ADAMS CLERK 7788 12-JAN-83 1100 20

7900 JAMES CLERK 7698 03-DEC-81 950 30

7902 FORD ANALYST 7566 03-DEC-81 3000 20

7934 MILLER CLERK 7782 23-JAN-82 1300 10

bash-3.00$

So, now we not only have execute, we also have fetch, implemented by fetchrow_array method of the statement handle. The array @row is created on the fly, for each row, by the fetchrow_array method. Therefore, we do not have to do "define" as we would have had, had we been programming an OCI program instead of a Perl script.

There several more methods for fetching data from an Oracle cursor:

l fetchrow_arrayref

l fetchrow_hashref

l fetchall_arrayref,

l fetchall_hashref

These methods differ by what do they return (hash or array reference) and how many rows do they return (one or all). Methods returning hash reference are convenient when we want to pick column value by column name, not its sequence number. For methods that return reference to hash, retrieval works like this:

$row=sth->fetchrow_hashref();

$ename=$row->{ENAME};

Methods that fetch all rows are convenient when the underlying query returns relatively few rows that are frequently referenced within a script.

All those methods fetch relational NULL as "undef" value, which is a problem if we have "use strict" in effect. That is the reason for having the line which reads:

$_ = "\t" if !defined($_);

in the script. If that line is commented out, we get an ugly output, laced with warning messages, which looks like this:

bash-3.00$ ./ttt