How to kill performance with 16MB table

If you think 16MB tables are so small that can’t be root cause of significant performance problems with SELECT queries, then read about my today’s headache – a story of 16MB table worth something like 270 seconds of response time.

It began mostly as usual: an application web page didn’t show up within 5 minutes. The application submitted a query to the database; really big query, 300+ lines. It didn’t work well (no surprise here). So I’ve used usual technique of gathering run-time execution statistics to find the culprit step(s) in the execution plan. Apparently there were no such a step in the plan of the top-level cursor (a bit of surprise here). According to the run-time statistics the query has finished within a second. Considering it was executed over VPN connection to the remote site on different continent this would mean instant reply in case of the same network (as it is for the application) – and this was obvious untrue. I noticed that query results were fetched slowly, row after row – 17 of them. A quick look over the select list confirmed a hypothesis: there were a number of PL/SQL function calls which were dispatched at the fetch and, apparently, weren’t accounted in the execution plan run-time statistics.

I’ve traced the query using extended SQL trace to see more information for function calls. It appeared that most of the time was spent in a single function call, with a single query – 100+ lines this time. It was called 17 times (no surprise here – the function was executed for each row returned by the outer query) and it took 337 seconds of pure CPU time (and nearly the same elapsed time). Per run-time execution statistic it was clear that a single step of the plan accounted for 277 seconds of response time. Here it is:

Rows     Row Source Operation
-------  ---------------------------------------------------
     17  COUNT STOPKEY (cr=36992 pr=0 pw=0 time=337065522 us)
... (snipped)
     51             NESTED LOOPS  (cr=35496 pr=0 pw=0 time=277024431 us)
     17              NESTED LOOPS  (cr=102 pr=0 pw=0 time=3004 us)
... (snipped)
     51              TABLE ACCESS FULL A_16MB_TABLE (cr=35394 pr=0 pw=0 time=277022293 us)
... (snipped)

The A_16MB_TABLE is a heap-organized table. It has no indexes. “Normally” I would expect it to be performance inhibitor as an inner table in a tight nested loop with a big number of rows returned from the outer row source. This was not the case: it was scanned just 17 times, which is ~16 sec/scan or ~1 MB/sec.

It was right time to have a look at the query and it’s plan – with predicate section of course. The table was joined in rather strange way, something like this:

select ...
  from (...) v,
         a_16mb_table t
 where t.vc_column like v.column1 || 'a pattern' || v.column2 || v.column3 || 'a pattern' escape '\'
 ...

The vew v (which was merged to the main block) in the query and it’s columns are not columns – they are derived from some character data with a bunch of TRANSLATE / SUBSTR / INSTR calls in different places. And the predicate section of the execution plan contained the answer why FTS was so slow: due to abilities to merge inline view, to move predicates and transitive closure, the actual predicate used to filter rows of the A_16MB_TABLE was more than just ugly. I couldn’t even understand the exact way Oracle generated such a huge filter. And this filter predicate was the root cause of the slow (yet fully in-memory) table scan. The hot-fix to this issue was easy: I materialized results of the view with rownum > 0 predicate, which has prevented Oracle using it’s intelligent techniques for modifying query predicates. However, I don’t consider this as an Oracle bug – in my view it was a designer’s fault. The data was stored in this table as crap, which is unacceptable if you want data to be accessed.

5 Comments

Timur Akhmadeev

Posted September 2, 2010 at 14:17 | Permalink | Reply

A more “common” example of something similar are filter subqueries. Here is an excellent example in the Oracle OTN forum that I’ve just encountered. Just imagine this: OP has an over 2000 seconds response time query spinning on the CPU with just two tables of 7 & 15 MB in size!
Roman

Posted August 19, 2013 at 18:52 | Permalink | Reply

Hi Timur,

What do you mean by saying “I materialized results of the view with rownum > 0 predicate, which has prevented Oracle using it’s intelligent techniques for modifying query predicates” ?

So, you created materialized view on that view v ( from (…) v ) ? Am I getting it right ? Why and how did you use “with rownum > 0 predicate” ?
Could you please explain a little bit more ?

Thanks much!
- Timur Akhmadeev
  
  Posted August 19, 2013 at 21:02 | Permalink | Reply
  
  Hi Roman
  
  No, I didn’t create materialized view. I just added a predicate “rownum > 0” into subquery, and the effect was: Oracle had to process this query block first and return (some) results. That’s why it is sometimes called “materialized”.
  Have a look at this post too.
Roman

Posted August 19, 2013 at 23:19 | Permalink | Reply

Thank you for the explanation and a useful link, Timur

Now I think we’re on the same page…
Performance tuning stuff unveils gradually.

Let me get this straight..
You added a “rownum > 0” predicate to the inline view v to make it non-mergeable and also to eliminate the possibility for CBO to push its predicates to the main query..(and to sub-optimal plan in the end – in this case)
Moreover, adding “rownum > 0” predicate doesn’t return all the values at a time. It just produces intermediate results for the next Optimizer operation (for example, to hash join)
and those values can be then transferred to the operation in a cascade fashion (partially) as needed. Eventually, this is the reason of improved performance..

Is that correct?!

P.S. Sorry if my question sounded lame… Sometimes it is good when someone point a direction which way to go
Especially to the one just entering the Oracle world.. ))

Thanks,
Roman
- Timur Akhmadeev
  
  Posted August 20, 2013 at 19:42 | Permalink | Reply
  
  You’re welcome.
  I think that’s mostly correct. I just can’t say for sure what kind of transformations CBO may/will disable when you add such a predicate to a subquery. It’s possible that some of them are still applied in such case, but view merging is definitely not.

Timur Akhmadeev's blog