15 January 2013

29190 views

20 0

Compression in Oracle – Part 1: Basic Table Compression

In this short series on compression in Oracle, we will be looking at the various types of compression used by traditional Oracle systems, this means: (i) basic table compression, (ii) OLTP table compression, and (iii) index compression. I won’t be discussing the hybrid columnar compression (HCC) that appeared with Exadata (though I may write up a mini-series on that topic

Of the three mechanisms, index compression and basic table compression are part of the core product, while OLTP compression is part of the separately licensed “Advanced Compression Option (ACO)”. In this first article we will restrict ourselves to creating data using basic table compression, leaving the question of updating and deleting compressed data to the second article, which will lead us into “OLTP” compression in the third article. We’ll leave index compression to stand alone in the fourth and fifth articles.

Our aim in this first article is to answer a few of the more common questions that people ask about compression by examining the mechanisms that Oracle uses for basic table compression.

When does (basic) compression work

The usual questions people ask about compression are: “how do I create compressed data?”, “how does Oracle decompress the blocks?”, “what impact does compression have on performance?” and the question you should ask before using any feature “are there any side effects I need to worry about?”

The easiest way to answer the first question is through a demonstration showing some of the possibilities. Here are 5 pieces of SQL that will create and populate a table with a copy of the first 50,000 rows of the view all_objects on a database running 11.2.0.3; after running each one I collected stats on the table and ran a query to report the number of blocks in the table, and a few other details about the table.

-- 1. Baseline CTAS

create table t1

select * from all_objects where rownum <= 50000;

-- 2. CTAS with basic compression enabled

create table t1 compress basic

select * from all_objects where rownum <= 50000;

-- 3. Normal insert into empty table defined as compressed

create table t1 compress basic

select * from all_objects where rownum = 0;

insert into t1 select * from all_objects where rownum <= 50000

-- 4. Direct path insert into empty table defined as compressed

create table t1 compress basic

select * from all_objects where rownum = 0;

insert /*+ append */ into t1 select * from all_objects where rownum <= 50000

-- 5. CTAS without compression, then change to compressed

create table t1

select * from all_objects where rownum <= 50000;

alter table t1 compress basic;

alter table t1 move

The query I ran after each test looked like this:

select blocks, pct_free , compression, compress_for

from user_tables

where table_name = 'T1';

There are other possibilities, of course; we could define a tablespace so that any table created in that tablespace was, by default, compressed; we can arrange for a single partition or subpartition of a partitioned table to be compressed; we can even change the default compression on partitioned tables so that all new partitions or subpartitions are created with compression.

Sticking with our five samples, though, I’ve summarised the results of these code fragments in the following table – although you’ll notice that I’ve reported two results from test 5, one from before the move, one after:

Test	BLOCKS	PCT_FREE	COMPRESSION	COMPRESS_FOR
1 (CTAS)	714	10	DISABLED
2 (CTAS compress)	189	0	ENABLED	BASIC
3 Insert	644	0	ENABLED	BASIC
4 Insert append	189	0	ENABLED	BASIC
5a Compress	714	10	ENABLED	BASIC
5b Move	189	0	ENABLED	BASIC

The first test is just a baseline to give you an idea of how much space the data needs by default – as you can see I let the table used the default value of pctfree (reported in view user_tables as pct_free) and needed 714 blocks to hold the data.

When I included the compress option with the CTAS, Oracle automatically set the pctfree to zero – and in this case managed to squeeze the data down to just 189 blocks. The zero pctfree is a hint that Oracle thinks this table is going to be read-only; but it is possible to set a non-zero pctfree and, as we shall see in the next article in the series, there may be cases where you want to take advantage of that option. Notice that in this, and all subsequent tests, the two columns reporting compression show that basic compression is enabled.

In the third and fourth tests I created an empty table with the compress option (which is why the pct_free is 0), and then inserted the data. As you can see, the data is only compressed when the insert is a direct path insert (you may recall that “compress basic” was originally “compress for direct_load operations”); the normal insert, even though it’s an array insert, doesn’t compress the data. (The drop from 714 blocks to 644 blocks in test 3 is just the side effect of pctfree dropping from 10 to zero – allowing for small variations 90% of 714 is very close to 644).

The final test then warns us that simply changing a table from uncompressed to compressed does nothing to the data. If you want to convert uncompressed data to compressed data you first need to change the table definition, and then you need to move the table to compress its contents. If you do this, of course, you’d also have to rebuild any indexes on the table immediately afterwards.

It’s compression, Jim, but not as we know it.

The rest of the questions I’ve asked above can best be addressed by seeing how Oracle does compression, and the answer is that (for basic and OLTP compression) Oracle doesn’t really do compression. What it does is “de-duplication” at the block level. Imagine you had three rows in a block containing the following data:

(‘XXXX’, ‘abcdef’, 254.32, ‘CLOSED’)

(‘XXXX’, ‘pqrstu’, 17.12, ‘CLOSED’)

(‘AAAA’, ‘abcdef’, 99.99, ‘CLOSED’)

Oracle could notice that the value ‘XXXX’ appears twice, that the value ‘abcdef’ appears twice, and that the value ‘CLOSED’ appears three times. So it can create a table of repeated values in the block, and insert tokens into the rows to make them shorter, so our block might start to look like this:

T1 (‘XXXX’)

T2 (‘abcdef’)

T3 (‘CLOSED’)

(T1, T2, 254.32, T3)

(T1, ‘pqrstu’, 17.12, T3)

(‘AAAA’, T2, 99.99, T3)

In fact, Oracle can get smarter than that, because it can rearrange the column order for each individual block to maximize the possibility of multiple columns turning into a single token. Notice, in my example, that token T1 and token T3 both appear in all three rows. Oracle can rearrange the order that the columns are stored in this block to put those tokens side by side, and create a new token the represents the combination of the two individual tokens, so our block becomes:

T1 (‘XXXX’, T2) -- a token made from a value and a token

T2 (‘CLOSED’)

T3 (‘abcdef’)

(T1, T3, 254.32) -- notice how this row is now only 3 “columns”

(T1, ‘pqrstu’, 17.12) -- ditto

(‘AAAA’, T2, T3, 99.99)

Let’s take a closer look at this by examining a few lines from the symbolic block dump of a block. Here’s the first fragment I want to highlight – it appears only for compressed table blocks:

1	perm_9ir2[4]={ 2 0 1 3 }

This is a table with 4 columns but for this block Oracle has rearranged (permuted) the order the columns are stored so that the thing stored at column 2 in this block is column 0 in the table definition, column 0 is really column 1, column 1 is column 2 and column 3 is (still) column 3

1 2	0x24:pti[0] nrow=65 offs=0 0x28:pti[1] nrow=400 offs=65

There are two “tables” in this block, the first is the token table which holds the 65 tokens which will be used in the block, starting at offset zero in the block’s row directory, the second holds 400 “proper” rows, starting at offset 65 in the block’s row directory. (This means, by the way, that the block’s row directory currently has 465 entries – there are some interesting side effects that can appear with large row directories.)

If we hunt through the block dump to find the first row in table 1 (i.e. the first “real” row) we find the following, which looks pretty much like an ordinary row dump from a typical heap table block dump for a row with 4 columns. But there are a few special points to notice:

tab 1, row 0, @0x1b28

tl: 5 fb: --H-FL-- lb: 0x0 cc: 4

col 0: [ 4] 41 41 41 41

col 1: [10] 41 41 41 41 41 41 41 41 41 41

col 2: [ 2] c1 02

col 3: [10] 20 20 20 20 20 20 20 20 20 31

bindmp: 2c 00 01 04 31

According to the column lengths (the number in square brackets) the row length should be 26 bytes, plus the 4 bytes for the 4 column lengths, plus one byte each for the flag byte (fb:), lock byte (lb:) and column count (cc:) – but the total length (tl:) is only 5 bytes. And the last line of this extract shows us the actual values (bind map [ed: see note below from Flado, “binary dump” is a much better interpretation of bindmp]) of those five bytes. These five bytes are the flag byte (0x2c = ‘–H-FL’), the lock byte, the “stored” column count – i.e. the number of columns stored at this location, which is just one – and the next two bytes tell us that that one “column” is a token representing 4 consecutive and we need to look at token 0x31 of the token table (interestingly, different versions of Oracle managed to order the token table differently even though the environment seemed to be the same – my test results are all from 11.2.0.3). Let’s look at row 49 (0x31) in table 0:

tab 0, row 49, @0x1ed0

tl: 19 fb: --H-FL-- lb: 0x0 cc: 4

col 0: [ 4] 41 41 41 41

col 1: [10] 41 41 41 41 41 41 41 41 41 41

col 2: [ 2] c1 02

col 3: [10] 20 20 20 20 20 20 20 20 20 31

bindmp: 00 08 04 36 40 ca c1 02 d2 20 20 20 20 20 20 20 20 20 31

The token looks almost identical to the row – but the total length of the token is 19 bytes. So let’s look at the binary dump for the token. The first two bytes of the map tell us that this token is used 8 times in the block. The next byte tells us that there are 4 columns in the token and, through some cunning encoding, the next two bytes tell us that the first two columns of this token are actually tokens 0x36 (decimal 54) and 0x40 (64). You can then see the actual values preceded by “200 + column length” for the last two columns of the token.

So let’s look at tokens 54 and 64 – which allow us to see that they are, indeed, single column tokens, with values that match the values we saw in token zero and the actual row. (You might note that token 54 is used in a total of 10 places in this block (the 0x0a in the second place in the binary dump, and token 64 is used in 5 places.)

tab 0, row 54, @0x1f74

tl: 7 fb: --H-FL-- lb: 0x0 cc: 1

col 0: [ 4] 41 41 41 41

bindmp: 00 0a cc 41 41 41 41

tab 0, row 64, @0x1f7b

tl: 13 fb: --H-FL-- lb: 0x0 cc: 1

col 0: [10] 41 41 41 41 41 41 41 41 41 41

bindmp: 00 05 d2 41 41 41 41 41 41 41 41 41 41

So, by tracking our way from row directory to row, to token (by way of row directory), to two more tokens (by way of the row directory) we see that we can expand an entry that started out as 5 bytes into a full row of 4 columns with 26 bytes of data.

There are several lessons to be learned from the work we did tracking through the block dump. The first is that Oracle doesn’t decompress the table blocks, it simply re-constructs the row you need by hopping back and forth between the row directory and the row pieces (the code may even avoid visiting some tokens – the single column ones – if the column values aren’t needed for the SQL statement). The second is that it’s possible to spend a significant amount of extra CPU time bouncing around a block to reconstruct a row – this could have a noticeable impact on CPU usage if you are doing lots of tablescans. As a side effect, because Oracle will have to hold (pin) the block for some time to reconstruct rows, you may find that your code will do fewer “consistent gets – examination” which means more activity on the “cache buffers chains” latch. Of course, we hope that the extra CPU time will be offset by the smaller number of physical reads we may have to do because we’ve packed our rows into a smaller number of blocks, which may allow us to keep more data cached for longer.

Summary

There’s still a lot to say about the side effects of compression, in particular what happens when you delete or update rows, and this will lead us on to the implementation of compression for OLTP (or ‘for all operations’ as it used to be) – but those are topics for future articles.

What we have seen from this first article is that basic compression is used only with direct path inserts, not with ordinary DML, and Oracle will by default set pctfree to zero on the table, which is a good indication that you are not supposed to modify the data once you’ve created it – basic compression is really only appropriate for data that is virtually read-only.

We have also seen that basic compression is just de-duplication of repeated values – but Oracle can get very clever about minimizing the amount of space used. In particular the stored column order can change from block to block, allowing Oracle to maximize the options for creating tokens that represent multiple adjacent columns. This deduplication mechanism means that Oracle doesn’t have to decompress blocks, it keeps the blocks in the buffer cache just like any other block, but reconstructs rows (in the PGA) by following pointers to tokens – and chasing pointers is a CPU intensive process; the better your compression the more CPU intensive your queries (particularly your tablescans) are likely to be.

There is a catalogue of all five items in this series (and a few others) at this URL

Compression in Oracle – Part 1: Basic Table Compression

When does (basic) compression work

It’s compression, Jim, but not as we know it.

Summary

Subscribe for more articles

Rate this article

Jonathan Lewis

Related articles

A Guide to INSERT, UPDATE, and DELETE Statements in Oracle