ORACLE CHECKPOINTS

December 16, 2012Database, UncategorizedAnju Garg

In this post, I will explain about checkpoints – their purpose and different types of checkpoints.

PURPOSE OF CHECKPOINTS

Database blocks are temporarily stored in Database buffer cache. As blocks are read, they are stored in DB buffer cache so that if any user accesses them later, they are available in memory and need not be read from the disk. When we update any row, the buffer in DB buffer cache corresponding to the block containing that row is updated in memory. Record of the change made is kept in redo log buffer . On commit, the changes we made are written to the disk thereby making them permanent. But where are those changes written? To the datafiles containing data blocks? No !!! The changes are recorded in online redo log files by flushing the contents of redo log buffer to them.This is called write ahead logging. If the instance crashed right now, the DB buffer cache will be wiped out but on restarting the database, Oracle will apply the changes recorded in redo log files to the datafiles.

Why doesn’t Oracle write the changes to datafiles right away when we commit the transaction? The reason is simple. If it chose to write directly to the datafiles, it will have to physically locate the data block in the datafile first and then update it which means that after committing, user has to wait until DBWR searches for the block and then writes it before he can issue next command. Moreover, writing to datafiles is performed in units of Oracle data blocks. Each block may contain multiple rows. Modifying even one column in one row of a block will necessitate writing whole of the block. This will bring down the performance drastically. That is where the role of redo logs comes in. The writes to the redo logs are sequential writes – LGWR just dumps the info in redologs to log files sequentially and synchronously so that the user does not have to wait for long. Moreover, in contrast to DBWR which writes data blocks, LGWR will write only the changes vectors . Hence, write ahead logging also improves performance by reducing the amount of data written synchronously. When will the changes be applied to the datablocks in datafiles? The data blocks in the datafiles will be updated by the DBWR asynchronously in response to certain triggers. These triggers are called checkpoints.

Checkpoint is a synchronization event at a specific point in time which causes some / all dirty blocks to be written to disk thereby guaranteeing that blocks dirtied prior to that point in time get written.

Whenever dirty blocks are written to datafiles, it allows oracle

- to reuse a redo log : A redo log can’t be reused until DBWR writes all the dirty blocks protected by that logfile to disk. If we attempt to reuse it before DBWR has finished its checkpoint, we get the following message in alert log : Checkpoint not complete.

- to reduce instance recovery time : As the memory available to a database instance increases, it is possible to have database buffer caches as large as several million buffers. It requires that the database checkpoint advance frequently to limit recovery time, since infrequent checkpoints and large buffer caches can exacerbate crash recovery times significantly.

- to free buffers for reads : Dirtied blocks can’t be used to read new data into them until they are written to disk. Thus DBWrR writes dirty blocks from the buffer cache, to make room in the cache.

Various types of checkpoints in Oracle :

– Full checkpoint

– Thread checkpoint

- File checkpoint

- Parallel Query checkpoint

- Object checkpoint

- Log switch checkpoint

_ Incremental checkpoint

Whenever a checkpoint is triggered :

- DBWR writes some /all dirty blocks to datafiles

- CKPT process updates the control file and datafile headers

FULL CHECKPOINT

- Writes block images to the database for all dirty buffers from all instances.

- Statistics updated

. DBWR checkpoints

. DBWR checkpoint buffers written

. DBWR thread checkpoint buffers written

- Caused by :

. Alter system checkpoint [global]

. ALter database begin backup

. ALter database close

. Shutdown [immediate]

- Controlfile and datafile headers are updated

. Checkpoint_change#

THREAD CHECKPOINT

– Writes block images to the database for all dirty buffers from one instance

- Statistics updated

. DBWR checkpoints

. DBWR checkpoint buffers written

. DBWR thread checkpoint buffers written

- Caused by :

. Alter system checkpoint local

- Controlfile and datafile headers are updated

. Checkpoint_change#

FILE CHECKPOINT

When a tablespace is put into backup mode or take it offline, Oracle writes all the dirty blocks from the tablespace to disk before changing the state of the tablespace.

- Writes block images to the database for all dirty buffers for all files of a tablespace from all instances

- Statistics updated

. DBWR checkpoints

. DBWR tablespace checkpoint buffers written

. DBWR checkpoint buffers written

- Caused by :

. Alter tablespace xxx offline

. Alter tablespace xxx begin backup

. Alter tablespace xxx read only

- Controlfile and datafile headers are updated

. Checkpoint_change#

PARALLEL QUERY CHECKPOINT

Parallel query often results in direct path reads (Full tablescan or index fast full scan). This means that blocks are read straight into the session’s PGA, bypassing the data cache; but that means if there are dirty buffers in the data cache, the session won’t see the most recent versions of the blocks unless they are copied to disk before the query starts – so parallel queries start with a checkpoint.

- Writes block images to the database for all dirty buffers belonging to objects accessed by the query from all instances.

- Statistics updated

. DBWR checkpoints

. DBWR checkpoint buffers written

- Caused by :

. Parallel Query

. Parallel Query component of Parallel DML (PDML) or Parallel DDL (PDDL)

- Mandatory for consistency

- Controlfile and datafile headers are updated

. Checkpoint_change#

OBJECT CHECKPOINT

When an object is dropped/truncated, the session initiates an object checkpoint telling DBWR to copy any dirty buffers for that object to disk and the state of those buffers is changed to free.

- Writes block images to the database for all dirty buffers belonging to an object from all instances.

- Statistics updated

. DBWR checkpoints

. DBWR object drop buffers written

- Caused by dropping or truncating a segment:

. Drop table XXX

. Drop table XXX Purge

. Truncate table xxx

. Drop index xxx

- Mandatory for media recovery purposes

- Controlfile and datafile headers are updated

. Checkpoint_change#

LOG SWITCH CHECKPOINT

- Writes the contents of the dirty buffers whose information is protected by a redo log to the database .

- Statistics updated

. DBWR checkpoints

. DBWR checkpoint buffers written

. background checkpoints started

. background checkpoints completed

- Caused by log switch

– Controlfile and datafile headers are updated

. Checkpoint_change#

INCREMENTAL CHECKPOINT

Prior to Oracle 8i, only well known checkpoint was log switch checkpoint. Whenever LGWR filled an online logfile, DBWR would go into a frenzy writing data blocks to disks, and when it had finished, Oracle would update each data file header block with the SCN to show that file was updated up to that point in time.

Oracle 8i introduced incremental checkpointing which triggered DBWR to write some dirty blocks from time to time so as to advance the checkpoint and reduce the instance recovery time.

Incremental checkpointing has been implemented using two algorithms :

– Ageing algorithm

- LRU/TCH algorithm

AGEING ALGORITHM

This strategy involves writing changed blocks that have been dirty for the longest time and is called aging writes. This algorithm relies on the CKPT Q running thru the cache and buffers being linked to the end of this list the first time they are made dirty.

.The LRU list contains all the buffers – free / pinned / dirty. Whenever a buffer in LRU list is dirtied, it is placed in CKPT Q as well i.e. a buffer can simultaneously have pointers in both LRU list and CKPT Q but the buffers in CKPT Q are arranged in the order in which they were dirtied.Thus, checkpoint queue contains dirty blocks in the order of SCN# in which they were dirtied

Every 3 secs DBWR wakes up and checks if there are those many dirty buffers in CKPT Q which need to br written so as to satisfy instance recovery requirement..

If those many or more dirty buffers are not found,

DBWR goes to sleep

else (dirty buffers found)

.CKPT target RBA is calculated based on

– The most recent RBA

– log_checkpoint_interval

– log_checkpoint_timeout

– fast_start_mttr_target

– fast_start_io_target

– 90% of the size of the smallest redo log file

. DBWR walks the CKPT Q from the low end (dirtied earliest) of the redo log file collecting buffers for writing to disk until it reaches the buffer that is more recent than the target RBA. These buffers are placed in write list-main.

. DBWR walks the write list-main and checks all the buffers

– If changes made to the buffer have already been written to redo log files

. Move those buffers to write-aux list

else

. Trigger LGWR to write changes to those buffers to redo logs

. Move those buffers to write-aux list

. Write buffers from write-aux list to disk

. Update checkpoint RBA in SGA

. Delink those buffers from CKPT Q

. Delink those buffers from write-aux list

- Statistics Updated :

. DBWR checkpoint buffers written

- Controlfile updated every 3 secs by CKPT

. Checkpoint progress record

As sessions link buffers to one end of the list, DBWR can effectively unlink buffers from the other end and copy them to disk. To reduce contention between DBWR and foreground sessions, there are two linked lists in each working set so that foreground sessions can link buffers to one while DBWR is unlinking them from the other.

LRU/TCH ALGORITHM

LRU/TCH algorithm writes the cold dirty blocks to disk that are on the point of being pushed out of cache.

As per ageing algorithm, DBWR will wake up every 3 seconds to flush dirty blocks to disk. But if blocks get dirtied at a fast pace during those 3 seconds and a server process needs some free buffers, some buffers need to be flushed to the disk to make room. That’s when LRU/TCH algorithm is used to write those dirty buffers which are on the cold end of the LRU list.

Whenever a server process needs some free buffers to read data, it scans the LRU list from its cold end to look for free buffers.

While searching

If unused buffers found

Read blocks from disk into the buffers and link them to the corresponding hash bucket

if it finds some clean buffers (contain data but not dirtied or dirtied and have been flushed to disk),

if they are the candidates to be aged out (low touch count)

Read blocks from disk into the buffers and link them to the corresponding hash bucket

else (have been accessed recently and should not be aged out)

Move them to MRU end depending upon its touch count.

If it finds dirty buffers (they are already in CKPT Q),

Delink them from LRU list

Link them to the write-main list (Now these buffers are in CKPT Q and write-main list)

The server process scans a threshold no. of buffers (_db_block_max_scan_pct = 40(default)). If it does not find required no. of free buffers,

It triggers DBWR to dirty blocks in write-mainlist to disk

. DBWR walks the write list-main and checks all the buffers

– If changes made to the buffer have already been written to redo log files

. Move those buffers to write-aux list

else

. Trigger LGWR to write changes to those buffers to redo logs

. Move those buffers to write-aux list

. Write buffers from write-aux list to disk

. Delink those buffers from CKPT Q and w rite-aux list

. Link those buffers to LRU list as free buffers

Note that

- In this algorithm, the dirty blocks are delinked from LRU list before linking them to write-main list in contrast to ageing algorithm where the blocks can be simultaneously be in both CKPT Q and LRU list.

– In this algorithm, checkpoint is not advanced because it may be possible that the dirty blocks on the LRU end may actually not be the ones which were dirtied earliest. They may be there because the server process did not move them to the MRU end earlier. There might be blocks present in CKPT Q which were dirtied earlier than the blocks in question.

I hope the information was useful. Thanks for your time.

Keep visiting the blog…

References:

Oracle Core: Essential Internals for DBAs and Developers by Jonathan Lewi s

http://www.dbafree.net/wp-content/uploads/2011/05/CheckPoints.pdf
http://jonathanlewis.wordpress.com/2007/04/12/log-file-switch/https://saruamit4.wordpress.com/2014/11/01/checkpoint-scn/

https://community.oracle.com/thread/886580

http://citeseerx.ist.psu.edu/viewdoc/download?doi=10.1.1.105.2184&rep=rep1&type=pdf

How many checkpoints in Oracle Database

—————————————————————————————–

Related links :

Home

Database Index

Tuning Index

Buffer Cache Wait Events

Consistent Reads In Oracle: Part-I

Clustering Factor Demystified
Find Values Of Another Session’s Parameters

Uncommitted Data In Datafiles

49 thoughts on “ORACLE CHECKPOINTS”

Rahul Gupta says:

April 8, 2013 at 12:58 pm

Thanks maa’m for writing such a nice post. cleared my lot of doubts on checkpointing.

Reply
1. Anju Garg says:
  
  April 9, 2013 at 7:28 am
  
  Thanks Rahul for your time to read my post!
  
  Reply
JAMSHER KHAN says:

April 9, 2013 at 7:35 am

Hi Maam,

Your blog is really wonderful I am looking forward to read each post. Thanks for sharing such a wonderful knowledge about Oracle.

Reply
1. Anju Garg says:
  
  April 9, 2013 at 8:20 am
  
  Thanks Jamsher!
  I don’t know much but whatever I know I like to share. It’s nice to know that this post was useful .
  
  Reply
Jim W from Merritt Island says:

May 22, 2013 at 1:15 pm

This is one of the clearest and easiest to understand descriptions of the process herein. Thank you for sharing.

Reply
Rami says:

May 30, 2013 at 1:37 am

It is very clear and worth description, thank you so much for sharing this information

Reply
Shyam Kishor Agarwal says:

June 23, 2013 at 7:38 am

Thank You very much for writing such a great blog. It’s helpful for me.

Reply
praveen says:

June 27, 2013 at 3:11 am

good post! but what is RBA?

Reply
K.P says:

July 4, 2013 at 10:56 pm

Hi Anju,

“CKPT target RBA is calculated based on
– The most recent RBA
– log_checkpoint_interval
– log_checkpoint_timeout
– fast_start_mttr_target
– fast_start_io_target
– 90% of the size of the smallest redo log file”

is that means: (step by step)

1. Oracle find the most recent RBA (high RBA)
2. If i use fast_start_mttr_target = 600 and size of smallest redo log file equals 512M – and when one of two condition satisfied, checkpoint will happen. And redo entries will be written to online redo log file from low RBA (dirtied earliest) to high RBA before DBWR working.

Is this right ?

Thanks
K.P

Reply
Nouman Rashid says:

August 26, 2013 at 5:55 am

Excellent Post !! To the mark !! I have read oracle 11g Certification book and knows all info about check pointing but you summed it all in a one great post ! Thanks for sharing

Reply
1. Anju Garg says:
  
  August 26, 2013 at 10:07 pm
  
  Thanks!
  
  Anju Garg
  
  Reply
Stephen T. says:

January 20, 2014 at 4:57 pm

Amazing post! thank you!

Reply
1. Anju Garg says:
  
  January 20, 2014 at 9:55 pm
  
  Thanx Stephen!
  
  Your comments and suggestions are always welcome.
  
  Regards
  ANju Garg
  
  Reply
Atul Markan says:

May 2, 2014 at 4:29 am

Really like it…. Thanks

Reply
1. Anju Garg says:
  
  May 2, 2014 at 5:40 am
  
  Thanks for your time Atul!!
  
  Regards
  Anju Garg
  
  Reply
  1. Rohit says:
    
    August 20, 2014 at 3:01 am
    
    Gr8..Keep it up
    
    Reply
    1. Anju Garg says:
      
      August 20, 2014 at 5:26 am
      
      Thanx Rohit.
      
      Regards
      Anju Garg
      
      Reply
TNK says:

August 23, 2014 at 8:39 am

well done …keep it up bro….!!!!

Reply
susi says:

September 22, 2014 at 1:28 am

hello Anju

its really usefull.i cleared many of my doubts.
but what is RBA
and can u plz explain those two algorithms in detail plz

Reply
supriyo Dey says:

December 1, 2014 at 4:28 am

can you confirm that incremental checkpoint happens in every 3 seconds

Reply
1. Anju Garg says:
  
  December 1, 2014 at 5:09 am
  
  INcremental checkpoint has to take place at least every 3 seconds. It might happen earlier if a server process requires some free buffers and there are not any.
  
  Regards
  Anju
  
  Reply
2. Anju Garg says:
  
  December 7, 2014 at 10:20 pm
  
  Hi Supriyo,
  
  I would like to add to my earlier reply that, DBWR wakes up every 3 seconds and checks if enough dirty blocks have accumulated to be written to disk. If not, DBWR might not write any dirty blocks to disk even after 3 seconds. It implies that DBWR will write whenever there is need for free buffers. It might be earlier than 3 seconds or later than 3 seconds even.
  
  Regards
  Anju Garg
  
  Reply
mohan says:

December 7, 2014 at 10:45 pm

Very well explained !!

regards from a storage admin.

Reply
1. Anju Garg says:
  
  December 8, 2014 at 4:35 am
  
  Thanks for your time Mohan.
  
  Your comments and suggestions are always welcome.
  
  Regards
  Anju Garg
  
  Reply
Jijo says:

January 13, 2015 at 10:49 am

Thanks maa’m for this nice post,

Reply
1. Anju Garg says:
  
  January 13, 2015 at 10:51 pm
  
  Thanks Jijo for your time.
  Your comments and suggestions are always welcome.
  
  regards
  Anju Garg
  
  Reply
nitishanandsrivastava says:

March 11, 2015 at 9:15 am

Another excellent post.

However in the 2nd paragraph

“Why doesn’t Oracle write the changes to datafiles right away when we commit the transaction? The reason is simple. If it chose to write directly to the datafiles, it will have to physically locate the data block in the datafile first and then update it which means that after committing, user has to wait until “”””””DBWR””””” searches for the block and then writes it before he can issue next command.”

As per my understanding “Server Process searches for the block and then DBWR writes it before ….” Isn’t the Server Process is responsible for scanning the datafile for the required block and then fetching the copy to buffer cache ??

Once again great post specially the explanation of the algorithms

Regards
Junior DBA

Reply
1. Anju Garg says:
  
  March 24, 2015 at 3:00 am
  
  Thanks Nitish for your time and feedback.
  
  Here, I imply that the dirty buffer in buffer cache will have to be written to the corresponding block in the datafile which requires positioning of the head.
  
  Hope it helps.
  
  regards
  Anju
  
  Reply
karthik garrepalli says:

May 6, 2015 at 10:03 pm

Thanks Anju for the Valuable post on checkpoints and background processes, worth spending time

Reply
1. Anju Garg says:
  
  May 6, 2015 at 10:50 pm
  
  Thanks Karthik for your time and feedback.
  
  Your comments and suggestions are always welcome.
  
  Regards
  Anju
  
  Reply
Partha Chakraborty says:

June 8, 2015 at 1:57 am

Ma’am, Its so complete of the related process. Explaining the complexity in real simple words. I am thankful to you.

Reply
1. Anju Garg says:
  
  June 8, 2015 at 3:32 am
  
  Thanks Partha for your time and feedback.
  
  Your comments and suggestions are always welcome.
  
  regards
  Anju
  
  Reply
udayjampaniUday says:

August 9, 2015 at 3:07 pm

Great Anju , the way it is explained made understanding very easy

Reply
1. Anju Garg says:
  
  August 9, 2015 at 11:14 pm
  
  Thanks Uday for your time and feedback.
  
  Your comments and suggestions are always welcome!
  
  regards
  Anju
  
  Reply
javi says:

August 28, 2015 at 7:51 pm

The most comprehensible and educational text on checkpoints I ever read. Cleared all my doubts.

regards,
Javi

Reply
1. Anju Garg says:
  
  August 29, 2015 at 10:51 pm
  
  Thanks Javi for your time and feedback!
  
  regards
  Anju
  
  Reply
Ganesh says:

September 1, 2015 at 6:38 pm

Very clear explanation

Reply
1. Anju Garg says:
  
  September 1, 2015 at 11:31 pm
  
  Thanks Ganesh for your time and feedback.
  
  Your comments and suggestions are always welcome.
  
  Regards
  Anju
  
  Reply
Manoj says:

September 14, 2015 at 4:33 am

thanks anuj , very clear explaination

Reply
1. Anju Garg says:
  
  September 14, 2015 at 5:02 am
  
  Thanks Manoj for your time and feedback!
  
  I think you have mis-spelt my name. My name is ANJU.
  
  regards
  Anju
  
  Reply
Tom says:

September 16, 2015 at 9:42 am

Superb Statement Easy to Understand

Reply
1. Anju Garg says:
  
  September 17, 2015 at 7:55 am
  
  Thanks Tom for your time and feedback.
  
  Your comments and suggestions are always welcome.
  
  regards
  Anju
  
  Reply
karthik says:

January 19, 2016 at 8:34 pm

Really nice article

Reply
1. Anju Garg says:
  
  January 19, 2016 at 11:01 pm
  
  Thanks KArtik for your time and feedback.
  
  regards
  Anju
  
  Reply
karthik says:

January 20, 2016 at 7:28 pm

I was really struggling to understand how the logbuffer and database buffer cache work in the oracle database. This article cleared all my doubts related to that.Nice work

Thank you

Karthik

Reply
Alec says:

June 25, 2016 at 10:14 am

Hi, Anju,
Thank you for very detailed explanation. Could you please clarify what is RDA?

Thank you.

Reply
1. Anju Garg says:
  
  June 25, 2016 at 11:50 pm
  
  Hi Alec,
  
  Thanks for your time and feedback.
  Regarding RBA, please refer to following link:
  http://www.ixora.com.au/notes/rba.htm
  
  Your comments and suggestions are always welcome.
  
  Regards
  Anju
  
  Reply
ram ram says:

September 15, 2016 at 1:54 pm

what is checkpoint in simple way

Reply
1. Anju Garg says:
  
  September 15, 2016 at 11:22 pm
  
  Hi
  
  Checkpoint is a synchronization event at a specific point in time which causes some / all dirty blocks to be written to disk thereby guaranteeing that blocks dirtied prior to that point in time get written.
  
  Hope it helps
  
  Regards
  Anju
  
  Reply