Subscribe to Posts by Email

Subscriber Count

    703

Disclaimer

All information is offered in good faith and in the hope that it may be of use for educational purpose and for Database community purpose, but is not guaranteed to be correct, up to date or suitable for any particular purpose. db.geeksinsight.com accepts no liability in respect of this information or its use. This site is independent of and does not represent Oracle Corporation in any way. Oracle does not officially sponsor, approve, or endorse this site or its content and if notify any such I am happy to remove. Product and company names mentioned in this website may be the trademarks of their respective owners and published here for informational purpose only. This is my personal blog. The views expressed on these pages are mine and learnt from other blogs and bloggers and to enhance and support the DBA community and this web blog does not represent the thoughts, intentions, plans or strategies of my current employer nor the Oracle and its affiliates or any other companies. And this website does not offer or take profit for providing these content and this is purely non-profit and for educational purpose only. If you see any issues with Content and copy write issues, I am happy to remove if you notify me. Contact Geek DBA Team, via geeksinsights@gmail.com

Pages

11gR2 RAC: Important Trace Directories & Log Locations

All,

Please find the structured 11gR2 GI Clusterware log locations and its relevancy.

Directory structured (Source: Oracle Documentation)

image

Log Locations

Continue reading 11gR2 RAC: Important Trace Directories & Log Locations

Replication: does Streams works after role transition i.e switchover or failover

Hello,

Today I have got a call containing couple of questions on streams replication in a standby environment especially when the streams is on downstream capture database.

Here are the questions & my answers to it.

Environment:-

Primary or Source: -10GR2 RAC with 2 node

Standby: Physical Standby

Streams: Downstreams capture database in different server

Behavior of streams at the time of shifting from Primary to Standby and vice versa :

a ) In case if Primary is shifted to Secondary(stand by) DB, the streams DB is to be replicated fully (not from point where it stopped)

This is called switchover/Failover or the role transitions. In your streams environment as the changes captured in downstream database (different database) with minimal changes or the pre-requisite this transitions can be performed seamlessly and the streams continue to work.

Procedure:-

Pre-requisities:- Continue reading Replication: does Streams works after role transition i.e switchover or failover

11g Grid Infrastructure (RAC): CRS Startup Sequence

 

This is about to understand the startup sequence of Grid Infrastructure daemons and its resources in 11gR2 RAC.

 

In 11g RAC aka Grid Infrastructure we all know there are additional background daemons and agents, and the Oracle documentation is not so clear nor the other blog.

 

For example:- I have found below diagram from metalink which is very confusing.

Continue reading 11g Grid Infrastructure (RAC): CRS Startup Sequence

11g Feature: Audit Trail Purging, No more custom scripts

Dear All,

Prior to 11g, in order to purge the audit trails we have to write a custom script and run in a cron or control M scheduler job or a DBMS_SCHEDULER, where in

From 11g 11.1.0.7 onwards Oracle provides a package DBMS_AUDIT_MGMT to manage the audit trails.

Additionally, This package can also be deployed in 10.2.0.3 databases via patch as mentioned in note 731908.1.

Example:-

Automated Audit trail Purging

Continue reading 11g Feature: Audit Trail Purging, No more custom scripts

Quiz Post #1 : SQL Magic

Q: is it my possible to make my query to change from "select 'what is my name' from dual" to "select 'you'r Geek DBA' from dual" during execution.

A: this feature is called "query equivalence".

query equivalence is declared using the dbms_advanced_rewrite.declare_rewrite_equivalence procedure, and uses the syntax:

dbms_advanced_rewrite.declare_rewrite_equivalence(declaration_name,source_statement,target_statement);

Scripts: Find out a expensive sql statements from AWR Top Events with just three small scripts

A typical day starts like this.

1) Application team complained about slowness in their job

2) DBA looking into this

No clues or details from App team saying this particular sql is that we are running but the response time of the database server is bit slow, so obvious that app team reported slowness.

This time I have chosen event based troubleshooting approach from AWR whether to see am I able to drill down the issue.

What I started first is, lets take a closer look at top classes in the database that is at the moment.

 

set lines 1000
set pages 1000
select wait_class_id, wait_class, count(*) cnt
from dba_hist_active_sess_history
where snap_id between 5205 and 5208
group by wait_class_id, wait_class
order by 3;

  WAIT_CLASS_ID WAIT_CLASS                CNT
------------- ------------------ ----------
Continue reading Scripts: Find out a expensive sql statements from AWR Top Events with just three small scripts

ORA-15036: disk ‘string’ is truncated, ASM disks group DATA is not mounting

Hello,

Another issue came up today!!!

Issue:- ORA-15036: disk 'string' is truncated The size of the disk, as reported by the operating system, was smaller than the size of the disk as recorded in the disk header block on the disk.

Apparently the concerned disk group is not mounting, unfortunately this contains our OCR Disks (okay, bigger problem)

Sequence of actions or the issues encountered:-

1) DBA found that disk partitions are not visible, passed to unix to fix the issue

2) Unix tried many alternatives and involved Novell (Suse Linux) as well , and Novell wisely rewritten the disk headers and they are visible now

3) Passed to DBA, now the DBA turn to mount the diskgroups

4) Got hit with ORA-15036, and CRSD failing to come up , parallely its resources as well.

5) Raised SR, Oracle suggested to recreate and restore the diskgroups data

6) Novell, said they can't do anything.

Strucked ....... and Corner end..... what else?

Continue reading ORA-15036: disk ‘string’ is truncated, ASM disks group DATA is not mounting

Basics: What is RAID?

Hello,

A Simple readable format about "What is RAID?"

RAID (Redundant Array of Independent Disks). A collection of disk drives that offers increased performance and fault tolerance. There are a number of different RAID levels. The three most commonly used are 0, 1, and 5:

  • Level 0: striping without parity (spreading out blocks of each file across multiple disks).
  • Level 1: disk mirroring or duplexing.
  • Level 2: bit-level striping with parity
  • Level 3: byte-level striping with dedicated parity. Same as Level 0, but also reserves one dedicated disk for error correction data. It provides good performance and some level of fault tolerance.
  • Level 4: block-level striping with dedicated parity
  • Level 5: block-level striping with distributed parity
  • Level 6: block-level striping with two sets of distributed parity for extra fault tolerance
  • Level 7: Asynchronous, cached striping with dedicated parity

Hope this helps!!!!

ORA-06512: at “SYS.KUPV$FT” – impdp failing

This time, impdp failing to create a job for import and failing with following error:-

ORA-06512: at "SYS.KUPV$FT",

In many cases you sometimes stop data pump job or in case of an abnormal end of the Data Pump job (we call it the orphaned job) or using undocumented parameter KEEP_MASTER=Y, the master table remain in the database.

So lets start digging out for orphaned jobs

Script to check Orphaned Pump Jobs

SET lines 140
COL owner_name FORMAT a10;
COL job_name FORMAT a20
COL state FORMAT a12
COL operation LIKE owner_name
COL job_mode LIKE owner_name
SELECT owner_name, job_name, operation, job_mode,
state, attached_sessions
FROM dba_datapump_jobs;

 

By running above script if any rows returned in “not running “ mode that means you will be having orphaned jobs. Typically orphaned jobs state will be “Not Running” state and but in my case some are even in Executing mode which they shouldn’t

As I got the following jobs running at my nose.

 

DBABATCH SYS_EXPORT_FULL_22 Executing

DBABATCH SYS_EXPORT_FULL_22 Not Running
DBABATCH SYS_EXPORT_FULL_22 Not Running
DBABATCH SYS_EXPORT_FULL_22 Not Running
DBABATCH SYS_EXPORT_FULL_22 Not Running
DBABATCH SYS_EXPORT_FULL_22 Not Running

 

Verify that any data pump sessions are running in the database

Continue reading ORA-06512: at “SYS.KUPV$FT” – impdp failing

PMON (ospid: nnnn): terminating the instance due to error 481

 

We got a known issue with ASM not coming up in second and subsequently failed to start crs and other resources.

Review grid alert log & os Logs

  • $GRID_HOME/log/<nodename>/alert<nodename>.log

oifcfg shows

$oifcfg –getif

eth0 192.168.2.10 global_clusterinterconnect

eth1 192.168.10.2 global

usb0 169.254.95.0

eth0:2 169.254.96.0

eth0:3 169.254.95.0

 

From 11g R2 (I believe 11.2.0.2 onwards) there is a cluster resource called HAIP which used to manage the cluster interconnects high availability. Prior to 11gr2 if cluster interconnect goes down there will be hang/node evictions depends on the situation. Where in from 11gr2 onwards we can specify up to 3 (as I known) cluster interconnects for a cluster which internally manages with this non-routable IP’s, Essentially, even if one of the physical interface is offline, private interconnect traffic can be routed through the other available physical interface. This leads to highly available architecture for private interconnect traffic.

 

Nice explanation from Riyaz’s Note:-

HAIP, High Availability IP, is the Oracle based solution for load balancing and failover for private interconnect traffic. Typically, Host based solutions such as Bonding (Linux), Trunking (Solaris) etc is used to implement high availability solutions for private interconnect traffic. But, HAIP is an Oracle solution for high availability. During initial start of clusterware, a non-routeable IP address is plumbed on the private subnet specified. That non-routable IP is used by the clusterware and the database for private interconnect traffic.

 

Now back to the issue:-

As you can see the usb0 is attached to the the internal IP (169.254.X.X is an non routable IP’s range internal to OS) , clusterware will confused with this and unable to start the crs resources promptly.

Continue reading PMON (ospid: nnnn): terminating the instance due to error 481