HAIP related Node Eviction Bugs:
Table of Contents
Bug 16876500 – GI HAIP AGENT DROPS A ROUTE FREQUENTLY AND THAT LEADS TO THE INSTANCE EVICTION
Rediscovery: - The netstat -rn shows only three haip addresses during one of the incident when it should report 4 haip adresses ( nxge2:1 was missing ) 169.254.64.0 169.254.105.29 U 1 408 nxge5:1 169.254.128.0 169.254.171.182 U 1 47 nxge6:1 169.254.192.0 169.254.250.239 U 1 45 nxge1:1 - Agent Log reports: 2013-05-22 12:32:43.834: [ USRTHRD][18] {0:0:2} (:CLSN00037:) Removed unused HAIP route 169.254.0.0 / 255.255.192.0 / 169.254.46.44 / nxge2:1 Root Cause : orarootagent sometime deleted the HAIP routing data Fix : orarootagent will not delete any in-used HAIP routing data after applying patch 16876500 Fixed in : 11.2.0.3.4GIPSU, 12.1.0.1,
Bug 14385860 – SOL.SPARC64 : CLSRSC-257: CLUSTER TIME SYNCHRONIZATION SERVICE START IN EXCLUSIV
Duplicate Bugs: Bug 17571466 - EVICTION AFTER IPC SEND TIMEOUT AFTER REMOVED UNUSED HAIP ROUTE 169.254.0.0 Bug 17410892 - HAIP FAILURE CAUSING ASM INSTANCE EVICTIONS Bug 16985519 - HAIP FAILURE CAUSING INSTANCE EVICTIONS Bug 17457316 - RAC INSTANCE EVICTION WITH IPC SEND TIMEOUT DETECTED Bug 17299910 - RAC INSTANCE GETTING EVICTED AFTER IPC SEND TIMEOUT DETECTED Rediscovery: - empty ip address seen in the orarootagent logs 2013-06-18 07:20:07.090: [ GIPCNET][16] gipcmodNetworkAttrAddrOsd: slos info: empty ip address - OSWatcher traceroute is clean, OS Messages files are clean, NETSTAT data is clean - Alert.log: IPC Send timeout detected. Receiver ospid 474 [oracle@bmblrac01 (LMS6)] Root Cause : Used ARP calls returning wrong information Fix : Avoid the HAIP from failover by applying patch 14385860 Fixed in : 11.2.0.3.7GIPSU , 12.1.0.1