网络故障原因,超乎想象

510阅读 0评论2022-04-08 brjl
分类:Oracle

dba对网络知识一般比较薄弱,但影响巨大,如何诊断网络故障呢?


诊断方向:
配置了防火墙
1.网线/网卡/交换机问题
2.UDP receive(rx) buffer sizes设置过小/UDP buffer socket溢出
3.私网性能差并且CPU使用率高,`netstat -s` 出现packet reassembly failures。
4.网络传输坏块(corruption)导致的UDP checksum errors 和/或 send (tx) / receive (rx) transmission errors
5.在通信通道中设置了不匹配的MTU的值

共有26个

  1. SELECT sysdate snap_time,a.inst_ID "INSTANCE", A.VALUE "GC BLOCKS LOST",
  2.  B.VALUE "GC CUR BLOCKS SERVED",
  3.  C.VALUE "GC CR BLOCKS SERVED",
  4.  round(A.VALUE/(B.VALUE+C.VALUE+0.01),2) RATIO
  5.  FROM GV$SYSSTAT A, GV$SYSSTAT B, GV$SYSSTAT C
  6.  WHERE A.NAME='gc blocks lost' AND
  7.  B.NAME='gc current blocks served' AND
  8.  C.NAME='gc cr blocks served' and
  9.  B.INST_ID=a.inst_id AND
  10.   C.INST_ID = a.inst_id
  11.  order by 2;
  12.  
  13. select a.inst_id "instance", a.value "gc blocks lost",
  14. b.value "gc cur blocks served",
  15. c.value "gc cr blocks served",
  16. a.value/(b.value+c.value+0.01) ratio
  17. from gv$sysstat a, gv$sysstat b, gv$sysstat c
  18. where a.name='gc blocks lost' and
  19. b.name='gc current blocks served' and
  20. c.name='gc cr blocks served' and
  21. b.inst_id=a.inst_id and
  22. c.inst_id = a.inst_id ;

  23. select inst_id, event, total_waits,time_waited
  24. from gv$system_event
  25. where event in ('gc current block lost','gc cr block lost')
  26. order by inst_id, total_waits desc;
参考:
RAC 环境中 gc block lost 和私网通信性能问题的诊断 (Doc ID 1674865.1)
上一篇:dbms_backup_restore version target database is not current
下一篇:aix 压缩及解压目录