昨天安装的测试环境的rac--2节点
CentOS release 6.8 (Final)
SQL*Plus: Release 11.2.0.4.0 Production
今天测试突然出现问题
在rac01节点上面操作报错
Connected.
SQL> select * from v$instance;
select * from v$instance
*
ERROR at line 1:
ORA-01012: not logged on
Process ID: 0
Session ID: 0 Serial number: 0
这个时候创建表空间,用户均报错
用toad工具连接vip 209,进行创建表空间,用户均可以成功,于是怀疑rac01节点有问题
果然
执行就报错
检查节点1的权限和路径
[root@rac11 ~]# chown -R grid:oinstall /u01/app/11.2.0/grid/
[root@rac11 ~]# ll /u01/app/11.2.0/grid/
这时候asmcmd可以执行
然后登陆sqlplus
[oracle@rac11 ~]$ sqlplus / as sysdba SQL*Plus: Release 11.2.0.4.0 Production on Wed Jun 5 15:48:44 2019 Copyright (c) 1982, 2013, Oracle. All rights reserved. Connected.
SQL> select * from v$log;
select * from v$log
*
ERROR at line 1:
ORA-01012: not logged on
Process ID: 0
Session ID: 0 Serial number: 0
查看错误日志
[oracle@rac11 trace]$ tail -n 100 /u01/app/oracle/diag/rdbms/bol/bol1/trace/alert_bol1.log
Fatal NI connect error 12547, connecting to:
(DESCRIPTION=(ADDRESS=(PROTOCOL=beq)(PROGRAM=/u01/app/11.2.0/grid/bin/oracle)(ARGV0=oracle+ASM1_o000_bol1)(ENVS='ORACLE_HOME=/u01/app/11.2.0/grid,ORACLE_SID=+ASM1')(ARGS='(DESCRIPTION=(LOCAL=YES)(ADDRESS=(PROTOCOL=beq)))'))(enable=setuser)(CONNECT_DATA=(CID=(PROGRAM=oracle@rac11)(HOST=rac11)(USER=oracle)))) VERSION INFORMATION:
TNS for Linux: Version 11.2.0.4.0 - Production
Oracle Bequeath NT Protocol Adapter for Linux: Version 11.2.0.4.0 - Production
Time: 05-JUN-2019 15:51:55
Tracing not turned on.
Tns error struct:
ns main err code: 12547 TNS-12547: TNS:lost contact
ns secondary err code: 12560
nt main err code: 517 TNS-00517: Lost contact
nt secondary err code: 32
nt OS err code: 0
ERROR: Failed to connect with connect string: (DESCRIPTION=(ADDRESS=(PROTOCOL=beq)(PROGRAM=/u01/app/11.2.0/grid/bin/oracle)(ARGV0=oracle+ASM1_o000_bol1)(ENVS='ORACLE_HOME=/u01/app/11.2.0/grid,ORACLE_SID=+ASM1')(ARGS='(DESCRIPTION=(LOCAL=YES)(ADDRESS=(PROTOCOL=beq)))'))(enable=setuser))
WARNING: ASM communication error: op 18 state 0x40 (12547)
ERROR: slave communication error with ASM
NOTE: Deferred communication with ASM instance
Errors in file /u01/app/oracle/diag/rdbms/bol/bol1/trace/bol1_pmon_840.trc:
ORA-12547: TNS:lost contact
[oracle@rac11 trace]$ more /u01/app/oracle/diag/rdbms/bol/bol1/trace/bol1_pmon_840.trc
WARNING: ASM communication error: op 18 state 0x40 (1034)
ERROR: slave communication error with ASM
NOTE: Deferred communication with ASM instance
ORA-01034: ORACLE not available
ORA-27123: unable to attach to shared memory segment
Linux-x86_64 Error: 13: Permission denied
Additional information: 26
Additional information: 360454
WARNING: ASM communication error: op 18 state 0x40 (1034)
ERROR: slave communication error with ASM
NOTE: Deferred communication with ASM instance
ORA-01034: ORACLE not available
ORA-27123: unable to attach to shared memory segment
Linux-x86_64 Error: 13: Permission denied
Additional information: 26
Additional information: 360454
查看db状态
[grid@rac11 ~]$ srvctl status asm -a
ASM is running on rac11,rac12
ASM is enabled.
[grid@rac11 ~]$ srvctl status database -d bol
Instance bol1 is not running on node rac11
Instance bol2 is running on node rac12
节点1没有启动
尝试启动报错
[grid@rac11 ~]$ srvctl start instance -d bol -i bol1
PRCR-1013 : Failed to start resource ora.bol.db
PRCR-1064 : Failed to start resource ora.bol.db on node rac11
CRS-5017: The resource action "ora.bol.db start" encountered the following error:
ORA-01078: failure in processing system parameters
ORA-01565: error in identifying file '+DATA/bol/spfilebol.ora'
ORA-17503: ksfdopn:2 Failed to open file +DATA/bol/spfilebol.ora
ORA-12547: TNS:lost contact
. For details refer to "(:CLSN00107:)" in "/u01/app/11.2.0/grid/log/rac11/agent/crsd/oraagent_oracle//oraagent_oracle.log". CRS-2674: Start of 'ora.bol.db' on 'rac11' failed
没遇到这样的问题,一阵baidu,google骚操作,尝试执行
[grid@rac11 ~]$ ls -l /u01/app/11.2.0/grid/bin/oracle
-rwxr-x--x 1 grid oinstall 209914479 Jun 4 15:32 /u01/app/11.2.0/grid/bin/oracle
[grid@rac11 ~]$ chmod 6751 /u01/app/11.2.0/grid/bin/oracle
[grid@rac11 ~]$ ls -l /u01/app/11.2.0/grid/bin/oracle
-rwsr-s--x 1 grid oinstall 209914479 Jun 4 15:32 /u01/app/11.2.0/grid/bin/oracle
[oracle@rac11 ~]$ ls -l /u01/app/oracle/product/11.2.0/db_1/bin/oracle
-rwxr-s--x 1 oracle asmadmin 239626641 Jun 4 16:00 /u01/app/oracle/product/11.2.0/db_1/bin/oracle
---rac02
[grid@rac12 ~]$ ls -l /u01/app/11.2.0/grid/bin/oracle
-rwsr-s--x 1 grid oinstall 209914479 Jun 4 15:33 /u01/app/11.2.0/grid/bin/oracle
[oracle@rac12 ~]$ ls -l /u01/app/oracle/product/11.2.0/db_1/bin/oracle
-rwsr-s--x 1 oracle asmadmin 239626641 Jun 4 16:03 /u01/app/oracle/product/11.2.0/db_1/bin/oracle
--rac01
[grid@rac11 ~]$ cd $ORACLE_HOME/bin/
[grid@rac11 bin]$ ll setasmgid
-rwxr-x--x 1 grid oinstall 2279833 Jun 4 15:32 setasmgid
[grid@rac11 bin]$ set setasmgid setasmgidwrap setfattr setkeycodes setmetamode setsid setup setarch setasmgid0 setfacl setfont setleds setserial setterm setxkbmap
[grid@rac11 bin]$ setasmgidwrap o=/u01/app/oracle/product/11.2.0/db_1/bin/oracle
[grid@rac11 bin]$ ls -l /u01/app/oracle/product/11.2.0/db_1/bin/oracle
-rwxr-s--x 1 oracle asmadmin 239626641 Jun 4 16:00 /u01/app/oracle/product/11.2.0/db_1/bin/oracle
[启动实例bol1][grid@rac11 bin]$ srvctl status database -d bol
Instance bol1 is running on node rac11
Instance bol2 is running on node rac12
节点01可以启动
--很费劲为什么会出现这样的问题,看了一下history,01,02节点的命令都是一样的,执行的时候也没出什么错误
~~~~~