http://chenhao6.blog.51cto.com/6228054/1323192
原创作品,允许转载,转载时请务必以超链接形式标明文章 原始出处 、作者信息和本声明。否则将追究法律责任。http://chenhao6.blog.51cto.com/6228054/1323192
CentOS 6.2+Nginx+Nagios,手机短信和qq邮箱提醒
注:192.168.0.21 服务端
192.168.0.22 客户端
环境:两台centos6.0 64位系统,都已经搭建好了源码的lnmp平台
结尾附上所需的软件包
1.nagios安装(中文版)
1
2
3
4
5
6
7
8
9
10
11
12
|
tar xvf tar .bz2
cd nagios-cn-3.2.3
useradd -m /bin/bash nagios
usermod -a
. /configure --prefix= /usr/local/nagios --with- command -group=nagcmd
make
make all
make install
make install -init #
make install -config #
make install -commandmode #
chmod o+rwx /usr/local/nagios/var/rw
|
2.nagios-plugins安装
1
2
3
4
5
6
7
8
|
wget //prdownloads .sourceforge.net /sourceforge/nagiosplug/nagios-plugins
tar zxvf tar .gz
cd nagios-plugins-1.4.16
yum install make apr*
openssl
kernel
cloog-ppl
krb5-devel
|
1
2
3
|
. /configure --prefix= /usr/local/nagios --with-mysql= /home/mysql/
make
make install
|
3.nrpe安装
1
2
3
4
5
6
7
8
9
10
11
12
|
tar xzvf tar .gz
cd nrpe-2.12
. /configure
make
. /configure
make all
make install -plugin
make install -daemon
make install -daemon-config
\ cp src /check_nrpe /usr/local/nagios/libexec/
/usr/local/nagios/bin/nrpe -c /usr/local/nagios/etc/nrpe .cfg
echo '/usr/local/nagios/bin/nrpe >> /etc/rc . local
|
1
2
3
4
5
|
要重启nrpe进行就先杀掉进行,然后重启
kill ` ps aux grep nrpe grep - v grep | awk '{print `
/usr/local/nagios/bin/nrpe -c /usr/local/nagios/etc/nrpe .cfg
本机测试下:
/usr/local/nagios/libexec/check_nrpe -H
|
加入系统服务
1
2
3
4
5
6
|
加入系统服务并设为开机自动
chkconfig
chkconfig
chown nagios.nagios /usr/local/nagios/var/rw
#
/usr/local/nagios/bin/nagios - v /usr/local/nagios/etc/nagios .cfg
|
添加别名命令,方便测试配置文件
1
2
3
4
5
|
vi ~/.bashrc
在里面用 alias 来自定义一个命令来代替,这里我用check
alias check= '/usr/local/nagios/bin/nagios
source ~/.bashrc
此时可以用check命令来检测配置文件了
|
修改联系人邮箱,修改为用于报警接收的邮件地址
1
2
3
4
5
6
7
8
9
10
11
12
13
14
15
16
17
18
19
20
21
22
23
24
25
26
27
28
29
30
31
32
33
34
35
36
37
38
39
40
41
42
43
44
45
46
47
48
49
50
51
|
vi /usr/local/nagios/etc/objects/contacts .cfg
###############################################################################
#
#
#
#
#
#
#
#
#
#
#
#
###############################################################################
###############################################################################
###############################################################################
#
#
#
###############################################################################
###############################################################################
#
#
#
define
contact_name
use
alias Nagios
email
}
###############################################################################
###############################################################################
#
#
#
###############################################################################
###############################################################################
#
#
define
contactgroup_name
alias Nagios
members
}
定义check_nrpe命令
vi /usr/local/nagios/etc/objects/commands .cfg
define command {
command_name
command_line /usr/local/nagios/libexec/check_nrpe -H
}
|
检测配置文件是否有误
check
1
2
3
4
5
6
7
8
9
10
11
12
13
14
15
16
17
18
19
20
21
22
23
24
25
|
nginx
安装FCGI模块
cd
tar zxvf tar .gz
cd FCGI-0.70
perl
make
make install
cd
安装
tar zxvf tar .gz
cd IO-1.25
perl
make
make install
cd
tar zxvf tar .gz
cd IO-All-0.41
perl
make
make install
cd
unzip
cp perl-fcgi.pl /usr/local/nginx/
chmod 755 /usr/local/nginx/perl-fcgi .pl
|
1
2
3
4
5
6
7
8
9
10
11
12
13
14
15
16
17
18
19
20
21
22
23
24
25
26
27
28
29
30
31
32
33
34
|
vi /usr/local/nginx/start_perl_cgi .sh
#!/bin/bash
#set
dir = /usr/local/nginx/
stop
{
#pkill
kill $( cat $ dir /logs/perl-fcgi .pid)
rm $ dir /logs/perl-fcgi .pid /dev/null
rm $ dir /logs/perl-fcgi .sock /dev/null
echo "stop
}
start
{
rm $ dir /now_start_perl_fcgi .sh /dev/null
chown nobody.root dir /logs
echo "$dir/perl-fcgi.pl >>$ dir /now_start_perl_fcgi .sh
chown nobody.nobody dir /now_start_perl_fcgi .sh
chmod u+x dir /now_start_perl_fcgi .sh
sudo -u dir /now_start_perl_fcgi .sh
echo "start
}
case $1 in
stop)
stop
;;
start)
start
;;
restart)
stop
start
;;
esac
|
把start_perl_cgi.sh文件中的nobody全部用nagios替换,nginx
目录上的用户
1
2
3
|
sed -i 's@nobody@nagios@g' /usr/local/nginx/start_perl_cgi .sh
chmod 755 /usr/local/nginx/start_perl_cgi .sh
/usr/local/nginx/start_perl_cgi .sh
|
1
2
3
4
5
|
#
vi /usr/local/nagios/etc/cgi .cfg
找到use_authentication=1并把值改为0
修改联系人邮箱,修改为用于报警接收的邮件地址
vi /usr/local/nagios/etc/objects/contacts .cfg
|
到这一步就是正常的
下面nginx 配置
我把监听改成80的了
然后开启服务
就可以访问了,然后继续安装客户端,最后给大家截图看效果
service nagios start
nagios被控端安装
1
2
3
4
5
6
7
8
9
10
11
12
13
14
15
16
|
yum install openssl-devel
1.
groupadd
useradd nagios /sbin/nologin -g
tar xvf tar .gz
cd nagios-plugins-1.4.16
. /configure --prefix= /usr/local/nagios --with-nagios-user=nagios /usr/local/mysql && make && make install
cd
2.
tar zxvf tar .gz
cd nrpe-2.13
. /configure
make all
make install -plugin
make install -daemon
make install -daemon-config
|
1
2
3
|
启动nrpe
/usr/local/nagios/bin/nrpe -c /usr/local/nagios/etc/nrpe .cfg
echo '/usr/local/nagios/bin/nrpe >> /etc/rc . local
|
监控服务端本机:自己监控自己不需要配置nrpe,服务端的nrpe只用于获取客户端的nrpe传送过来的数据,在这里因为中文版的nagios已经默认有些配置,等会儿修改下直接用了
监控客户端:监控的服务有:mysql、nginx、memory、ip连接数、僵死的进程、磁盘空间、磁盘IO、登录用户数、进程总数、cpu负载、PING、SSH
1
2
3
|
unzip
\ cp libexec/* /usr/local/nagios/libexec
chmod -R /usr/local/nagios/libexec
|
装插件
1
2
3
4
5
|
创建一个空的数据库nagios,授权nagios这个用户从任何地方访问nagios这个数据库,刷新授权设置,查询下nagios这个用户是否创建成功
create
grant select on '%' identified '123456' ;
flush
select User,Password,Host
|
1
2
3
4
5
6
7
8
|
添加mysql库到系统搜索库
vim /etc/ld .so.conf
/usr/local/mysql/lib
ldconfig
要监控磁盘io,还得安装sysstat这个工具包
yum install sysstat
配置客户端上面的nrpe
vim /usr/local/nagios/etc/nrpe .cfg
|
1
2
3
4
5
6
7
8
9
10
11
12
13
14
15
16
17
18
19
20
21
22
23
24
25
26
27
28
29
30
31
32
33
34
35
36
37
38
39
40
41
42
43
44
45
46
47
48
49
50
51
52
53
54
55
56
57
58
59
60
61
62
63
64
65
66
67
68
69
70
71
72
73
74
75
76
77
78
79
80
81
82
83
84
85
86
87
88
89
90
91
92
93
94
95
96
97
98
99
100
101
102
103
104
105
106
107
108
109
110
111
112
113
114
115
116
117
118
119
120
121
122
123
124
125
126
127
128
129
130
131
132
133
134
135
136
137
138
139
140
141
142
143
144
145
146
147
148
149
150
151
152
153
154
155
156
157
158
159
160
161
162
163
164
165
166
167
168
169
170
171
172
173
174
175
176
177
178
179
180
181
182
183
184
|
配置客户端上面的nrpe
vim /usr/local/nagios/etc/nrpe .cfg
command [check_users]= /usr/local/nagios/libexec/check_users -w
command [check_load]= /usr/local/nagios/libexec/check_cpu .sh
command [check_sda1]= /usr/local/nagios/libexec/check_disk -w /dev/sda1
command [check_sda2]= /usr/local/nagios/libexec/check_disk -w /dev/sda2
command [check_zombie_procs]= /usr/local/nagios/libexec/check_procs -w
command [check_total_procs]= /usr/local/nagios/libexec/check_procs -w
command [check_swap]= /usr/local/nagios/libexec/check_swap -w
command [check_iostat]= /usr/local/nagios/libexec/check_iostat .sh
command [check_mysql]= /usr/local/nagios/libexec/check_mysql -H
command [check_nginx]= /usr/local/nagios/libexec/check_nginx .sh /status -w
command [check_mem]= /usr/local/nagios/libexec/check_memory .pl
command [check_ip_conn]= /usr/local/nagios/libexec/ip_conn .sh
command [check_ssh]= /usr/local/nagios/libexec/check_tcp -p
配置完成后,重启nrpe
kill ` ps aux grep nrpe grep - v grep | awk '{print `
/usr/local/nagios/bin/nrpe -c /usr/local/nagios/etc/nrpe .cfg
服务端配置:
监控服务端本机的配置:
vim /usr/local/nagios/etc/objects/localhost .cfg
修改里面的配置,最后修改完成的配置如下
define
use
host_name
alias localhost
address
icon_image
statusmap_image
2d_coords
3d_coords
}
define
hostgroup_name
alias Linux
members
}
define
servicegroup_name
alias 联通性检查
members
}
define
use local -service
host_name
service_description
check_command
}
define
use local -service
host_name
service_description
check_command
}
define
use local -service
host_name
service_description
check_command
}
define
use local -service
host_name
service_description
check_command
}
define
use local -service
host_name
service_description
check_command
}
define
use local -service
host_name
service_description
check_command
}
define
use local -service
host_name
service_description
check_command
notifications_enabled
}
服务器监控客户端的配置:
保存退出后复制这个文件一份,作为nagios-client的监控模版文件
cp /usr/local/nagios/etc/objects/localhost .cfg /usr/local/nagios/etc/objects/nagios-client .cfg
vim /usr/local/nagios/etc/objects/nagios-client .cfg
define
use
host_name
alias nagios-client
address
icon_image
statusmap_image
2d_coords
3d_coords
}
define
use local -service
host_name
service_description
check_command
}
define
use local -service
host_name
service_description
check_command
}
define
use local -service
host_name
service_description
check_command
}
define
use local -service
host_name
service_description
check_command
}
define
use local -service
host_name
service_description
check_command
}
define
use local -service
host_name
service_description
check_command
}
define
use local -service
host_name
service_description
check_command
}
define
use local -service
host_name
service_description
check_command
notifications_enabled
}
define
use local -service
host_name
service_description
check_command
}
define
use local -service
host_name
service_description
check_command
}
define
use local -service
host_name
service_description
check_command
}
define
use local -service
host_name
service_description
check_command
}
define
use local -service
host_name
service_description
check_command
}
define
use local -service
host_name
service_description
check_command
}
|
1
2
3
4
5
6
7
8
9
|
直接把原来的邮件报警的两条命令中的 /bin/mail 修改为 /usr/bin/mutt 即可,如下图
加快nagios的报警时间设置:
1.修改模版文件:
vim /usr/local/nagios/etc/objects/templates .cfg
修改所有normal_check_interval项的值为1,既发现故障后1分钟就报警
修改所有check_interval项的值为1,即正常情况下每分钟检查一次
修改所有notification_interval #在主机出现异常后,故障一直没有解决,nagios再次对使用者发出通知的时间
service
|
测试告警:
试验完成!
附上软件包所需软件地址
缺的软件可以直接找我要!
http://down.51cto.com/data/1007210
本文出自 “浩子的▁运维笔录ヽ” 博客,请务必保留此出处http://chenhao6.blog.51cto.com/6228054/1323192