大众点评cat监控平台搭建(实测可用,很多自己打包做docker镜像的都500错误)

参考官方文档:https://github.com/dianping/cat/wiki/readme_server

1.数据库相关

(1)创建数据库cat,并执行以下sql创建相关表:
CREATE TABLE dailyreport (
id int(11) NOT NULL AUTO_INCREMENT,
name varchar(20) NOT NULL COMMENT ‘报表名称, transaction, problem…’,
ip varchar(50) NOT NULL COMMENT ‘报表来自于哪台cat-consumer机器’,
domain varchar(50) NOT NULL COMMENT ‘报表处理的Domain信息’,
period datetime NOT NULL COMMENT ‘报表时间段’,
type tinyint(4) NOT NULL COMMENT ‘报表数据格式, 1/xml, 2/json, 默认1’,
creation_date datetime NOT NULL COMMENT ‘报表创建时间’,
PRIMARY KEY (id),
UNIQUE KEY period (period,domain,name)
) ENGINE=InnoDB DEFAULT CHARSET=utf8 COMMENT=‘天报表’;

CREATE TABLE weeklyreport (
id int(11) NOT NULL AUTO_INCREMENT,
name varchar(20) NOT NULL COMMENT ‘报表名称, transaction, problem…’,
ip varchar(50) NOT NULL COMMENT ‘报表来自于哪台cat-consumer机器’,
domain varchar(50) NOT NULL COMMENT ‘报表处理的Domain信息’,
period datetime NOT NULL COMMENT ‘报表时间段’,
type tinyint(4) NOT NULL COMMENT ‘报表数据格式, 1/xml, 2/json, 默认1’,
creation_date datetime NOT NULL COMMENT ‘报表创建时间’,
PRIMARY KEY (id),
UNIQUE KEY period (period,domain,name)
) ENGINE=InnoDB DEFAULT CHARSET=utf8 COMMENT=‘周报表’;

CREATE TABLE monthreport (
id int(11) NOT NULL AUTO_INCREMENT,
name varchar(20) NOT NULL COMMENT ‘报表名称, transaction, problem…’,
ip varchar(50) NOT NULL COMMENT ‘报表来自于哪台cat-consumer机器’,
domain varchar(50) NOT NULL COMMENT ‘报表处理的Domain信息’,
period datetime NOT NULL COMMENT ‘报表时间段’,
type tinyint(4) NOT NULL COMMENT ‘报表数据格式, 1/xml, 2/json, 默认1’,
creation_date datetime NOT NULL COMMENT ‘报表创建时间’,
PRIMARY KEY (id),
UNIQUE KEY period (period,domain,name)
) ENGINE=InnoDB DEFAULT CHARSET=utf8 COMMENT=‘月报表’;

CREATE TABLE hostinfo (
id int(11) NOT NULL AUTO_INCREMENT,
ip varchar(50) NOT NULL COMMENT ‘部署机器IP’,
domain varchar(200) NOT NULL COMMENT ‘部署机器对应的项目名’,
hostname varchar(200) DEFAULT NULL COMMENT ‘机器域名’,
creation_date datetime NOT NULL,
last_modified_date datetime NOT NULL,
PRIMARY KEY (id),
UNIQUE KEY ip_index (ip)
) ENGINE=InnoDB DEFAULT CHARSET=utf8 COMMENT=‘IP和项目名的对应关系’;

CREATE TABLE hourlyreport (
id int(11) NOT NULL AUTO_INCREMENT,
type tinyint(4) NOT NULL COMMENT ‘报表类型, 1/xml, 9/binary 默认1’,
name varchar(20) NOT NULL COMMENT ‘报表名称’,
ip varchar(50) DEFAULT NULL COMMENT ‘报表来自于哪台机器’,
domain varchar(50) NOT NULL COMMENT ‘报表项目’,
period datetime NOT NULL COMMENT ‘报表时间段’,
creation_date datetime NOT NULL COMMENT ‘报表创建时间’,
PRIMARY KEY (id),
KEY IX_Domain_Name_Period (domain,name,period),
KEY IX_Name_Period (name,period),
KEY IX_Period (period)
) ENGINE=InnoDB DEFAULT CHARSET=utf8 ROW_FORMAT=COMPRESSED COMMENT=‘用于存放实时报表信息,处理之后的结果’;

CREATE TABLE hourly_report_content (
report_id int(11) NOT NULL COMMENT ‘报表ID’,
content longblob NOT NULL COMMENT ‘二进制报表内容’,
period datetime NOT NULL COMMENT ‘报表时间段’,
creation_date datetime NOT NULL COMMENT ‘创建时间’,
PRIMARY KEY (report_id),
KEY IX_Period (period)
) ENGINE=InnoDB DEFAULT CHARSET=utf8 ROW_FORMAT=COMPRESSED COMMENT=‘小时报表二进制内容’;

CREATE TABLE daily_report_content (
report_id int(11) NOT NULL COMMENT ‘报表ID’,
content longblob NOT NULL COMMENT ‘二进制报表内容’,
period datetime COMMENT ‘报表时间段’,
creation_date datetime NOT NULL COMMENT ‘创建时间’,
PRIMARY KEY (report_id),
KEY IX_Period (period)
)ENGINE=InnoDB DEFAULT CHARSET=utf8 ROW_FORMAT=COMPRESSED COMMENT=‘天报表二进制内容’;

CREATE TABLE weekly_report_content (
report_id int(11) NOT NULL COMMENT ‘报表ID’,
content longblob NOT NULL COMMENT ‘二进制报表内容’,
period datetime COMMENT ‘报表时间段’,
creation_date datetime NOT NULL COMMENT ‘创建时间’,
PRIMARY KEY (report_id),
KEY IX_Period (period)
)ENGINE=InnoDB DEFAULT CHARSET=utf8 ROW_FORMAT=COMPRESSED COMMENT=‘周报表二进制内容’;

CREATE TABLE monthly_report_content (
report_id int(11) NOT NULL COMMENT ‘报表ID’,
content longblob NOT NULL COMMENT ‘二进制报表内容’,
period datetime COMMENT ‘报表时间段’,
creation_date datetime NOT NULL COMMENT ‘创建时间’,
PRIMARY KEY (report_id),
KEY IX_Period (period)
)ENGINE=InnoDB DEFAULT CHARSET=utf8 ROW_FORMAT=COMPRESSED COMMENT=‘月报表二进制内容’;

CREATE TABLE businessReport (
id int(11) NOT NULL AUTO_INCREMENT,
type tinyint(4) NOT NULL COMMENT ‘报表类型 报表数据格式, 1/Binary, 2/xml , 3/json’,
name varchar(20) NOT NULL COMMENT ‘报表名称’,
ip varchar(50) NOT NULL COMMENT ‘报表来自于哪台机器’,
productLine varchar(50) NOT NULL COMMENT ‘指标来源于哪个产品组’,
period datetime NOT NULL COMMENT ‘报表时间段’,
content longblob COMMENT ‘用于存放报表的具体内容’,
creation_date datetime NOT NULL COMMENT ‘报表创建时间’,
PRIMARY KEY (id),
KEY IX_Period_productLine_name (period,productLine,name)
) ENGINE=InnoDB DEFAULT CHARSET=utf8 ROW_FORMAT=COMPRESSED COMMENT=‘用于存放业务监控实时报表信息,处理之后的结果’;

CREATE TABLE task (
id int(11) NOT NULL AUTO_INCREMENT,
producer varchar(20) NOT NULL COMMENT ‘任务创建者ip’,
consumer varchar(20) NULL COMMENT ‘任务执行者ip’,
failure_count tinyint(4) NOT NULL COMMENT ‘任务失败次数’,
report_name varchar(20) NOT NULL COMMENT ‘报表名称, transaction, problem…’,
report_domain varchar(50) NOT NULL COMMENT ‘报表处理的Domain信息’,
report_period datetime NOT NULL COMMENT ‘报表时间’,
status tinyint(4) NOT NULL COMMENT ‘执行状态: 1/todo, 2/doing, 3/done 4/failed’,
task_type tinyint(4) NOT NULL DEFAULT ‘1’ COMMENT ‘0表示小时任务,1表示天任务’,
creation_date datetime NOT NULL COMMENT ‘任务创建时间’,
start_date datetime NULL COMMENT ‘开始时间, 这次执行开始时间’,
end_date datetime NULL COMMENT ‘结束时间, 这次执行结束时间’,
PRIMARY KEY (id),
UNIQUE KEY task_period_domain_name_type (report_period,report_domain,report_name,task_type)
) ENGINE=InnoDB DEFAULT CHARSET=utf8 COMMENT=‘后台任务’;

CREATE TABLE project (
id int(11) NOT NULL AUTO_INCREMENT,
domain varchar(200) NOT NULL COMMENT ‘项目名称’,
cmdb_domain varchar(200) DEFAULT NULL COMMENT ‘cmdb项目名称’,
level int(5) DEFAULT NULL COMMENT ‘项目级别’,
bu varchar(50) DEFAULT NULL COMMENT ‘CMDB事业部’,
cmdb_productline varchar(50) DEFAULT NULL COMMENT ‘CMDB产品线’,
owner varchar(50) DEFAULT NULL COMMENT ‘项目负责人’,
email longtext DEFAULT NULL COMMENT ‘项目组邮件’,
phone longtext DEFAULT NULL COMMENT ‘联系电话’,
creation_date datetime DEFAULT NULL COMMENT ‘创建时间’,
modify_date datetime DEFAULT NULL COMMENT ‘修改时间’,
PRIMARY KEY (id),
UNIQUE KEY domain (domain)
)ENGINE=InnoDB DEFAULT CHARSET=utf8 COMMENT=‘项目基本信息’;

CREATE TABLE topologyGraph (
id int(11) NOT NULL AUTO_INCREMENT,
ip varchar(50) NOT NULL COMMENT ‘报表来自于哪台cat-client机器ip’,
period datetime NOT NULL COMMENT ‘报表时间段,精确到分钟’,
type tinyint(4) NOT NULL COMMENT ‘报表数据格式, 1/xml, 2/json, 3/binary’,
content longblob COMMENT ‘用于存放报表的具体内容’,
creation_date datetime NOT NULL COMMENT ‘报表创建时间’,
PRIMARY KEY (id),
KEY period (period)
) ENGINE=InnoDB DEFAULT CHARSET=utf8 COMMENT=‘用于存储历史的拓扑图曲线’;

CREATE TABLE config (
id int(11) NOT NULL AUTO_INCREMENT,
name varchar(50) NOT NULL COMMENT ‘配置名称’,
content longtext COMMENT ‘配置的具体内容’,
creation_date datetime NOT NULL COMMENT ‘配置创建时间’,
modify_date datetime NOT NULL COMMENT ‘配置修改时间’,
PRIMARY KEY (id),
UNIQUE KEY name (name)
) ENGINE=InnoDB DEFAULT CHARSET=utf8 COMMENT=‘用于存储系统的全局配置信息’;

CREATE TABLE baseline (
id int(11) NOT NULL AUTO_INCREMENT,
report_name varchar(100) DEFAULT NULL,
index_key varchar(100) DEFAULT NULL,
report_period datetime DEFAULT NULL,
data blob,
creation_date datetime DEFAULT NULL,
PRIMARY KEY (id),
KEY period_name_key (report_period,report_name,index_key)
) ENGINE=InnoDB DEFAULT CHARSET=utf8;

CREATE TABLE alteration (
id int(11) NOT NULL AUTO_INCREMENT COMMENT ‘自增长ID’,
type varchar(64) NOT NULL COMMENT ‘分类’,
title varchar(128) NOT NULL COMMENT ‘变更标题’,
domain varchar(128) NOT NULL COMMENT ‘变更项目’,
hostname varchar(128) NOT NULL COMMENT ‘变更机器名’,
ip varchar(128) DEFAULT NULL COMMENT ‘变更机器IP’,
date datetime NOT NULL COMMENT ‘变更时间’,
user varchar(45) NOT NULL COMMENT ‘变更用户’,
alt_group varchar(45) DEFAULT NULL COMMENT ‘变更组别’,
content longtext NOT NULL COMMENT ‘变更内容’,
url varchar(200) DEFAULT NULL COMMENT ‘变更链接’,
status tinyint(4) DEFAULT ‘0’ COMMENT ‘变更状态’,
creation_date datetime NOT NULL COMMENT ‘数据库创建时间’,
PRIMARY KEY (id),
KEY ind_date_domain_host (date,domain,hostname)
) ENGINE=InnoDB DEFAULT CHARSET=utf8 COMMENT=‘变更表’;

CREATE TABLE alert (
id int(11) NOT NULL AUTO_INCREMENT COMMENT ‘自增长ID’,
domain varchar(128) NOT NULL COMMENT ‘告警项目’,
alert_time datetime NOT NULL COMMENT ‘告警时间’,
category varchar(64) NOT NULL COMMENT ‘告警分类:network/business/system/exception -alert’,
type varchar(64) NOT NULL COMMENT ‘告警类型:error/warning’,
content longtext NOT NULL COMMENT ‘告警内容’,
metric varchar(128) NOT NULL COMMENT ‘告警指标’,
creation_date datetime NOT NULL COMMENT ‘数据插入时间’,
PRIMARY KEY (id)
) ENGINE=InnoDB DEFAULT CHARSET=utf8 COMMENT=‘存储告警信息’;

CREATE TABLE alert_summary (
id int(11) NOT NULL AUTO_INCREMENT COMMENT ‘自增长ID’,
domain varchar(128) NOT NULL COMMENT ‘告警项目’,
alert_time datetime NOT NULL COMMENT ‘告警时间’,
content longtext NOT NULL COMMENT ‘统一告警内容’,
creation_date datetime NOT NULL COMMENT ‘数据插入时间’,
PRIMARY KEY (id)
) ENGINE=InnoDB DEFAULT CHARSET=utf8 COMMENT=‘统一告警信息’;

CREATE TABLE operation (
id int(11) NOT NULL AUTO_INCREMENT COMMENT ‘自增长ID’,
user varchar(128) NOT NULL COMMENT ‘用户名’,
module varchar(128) NOT NULL COMMENT ‘模块’,
operation varchar(128) NOT NULL COMMENT ‘操作’,
time datetime NOT NULL COMMENT ‘修改时间’,
content longtext NOT NULL COMMENT ‘修改内容’,
creation_date datetime NOT NULL COMMENT ‘数据插入时间’,
PRIMARY KEY (id)
) ENGINE=InnoDB DEFAULT CHARSET=utf8 COMMENT=‘用户操作日志’;

CREATE TABLE overload (
id int(11) NOT NULL AUTO_INCREMENT COMMENT ‘自增长ID’,
report_id int(11) NOT NULL COMMENT ‘报告id’,
report_type tinyint(4) NOT NULL COMMENT ‘报告类型 1:hourly 2:daily 3:weekly 4:monthly’,
report_size double NOT NULL COMMENT ‘报告大小 单位MB’,
period datetime NOT NULL COMMENT ‘报表时间’,
creation_date datetime NOT NULL COMMENT ‘创建时间’,
PRIMARY KEY (id),
KEY period (period)
) ENGINE=InnoDB DEFAULT CHARSET=utf8 COMMENT=‘过大容量表’;

CREATE TABLE config_modification (
id int(11) NOT NULL AUTO_INCREMENT COMMENT ‘自增长ID’,
user_name varchar(64) NOT NULL COMMENT ‘用户名’,
account_name varchar(64) NOT NULL COMMENT ‘账户名’,
action_name varchar(64) NOT NULL COMMENT ‘action名’,
argument longtext COMMENT ‘参数内容’,
date datetime NOT NULL COMMENT ‘修改时间’,
creation_date datetime NOT NULL COMMENT ‘创建时间’,
PRIMARY KEY (id)
) ENGINE=InnoDB DEFAULT CHARSET=utf8 COMMENT=‘配置修改记录表’;

CREATE TABLE user_define_rule (
id int(11) NOT NULL AUTO_INCREMENT COMMENT ‘自增长ID’,
content text NOT NULL COMMENT ‘用户定义规则’,
creation_date datetime NOT NULL COMMENT ‘创建时间’,
PRIMARY KEY (id)
) ENGINE=InnoDB DEFAULT CHARSET=utf8 COMMENT=‘用户定义规则表’;

CREATE TABLE business_config (
id int(11) NOT NULL AUTO_INCREMENT,
name varchar(20) NOT NULL DEFAULT ‘’ COMMENT ‘配置名称’,
domain varchar(50) NOT NULL DEFAULT ‘’ COMMENT ‘项目’,
content longtext COMMENT ‘配置内容’,
updatetime datetime NOT NULL,
PRIMARY KEY (id),
KEY updatetime (updatetime),
KEY name_domain (name,domain)
) ENGINE=InnoDB DEFAULT CHARSET=utf8;

CREATE TABLE metric_screen (
id int(11) NOT NULL AUTO_INCREMENT,
name varchar(50) NOT NULL COMMENT ‘配置名称’,
graph_name varchar(50) NOT NULL DEFAULT ‘’ COMMENT ‘Graph名称’,
view varchar(50) NOT NULL DEFAULT ‘’ COMMENT ‘视角’,
endPoints longtext NOT NULL,
measurements longtext NOT NULL COMMENT ‘配置的指标’,
content longtext NOT NULL COMMENT ‘配置的具体内容’,
creation_date datetime NOT NULL COMMENT ‘配置创建时间’,
updatetime datetime NOT NULL COMMENT ‘配置修改时间’,
PRIMARY KEY (id),
UNIQUE KEY name_graph (name,graph_name)
) ENGINE=InnoDB DEFAULT CHARSET=utf8 COMMENT=‘系统监控的screen配置’;

CREATE TABLE metric_graph (
id int(11) NOT NULL AUTO_INCREMENT,
graph_id int(11) NOT NULL COMMENT ‘大盘ID’,
name varchar(50) NOT NULL COMMENT ‘配置ID’,
content longtext COMMENT ‘配置的具体内容’,
creation_date datetime NOT NULL COMMENT ‘配置创建时间’,
updatetime datetime NOT NULL COMMENT ‘配置修改时间’,
PRIMARY KEY (id),
UNIQUE name (name)
) ENGINE=InnoDB DEFAULT CHARSET=utf8 COMMENT=‘系统监控的graph配置’;

CREATE TABLE server_alarm_rule (
id int(11) NOT NULL AUTO_INCREMENT,
category varchar(50) NOT NULL COMMENT ‘监控分类’,
endPoint varchar(200) NOT NULL COMMENT ‘监控对象ID’,
measurement varchar(200) NOT NULL COMMENT ‘监控指标’,
tags varchar(200) NOT NULL DEFAULT ‘’ COMMENT ‘监控指标标签’,
content longtext NOT NULL COMMENT ‘配置的具体内容’,
type varchar(20) NOT NULL DEFAULT ‘’ COMMENT ‘数据聚合方式’,
creator varchar(100) DEFAULT ‘’ COMMENT ‘创建人’,
creation_date datetime NOT NULL COMMENT ‘配置创建时间’,
updatetime datetime NOT NULL COMMENT ‘配置修改时间’,
PRIMARY KEY (id),
KEY updatetime (updatetime)
) ENGINE=InnoDB DEFAULT CHARSET=utf8 COMMENT=‘系统告警的配置’;
(2)一套独立的CAT集群只需要一个数据库;

(3)数据库编码使用utf8mb4,否则可能造成中文乱码等问题;

2.服务端部署

(1)下载Tomcat服务器,如:https://tomcat.apache.org/download-80.cgi

(2)下载cat.war包,由于依赖问题,个人建议直接下载官网打好包的cat-home-3.0.0.war,然后重命名为cat.war,下载地址:http://unidal.org/nexus/service/local/repositories/releases/content/com/dianping/cat/cat-home/3.0.0/cat-home-3.0.0.war

(3)将cat.war放到解压后的Tomcat的webapps目录下

3.服务端配置

(1)创建相关目录并赋权限,并在/data/appdatas/cat目录下新建client.xml,server.xml,datasources.xml配置文件

mkdir -p /data/{appdatas,applogs}/cat

chmod -R 777 /data

(2)datasources.xml

<?xml version="1.0" encoding="utf-8"?> 3 1s 10m 1000 com.mysql.jdbc.Driver <![CDATA[jdbc:mysql://192.168.0.1:3306/cat]]> test test <![CDATA[useUnicode=true&characterEncoding=UTF-8&autoReconnect=true&socketTimeout=120000]]> (3)server.xml <?xml version="1.0" encoding="utf-8"?> 10.223.123.221:8080 (4)client.xml <?xml version="1.0" encoding="utf-8"?>

4.重算客户端路由

(1)启动Tomcat,访问http://10.223.123.221:8080/cat/s/config?op=routerConfigUpdate
大众点评cat监控平台搭建(实测可用,很多自己打包做docker镜像的都500错误)

<?xml version="1.0" encoding="utf-8"?> 点击提交按钮,然后点击重算路由按钮,就绪后重启Tomcat即可(测试不重启Tomcat也能生效)

访问http://10.223.123.221:8080/cat/r,如果显示下图则表示服务端部署成功
大众点评cat监控平台搭建(实测可用,很多自己打包做docker镜像的都500错误)
如果显示“有问题的CAT服务器[ip]”,请核查一下配置文件以及客户端路由中,都修改为内网ip。

5.客户端springboot集成

(1)在项目的同一个盘符下创建client.xml配置文件和日志目录,文件结构与服务端一致即可,ip改为cat服务端对应的IP,内容如下:

<?xml version="1.0" encoding="utf-8"?> (2)下载cat-client依赖包,安装到本地maven仓库:

先打开该网址:https://mvnrepository.com/artifact/com.dianping.cat/cat-client/3.0.0===》http://unidal.org/nexus/content/repositories/releases/然后即可下载需要的jar包
大众点评cat监控平台搭建(实测可用,很多自己打包做docker镜像的都500错误)

本人在测试过程中,发现maven依赖的方式下载不下来,固直接下载jar包下来安装到本地仓库:http://unidal.org/nexus/content/repositories/releases/com/dianping/cat/cat-client/3.0.0/cat-client-3.0.0.jar

需要下载如下图cat-client-3.0.0.jar,cat-core-3.0.0.jar按理说可以不要,我做测试的时候安装了,应该没有影响:
大众点评cat监控平台搭建(实测可用,很多自己打包做docker镜像的都500错误)
执行命令安装:mvn install:install-file -Dfile=cat-client-3.0.0.jar -DgroupId=com.dianping.cat -DartifactId=cat-client -Dversion=3.0.0 -Dpackaging=jar
大众点评cat监控平台搭建(实测可用,很多自己打包做docker镜像的都500错误)

这样就能在springboot项目中进行依赖了。

(3)项目配置和埋点

大众点评cat监控平台搭建(实测可用,很多自己打包做docker镜像的都500错误)
public List fetchUrls(int limitCount, Set taskIds) {
Transaction t = Cat.newTransaction(“DistributedSpiderApiImpl”, “fetchUrls”);
Event event = Cat.newEvent(“fetchUrlsEvent”, “fetchUrlsQPS”);
List urlModelList = null;
try {
urlModelList = businessCode(limitCount, taskIds);
event.setSuccessStatus();
t.setStatus(Transaction.SUCCESS);
} catch (Exception e) {
event.setStatus(“False”);
t.setStatus(e);
Cat.logError(e);
} finally {
event.complete();
t.complete();
}
return urlModelList;
}
大众点评cat监控平台搭建(实测可用,很多自己打包做docker镜像的都500错误)

看看效果:
大众点评cat监控平台搭建(实测可用,很多自己打包做docker镜像的都500错误)
大众点评cat监控平台搭建(实测可用,很多自己打包做docker镜像的都500错误)
大众点评cat监控平台搭建(实测可用,很多自己打包做docker镜像的都500错误)
6.集群搭建

(1)配置文件都改为集群的配置,共用一个mysql,datasources.xml文件每个集群机器都要放一份;

(2)server.xml配置

<?xml version="1.0" encoding="utf-8"?> 10.126.0.119:8080,10.126.0.120:8080,10.126.0.130:8080

(3)client.xml

<?xml version="1.0" encoding="utf-8"?>








(4)客户端路由配置

<?xml version="1.0" encoding="utf-8"?>

(5)服务端配置,主要是配置remote-servers : 定义HTTP服务列表,(远程监听端同步更新服务端信息即取此值)

http://10.126.0.130:8080/cat/s/config?op=serverConfigUpdate
大众点评cat监控平台搭建(实测可用,很多自己打包做docker镜像的都500错误)

<?xml version="1.0" encoding="utf-8"?>

集群效果图如下:
大众点评cat监控平台搭建(实测可用,很多自己打包做docker镜像的都500错误)

上一篇:记一次 发布net5.0 报500错误经历


下一篇:真服了!大牛自我总结500页“Java成长笔记”