.NetCore使用skywalking实现实时性能监控

一、简介

很久之前写了一篇 《.Net Core 2.0+ InfluxDB+Grafana+App Metrics 实现跨平台的实时性能监控》关于NetCore性能监控的文章,使用Influxdb+AppMetrics进行项目性能监控,由于技术有限,在正式环境使用一段时间后,莫名的AppMetrics就没办法往influxdb中插入数据了,后来我也在App Metrics作者的github上留言了,并且作者也根据我阐述的情况做了测试,没有复现我的问题,最后这个问题就不了了知了,然后项目性能监控这个事搁置了一段时间,直到2018年参加上海.net线下技术沙龙,在会场首次听到skywalking,那时候skywalking正在做NetCore的支持,会后回到公司便开始关注skywalking,知道skywalking支持NetCore后,第一时间在公司的项目中运用了skywalking。

二、安装环境

要想使用skywalking,首先得安装相关环境。本文以windows为例。

1、安装java sdk(如果不会配置java环境的话,请参考百度百科:https://jingyan.baidu.com/article/08b6a591bdb18314a80922a0.html

2、java环境安装完成后,下载Elasticsearch进行安装 https://www.elastic.co/downloads/elasticsearch (本文使用skywalking 6.x版本,6.x版本对应使用ES 6.x版本,请自行下载对应版本)

3、下载完Elasticsearch 后将Elasticsearch解压到安装位置,以我电脑为例,我安装在D:\Program Files

4、修改ES配置,进入ES文件下的:\config,找到elasticsearch.yml,打开后修改如下配置:

.NetCore使用skywalking实现实时性能监控
 1 # ======================== Elasticsearch Configuration =========================
 2 #
 3 # NOTE: Elasticsearch comes with reasonable defaults for most settings.
 4 #       Before you set out to tweak and tune the configuration, make sure you
 5 #       understand what are you trying to accomplish and the consequences.
 6 #
 7 # The primary way of configuring a node is via this file. This template lists
 8 # the most important settings you may want to configure for a production cluster.
 9 #
10 # Please consult the documentation for further information on configuration options:
11 # https://www.elastic.co/guide/en/elasticsearch/reference/index.html
12 #
13 # ---------------------------------- Cluster -----------------------------------
14 #
15 # Use a descriptive name for your cluster:
16 #
17 cluster.name: myskywalking
18 #
19 # ------------------------------------ Node ------------------------------------
20 #
21 # Use a descriptive name for the node:
22 #
23 node.name: node-1
24 #
25 # Add custom attributes to the node:
26 #
27 #node.attr.rack: r1
28 #
29 # ----------------------------------- Paths ------------------------------------
30 #
31 # Path to directory where to store the data (separate multiple locations by comma):
32 #
33 path.data: D:/Program Files/elasticsearch-6.6.2/path/to/data
34 #
35 # Path to log files:
36 #
37 path.logs: D:/Program Files/elasticsearch-6.6.2/path/to/logs
38 #
39 # ----------------------------------- Memory -----------------------------------
40 #
41 # Lock the memory on startup:
42 #
43 bootstrap.memory_lock: false
44 #
45 # Make sure that the heap size is set to about half the memory available
46 # on the system and that the owner of the process is allowed to use this
47 # limit.
48 #
49 # Elasticsearch performs poorly when the system is swapping the memory.
50 #
51 # ---------------------------------- Network -----------------------------------
52 #
53 # Set the bind address to a specific IP (IPv4 or IPv6):
54 #
55 network.host: 0.0.0.0
56 http.port: 9200
57 http.cors.enabled: true 
58 http.cors.allow-origin: "*" 
59 http.cors.allow-methods: OPTIONS,HEAD,GET,POST,PUT,DELETE
60 http.cors.allow-headers: "X-Requested-With, Content-Type, Content-Length, X-Users"
61 
62 #
63 # For more information, consult the network module documentation.
64 #
65 # --------------------------------- Discovery ----------------------------------
66 #
67 # Pass an initial list of hosts to perform discovery when new node is started:
68 # The default list of hosts is ["127.0.0.1", "[::1]"]
69 #
70 #discovery.zen.ping.unicast.hosts: ["host1", "host2"]
71 #
72 # Prevent the "split brain" by configuring the majority of nodes (total number of master-eligible nodes / 2 + 1):
73 #
74 #discovery.zen.minimum_master_nodes: 
75 #
76 # For more information, consult the zen discovery module documentation.
77 #
78 # ---------------------------------- Gateway -----------------------------------
79 #
80 # Block initial recovery after a full cluster restart until N nodes are started:
81 #
82 #gateway.recover_after_nodes: 3
83 #
84 # For more information, consult the gateway module documentation.
85 #
86 # ---------------------------------- Various -----------------------------------
87 #
88 # Require explicit names when deleting indices:
89 #
90 #action.destructive_requires_name: true
View Code

修改好elasticsearch.yml文件后,打开cmd命令,进入到D:\Program Files\elasticsearch-6.6.2\bin,bin文件夹下,输入如下命令:  elasticsearch-service.bat install  将ES安装成windows,这样就可以方便系统重启后自动启动

然后将服务启动后即可

5、接下来下载skywalking,http://skywalking.apache.org/downloads/

选择版本为 :6.0.0-GA 的下载

三、配置和效果

1、在本地电脑中创建一个文件夹(注意:本人亲自躺过的坑,skywalking服务必须放在无空格的文件夹,比如:Program Files这个文件是绝对不能放的,不然服务运行的时候只会一闪而过,连log日志都不会生成,切记!切记!切记!)

我在D盘下创建了一个叫skyworkingService文件,路径如下:D:\skyworkingService

将下好的skywalking解压到该目录下,命名为skywalking-apm-GA,路径如下:D:\skyworkingService\skywalking-apm-GA

接着,打开config文件,找到application.yml文件,修改其配置如下:

.NetCore使用skywalking实现实时性能监控
 1 # Licensed to the Apache Software Foundation (ASF) under one
 2 # or more contributor license agreements.  See the NOTICE file
 3 # distributed with this work for additional information
 4 # regarding copyright ownership.  The ASF licenses this file
 5 # to you under the Apache License, Version 2.0 (the
 6 # "License"); you may not use this file except in compliance
 7 # with the License.  You may obtain a copy of the License at
 8 #
 9 #     http://www.apache.org/licenses/LICENSE-2.0
10 #
11 # Unless required by applicable law or agreed to in writing, software
12 # distributed under the License is distributed on an "AS IS" BASIS,
13 # WITHOUT WARRANTIES OR CONDITIONS OF ANY KIND, either express or implied.
14 # See the License for the specific language governing permissions and
15 # limitations under the License.
16 
17 cluster:
18   standalone:
19   # Please check your ZooKeeper is 3.5+, However, it is also compatible with ZooKeeper 3.4.x. Replace the ZooKeeper 3.5+
20   # library the oap-libs folder with your ZooKeeper 3.4.x library.
21 #  zookeeper:
22 #    nameSpace: ${SW_NAMESPACE:""}
23 #    hostPort: ${SW_CLUSTER_ZK_HOST_PORT:localhost:2181}
24 #    #Retry Policy
25 #    baseSleepTimeMs: ${SW_CLUSTER_ZK_SLEEP_TIME:1000} # initial amount of time to wait between retries
26 #    maxRetries: ${SW_CLUSTER_ZK_MAX_RETRIES:3} # max number of times to retry
27 #  kubernetes:
28 #    watchTimeoutSeconds: ${SW_CLUSTER_K8S_WATCH_TIMEOUT:60}
29 #    namespace: ${SW_CLUSTER_K8S_NAMESPACE:default}
30 #    labelSelector: ${SW_CLUSTER_K8S_LABEL:app=collector,release=skywalking}
31 #    uidEnvName: ${SW_CLUSTER_K8S_UID:SKYWALKING_COLLECTOR_UID}
32 #  consul:
33 #    serviceName: ${SW_SERVICE_NAME:"SkyWalking_OAP_Cluster"}
34 #     Consul cluster nodes, example: 10.0.0.1:8500,10.0.0.2:8500,10.0.0.3:8500
35 #    hostPort: ${SW_CLUSTER_CONSUL_HOST_PORT:localhost:8500}
36 core:
37   default:
38     restHost: ${SW_CORE_REST_HOST:0.0.0.0}
39     restPort: ${SW_CORE_REST_PORT:12800}
40     restContextPath: ${SW_CORE_REST_CONTEXT_PATH:/}
41     gRPCHost: ${SW_CORE_GRPC_HOST:0.0.0.0}
42     gRPCPort: ${SW_CORE_GRPC_PORT:11800}
43     downsampling:
44     - Hour
45     - Day
46     - Month
47     # Set a timeout on metric data. After the timeout has expired, the metric data will automatically be deleted.
48     recordDataTTL: ${SW_CORE_RECORD_DATA_TTL:90} # Unit is minute
49     minuteMetricsDataTTL: ${SW_CORE_MINUTE_METRIC_DATA_TTL:90} # Unit is minute
50     hourMetricsDataTTL: ${SW_CORE_HOUR_METRIC_DATA_TTL:36} # Unit is hour
51     dayMetricsDataTTL: ${SW_CORE_DAY_METRIC_DATA_TTL:45} # Unit is day
52     monthMetricsDataTTL: ${SW_CORE_MONTH_METRIC_DATA_TTL:18} # Unit is month
53 storage:
54   # h2:
55     # driver: ${SW_STORAGE_H2_DRIVER:org.h2.jdbcx.JdbcDataSource}
56     # url: ${SW_STORAGE_H2_URL:jdbc:h2:mem:skywalking-oap-db}
57     # user: ${SW_STORAGE_H2_USER:sa}
58  elasticsearch:
59    nameSpace: ${SW_NAMESPACE:"myskywalking"}
60    clusterNodes: ${SW_STORAGE_ES_CLUSTER_NODES:localhost:9200}
61    indexShardsNumber: ${SW_STORAGE_ES_INDEX_SHARDS_NUMBER:2}
62    indexReplicasNumber: ${SW_STORAGE_ES_INDEX_REPLICAS_NUMBER:0}
63    # Batch process setting, refer to https://www.elastic.co/guide/en/elasticsearch/client/java-api/5.5/java-docs-bulk-processor.html
64    bulkActions: ${SW_STORAGE_ES_BULK_ACTIONS:2000} # Execute the bulk every 2000 requests
65    bulkSize: ${SW_STORAGE_ES_BULK_SIZE:20} # flush the bulk every 20mb
66    flushInterval: ${SW_STORAGE_ES_FLUSH_INTERVAL:10} # flush the bulk every 10 seconds whatever the number of requests
67    concurrentRequests: ${SW_STORAGE_ES_CONCURRENT_REQUESTS:2} # the number of concurrent requests
68 receiver-register:
69   default:
70 receiver-trace:
71   default:
72     bufferPath: ${SW_RECEIVER_BUFFER_PATH:../trace-buffer/}  # Path to trace buffer files, suggest to use absolute path
73     bufferOffsetMaxFileSize: ${SW_RECEIVER_BUFFER_OFFSET_MAX_FILE_SIZE:100} # Unit is MB
74     bufferDataMaxFileSize: ${SW_RECEIVER_BUFFER_DATA_MAX_FILE_SIZE:500} # Unit is MB
75     bufferFileCleanWhenRestart: ${SW_RECEIVER_BUFFER_FILE_CLEAN_WHEN_RESTART:false}
76     sampleRate: ${SW_TRACE_SAMPLE_RATE:10000} # The sample rate precision is 1/10000. 10000 means 100% sample in default.
77 receiver-jvm:
78   default:
79 #service-mesh:
80 #  default:
81 #    bufferPath: ${SW_SERVICE_MESH_BUFFER_PATH:../mesh-buffer/}  # Path to trace buffer files, suggest to use absolute path
82 #    bufferOffsetMaxFileSize: ${SW_SERVICE_MESH_OFFSET_MAX_FILE_SIZE:100} # Unit is MB
83 #    bufferDataMaxFileSize: ${SW_SERVICE_MESH_BUFFER_DATA_MAX_FILE_SIZE:500} # Unit is MB
84 #    bufferFileCleanWhenRestart: ${SW_SERVICE_MESH_BUFFER_FILE_CLEAN_WHEN_RESTART:false}
85 #istio-telemetry:
86 #  default:
87 #receiver_zipkin:
88 #  default:
89 #    host: ${SW_RECEIVER_ZIPKIN_HOST:0.0.0.0}
90 #    port: ${SW_RECEIVER_ZIPKIN_PORT:9411}
91 #    contextPath: ${SW_RECEIVER_ZIPKIN_CONTEXT_PATH:/}
92 query:
93   graphql:
94     path: ${SW_QUERY_GRAPHQL_PATH:/graphql}
95 alarm:
96   default:
97 telemetry:
98   none:
View Code

 修改完成后,进入到bin文件中,右键单击startup.bat,以管理员权限运行,即可看到如下弹框

.NetCore使用skywalking实现实时性能监控

弹出这两个框说明服务已经启动了

这个时候访问http://localhost:8080,即可看到如下界面:

.NetCore使用skywalking实现实时性能监控

默认账号admin,密码admin,登录后看看到想要的监控数据和各服务直接的拓扑图,因为我的服务跑了一段时间,所以下面的界面是有数据的:

.NetCore使用skywalking实现实时性能监控

.NetCore使用skywalking实现实时性能监控

2、由于启动skywalking后会弹出两个命令窗口,所以如果运维人员不小心关了窗口的话服务自然就停掉了,所以为了避免这种问题,我们还可以将bin文件夹下的oapService.bat和webappService.bat进行配置,如下:

.NetCore使用skywalking实现实时性能监控
 1 @REM
 2 @REM  Licensed to the Apache Software Foundation (ASF) under one or more
 3 @REM  contributor license agreements.  See the NOTICE file distributed with
 4 @REM  this work for additional information regarding copyright ownership.
 5 @REM  The ASF licenses this file to You under the Apache License, Version 2.0
 6 @REM  (the "License"); you may not use this file except in compliance with
 7 @REM  the License.  You may obtain a copy of the License at
 8 @REM
 9 @REM      http://www.apache.org/licenses/LICENSE-2.0
10 @REM
11 @REM  Unless required by applicable law or agreed to in writing, software
12 @REM  distributed under the License is distributed on an "AS IS" BASIS,
13 @REM  WITHOUT WARRANTIES OR CONDITIONS OF ANY KIND, either express or implied.
14 @REM  See the License for the specific language governing permissions and
15 @REM  limitations under the License.
16 
17 @echo off
18 
19 setlocal
20 set OAP_PROCESS_TITLE=Skywalking-Collector
21 set OAP_HOME=%~dp0%..
22 set OAP_OPTS="-Xms256M -Xmx512M -Doap.logDir=%OAP_HOME%\logs"
23 
24 set CLASSPATH=%OAP_HOME%\config;.;
25 set CLASSPATH=%OAP_HOME%\oap-libs\*;%CLASSPATH%
26 
27 if defined JAVA_HOME (
28  set _EXECJAVA="%JAVA_HOME%\bin\javaw"
29 )
30 
31 if not defined JAVA_HOME (
32  echo "JAVA_HOME not set."
33  set _EXECJAVA=javaw
34 )
35 
36 start "%OAP_PROCESS_TITLE%" %_EXECJAVA% "%OAP_OPTS%" -cp "%CLASSPATH%" org.apache.skywalking.oap.server.starter.OAPServerStartUp
37 endlocal
oapService.bat .NetCore使用skywalking实现实时性能监控
 1 @REM
 2 @REM  Licensed to the Apache Software Foundation (ASF) under one or more
 3 @REM  contributor license agreements.  See the NOTICE file distributed with
 4 @REM  this work for additional information regarding copyright ownership.
 5 @REM  The ASF licenses this file to You under the Apache License, Version 2.0
 6 @REM  (the "License"); you may not use this file except in compliance with
 7 @REM  the License.  You may obtain a copy of the License at
 8 @REM
 9 @REM      http://www.apache.org/licenses/LICENSE-2.0
10 @REM
11 @REM  Unless required by applicable law or agreed to in writing, software
12 @REM  distributed under the License is distributed on an "AS IS" BASIS,
13 @REM  WITHOUT WARRANTIES OR CONDITIONS OF ANY KIND, either express or implied.
14 @REM  See the License for the specific language governing permissions and
15 @REM  limitations under the License.
16 
17 @echo off
18 
19 setlocal
20 set WEBAPP_PROCESS_TITLE=Skywalking-Webapp
21 set WEBAPP_HOME=%~dp0%..
22 set JARPATH=%WEBAPP_HOME%\webapp
23 set WEBAPP_LOG_DIR=%WEBAPP_HOME%\logs
24 
25 if exist "%WEBAPP_LOG_DIR%" (
26     mkdir "%WEBAPP_LOG_DIR%"
27 )
28 
29 set LOG_FILE_LOCATION=%WEBAPP_LOG_DIR%\webapp.log
30 
31 if defined JAVA_HOME (
32  set _EXECJAVA="%JAVA_HOME%\bin\javaw"
33 )
34 
35 if not defined JAVA_HOME (
36  echo "JAVA_HOME not set."
37  set _EXECJAVA=javaw
38 )
39 
40 start "%WEBAPP_PROCESS_TITLE%" %_EXECJAVA%  -jar %JARPATH%/skywalking-webapp.jar --spring.config.location=%JARPATH%/webapp.yml --logging.file=%LOG_FILE_LOCATION%
41 endlocal
webappService.bat

其实只是将文件里的java改成了javaw,这样就可以在后台运行了,保存后再次运行startup.bat文件,这个时候界面上会有个cmd命令界面一闪而过,不要慌,我们打开资源管理器看看,会发现进程中多了两个名为“javaw.exe”的进程

这个时候访问:http://localhost:8080 一样可以看到上面的ui界面!

至此,skywalking的所有环境皆搭建完毕,接下来,在我们项目中添加skywalking的探针,方便skywalking收集我们项目中的数据

四、项目引用skywalking探针

新建一个NetCore的webapi,然后在引用中引用SkyWalking.AspNetCore,如图:

.NetCore使用skywalking实现实时性能监控

项目引用后,在Startup.cs中注入skywalking。

在头部引用:using SkyWalking.AspNetCore;

然后找到public void ConfigureServices(IServiceCollection services)下输入一下代码即可:

.NetCore使用skywalking实现实时性能监控
1 services.AddSkyWalking(option =>
2 {
3       option.ApplicationCode = "AreaServer";
4       option.DirectServers = 127.0.0.1:11800;
5  });
View Code

运行代码后,控制台内每隔几秒就会有以下信息输出

.NetCore使用skywalking实现实时性能监控

证明skywalking探针已经成功,接下来请求一下你的接口,然后进入skywalking的ui中看看你的成果吧!

上一篇:iphone – RAR解压缩的最小内存


下一篇:Linux下安装SkyWalking 6.x版本 以及.NETCore项目集成