Yarn metrics参数说明
获取Yarn jmx信息:
curl -i http://xxx:8088/jmx
Hadoop:service=ResourceManager,name=FSOpDurations
Hadoop:service=ResourceManager,name=JvmMetrics
Hadoop:service=ResourceManager,name=ClusterMetrics
Hadoop:service=ResourceManager,name=RpcActivityForPort8033
Hadoop:service=ResourceManager,name=MetricsSystem,sub=Stats
Hadoop:service=ResourceManager,name=QueueMetrics,q0=root,q1=user02
Hadoop:service=ResourceManager,name=QueueMetrics,q0=root,q1=A,q2=C
Hadoop:service=ResourceManager,name=QueueMetrics,q0=root
Hadoop:service=ResourceManager,name=QueueMetrics,q0=root,q1=A
Hadoop:service=ResourceManager,name=QueueMetrics,q0=root,q1=A,q2=B
Hadoop:service=ResourceManager,name=QueueMetrics,q0=root,q1=default
Hadoop:service=ResourceManager,name=QueueMetrics,q0=root,q1=user01
#QueueMetrics
running_0
running_60
running_300
running_1440
FairShareMB Fair share of memory in MB
FairShareVCores Fair share of CPU in vcores
SteadyFairShareMB Steady fair share of memory in MB
SteadyFairShareVCores Steady fair share of CPU in vcores
MinShareMB Minimum share of memory in MB
MinShareVCores Minimum share of CPU in vcores
MaxShareMB Maximum share of memory in MB
MaxShareVCores Maximum share of CPU in vcores
AppsSubmitted application提交个数
AppsRunning 正在运行的application个数
AppsPending 挂起的application个数
AppsCompleted 完成的application个数
AppsKilled 被杀死的application个数
AppsFailed 失败的application个数
AllocatedMB 已分配的内存量
AllocatedVCores 分配给正在运行的应用的虚拟core
AllocatedContainers 已分配容器数
AggregateContainersAllocated Aggregate # of allocated containers
AggregateContainersReleased Aggregate # of released containers
AvailableMB 可用内存量
AvailableVCores 可用虚拟Core数
PendingMB Pending memory allocation in MB
PendingVCores Pending CPU allocation in virtual cores
PendingContainers # of pending containers
ReservedMB 预留内存量
ReservedVCores 预留虚拟Core数
ReservedContainers 预留容器数
ActiveUsers 激活用户数
ActiveApplications 激活的应用数
AppAttemptFirstContainerAllocationDelayNumOps 为应用分配容器延迟数
AppAttemptFirstContainerAllocationDelayAvgTime 为应用分配容器延迟时间
#FSOpDurations
ContinuousSchedulingRunNumOps Duration for a continuous scheduling run
ContinuousSchedulingRunAvgTime
ContinuousSchedulingRunStdevTime
ContinuousSchedulingRunIMinTime
ContinuousSchedulingRunIMaxTime
ContinuousSchedulingRunMinTime
ContinuousSchedulingRunMaxTime
NodeUpdateCallNumOps Duration to handle a node update
NodeUpdateCallAvgTime
NodeUpdateCallStdevTime
NodeUpdateCallIMinTime
NodeUpdateCallIMaxTime
NodeUpdateCallMinTime
NodeUpdateCallMaxTime
UpdateThreadRunNumOps Duration for a update thread run
UpdateThreadRunAvgTime
UpdateThreadRunStdevTime
UpdateThreadRunIMinTime
UpdateThreadRunIMaxTime
UpdateThreadRunMinTime
UpdateThreadRunMaxTime
UpdateCallNumOps Duration for an update call
UpdateCallAvgTime
UpdateCallStdevTime
UpdateCallIMinTime
UpdateCallIMaxTime
UpdateCallMinTime
UpdateCallMaxTime
PreemptCallNumOps Duration for a preempt call
PreemptCallAvgTime
PreemptCallStdevTime
PreemptCallIMinTime
PreemptCallIMaxTime
PreemptCallMinTime
PreemptCallMaxTime
#ClusterMetrics
NumActiveNMs active NMs
NumDecommissionedNMs decommissioned NMs
NumLostNMs lost NMs
NumUnhealthyNMs unhealthy NMs
NumRebootedNMs Rebooted NMs
AMLaunchDelayNumOps AM container launch delay
AMLaunchDelayAvgTime
AMRegisterDelayNumOps AM register delay
AMRegisterDelayAvgTime
#RpcActivityForPort8033
ReceivedBytes Total number of received bytes
SentBytes Total number of sent bytes
RpcQueueTimeNumOps Total number of RPC calls
RpcQueueTimeAvgTime Average queue time in milliseconds
RpcProcessingTimeNumOps Total number of RPC calls (same to RpcQueueTimeNumOps)
RpcProcessingTimeAvgTime Average Processing time in milliseconds
RpcAuthenticationFailures Total number of authentication failures
RpcAuthenticationSuccesses Total number of authentication successes
RpcAuthorizationFailures Total number of authorization failures
RpcAuthorizationSuccesses Total number of authorization successes
RpcClientBackoff
NumOpenConnections NumOpenConnections
CallQueueLength Current length of the call queue
jmx hadoop部分参数
curl http://10.116.100.2:50070/jmx?qry=Hadoop:service=NameNode,name=RpcActivityForPort8020
NameNode
NameNode:50070
Hadoop:service=NameNode,name=JvmMetrics
- MemHeapMaxM
- MemMaxM
Hadoop:service=NameNode,name=FSNamesystem
- CapacityTotal
- CapacityTotalGB
- CapacityRemaining
- CapacityRemainingGB
- TotalLoad
- FilesTotal
Hadoop:service=NameNode,name=FSNamesystemState
- NumLiveDataNodes
Hadoop:service=NameNode,name=NameNodeInfo
- LiveNodes
java.lang:type=Runtime
- StartTime
Hadoop:service=NameNode,name=FSNamesystemState
- TopUserOpCounts:timestamp
DataNode
DataNode:50075
Hadoop:service=DataNode,name=DataNodeActivity-slave-50010
- BytesWritten
- BytesRead
- BlocksWritten
- BlocksRead
- ReadsFromLocalClient
- ReadsFromRemoteClient
- WritesFromLocalClient
- WritesFromRemoteClient
- BlocksGetLocalPathInfo
HDFS
NameNode:50070
Hadoop:service=NameNode,name=NameNodeActivity
- CreateFileOps
- FilesCreated
- FilesAppended
- FilesRenamed
- GetListingOps
- DeleteFileOps
- FilesDeleted
Hadoop:service=NameNode,name=FSNamesystem
- CapacityTotal
- CapacityTotalGB
- CapacityUsed
- CapacityUsedGB
- CapacityRemaining
- CapacityRemainingGB
- CapacityUsedNonDFS
Yarn
ResourceManager:8088
Hadoop:service=ResourceManager,name=ClusterMetrics
- NumActiveNMs
- NumDecommissionedNMs
Hadoop:service=ResourceManager,name=JvmMetrics
- MemHeapUsedM