./start-cluster.sh
启动后,页面上除了jobmanager上面的taskmanager其他三个节点都没有显示
查看日志:
2021-03-03 15:50:48,544 INFO org.apache.flink.runtime.taskexecutor.TaskExecutor - Could not resolve ResourceManager address akka.tcp://flink@host9:6123/user/resourcemanager, retrying in 10000 ms: Ask timed out on [ActorSelection[Anchor(akka.tcp://flink@host9:6123/), Path(/user/resourcemanager)]] after [10000 ms]. Message of type [akka.actor.Identify]. A typical reason for `AskTimeoutException` is that the recipient actor didn't send a reply..
2021-03-03 15:51:08,584 INFO org.apache.flink.runtime.taskexecutor.TaskExecutor - Could not resolve ResourceManager address akka.tcp://flink@host9:6123/user/resourcemanager, retrying in 10000 ms: Ask timed out on [ActorSelection[Anchor(akka.tcp://flink@host9:6123/), Path(/user/resourcemanager)]] after [10000 ms]. Message of type [akka.actor.Identify]. A typical reason for `AskTimeoutException` is that the recipient actor didn't send a reply..
2021-03-03 15:51:28,624 INFO org.apache.flink.runtime.taskexecutor.TaskExecutor - Could not resolve ResourceManager address akka.tcp://flink@host9:6123/user/resourcemanager, retrying in 10000 ms: Ask timed out on [ActorSelection[Anchor(akka.tcp://flink@host9:6123/), Path(/user/resourcemanager)]] after [10000 ms]. Message of type [akka.actor.Identify]. A typical reason for `AskTimeoutException` is that the recipient actor didn't send a reply..
2021-03-03 15:51:48,664 INFO org.apache.flink.runtime.taskexecutor.TaskExecutor - Could not resolve ResourceManager address akka.tcp://flink@host9:6123/user/resourcemanager, retrying in 10000 ms: Ask timed out on [ActorSelection[Anchor(akka.tcp://flink@host9:6123/), Path(/user/resourcemanager)]] after [10000 ms]. Message of type [akka.actor.Identify]. A typical reason for `AskTimeoutException` is that the recipient actor didn't send a reply..
2021-03-03 15:52:08,704 INFO org.apache.flink.runtime.taskexecutor.TaskExecutor - Could not resolve ResourceManager address akka.tcp://flink@host9:6123/user/resourcemanager, retrying in 10000 ms: Ask timed out on [ActorSelection[Anchor(akka.tcp://flink@host9:6123/), Path(/user/resourcemanager)]] after [10000 ms]. Message of type [akka.actor.Identify]. A typical reason for `AskTimeoutException` is that the recipient actor didn't send a reply..
2021-03-03 15:52:18,145 ERROR org.apache.flink.runtime.taskexecutor.TaskExecutor - Fatal error occurred in TaskExecutor akka.tcp://flink@localhost:35834/user/taskmanager_0.
org.apache.flink.runtime.taskexecutor.exceptions.RegistrationTimeoutException: Could not register at the ResourceManager within the specified maximum registration duration 300000 ms. This indicates a problem with this instance. Terminating now.
at org.apache.flink.runtime.taskexecutor.TaskExecutor.registrationTimeout(TaskExecutor.java:1111)
at org.apache.flink.runtime.taskexecutor.TaskExecutor.lambda$startRegistrationTimeout$8(TaskExecutor.java:1097)
at org.apache.flink.runtime.rpc.akka.AkkaRpcActor.handleRunAsync(AkkaRpcActor.java:397)
2021-03-03 15:50:48,544 INFO org.apache.flink.runtime.taskexecutor.TaskExecutor - Could not resolve ResourceManager address akka.tcp://flink@host9:6123/user/resourcemanager, retrying in 10000 ms: Ask timed out on [ActorSelection[Anchor(akka.tcp://flink@host9:6123/), Path(/user/resourcemanager)]] after [10000 ms]. Message of type [akka.actor.Identify]. A typical reason for `AskTimeoutException` is that the recipient actor didn't send a reply..
2021-03-03 15:51:08,584 INFO org.apache.flink.runtime.taskexecutor.TaskExecutor - Could not resolve ResourceManager address akka.tcp://flink@host9:6123/user/resourcemanager, retrying in 10000 ms: Ask timed out on [ActorSelection[Anchor(akka.tcp://flink@host9:6123/), Path(/user/resourcemanager)]] after [10000 ms]. Message of type [akka.actor.Identify]. A typical reason for `AskTimeoutException` is that the recipient actor didn't send a reply..
2021-03-03 15:51:28,624 INFO org.apache.flink.runtime.taskexecutor.TaskExecutor - Could not resolve ResourceManager address akka.tcp://flink@host9:6123/user/resourcemanager, retrying in 10000 ms: Ask timed out on [ActorSelection[Anchor(akka.tcp://flink@host9:6123/), Path(/user/resourcemanager)]] after [10000 ms]. Message of type [akka.actor.Identify]. A typical reason for `AskTimeoutException` is that the recipient actor didn't send a reply..
2021-03-03 15:51:48,664 INFO org.apache.flink.runtime.taskexecutor.TaskExecutor - Could not resolve ResourceManager address akka.tcp://flink@host9:6123/user/resourcemanager, retrying in 10000 ms: Ask timed out on [ActorSelection[Anchor(akka.tcp://flink@host9:6123/), Path(/user/resourcemanager)]] after [10000 ms]. Message of type [akka.actor.Identify]. A typical reason for `AskTimeoutException` is that the recipient actor didn't send a reply..
2021-03-03 15:52:08,704 INFO org.apache.flink.runtime.taskexecutor.TaskExecutor - Could not resolve ResourceManager address akka.tcp://flink@host9:6123/user/resourcemanager, retrying in 10000 ms: Ask timed out on [ActorSelection[Anchor(akka.tcp://flink@host9:6123/), Path(/user/resourcemanager)]] after [10000 ms]. Message of type [akka.actor.Identify]. A typical reason for `AskTimeoutException` is that the recipient actor didn't send a reply..
2021-03-03 15:52:18,145 ERROR org.apache.flink.runtime.taskexecutor.TaskExecutor - Fatal error occurred in TaskExecutor akka.tcp://flink@localhost:35834/user/taskmanager_0.
org.apache.flink.runtime.taskexecutor.exceptions.RegistrationTimeoutException: Could not register at the ResourceManager within the specified maximum registration duration 300000 ms. This indicates a problem with this instance. Terminating now.
at org.apache.flink.runtime.taskexecutor.TaskExecutor.registrationTimeout(TaskExecutor.java:1111)
at org.apache.flink.runtime.taskexecutor.TaskExecutor.lambda$startRegistrationTimeout$8(TaskExecutor.java:1097)
at org.apache.flink.runtime.rpc.akka.AkkaRpcActor.handleRunAsync(AkkaRpcActor.java:397)
发现是因为主节点 flink-conf.yaml 配置的问题,不应该使用localhost,应该使用对应的hostname
# the conf/masters file, this will be taken care of automatically. Yarn/Mesos
# automatically configure the host name based on the hostname of the node where the
# JobManager runs.
jobmanager.rpc.address: localhost(应该改为host9)
# The RPC port where the JobManager is reachable.
jobmanager.rpc.port: 6123