Apache Drill 在午餐时立即崩溃

Apache Drill crash immediately on lunch

我的 drill 集群有问题,我的 drill 集群有 1 个主节点和 4 个节点。

我已经停止了每一个钻孔节点

sudo -i /home/hadoop/apache-drill-1.10.0/bin/drillbit.sh stop

并开始了它

sudo -i /home/hadoop/apache-drill-1.10.0/bin/drillbit.sh start

我不知道该怎么做,尝试过在线搜索,但似乎没有任何相关内容 link 没有解决我的问题

    2017-07-04 13:09:10,454 [main] INFO  o.apache.drill.exec.server.Drillbit - Construction completed (2928 ms).
    2017-07-04 13:09:10,864 [main] INFO  o.a.drill.exec.rpc.user.UserServer - User Error Occurred: Drillbit could not bind to port 31010. (Address already in use)
    org.apache.drill.common.exceptions.UserException: RESOURCE ERROR: Drillbit could not bind to port 31010.

    Server type UserServer

    [Error Id: a75dd2ec-a3b6-4fcb-b60b-0a1a63354943 ]

................
................
org.apache.drill.common.AutoCloseables.close(AutoCloseables.java:64) ~[drill-common-1.10.0.jar:1.10.0]
            at org.apache.drill.exec.server.Drillbit.close(Drillbit.java:159) [drill-java-exec-1.10.0.jar:1.10.0]
            at org.apache.drill.exec.server.Drillbit.start(Drillbit.java:294) [drill-java-exec-1.10.0.jar:1.10.0]
            at org.apache.drill.exec.server.Drillbit.start(Drillbit.java:271) [drill-java-exec-1.10.0.jar:1.10.0]
            at org.apache.drill.exec.server.Drillbit.main(Drillbit.java:267) [drill-java-exec-1.10.0.jar:1.10.0]
    2017-07-04 13:09:11,919 [main] INFO  o.apache.drill.exec.server.Drillbit - Shutdown completed (1053 ms).

很想知道如何解决它

如何找到原因: Drillbit 无法绑定到端口 31010。(地址已被使用

我的 ec2 实例锁定端口 31010 上似乎有僵尸任务

所以我杀死了那个 pid 任务 通过这个:

netstat -tulp | grep LIST | grep 31010

return

tcp        0      0 *:31010                     *:*                         LISTEN      16449/java

现在是终结者

kill -9 16449