在 minidcos 上部署 apache spark 时,将 "cgroup" 安装到位于“/sys/**”"no subsystem for mount"”的根文件系统“/var/lib/docker/overlay2”

mounting "cgroup" to rootfs "/var/lib/docker/overlay2" at "/sys/**" "no subsystem for mount"" when deploying apache spark on minidcos

我正在尝试使用 dcos 目录部署 apache spark on minidcos

安装elastic、kafka、prometheus时安装成功,如下图

但是我安装spark的时候失败了,服务debug中显示的错误是"TASK_FAILED : Container exited with status 125"

日志中完整的错误跟踪是-

    WARNING: Your kernel does not support swap limit capabilities or the cgroup is not mounted. Memory limited without swap.
    container_linux.go:247: starting container process caused "process_linux.go:359: 

    container init caused \"rootfs_linux.go:54: mounting \\"cgroup\\" to rootfs \\"/var/lib/docker/overlay2/319fc54c21ddaf3c611e69437342b0a40c70e76bfbd9b0c88175cb611449fa3e/merged\\" at \\"/sys/fs/cgroup\\" caused \\"no subsystem for mount\\"\""
    docker: Error response from daemon: oci runtime error: container_linux.go:247: starting container process caused "process_linux.go:359: container init caused \"rootfs_linux.go:54: mounting \\"cgroup\\" to rootfs \\"/var/lib/docker/overlay2/319fc54c21ddaf3c611e69437342b0a40c70e76bfbd9b0c88175cb611449fa3e/merged\\" at \\"/sys/fs/cgroup\\" caused \\"no subsystem for mount\\"\"".
    time="2019-02-23T01:15:36Z" level=error msg="error getting events from daemon: net/http: request canceled" 

    mounting "cgroup" to rootfs "/var/lib/docker/overlay2/merged" at "/sys/fs/cgroup" caused "no subsystem for mount""

有关我系统中安装的组件的其他详细信息是 --

  $sudo minidcos --version
    minidcos, version 2019.03.13.0

    $lsb_release -a
    No LSB modules are available.
    Distributor ID: Ubuntu
    Description:    Ubuntu 18.04.2 LTS
    Release:    18.04
    Codename:   bionic

    $docker version
    Client:
    Version:           18.09.4
    API version:       1.39
    Go version:        go1.10.8
    Git commit:        d14af54266
    Built:             Wed Mar 27 18:35:44 2019
    OS/Arch:           linux/amd64
    Experimental:      false

    Server: Docker Engine - Community
    Engine:
    Version:          18.09.4
    API version:      1.39 (minimum version 1.12)
    Go version:       go1.10.8
    Git commit:       d14af54
    Built:            Wed Mar 27 18:01:48 2019
    OS/Arch:          linux/amd64
    Experimental:     false

我该如何解决这个错误?

找到它在其中启动的节点的 docker 容器: minidcos docker inspect
输入这个节点:docker exec -it dcos-e2e-default-6af12-agent-1 bash(替换你找到的容器名)
列出所有容器 docker ps -a
显示失败容器的日志 docker logs mesos-a500e80d-d694-4fb5-8baa-0b238f472f1f

您使用的 minidcos 是哪个版本?

我认为这是一个已在版本 2019.02.16.0 中修复的问题。

升级到最新版本,运行minidcos docker doctor然后重试。