运行 mlflow 作为 systemd 服务 - 未找到 gunicorn

Running mlflow as a systemd service - gunicorn not found

我正在尝试 运行 安装在 virtualenv 中的 mlflow 跟踪服务器作为 Ubuntu 20.04 上的 systemd 服务,但我收到一条错误消息,指出它无法找到 gunicorn .这是我的日记

nov 27 10:37:17 Atrium-Power mlflow[81375]: Traceback (most recent call last):
nov 27 10:37:17 Atrium-Power mlflow[81375]:   File "/home/praxasense/.miniconda3/envs/mlflow-server/bin/mlflow", line 8, in <module>
nov 27 10:37:17 Atrium-Power mlflow[81375]:     sys.exit(cli())
nov 27 10:37:17 Atrium-Power mlflow[81375]:   File "/home/praxasense/.miniconda3/envs/mlflow-server/lib/python3.9/site-packages/click/core.py", line 829, in __call__
nov 27 10:37:17 Atrium-Power mlflow[81375]:     return self.main(*args, **kwargs)
nov 27 10:37:17 Atrium-Power mlflow[81375]:   File "/home/praxasense/.miniconda3/envs/mlflow-server/lib/python3.9/site-packages/click/core.py", line 782, in main
nov 27 10:37:17 Atrium-Power mlflow[81375]:     rv = self.invoke(ctx)
nov 27 10:37:17 Atrium-Power mlflow[81375]:   File "/home/praxasense/.miniconda3/envs/mlflow-server/lib/python3.9/site-packages/click/core.py", line 1259, in invoke
nov 27 10:37:17 Atrium-Power mlflow[81375]:     return _process_result(sub_ctx.command.invoke(sub_ctx))
nov 27 10:37:17 Atrium-Power mlflow[81375]:   File "/home/praxasense/.miniconda3/envs/mlflow-server/lib/python3.9/site-packages/click/core.py", line 1066, in invoke
nov 27 10:37:17 Atrium-Power mlflow[81375]:     return ctx.invoke(self.callback, **ctx.params)
nov 27 10:37:17 Atrium-Power mlflow[81375]:   File "/home/praxasense/.miniconda3/envs/mlflow-server/lib/python3.9/site-packages/click/core.py", line 610, in invoke
nov 27 10:37:17 Atrium-Power mlflow[81375]:     return callback(*args, **kwargs)
nov 27 10:37:17 Atrium-Power mlflow[81375]:   File "/home/praxasense/.miniconda3/envs/mlflow-server/lib/python3.9/site-packages/mlflow/cli.py", line 392, in server
nov 27 10:37:17 Atrium-Power mlflow[81375]:     _run_server(
nov 27 10:37:17 Atrium-Power mlflow[81375]:   File "/home/praxasense/.miniconda3/envs/mlflow-server/lib/python3.9/site-packages/mlflow/server/__init__.py", line 138, in _run_server
nov 27 10:37:17 Atrium-Power mlflow[81375]:     exec_cmd(full_command, env=env_map, stream_output=True)
nov 27 10:37:17 Atrium-Power mlflow[81375]:   File "/home/praxasense/.miniconda3/envs/mlflow-server/lib/python3.9/site-packages/mlflow/utils/process.py", line 34, in exec_cmd
nov 27 10:37:17 Atrium-Power mlflow[81375]:     child = subprocess.Popen(
nov 27 10:37:17 Atrium-Power mlflow[81375]:   File "/home/praxasense/.miniconda3/envs/mlflow-server/lib/python3.9/subprocess.py", line 947, in __init__
nov 27 10:37:17 Atrium-Power mlflow[81375]:     self._execute_child(args, executable, preexec_fn, close_fds,
nov 27 10:37:17 Atrium-Power mlflow[81375]:   File "/home/praxasense/.miniconda3/envs/mlflow-server/lib/python3.9/subprocess.py", line 1819, in _execute_child
nov 27 10:37:17 Atrium-Power mlflow[81375]:     raise child_exception_type(errno_num, err_msg, err_filename)
nov 27 10:37:17 Atrium-Power mlflow[81375]: FileNotFoundError: [Errno 2] No such file or directory: 'gunicorn'

我的 systemd 是这样的:

[Unit]
StartLimitBurst=5
StartLimitIntervalSec=33

[Service]
User=praxasense
WorkingDirectory=/home/praxasense
Restart=always
RestartSec=5
ExecStart=/home/praxasense/.miniconda3/envs/mlflow-server/bin/mlflow server --port 3569 --backend-store-uri .mlruns

[Install]
WantedBy=multi-user.target

奇怪的是,如果我在我的终端中 运行 来自 ExecStart 的命令,它在 fish shell 中工作正常,但在 bash、 但是 如果我做 conda activate mlflow-server 然后做 mlflow ... 工作。据我所知,Python 解释器应该知道它的虚拟环境,所以它应该像我尝试的那样工作,但显然我遗漏了一些东西,导致它无法找到 gunicon 包,而 gunicon 包显然在那里.

有什么想法吗?

尝试将 venv 的 bin 路径添加到 systemd 运行的环境中:

[Service]
...
Environment="PATH=/home/praxasense/.miniconda3/envs/mlflow-server/bin"
...

我还建议设置 KillMode=mixed,因为 MLFlow 将生成 gunicorn 实例,如果您以其他方式终止服务,这些实例将不会终止。 mixed表示子进程也将被终止。