在 Yarn 中找不到 spark-submit 完成的工作

Can't find spark-submit finished job in Yarn

我使用 spark-submit 提交了作业,作业正确完成但未在 Yarn 中显示。

如何在 Yarn 中直接查看进程?我已经指定了 yarn 客户端模式,但在 yarn web 控制台中我什么也看不到。

基于 yarn application -list -appStates FINISHED 可以胜任。同样在同一个线程中,您会发现另一个可能的原因:

It turns out that I had enabled Log aggregation in YARN but had set the yarn.nodemanager.remote-app-log-dir to a custom hdfs directory (/tmp/yarnlogs), So logs were actually getting aggregated at /tmp/yarnlogs in HDFS, but the yarn command was still searching for logs at the default location on HDFS (/tmp/logs). So changing the property to its default value fixed it for me.

希望对您有所帮助!