如何查看我的 mapreduce 作业是否在多个节点上执行?

How to see if my mapreduce job is executed on multiple nodes?

我在集群中有 运行 个 Map Reduce 作业。我在 HDInsight 上用 2 个名称节点和 4 个数据节点创建了一个作业。

我没有设置map任务和reduce任务的数量。执行我的 mapreduce 作业后,我得到如下结果。

我注意到启动的 reduce 任务数是 1.Dose 这意味着我的作业只在一个节点上执行?如何查看此作业中使用了多少个节点?

File System Counters
                FILE: Number of bytes read=2209390166
                FILE: Number of bytes written=3314494070
                FILE: Number of read operations=0
                FILE: Number of large read operations=0
                FILE: Number of write operations=0
                WASB: Number of bytes read=1084887535
                WASB: Number of bytes written=1205106549
                WASB: Number of read operations=0
                WASB: Number of large read operations=0
                WASB: Number of write operations=0
        Job Counters
                Launched map tasks=2
                Launched reduce tasks=1
                Rack-local map tasks=2
                Total time spent by all maps in occupied slots (ms)=148221
                Total time spent by all reduces in occupied slots (ms)=302038
                Total time spent by all map tasks (ms)=148221
                Total time spent by all reduce tasks (ms)=151019
                Total vcore-seconds taken by all map tasks=148221
                Total vcore-seconds taken by all reduce tasks=151019
                Total megabyte-seconds taken by all map tasks=113833728
                Total megabyte-seconds taken by all reduce tasks=231965184
        Map-Reduce Framework
                Map input records=3820642
                Map output records=3820642
                Map output bytes=1092815223
                Map output materialized bytes=1104695065
                Input split bytes=286
                Combine input records=0
                Combine output records=0
                Reduce input groups=200
                Reduce shuffle bytes=1104695065
                Reduce input records=3820642
                Reduce output records=3820642
                Spilled Records=11461926
                Shuffled Maps =2
                Failed Shuffles=0
                Merged Map outputs=2
                GC time elapsed (ms)=16229
                CPU time spent (ms)=225140
                Physical memory (bytes) snapshot=1377296384
                Virtual memory (bytes) snapshot=5068787712
                Total committed heap usage (bytes)=1175453696
        Shuffle Errors
                BAD_ID=0
                CONNECTION=0
                IO_ERROR=0
                WRONG_LENGTH=0
                WRONG_MAP=0
                WRONG_REDUCE=0
        File Input Format Counters
                Bytes Read=1084887106
        File Output Format Counters
                Bytes Written=68771556

是的,你将reduce数设置为1,输出reduce为1,你可以看到它Map input records=3820642 地图输出记录=3820642 地图输出字节数=1092815223 映射输出物化字节=1104695065 输入拆分字节=286 合并输入记录=0 合并输出记录=0 减少输入组=200 减少随机播放字节数=1104695065 减少输入记录=3820642 减少输出记录=3820642