mapreduce 中的 NoSuchElementException
NoSuchElementException in mapreduce
我是 map reduce 的新手,遇到 NoSuchElementException,请帮忙。
在文本下方输入文件容器:
this is a hadoop program
i am writing it for first time
映射器class:
public class Mappers extends MapReduceBase implements Mapper<LongWritable, Text, IntWritable, IntWritable>{
private Text word = new Text();
private IntWritable singleWordCount = new IntWritable();
private IntWritable one = new IntWritable(1);
@Override
public void map(LongWritable key, Text value, OutputCollector<IntWritable, IntWritable> output, Reporter reporter) throws IOException {
StringTokenizer wordList = new StringTokenizer(value.toString());
while (wordList.hasMoreTokens()) {
int wordSize = wordList.nextToken().length();
singleWordCount.set(wordSize);
if(word != null && wordList != null && wordList.nextToken() != null){
word.set(wordList.nextToken());
output.collect(singleWordCount, one);
}
}
}
}
This is the error I am getting
您在循环中为每次迭代调用了三次 wordList.nextToken()
。每次你调用它 StringTokenizer
都会 return 下一个标记,当你的程序在你的文本中遇到单词 first
时会导致异常,因为你检索 first
然后 time
然后尝试检索下一个不存在的词,导致异常。
您需要做的是在每次迭代中检索一次并将其存储在变量中。或者,如果您确实需要在一次迭代中检索两个单词,请始终调用 hasMoreTokens()
以检查在实际调用 nextToken()
.
之前是否确实还有另一个单词要处理
我是 map reduce 的新手,遇到 NoSuchElementException,请帮忙。
在文本下方输入文件容器:
this is a hadoop program
i am writing it for first time
映射器class:
public class Mappers extends MapReduceBase implements Mapper<LongWritable, Text, IntWritable, IntWritable>{
private Text word = new Text();
private IntWritable singleWordCount = new IntWritable();
private IntWritable one = new IntWritable(1);
@Override
public void map(LongWritable key, Text value, OutputCollector<IntWritable, IntWritable> output, Reporter reporter) throws IOException {
StringTokenizer wordList = new StringTokenizer(value.toString());
while (wordList.hasMoreTokens()) {
int wordSize = wordList.nextToken().length();
singleWordCount.set(wordSize);
if(word != null && wordList != null && wordList.nextToken() != null){
word.set(wordList.nextToken());
output.collect(singleWordCount, one);
}
}
}
}
This is the error I am getting
您在循环中为每次迭代调用了三次 wordList.nextToken()
。每次你调用它 StringTokenizer
都会 return 下一个标记,当你的程序在你的文本中遇到单词 first
时会导致异常,因为你检索 first
然后 time
然后尝试检索下一个不存在的词,导致异常。
您需要做的是在每次迭代中检索一次并将其存储在变量中。或者,如果您确实需要在一次迭代中检索两个单词,请始终调用 hasMoreTokens()
以检查在实际调用 nextToken()
.