在 GATE 中多行注释句子
Annotating sentence in multiple lines in GATE
我对 GATE 中的 Sentence Splitter 模块有疑问。我的文字是这样的:
Social history. He drank a lot in his young age. He did
not attend a school. He was depressed of his condition.
虽然我们确定句子应该像这样拆分
Sentence 1: Social history.
Sentence 2: He drank a lot in his young age.
Sentence 3: He did not attend a school.
Sentence 4: He was depressed of his condition.
ANNIE Sentence Splitter 认识到不同行中的文本应该分组在不同的句子中,因此结果如下:
Sentence 1: Social history.
Sentence 2: He drank a lot in his young age.
Sentence 3: He did
Sentence 4: not attend a school.
Sentence 5: He was depressed of his condition.
那是因为句子分多行。有没有办法告诉分句器这个句子可能不止一行?或者有没有更好的方法来识别此类文本中的句子?
谢谢:)
尝试使用 RegEx Sentence Splitter 而不是 Annie。
使用 ANNIE Sentence Splitter,您的参数 TransducerURL 默认指向如下内容:
/PATH-TO-GATE/plugins/ANNIE/resources/sentenceSplitter/grammar/main-single-nl.jape
在此文件夹中还有一个名为:
的 jape 文件
/PATH-TO-GATE/plugins/ANNIE/resources/sentenceSplitter/grammar/main.jape
如果您更改它,它应该会起作用。
我对 GATE 中的 Sentence Splitter 模块有疑问。我的文字是这样的:
Social history. He drank a lot in his young age. He did
not attend a school. He was depressed of his condition.
虽然我们确定句子应该像这样拆分
Sentence 1: Social history.
Sentence 2: He drank a lot in his young age.
Sentence 3: He did not attend a school.
Sentence 4: He was depressed of his condition.
ANNIE Sentence Splitter 认识到不同行中的文本应该分组在不同的句子中,因此结果如下:
Sentence 1: Social history.
Sentence 2: He drank a lot in his young age.
Sentence 3: He did
Sentence 4: not attend a school.
Sentence 5: He was depressed of his condition.
那是因为句子分多行。有没有办法告诉分句器这个句子可能不止一行?或者有没有更好的方法来识别此类文本中的句子?
谢谢:)
尝试使用 RegEx Sentence Splitter 而不是 Annie。
使用 ANNIE Sentence Splitter,您的参数 TransducerURL 默认指向如下内容:
/PATH-TO-GATE/plugins/ANNIE/resources/sentenceSplitter/grammar/main-single-nl.jape
在此文件夹中还有一个名为:
的 jape 文件/PATH-TO-GATE/plugins/ANNIE/resources/sentenceSplitter/grammar/main.jape
如果您更改它,它应该会起作用。