WordNet 3.1 和 WordNet 3.0 有什么区别?
What's the difference between WordNet 3.1 and WordNet 3.0?
wordnet.princeton.edu
似乎没有更新日志或类似的东西
如果您在 WordNet's Current Version section 下查看,您会发现:
The most recent Windows version of WordNet is 2.1, released in March
2005. Version 3.0 for Unix/Linux/Solaris/etc. was released in December, 2006. Version 3.1 is currently available only online.
另外,说说3.0和3.1版本的区别你可以阅读:
WordNet 3.1 DATABASE FILES ONLY
You can download the WordNet 3.1 database files from here. Note that
this is not a full package as those above, nor does it contain any
code for running WordNet. However, you can replace the files in the
database directory of your 3.0 local installation with these files and
the WordNet interface will run, returning entries from the 3.1
database. This is simply a compressed tar file of the WordNet 3.1
database files.
所以区别在于WordNet 3.1是在线的,但是你可以更换3.0版本的数据库,使用本地安装。
您可以找到有关版本 3.0 的文档here。
添加到@abarisone 的回答中,WordNet 3.0 和 WordNet 3.1 之间的实际同义词集 ID 本身可能不同:(
例如,在 WordNet 3.1 中 chair 是 103005231-n.
但是,在 WordNet 3.0 中它 是 103001627-n。但是你不能在 http://wordnet-rdf.princeton.edu/wn31/103001627-n nor http://wordnet-rdf.princeton.edu/wn30/103001627-n, but instead you need to use http://wordnet-rdf.princeton.edu/wn30/03001627-n which incorrectly redirects to 102992974-n 中查找它。
我认为 WordNet RDF 3.1 online app, because 102992974-n 中的错误并不正式存在。您甚至无法搜索它(在线和离线)。如果您在该页面上获得 RDF/JSON-LD 文件,它会为您提供 103005231-n.
在wn3.1.dict/dict/index.noun
中:
chair n 5 4 @ ~ %p + 5 2 03005231 00599171 10488547 03275941 03005700
该文件中的任何地方都没有提及 02992974
。
这两个问题都令人困惑。我想知道为什么他们在小版本中更改了同义词集 ID。
关于 WordNet 同义词集 ID 的状态:
结论是,目前使用 WordNet 3.0 同义词集 ID 是最安全的。
为了以后的工作,可以考虑使用来自Global Wordnet Association的Inter-Lingual Index(即将推出)。它将具有与 Wordnet 3.0 兼容的 ID。
来自 wn-users mailing list, 30 Oct 2015 的引用:
From: Raphael, Nicholas
The URI is built from the “dblocation” field, which is a byte offset
from the beginning of the relevant character-based database file (I’m
not sure which). This will change from release to release as items are
removed and added and moved around.
.
From: Peter Clark
To the best of my knowledge…. FYI a little known fact is that the
sense keys (e.g., “ability%1:07:00::”) are stable between releases,
except when senses are split or merged. This provides a stable way to
refer to synsets across releases, rather than use synset numbers. Also
you can find the mappings between synset numbers in different releases
by looking for the same sense keys. (sensekey->synset is a many-to-1
mapping: A synset may have multiple sense keys, one for each
word+sense in the synset. But a sense key maps to exactly one synset).
Best wishes, Pete
.
From: John McCrae
Hello Hendy,
Yes WordNet synset Identifiers are based on the byte offset of the
descriptor in a given release of WordNet, as such they are far from
stable across versions of WordNets. The sense identifiers are more
stable but still can be unreliable as sense do get split and merged.
Also, there are two slightly different versions of WordNet 3.1 and the
WordNet RDF version accepts synset identifiers from either... this is
of course, as others have commented, all very confusing.
For this reason, the Global WordNet Association has started work on an
Inter-Lingual Index, which we expect to be online soon (i.e., in time
for the Global WordNet Conference in January), and will give each
synset a single unchanging URI.
Piek Vossen gave a good talk about this recently and this slides are
online here: http://ldl2014.org/slides/Vossen-LOD-CILI.pdf
For the moment, I would recommend using WN 3.0 identifiers to link
synsets, which the WordNet Interlingual Index will also be based on.
Regards, John
wordnet.princeton.edu
似乎没有更新日志或类似的东西如果您在 WordNet's Current Version section 下查看,您会发现:
The most recent Windows version of WordNet is 2.1, released in March 2005. Version 3.0 for Unix/Linux/Solaris/etc. was released in December, 2006. Version 3.1 is currently available only online.
另外,说说3.0和3.1版本的区别你可以阅读:
WordNet 3.1 DATABASE FILES ONLY
You can download the WordNet 3.1 database files from here. Note that this is not a full package as those above, nor does it contain any code for running WordNet. However, you can replace the files in the database directory of your 3.0 local installation with these files and the WordNet interface will run, returning entries from the 3.1 database. This is simply a compressed tar file of the WordNet 3.1 database files.
所以区别在于WordNet 3.1是在线的,但是你可以更换3.0版本的数据库,使用本地安装。
您可以找到有关版本 3.0 的文档here。
添加到@abarisone 的回答中,WordNet 3.0 和 WordNet 3.1 之间的实际同义词集 ID 本身可能不同:(
例如,在 WordNet 3.1 中 chair 是 103005231-n.
但是,在 WordNet 3.0 中它 是 103001627-n。但是你不能在 http://wordnet-rdf.princeton.edu/wn31/103001627-n nor http://wordnet-rdf.princeton.edu/wn30/103001627-n, but instead you need to use http://wordnet-rdf.princeton.edu/wn30/03001627-n which incorrectly redirects to 102992974-n 中查找它。
我认为 WordNet RDF 3.1 online app, because 102992974-n 中的错误并不正式存在。您甚至无法搜索它(在线和离线)。如果您在该页面上获得 RDF/JSON-LD 文件,它会为您提供 103005231-n.
在wn3.1.dict/dict/index.noun
中:
chair n 5 4 @ ~ %p + 5 2 03005231 00599171 10488547 03275941 03005700
该文件中的任何地方都没有提及 02992974
。
这两个问题都令人困惑。我想知道为什么他们在小版本中更改了同义词集 ID。
关于 WordNet 同义词集 ID 的状态:
结论是,目前使用 WordNet 3.0 同义词集 ID 是最安全的。
为了以后的工作,可以考虑使用来自Global Wordnet Association的Inter-Lingual Index(即将推出)。它将具有与 Wordnet 3.0 兼容的 ID。
来自 wn-users mailing list, 30 Oct 2015 的引用:
From: Raphael, Nicholas
The URI is built from the “dblocation” field, which is a byte offset from the beginning of the relevant character-based database file (I’m not sure which). This will change from release to release as items are removed and added and moved around.
.
From: Peter Clark
To the best of my knowledge…. FYI a little known fact is that the sense keys (e.g., “ability%1:07:00::”) are stable between releases, except when senses are split or merged. This provides a stable way to refer to synsets across releases, rather than use synset numbers. Also you can find the mappings between synset numbers in different releases by looking for the same sense keys. (sensekey->synset is a many-to-1 mapping: A synset may have multiple sense keys, one for each word+sense in the synset. But a sense key maps to exactly one synset). Best wishes, Pete
.
From: John McCrae
Hello Hendy,
Yes WordNet synset Identifiers are based on the byte offset of the descriptor in a given release of WordNet, as such they are far from stable across versions of WordNets. The sense identifiers are more stable but still can be unreliable as sense do get split and merged. Also, there are two slightly different versions of WordNet 3.1 and the WordNet RDF version accepts synset identifiers from either... this is of course, as others have commented, all very confusing.
For this reason, the Global WordNet Association has started work on an Inter-Lingual Index, which we expect to be online soon (i.e., in time for the Global WordNet Conference in January), and will give each synset a single unchanging URI.
Piek Vossen gave a good talk about this recently and this slides are online here: http://ldl2014.org/slides/Vossen-LOD-CILI.pdf
For the moment, I would recommend using WN 3.0 identifiers to link synsets, which the WordNet Interlingual Index will also be based on.
Regards, John