Gremlin：在单独的查询中读取边缘 GraphML 文件和节点 GraphML 文件

Question

我有两个文件要使用 g.io(<name file>).read().iterate() 加载：nodes.xml 和 edges.xml。

nodes.xml文件包含了我要上传的图的节点，内容是这样的：

<?xml version='1.0' encoding='utf-8'?>
<graphml xmlns="http://graphml.graphdrawing.org/xmlns" xmlns:xsi="http://www.w3.org/2001/XMLSchema-instance" xsi:schemaLocation="http://graphml.graphdrawing.org/xmlns http://graphml.graphdrawing.org/xmlns/1.0/graphml.xsd">
    <key id="labelV" for="node" attr.name="labelV" attr.type="string" />
    <key id="name" for="node" attr.name="name" attr.type="string" />
    <key id="age" for="node" attr.name="age" attr.type="int" />
    <graph id="G" edgedefault="directed">
        <node id="1">
            <data key="labelV">person</data>
            <data key="name">marko</data>
            <data key="age">29</data>
        </node>
        <node id="2">
            <data key="labelV">person</data>
            <data key="name">vadas</data>
            <data key="age">27</data>
        </node>
    </graph>
</graphml>

edges.xml文件包含我要上传的图的边，其内容是这样的：

<?xml version='1.0' encoding='utf-8'?>
<graphml xmlns="http://graphml.graphdrawing.org/xmlns" xmlns:xsi="http://www.w3.org/2001/XMLSchema-instance" xsi:schemaLocation="http://graphml.graphdrawing.org/xmlns http://graphml.graphdrawing.org/xmlns/1.0/graphml.xsd">
    <key id="labelE" for="edge" attr.name="labelE" attr.type="string" />
    <key id="weight" for="edge" attr.name="weight" attr.type="double" />
    <graph id="G" edgedefault="directed">
        <edge id="7" source="1" target="2">
            <data key="labelE">knows</data>
            <data key="weight">0.5</data>
        </edge>
    </graph>
</graphml>

我想先上传节点运行 g.io('nodes.xml').read().iterate() 然后 [=] 37=] g.io('edges.xml').read().iterate()。但是当我上传 edges.xml 时，它不会向先前创建的节点添加边，而是创建新节点。

在 Gremlin 中使用 类似命令 可以轻松地先加载节点，然后在单独的查询中加载边？我知道这可以通过复杂的查询来完成，这些查询涉及通过用户查询逐条读取和创建 edges.xml 文件中的边缘，但我想知道是否有更简单的方法。另外，我不想上传包含所有节点和边的单个文件。

Answer 1

恐怕 GraphMLReader 不是那样工作的。它不是为读入现有图表而设计的。老实说，我不记得这是有意还是无意的。

虽然代码并不太复杂。您可能只是修改它以按照您想要的方式工作。您可以看到 here，其中代码检查 id 的顶点缓存。该缓存在您第二次执行时为空，因为它仅通过添加新顶点的方式填充 - 它不记得您第一次运行的任何内容，并且它不会直接从图中读取您的第二次运行.只需将其更改为逻辑以更好地满足您的需求。

Gremlin：在单独的查询中读取边缘 GraphML 文件和节点 GraphML 文件

Gremlin: Read edge GraphML file and node GraphML file in separate queries

graphml

neo4j

gremlin

tinkerpop

janusgraph