在给定标签处将 XML 文件分成多个,例如:使用 python 的标签

Splitting XML file into multiple at given tags ex: tags using python

您好,我有一个大 xml 文件,我想根据 ID(这里的 ID 是唯一的)将 xml 文件拆分成多个文件。目前我有 xml 文件,其中包含 3 个唯一的标签 ID,我想拆分它们。

我的文件如下

main.xml

    <?xml version="1.0" encoding="UTF-8" standalone="yes"?>
<tal xmlns="http://hello.com" schemaVersion="5.0.0" refSchemaFile="tal.xsd" status="Executable">
    <ExecProperties supportsParallelMostFlash="false"/>
    <talLine id="tl_1" status="Executable" baseVariant="DKOMBI8" diagAddress="60">
        <blFlash status="Executable">
            <blFlashTA status="Executable">
                <sgbmid>
                    <processClass>BTLD</processClass>
                    <id>00007732</id>
                    <mainVersion>2</mainVersion>
                    <subVersion>3</subVersion>
                    <patchVersion>11</patchVersion>
                </sgbmid>
            </blFlashTA>
            <blFlashTA status="Executable">
                <sgbmid>
                    <processClass>FLSL</processClass>
                    <id>00007735</id>
                    <mainVersion>2</mainVersion>
                    <subVersion>3</subVersion>
                    <patchVersion>11</patchVersion>
                </sgbmid>
            </blFlashTA>
        </blFlash>
    </talLine>
    <talLine id="tl_2" status="Executable" baseVariant="DKOMBI8" diagAddress="60">
        <swDeploy status="Executable">
            <swDeployTA status="Executable">
                <sgbmid>
                    <processClass>SWFL</processClass>
                    <id>00007736</id>
                    <mainVersion>2</mainVersion>
                    <subVersion>3</subVersion>
                    <patchVersion>11</patchVersion>
                </sgbmid>
            </swDeployTA>
            <swDeployTA status="Executable">
                <sgbmid>
                    <processClass>SWFL</processClass>
                    <id>00007bfc</id>
                    <mainVersion>2</mainVersion>
                    <subVersion>3</subVersion>
                    <patchVersion>11</patchVersion>
                </sgbmid>
            </swDeployTA>
        </swDeploy>
    </talLine>
    <talLine id="tl_3" status="Executable" baseVariant="DKOMBI8" diagAddress="60">
        <cdDeploy status="Executable">
            <cdDeployTA status="Executable">
                <sgbmid>
                    <processClass>CAFD</processClass>
                    <id>00006d4e</id>
                    <mainVersion>0</mainVersion>
                    <subVersion>4</subVersion>
                    <patchVersion>11</patchVersion>
                </sgbmid>
            </cdDeployTA>
        </cdDeploy>
    </talLine>
    <executionTime actualEndTime="0" actualStartTime="0" plannedEndTime="0" plannedStartTime="0"/>
    <installedECUList_Ist/>
    <installedECUList_Soll/>
</tal>

我需要每个 "id"-data 在一个带有页眉和页脚的文件中(你可以观察下面的文件),上面是示例文件。我需要像下面这样的随地吐痰的文件

1.xml

    <?xml version="1.0" encoding="UTF-8" standalone="yes"?>
<tal xmlns="http://hello.com" schemaVersion="5.0.0" refSchemaFile="tal.xsd" status="Executable">
    <ExecProperties supportsParallelMostFlash="false"/>
    <talLine id="tl_1" status="Executable" baseVariant="DKOMBI8" diagAddress="60">
        <blFlash status="Executable">
            <blFlashTA status="Executable">
                <sgbmid>
                    <processClass>BTLD</processClass>
                    <id>00007732</id>
                    <mainVersion>2</mainVersion>
                    <subVersion>3</subVersion>
                    <patchVersion>11</patchVersion>
                </sgbmid>
            </blFlashTA>
            <blFlashTA status="Executable">
                <sgbmid>
                    <processClass>FLSL</processClass>
                    <id>00007735</id>
                    <mainVersion>2</mainVersion>
                    <subVersion>3</subVersion>
                    <patchVersion>11</patchVersion>
                </sgbmid>
            </blFlashTA>
        </blFlash>
    </talLine>
    <executionTime actualEndTime="0" actualStartTime="0" plannedEndTime="0" plannedStartTime="0"/>
    <installedECUList_Ist/>
    <installedECUList_Soll/>
</tal>

2.xml

  <?xml version="1.0" encoding="UTF-8" standalone="yes"?>
<tal xmlns="http://hello.com" schemaVersion="5.0.0" refSchemaFile="tal.xsd" status="Executable">
    <ExecProperties supportsParallelMostFlash="false"/>
    <talLine id="tl_3" status="Executable" baseVariant="DKOMBI8" diagAddress="60">
        <cdDeploy status="Executable">
            <cdDeployTA status="Executable">
                <sgbmid>
                    <processClass>CAFD</processClass>
                    <id>00006d4e</id>
                    <mainVersion>0</mainVersion>
                    <subVersion>4</subVersion>
                    <patchVersion>11</patchVersion>
                </sgbmid>
            </cdDeployTA>
        </cdDeploy>
    </talLine>
    <executionTime actualEndTime="0" actualStartTime="0" plannedEndTime="0" plannedStartTime="0"/>
    <installedECUList_Ist/>
    <installedECUList_Soll/>
</tal>

3.xml

    <?xml version="1.0" encoding="UTF-8" standalone="yes"?>
<tal xmlns="http://hello.com" schemaVersion="5.0.0" refSchemaFile="tal.xsd" status="Executable">
    <ExecProperties supportsParallelMostFlash="false"/>
    <talLine id="tl_2" status="Executable" baseVariant="DKOMBI8" diagAddress="60">
        <swDeploy status="Executable">
            <swDeployTA status="Executable">
                <sgbmid>
                    <processClass>SWFL</processClass>
                    <id>00007736</id>
                    <mainVersion>2</mainVersion>
                    <subVersion>3</subVersion>
                    <patchVersion>11</patchVersion>
                </sgbmid>
            </swDeployTA>
            <swDeployTA status="Executable">
                <sgbmid>
                    <processClass>SWFL</processClass>
                    <id>00007bfc</id>
                    <mainVersion>2</mainVersion>
                    <subVersion>3</subVersion>
                    <patchVersion>11</patchVersion>
                </sgbmid>
            </swDeployTA>
        </swDeploy>
    </talLine>
    <executionTime actualEndTime="0" actualStartTime="0" plannedEndTime="0" plannedStartTime="0"/>
    <installedECUList_Ist/>
    <installedECUList_Soll/>
</tal>

我试图删除一些带有 id 的特定标签数据,但没有成功。你能建议我更好的方法来实现我的目标吗?

import xml.etree.ElementTree as ET
tree = ET.parse('main.xml')
root = tree.getroot()
mydata = root.find(".talLine[@id='tl_1']")
mydata.remove(mydata)

提前致谢。

我们只需要从根目录中找出您的标签的索引并清除它们并保存到新的 xml 文件中

import xml.etree.ElementTree as ET
mytree=ET.parse('D://talfiles//TAL_High_Hud_Dcs_002_003_011.xml')
myroot=mytree.getroot()
myroot[1].clear()
myroot[2].clear()
mytree.write('D://talfiles//1.xml')

然后您的输出将保存到新文件中

1.xml 文件

<ns0:tal xmlns:ns0="http:hello.com" refSchemaFile="tal.xsd" schemaVersion="5.0.0" status="Executable">
<ns0:ExecProperties supportsParallelMostFlash="false" />
<ns0:talLine /><ns0:talLine /><ns0:talLine baseVariant="DKOMBI8" diagAddress="60" id="tl_3" status="Executable">
    <ns0:cdDeploy status="Executable">
        <ns0:cdDeployTA status="Executable">
            <ns0:sgbmid>
                <ns0:processClass>CAFD</ns0:processClass>
                <ns0:id>00006d4e</ns0:id>
                <ns0:mainVersion>0</ns0:mainVersion>
                <ns0:subVersion>4</ns0:subVersion>
                <ns0:patchVersion>11</ns0:patchVersion>
            </ns0:sgbmid>
        </ns0:cdDeployTA>
    </ns0:cdDeploy>
</ns0:talLine>
<ns0:executionTime actualEndTime="0" actualStartTime="0" plannedEndTime="0" plannedStartTime="0" />
<ns0:installedECUList_Ist />
<ns0:installedECUList_Soll />

但这里我们面临挑战,我们需要避免在输出的每一行中出现 ns0 字符串 xml 文件