如果条件在第二级或第三级部分元素中匹配,则更改属性值

Change attribute value if condition matches in second or third level section elements

现在我有一个 XML 文件并要修改部分级别属性(特别是二级或三级)。例如:

输入:

<?xml version="1.0"?>
<article>
<front></front>
<body>
<sec id="sec1">
<title>1. Introduction</title><p>The cerebrospinal venous system has been the focus of many studies in the last few years, because of the hypothesized involvement of insufficient extracranial venous drainage in central nervous system disorders such as multiple sclerosis, normal-pressure hydrocephalus, and transient monocular blindness [<xref ref-type="bibr" rid="B1">1</xref>&ndash;<xref ref-type="bibr" rid="B4">4</xref>]. An insufficiency in venous blood drainage can be due to the presence of single or multiple stenosis on the main routes of cerebrospinal venous system [<xref ref-type="bibr" rid="B5">5</xref>].</p>
<sec id="sec1.1">
<title>Section level 2</title>
<p><def-list><def-item><term>term I:</term><def><p>defintion I</p></def></def-item><def-item><term>term 2:</term><def><p>defintion 2</p></def></def-item></def-list>In the past years, great efforts have been made to develop excellent algorithms and tools for the processing and analyzing of traditional BS-Seq data [<xref ref-type="bibr" rid="B7">7</xref>&#x2013;<xref ref-type="bibr" rid="B10">10</xref>] but none for hairpin-BS-Seq data. In this study, we designed and implemented HBS-tools and compared them against other state-of-the-art mapping tools. Our result indicated that HBS-tools have a reduced mapping time and improved mapping efficiency.</p>
</sec>
</sec></body>
</article>

如果第二级或第三级部分元素在 def-list 之前,那么我已插入特定部分级别的属性 att1="deflist"

预期输出:

<?xml version="1.0"?>
<article>
<front></front>
<body>
<sec id="sec1">
<title>1. Introduction</title><p>The cerebrospinal venous system has been the focus of many studies in the last few years, because of the hypothesized involvement of insufficient extracranial venous drainage in central nervous system disorders such as multiple sclerosis, normal-pressure hydrocephalus, and transient monocular blindness [<xref ref-type="bibr" rid="B1">1</xref>&ndash;<xref ref-type="bibr" rid="B4">4</xref>]. An insufficiency in venous blood drainage can be due to the presence of single or multiple stenosis on the main routes of cerebrospinal venous system [<xref ref-type="bibr" rid="B5">5</xref>].</p>
<sec id="sec1.1" att1="deflist">
<title>Section level 2</title>
<p><def-list><def-item><term>term I:</term><def><p>defintion I</p></def></def-item><def-item><term>term 2:</term><def><p>defintion 2</p></def></def-item></def-list>In the past years, great efforts have been made to develop excellent algorithms and tools for the processing and analyzing of traditional BS-Seq data [<xref ref-type="bibr" rid="B7">7</xref>&#x2013;<xref ref-type="bibr" rid="B10">10</xref>] but none for hairpin-BS-Seq data. In this study, we designed and implemented HBS-tools and compared them against other state-of-the-art mapping tools. Our result indicated that HBS-tools have a reduced mapping time and improved mapping efficiency.</p>
</sec>
</sec></body>
</article>

我的代码:

use strict;
use warnings;
use XML::Twig;

my $t= XML::Twig->new( twig_handlers => 
               { 'sec/section/def-list' => \&Check_deflist }
               )
          ->parsefile('input.xml');

sub Check_deflist
{  }

对于肮脏的代码表示歉意...任何人都可以在这方面帮助我,我们将不胜感激。

首先您需要修正您的 xpath 表达式。您的元素称为 <sec>,而不是 <section>。然后您需要使用正确的表达式来定位 <def-list> 元素。它们不直接在第二个 <sec> 之前,因此您需要使用两个斜杠 //.

sec/sec//def-list

现在对于处理程序,您可以获取元素和 go up its tree to find <sec>s. We put that into a list and take the first one, which is another element. On that, we set the attribute。就是这样。

use strict;
use warnings;
use XML::Twig;

my $t = XML::Twig->new(
    twig_handlers => { 'sec/sec//def-list' => \&Check_deflist },
    pretty_print  => 'indented'
)->parse( \*DATA )->print;

sub Check_deflist {
    ( $_->ancestors('sec') )[0]->set_att( att1 => 'deflist' );
}

__DATA__
<sec id="sec1">
<title>Section level 1</title>
<p>.......</p>
    <sec id="sec1.1">
    <title>Section level 2</title>
    <p><def-list><p>...</p>...</def-list></p>
    </sec>
</sec>

漂亮的输出:

<sec id="sec1">
  <title>Section level 1</title>
  <p>.......</p>
  <sec att1="deflist" id="sec1.1">
    <title>Section level 2</title>
    <p>
      <def-list><p>...</p>...</def-list>
    </p>
  </sec>
</sec>

如果第二层<def-list>超过一个<sec>,它也可以工作<sec>