如何避免在字典中覆盖 k、v

Question

我的程序正在解析 XML 文件中的值，然后将它们放入字典中。

在这里，我使用了一个 for 循环来迭代文件和属性以及文本中的所有标签

但是当有像[250][155]这样的子标签时<name>，它会覆盖[4]<name>

而这一切都是运行在for循环

下

现在，我想阻止循环在进入循环后覆盖值

import pprint as p  # Importing pprint/pretty print for formatting dict
import xml.etree.ElementTree as ETT  # Importing xml.etree for xml parsing
import csv  # Importing csv to write in CSV file


def fetch_data():
    # Asking input from user for the file path
    xml_file_path = input(
        "Enter the path to the file. \n Note* use 'Double back slash' instead of 'single back slash': \n \t \t \t \t \t")

    # Initiating variable parser which will parse from the file
    parser = ETT.parse(xml_file_path)

    # Initiating variable root to get root element which will be useful in further queries
    root = parser.getroot()

    # Initiating variable main_d which is the dictionary in which the parsed data will be stored
    main_d = {}

    for w in root.iter():  # Making a for loop which will iter all tags out of the file
        value = w.attrib  # Initiating variable value for storing attributes where attributes are in the form of dictionaries
        value['value'] = w.text  # Hence, appending the text/value of the tag in the value dict
        if w not in main_d:
            main_d[w.tag] = value  # Writing all the keys and values in main_d
        else:
            main_d.pop(w)
    p.pprint(main_d, sort_dicts=False, width=200, depth=100)


fetch_data()

这就是 XML 的样子

<?xml version="1.0" encoding="UTF-8"?>
<Data data_version="1">
    <modified_on_date>some_time</modified_on_date>
    <file_version>some version</file_version>
    <name>h</name>
    <class>Hot</class>
    <fct>
        <fc_tem di="value1" un="value2" unn="value3">some integer</fc_tem>
        <fc_str di="value1" un="value2" unn="value3">some integer</fc_str>
        <DataTable name="namee" type="0" columns="2" rows="2" version="some version">
            <name>this will be overwritten on the first one up there</name>
            <type>0</type>
        </DataTable>    
    </fct>
</Data>

这是我目前的进度

考虑到程序的保密性，我只能分享这些了

Answer 1

首先，感谢@PatrickArtner，他的方法奏效了

所以你只需要做 w.tag 而不是 w

完整的片段是：

# This program is to fetch/parse data(tags, attribs, text) from the XML/XMT
# file provided


# Importing required libraries

import pprint as p                                      # Importing pprint/pretty print for formatting dict
import xml.etree.ElementTree as ETT                     # Importing xml.etree for xml parsing
import csv                                              # Importing csv to write in CSV file


# Creating a method/function fetch_data() to fetch/parse data from the given XML/XMT file

def fetch_data():


    # Asking input from user for the file path
    xml_file_path = input(
        "Enter the path to the file \n \t \t :")

    # Asking input from user for the name of the csv file which will be created
    file_name = input(str("Enter the file name with extension you want as output \n \t \t : "))


    # Initiating variable parser which will parse from the file
    parser = ETT.parse(xml_file_path)


    # Initiating variable root to get root element which will be useful in further queries
    root = parser.getroot()


    # Initiating variable main_d which is the dictionary in which the parsed data will be stored
    main_d = {}


    for w in root.iter():                               # Making a for loop which will iter all tags out of the file
        value = w.attrib                                # Initiating variable value for storing attributes where attributes are in the form of dictionaries
        value['value'] = w.text                         # Hence, appending the text/value of the tag in the value dict
        if w.tag not in main_d:                         # Checking if the tag exists or not, this will help to avoid overwriting of tag values
            main_d[w.tag] = value                       # Writing all the keys and values in main_d
        else:
            pass

    p.pprint(main_d, sort_dicts=False, width=200)               # This is just to check the output

    with open(file_name, 'w+', buffering=True) as file:         # Opening a file with the filename provided by the user
        csvwriter = csv.writer(file, quoting=csv.QUOTE_ALL)     # Initiating a variable csvwriter for the file and passing QUOTE_ALL agr.
        for x in main_d.keys():                                 # Creating a loop to write the tags
            csvwriter.writerow({x})                             # Writing the tags



fetch_data()

如何避免在字典中覆盖 k、v

How to avoid overwriting of k, v in dictionaries

python

xml

dictionary

for-loop