如何遍历字典形状 .txt 文件以创建 DataFrame

Question

我有一个 .txt 文件，里面有一个字典，比如（几行）：

{
    "data": [
        {
            "title": "Greatest chess game",
            "created_time": "2020-02-17T16:51:44+0000"
         } 
             ]
}

我需要打开这个文件并创建类似的 df：

     title                 created_time
0    Greatest chess game   2020-02-17T16:51:44+0000
....

当我打开 txt 文件时：

output_file=open('data\data_new.txt', 'w')
with open(output_file, 'r') as reader:
    print(reader.readline(5))

它被视为：

TypeError: expected str, bytes or os.PathLike object, not _io.TextIOWrapper

如何转换它？

Answer 1

不确定为什么要打开文件两次，一次在读取模式下，一次在写入模式下，不确定是否真的可行。

无论如何，如前所述，您似乎拥有 JSON，因此请使用 json 库来读取文件。

import json
import pandas as pd

with open('data_new.txt', 'r', encoding='utf-8') as reader:
    json = json.load(reader )
    
df = pd.DataFrame(json['data'])

print(df)

Answer 2

我认为您可以使用 json 库将您的文本文件作为 json 文件打开，例如：

import json

with open('data.txt') as json_file:
    data = json.load(json_file)
    #here you can make you dataframe

如何遍历字典形状 .txt 文件以创建 DataFrame

How to iterate through dictionary shape .txt file to create DataFrame

python

dictionary

dataframe

txt