使用 python 将 CSV 转换为所需格式的 JSON

Convert CSV to JSON in desired format using python

我想收集一些关于从相应输入获得低于输出的见解,我尝试了一些代码但无法获得我想要的结果。希望看到 table 转换为理想的格式,因为我必须在后期处理巨大的 CSV。非常感谢任何意见。

输入:

Reference mcc Value currency
abcd1234 5300 134.09 USD
abcd1235 5411 38.48 USD

使用代码:

from csv import DictReader
from itertools import groupby
from pprint import pprint
import json

with open('Test_bulk_transactions_data.csv') as csvfile:
    r = DictReader(csvfile, skipinitialspace=True)
    data = [dict(d) for d in r]

    group = []
    uniquekeys = []

    for k, g in groupby(data, lambda r: (r['reference'], r['mcc'])):
        group.append({
            "reference": k[0],
            "mcc": k[1],
            "amount": [{k:v for k, v in d.items() if k not in ['reference','mcc']} for d in list(g)]})
        uniquekeys.append(k)

print(json.dumps(group, indent = 3) + '}')

当前输出:

[
   {
      "reference": "abcd1234",
      "mcc": "5300",
      "amount": [
         {
            "value": "134.09",
            "currency": "USD"
         }
      ]
   },
   {
      "reference": "abcd1235",
      "mcc": "5411",
      "amount": [
         {
            "value": "38.48",
            "currency": "USD"
         }
      ]
   }
]}

期望输出:

{
  "cardTransactions": [
    {
      "reference": "abcd1234",
      "mcc": "5300",
      "amount": {
        "value": 134.09,
        "currency": "USD"
      }
    },
    {
      "reference": "abcd1235",
      "mcc": "5411",
      "amount": {
        "value": 38.48,
        "currency": "USD"
      }
    }
  ]
}

看起来您只需要将所有内容附加到“cardTransactions”键中,并且该值可以在创建时转换为浮点数。

"amount": [{k: float(v) for k, v in d.items() if k not in ['reference','mcc']} for d in list(g)]})
group = [] to group = defaultdict(list)
and group['cardTransactions'].append(... code as usual ...)

您想要的输出不允许在给定交易中有多个 value/currency,因此您根本不需要使用 groupby。

过程可能是这样的:

data = { 'cardTransactions':
         [{ 'amount': {'value'   : float(d.pop('Value')),
                       'currency': d.pop('currency')}, 
             **d }
          for d in r ]}

print(json.dumps(data, indent = 3))