使用 python 将 CSV 转换为所需格式的 JSON
Convert CSV to JSON in desired format using python
我想收集一些关于从相应输入获得低于输出的见解,我尝试了一些代码但无法获得我想要的结果。希望看到 table 转换为理想的格式,因为我必须在后期处理巨大的 CSV。非常感谢任何意见。
输入:
Reference
mcc
Value
currency
abcd1234
5300
134.09
USD
abcd1235
5411
38.48
USD
使用代码:
from csv import DictReader
from itertools import groupby
from pprint import pprint
import json
with open('Test_bulk_transactions_data.csv') as csvfile:
r = DictReader(csvfile, skipinitialspace=True)
data = [dict(d) for d in r]
group = []
uniquekeys = []
for k, g in groupby(data, lambda r: (r['reference'], r['mcc'])):
group.append({
"reference": k[0],
"mcc": k[1],
"amount": [{k:v for k, v in d.items() if k not in ['reference','mcc']} for d in list(g)]})
uniquekeys.append(k)
print(json.dumps(group, indent = 3) + '}')
当前输出:
[
{
"reference": "abcd1234",
"mcc": "5300",
"amount": [
{
"value": "134.09",
"currency": "USD"
}
]
},
{
"reference": "abcd1235",
"mcc": "5411",
"amount": [
{
"value": "38.48",
"currency": "USD"
}
]
}
]}
期望输出:
{
"cardTransactions": [
{
"reference": "abcd1234",
"mcc": "5300",
"amount": {
"value": 134.09,
"currency": "USD"
}
},
{
"reference": "abcd1235",
"mcc": "5411",
"amount": {
"value": 38.48,
"currency": "USD"
}
}
]
}
看起来您只需要将所有内容附加到“cardTransactions”键中,并且该值可以在创建时转换为浮点数。
"amount": [{k: float(v) for k, v in d.items() if k not in ['reference','mcc']} for d in list(g)]})
group = [] to group = defaultdict(list)
and group['cardTransactions'].append(... code as usual ...)
您想要的输出不允许在给定交易中有多个 value/currency,因此您根本不需要使用 groupby。
过程可能是这样的:
data = { 'cardTransactions':
[{ 'amount': {'value' : float(d.pop('Value')),
'currency': d.pop('currency')},
**d }
for d in r ]}
print(json.dumps(data, indent = 3))
我想收集一些关于从相应输入获得低于输出的见解,我尝试了一些代码但无法获得我想要的结果。希望看到 table 转换为理想的格式,因为我必须在后期处理巨大的 CSV。非常感谢任何意见。
输入:
Reference | mcc | Value | currency |
---|---|---|---|
abcd1234 | 5300 | 134.09 | USD |
abcd1235 | 5411 | 38.48 | USD |
使用代码:
from csv import DictReader
from itertools import groupby
from pprint import pprint
import json
with open('Test_bulk_transactions_data.csv') as csvfile:
r = DictReader(csvfile, skipinitialspace=True)
data = [dict(d) for d in r]
group = []
uniquekeys = []
for k, g in groupby(data, lambda r: (r['reference'], r['mcc'])):
group.append({
"reference": k[0],
"mcc": k[1],
"amount": [{k:v for k, v in d.items() if k not in ['reference','mcc']} for d in list(g)]})
uniquekeys.append(k)
print(json.dumps(group, indent = 3) + '}')
当前输出:
[
{
"reference": "abcd1234",
"mcc": "5300",
"amount": [
{
"value": "134.09",
"currency": "USD"
}
]
},
{
"reference": "abcd1235",
"mcc": "5411",
"amount": [
{
"value": "38.48",
"currency": "USD"
}
]
}
]}
期望输出:
{
"cardTransactions": [
{
"reference": "abcd1234",
"mcc": "5300",
"amount": {
"value": 134.09,
"currency": "USD"
}
},
{
"reference": "abcd1235",
"mcc": "5411",
"amount": {
"value": 38.48,
"currency": "USD"
}
}
]
}
看起来您只需要将所有内容附加到“cardTransactions”键中,并且该值可以在创建时转换为浮点数。
"amount": [{k: float(v) for k, v in d.items() if k not in ['reference','mcc']} for d in list(g)]})
group = [] to group = defaultdict(list)
and group['cardTransactions'].append(... code as usual ...)
您想要的输出不允许在给定交易中有多个 value/currency,因此您根本不需要使用 groupby。
过程可能是这样的:
data = { 'cardTransactions':
[{ 'amount': {'value' : float(d.pop('Value')),
'currency': d.pop('currency')},
**d }
for d in r ]}
print(json.dumps(data, indent = 3))