如何在传递给@csv 过滤器之前用嵌套结构展平 JSON

Question

我正在尝试解析一些 JSON，这是 AWS CLI 命令的输出以显示快照。我想将这些数据加载到电子表格中，以便能够对其进行过滤、分组和审核。

我一直对如何将嵌套的 Tags 数组展平到父对象中感到困惑，以便中间可以传递给 @csv 过滤器。

示例如下：

初始输入JSON:

{
  "Snapshots": [
    {
      "SnapshotId": "snap-fff",
      "StartTime": "2014-04-01T06:00:13.000Z",
      "VolumeId": "vol-fff",
      "VolumeSize": 50,
      "Description": "desc1",
      "Tags": [
        {
          "Value": "/dev/sdf",
          "Key": "device"
        },
        {
          "Value": "a name",
          "Key": "Name"
        },
        {
          "Value": "Internal",
          "Key": "Customer"
        },
        {
          "Value": "Demo",
          "Key": "Environment"
        },
        {
          "Value": "Brand 1",
          "Key": "Branding"
        },
        {
          "Value": "i-fff",
          "Key": "instance_id"
        }
      ]
    },
    {
      "SnapshotId": "snap-ccc",
      "StartTime": "2014-07-01T05:59:14.000Z",
      "VolumeId": "vol-ccc",
      "VolumeSize": 8,
      "Description": "B Desc",
      "Tags": [
        {
          "Value": "/dev/sda1",
          "Key": "device"
        },
        {
          "Value": "External",
          "Key": "Customer"
        },
        {
          "Value": "Production",
          "Key": "Environment"
        },
        {
          "Value": "i-ccc",
          "Key": "instance_id"
        },
        {
          "Value": "B Brand",
          "Key": "Branding"
        },
        {
          "Value": "B Name",
          "Key": "Name"
        },
        {
          "Value": "AnotherValue",
          "Key": "AnotherKey"
        }
      ]
    }
  ]
}

期望中级：

[
  {
    "SnapshotId": "snap-fff",
    "StartTime": "2014-04-01T06:00:13.000Z",
    "VolumeId": "vol-fff",
    "VolumeSize": 50,
    "Description": "desc1",
    "device": "/dev/sdf",
    "Name": "a name",
    "Customer": "Internal",
    "Environment": "Demo",
    "Branding": "Brand 1",
    "instance_id": "i-fff",
  }
  {
    "SnapshotId": "snap-ccc",
    "StartTime": "2014-07-01T05:59:14.000Z",
    "VolumeId": "vol-ccc",
    "VolumeSize": 8,
    "Description": "B Desc",
    "device": "/dev/sda1",
    "Customer": "External",
    "Environment": "Production",
    "instance_id": "i-ccc",
    "Branding": "B Brand",
    "Name": "B Name",
    "AnotherKey": "AnotherValue",
  }
]

最终输出：

"SnapshotId","StartTime","VolumeId","VolumeSize","Description","device","Name","Customer","Environment","Branding","instance_id","AnotherKey"
"snap-fff","2014-04-01T06:00:13.000Z","vol-fff",50,"desc1","/dev/sdf","a name","Internal","Demo","Brand 1","i-fff",""
"snap-ccc","2014-07-01T05:59:14.000Z","vol-ccc",8,"B Desc","/dev/sda1","External","Production","i-ccc","B Brand","B Name","AnotherValue"

Answer 1

以下 jq 过滤器生成请求的中间输出：

.Snapshots[] | (. + (.Tags|from_entries)) | del(.Tags)

解释：from_entries将键值对象数组转换为具有给定键值对的对象。这被添加到目标对象，最后 "Tags" 键被删除。

如果 "target" 对象有一个键也出现在 "Tags" 数组中，那么上面的过滤器将优先选择 "Tags" 数组中的值。因此，您可能希望更改“+”操作数的顺序，或以其他方式解决冲突。

如何在传递给@csv 过滤器之前用嵌套结构展平 JSON

How to flatten JSON with nested structure before passing to @csv filter

nested

flatten

jq