颠簸分组在一起

Jolt grouping together

我在 JSON 中有这个输入,我很难将它们组合在一起:

[   
    {
        "PK": "123",
        "SURNAME": "CHEN",
        "SEX": "F",
        "DATE_OF_BIRTH": "1962-08-29 00:00:00.0",
        "PHONE_TYPE": "05",
        "PHONE_NO": "12312312",
        "OPERATION": "INSERT",
    }, {
        "PK": "123",
        "SURNAME": "CHEN",
        "SEX": "F",
        "DATE_OF_BIRTH": "1962-08-29 00:00:00.0",
        "PHONE_TYPE": "04",
        "PHONE_NO": "78787878",
        "OPERATION": "UPDATE"
    },{
        "PK": "456",
        "SURNAME": "DEV",
        "SEX": "M",
        "DATE_OF_BIRTH": "1953-06-06 00:00:00.0",
        "PHONE_TYPE": "05",
        "PHONE_NO": "34343434",
        "OPERATION": "INSERT"
    }, {
        "CLIENT_ID": "456",
        "SURNAME": "DEV",
        "SEX": "M",
        "DATE_OF_BIRTH": "1953-06-06 00:00:00.0",
        "PHONE_TYPE": "02",
        "PHONE_NO": "56565656",
        "OPERATION": "DELETE",
    }
]

这是预期的输出:

{
    "Customers": [{
            "MatchingProfile": {
                "CustomerNumber": "", // leave blank
                "DBType": "Oracle",
                "DBKey": "123",
                "LastName": "CHEN",
                "Gender": "Female",
                "Birthdate": "1962-08-29",
            },
            "Contacts": [{
                    "ContactType": "Fax",
                    "CountryCode": "", // leave blank
                    "Phone_Number": "12312312",
                    "Status": "Active"
                }, {
                    "ContactType": "Mobile",
                    "CountryCode": "", // leave blank
                    "PhoneNumber": "78787878",
                    "Status": "Active"
                }
            ]
        },{
            "MatchingProfile": {
                "CustomerNumber": "", // leave blank
                "DBType": "Oracle",
                "DBKey": "456",
                "LastName": "DEV",
                "Gender": "Male",
                "Birthdate": "1953-06-06",
            },
            "Contacts": [{
                    "ContactType": "Fax",
                    "CountryCode": "", // leave blank
                    "PhoneNumber": "34343434",
                    "Status": "Active"
                }, {
                    "ContactType": "Office",
                    "CountryCode": "", // leave blank
                    "PhoneNumber": "56565656",
                    "Status": "Inactive"
                }
            ]
        }
    ]
}

输入的 SEX 是 "M"、"F" 加上一些其他编码值。输出 Gender 的对应值为 "Male"、"Female",否则为“”(空白)。 (不要指责我性别歧视,我知道,这是项目要求好吗?不是我说的)

来自输入的 "INSERT" 和 "UPDATE" 的 OPERATION 将是相应的 Status:"Active" ;对于 "DELETE" 它将是 Status : "Inactive".

此外,生日输出被截断为 DATE_OF_BIRTH,减去时间。

PHONE_TYPE 如下:02 - "Office"、04 - "Mobile"、05 - "Fax"(我故意遗漏了其他的)。

是否可以在 Jolt 中对此进行映射?你能展示一个规格吗?我是 Jolt 的新手,我有点困惑。这比 Excel Pivot 难 10 倍。

这几乎是 OOTB Jolt 所能达到的最接近了。注意 Jolt 用于更改数据结构,而不是对 "PHONE_TYPE" 之类的内容进行自定义数据映射:“04”表示 "Fax".

转换后的输出

{
  "Customers" : [ {
    "MatchingProfile" : {
      "DBKey" : "123",
      "Gender" : "F",
      "LastName" : "CHEN",
      "Birthdate" : "1962-08-29 00:00:00.0",
      "Contacts" : [ {
        "ContactType" : "05",
        "Phone_Number" : "12312312",
        "Status" : "INSERT"
      }, {
        "ContactType" : "04",
        "Phone_Number" : "78787878",
        "Status" : "UPDATE"
      } ]
    }
  }, {
    "MatchingProfile" : {
      "DBKey" : "456",
      "Gender" : "M",
      "LastName" : "DEV",
      "Birthdate" : "1953-06-06 00:00:00.0",
      "Contacts" : [ {
        "ContactType" : "05",
        "Phone_Number" : "34343434",
        "Status" : "INSERT"
      }, {
        "ContactType" : "02",
        "Phone_Number" : "56565656",
        "Status" : "DELETE"
      } ]
    }
  } ]
}

震动规格

[
  // first pivot by the value of SURNAME
  {
    "operation": "shift",
    "spec": {
      "*": { // for each item in the array
        "SURNAME": { // match SURNAME
          "*": { // match any value of SURNAME
            "@2": "&[]" // copy the whole record from 2 levels up to the SURNAME as an array, so we know that in the next step it is always an array
          }
        }
      }
    }
  },
  {
    "operation": "shift",
    "spec": {
      "*": { // match CHEN or DEV
        "0": {
          // only pull pk, sex, dob from the first entry of the SURNAME array so as to not duplicate output
          "PK": "Customers[#3].MatchingProfile.DBKey",
          "SEX": "Customers[#3].MatchingProfile.Gender",
          "SURNAME": "Customers[#3].MatchingProfile.LastName",
          "DATE_OF_BIRTH": "Customers[#3].MatchingProfile.Birthdate",

          // this does mean that the PHONE_TYPE has to be dealt with twice
          // once for the zeroth item, and then once again for the rest
          "PHONE_TYPE": "Customers[#3].MatchingProfile.Contacts[0].ContactType",
          "PHONE_NO": "Customers[#3].MatchingProfile.Contacts[0].Phone_Number",
          "OPERATION": "Customers[#3].MatchingProfile.Contacts[0].Status"
        },
        "*": {
          // handle PHONE_TYPE and friends for the other records
          "PHONE_TYPE": "Customers[#3].MatchingProfile.Contacts[&1].ContactType",
          "PHONE_NO": "Customers[#3].MatchingProfile.Contacts[&1].Phone_Number",
          "OPERATION": "Customers[#3].MatchingProfile.Contacts[&1].Status"
        }
      }
    }
  }
]

如果您发现 Jolt 对于枢轴和结构变化很有价值,那么您最好的选择是 "fixup" 您的输入数据数组,也就是映射 "PHONE_TYPE": "04" 到 "Fax", trim 来自生日的 00:00:00,然后使用 Jolt 制作嵌套的 "Customers[].MatchingProfile.Contacts[]" 结构。