在击中s3之前如何解密kinesis中的数据

How is data in kinesis decrypted before hitting s3

我目前有一个架构,我的 kinesis -> kinesis firehouse -> s3

我正在使用以下方法直接在运动中创建记录:

 aws kinesis put-record --stream-name <some_kinesis_stream> --partition-key 123 --data testdata --profile sandbox

我运行时的数据:

aws kinesis get-records --shard-iterator --profile 沙箱

看起来像这样:

 {
            "SequenceNumber": "49597697038430366340153578495294928515816248592826368002",
            "ApproximateArrivalTimestamp": 1563835989.441,
            "Data": "eyJrZXkiOnsiZW1wX25vIjo1Mjc2OCwiZGVwdF9ubyI6ImQwMDUifSwidmFsdWUiOnsiYmVmb3JlIjpudWxsLCJhZnRlciI6eyJlbXBfbm8iOjUyNzY4LCJkZXB0X25vIjoiZDAwNSIsImZyb21fZGF0ZSI6Nzk2NSwidG9fZGF0ZSI6MjkzMjUzMX0sInNvdXJjZSI6eyJ2ZXJzaW9uIjoiMC45LjUuRmluYWwiLCJjb25uZWN0b3IiOiJteXNxbCIsIm5hbWUiOiJraW5lc2lzIiwic2VydmVyX2lkIjowLCJ0c19zZWMiOjAsImd0aWQiOm51bGwsImZpbGUiOiJteXNxbC1iaW4tY2hhbmdlbG9nLjAwMDAwMiIsInBvcyI6MTU0LCJyb3ciOjAsInNuYXBzaG90Ijp0cnVlLCJ0aHJlYWQiOm51bGwsImRiIjoiZW1wbG95ZWVzIiwidGFibGUiOiJkZXB0X2VtcCIsInF1ZXJ5IjpudWxsfSwib3AiOiJjIiwidHNfbXMiOjE1NjM4MzEzMTI2Njh9fQ==",
            "PartitionKey": "-591791328"
        }

但在 s3 中,它看起来像:

`testdatatestdatatestdatatestdatatestdatatestdatatestdatatestdata`

因为我运行 putrecords 好几次了。

这是怎么回事?当我 运行 get-records 时,我获得了哪些记录?那是什么数据?这些数据是如何解密成我的原始字符串的?怎么回事?

您似乎已将您的消防水管配置为启用 server-side 数据加密。如果是这种情况,则适用以下规则:

When you configure a Kinesis data stream as the data source of a Kinesis Data Firehose delivery stream, Kinesis Data Firehose no longer stores the data at rest. Instead, the data is stored in the data stream.

When you send data from your data producers to your data stream, Kinesis Data Streams encrypts your data using an AWS Key Management Service (AWS KMS) key before storing the data at rest. When your Kinesis Data Firehose delivery stream reads the data from your data stream, Kinesis Data Streams first decrypts the data and then sends it to Kinesis Data Firehose. Kinesis Data Firehose buffers the data in memory based on the buffering hints that you specify. It then delivers it to your destinations without storing the unencrypted data at rest.

了解更多信息:https://docs.aws.amazon.com/firehose/latest/dev/encryption.html

15 天了,希望你已经找到了答案。

如果不是,那么 get-records 与您在 S3 中看到的数据不匹配的原因似乎是基于您执行 aws kinesis get-records --shard-iterator --profile sandbox 调用的方式,您没有明确提供分片迭代器值。

根据您的 --data testdata put-record 调用,您在 S3 中看到的是正确的和预期的。 testdatatestdatatestdatatestdatatestdatatestdatatestdatatestdata

您在 Kinesis 中看到的是 base64 编码的: "Data": "eyJrZXkiOnsiZW1wX25vIjo1Mjc2OCwiZGVwdF9ubyI6ImQwMDUifSwidmFsdWUiOnsiYmVmb3JlIjpudWxsLCJhZnRlciI6eyJlbXBfbm8iOjUyNzY4LCJkZXB0X25vIjoiZDAwNSIsImZyb21fZGF0ZSI6Nzk2NSwidG9fZGF0ZSI6MjkzMjUzMX0sInNvdXJjZSI6eyJ2ZXJzaW9uIjoiMC45LjUuRmluYWwiLCJjb25uZWN0b3IiOiJteXNxbCIsIm5hbWUiOiJraW5lc2lzIiwic2VydmVyX2lkIjowLCJ0c19zZWMiOjAsImd0aWQiOm51bGwsImZpbGUiOiJteXNxbC1iaW4tY2hhbmdlbG9nLjAwMDAwMiIsInBvcyI6MTU0LCJyb3ciOjAsInNuYXBzaG90Ijp0cnVlLCJ0aHJlYWQiOm51bGwsImRiIjoiZW1wbG95ZWVzIiwidGFibGUiOiJkZXB0X2VtcCIsInF1ZXJ5IjpudWxsfSwib3AiOiJjIiwidHNfbXMiOjE1NjM4MzEzMTI2Njh9fQ==",

所以解码可以让你: { "key": { "emp_no": 52768, "dept_no": "d005" }, "value": { "before": null, "after": { "emp_no": 52768, "dept_no": "d005", "from_date": 7965, "to_date": 2932531 }, "source": { "version": "0.9.5.Final", "connector": "mysql", "name": "kinesis", "server_id": 0, "ts_sec": 0, "gtid": null, "file": "mysql-bin-changelog.000002", "pos": 154, "row": 0, "snapshot": true, "thread": null, "db": "employees", "table": "dept_emp", "query": null }, "op": "c", "ts_ms": 1563831312668 } }

它与您的 "testdata" 不匹配的原因是您在可能错误的分片上查看了错误的分片迭代器。不确定你的运动学设置到底是什么。

把这篇文章看一遍,https://docs.aws.amazon.com/streams/latest/dev/fundamental-stream.html。应该给你测试这个工作流程的步骤。