检查点是否需要流作业中的 delta lake 合并操作
Is the checkpoint requires for the delta lake merge operation in a streaming job
我有一个理解 spark streaming merge it's helpful to have a checkpoint location specified to not process stuff twice on the job restart (even if the operation is idempotent and ins't mentioned in example notebook)。正确吗?
如果不指定检查点的位置,每次都会重新处理所有数据。
我有一个理解 spark streaming merge it's helpful to have a checkpoint location specified to not process stuff twice on the job restart (even if the operation is idempotent and ins't mentioned in example notebook)。正确吗?
如果不指定检查点的位置,每次都会重新处理所有数据。