Scala 数据框 - github 上 explode 的 spark/scala 数据框源代码在哪里
Scala dataframe - where is the spark/ scala dataframe source code for explode on github
如 this article, . Without moving to higher spark versions, alternate methods to improve it are also slow. Since the issue has been fixed in later versions of spark, one approach would be to copy the fixed source code. In looking for the source code, I found a reference to explode in functions, but, I do not know how to track the function further 中所述。我如何在新的 spark 源代码中找到用于工作 Explode 的源代码 - 所以,我可以使用它而不是当前版本的 explode?
https://github.com/apache/spark/blob/master/sql/catalyst/src/main/scala/org/apache/spark/sql/catalyst/expressions/generators.scala 是 link 我认为你正在寻找
在看到爆炸函数只有 def explode(e: Column): Column = withExpr { Explode(e.expr) }
之后,我通过在 https://github.com/apache/spark/blob/master/sql/core/src/main/scala/org/apache/spark/sql/functions.scala 中展开所有 import org.apache._
导入找到了它
如果你想导入底层的 Explode 函数,我相信直接导入会是 import org.apache.spark.sql.catalyst.expressions.Explode
如 this article,
https://github.com/apache/spark/blob/master/sql/catalyst/src/main/scala/org/apache/spark/sql/catalyst/expressions/generators.scala 是 link 我认为你正在寻找
在看到爆炸函数只有 def explode(e: Column): Column = withExpr { Explode(e.expr) }
import org.apache._
导入找到了它
如果你想导入底层的 Explode 函数,我相信直接导入会是 import org.apache.spark.sql.catalyst.expressions.Explode