PySpark 中的回归。使用哪个库

Regression in PySpark. Which library to Use

"pyspark.mllib.regression"和"pyspark.ml.regression"有什么区别

应该使用哪一个

这取决于您的 spark 版本。官方建议ml。

As of Spark 2.0, the RDD-based APIs in the spark.mllib package have entered maintenance mode. The primary Machine Learning API for Spark is now the DataFrame-based API in the spark.ml package.

希望这会有所帮助!