如何在 Amazon EMR 5.16.0 上为 Zeppelin 0.7.3 安装 R 解释器

How can I install the R interpreter for Zeppelin 0.7.3 on Amazon EMR 5.16.0

当我使用 emr-5.16.0 版本创建 EMR 集群并包含 Zeppelin 时,它会随之安装 R,但我似乎无法加载解释器。即使我 运行 "sudo bash bin/install-interpreter.sh -a" 它也没有出现。

终于想通了。从 5.16.0 开始,EMR 不支持开箱即用的 Zeppelin 中的 R,如记录 here

通过设置不包含 Zeppelin 的 EMR,我能够从源代码构建 Zeppelin,并且 运行 在 SSH 进入主节点时执行以下操作:

sudo yum -y update
sudo yum -y install R R-devel libcurl-devel openssl-devel git
sudo R -e "install.packages('devtools', repos = 'http://cran.us.r-project.org')"
sudo R -e "install.packages('sparklyr', repos = 'http://cran.us.r-project.org')"
sudo R -e "install.packages('evaluate', repos = 'http://cran.us.r-project.org')"
sudo R -e "install.packages('knitr', repos = 'http://cran.us.r-project.org')"
sudo R -e "install.packages('ggplot2', repos = 'http://cran.us.r-project.org')"
sudo R -e "install.packages(c('devtools','mplot', 'googleVis'), repos = 'http://cran.us.r-project.org');
require(devtools); install_github('ramnathv/rCharts')"


mkdir build
cd build
wget http://www.eu.apache.org/dist/maven/maven-3/3.3.9/binaries/apache-maven-3.3.9-bin.tar.gz
sudo tar -zxf apache-maven-3.3.9-bin.tar.gz -C /usr/local/
sudo ln -s /usr/local/apache-maven-3.3.9/bin/mvn /usr/local/bin/mvn

git clone https://github.com/apache/zeppelin.git
cd zeppelin
git checkout tags/v0.8.0

mvn clean package -DskipTests -Pscala-2.11 -Pr -Dspark.version=2.2.0 -DHadoop.version=2.7.7
#NEED TO CHANGE PORT IN CONFIG
#need to set export SPARK_HOME=/usr/lib/spark in zeppelin-env.sh

./bin/zeppelin-daemon.sh start