如何将保存为 RData 的数据框从 R 导入到 pandas?
How can I import a data frame from R saved as RData to pandas?
我正在尝试将保存为 RData 的数据框从 R 导入到 pandas 数据框。我该怎么做?我尝试使用 rpy2 失败,如下所示:
import pandas as pd
from rpy2.robjects import r
from rpy2.robjects import pandas2ri
pandas2ri.activate()
# I use iris for convenience but I could have done r.load('my_data.RData')
print(r.data('iris'))
print(r['iris'].head())
print(type(r.data('iris')))
print(pandas2ri.ri2py_dataframe(r.data('iris')))
print(pandas2ri.ri2py(r.data('iris')))
print(pd.DataFrame(r.data('iris')))
输出:
[1] "iris"
Sepal.Length Sepal.Width Petal.Length Petal.Width Species
1 5.1 3.5 1.4 0.2 setosa
2 4.9 3.0 1.4 0.2 setosa
3 4.7 3.2 1.3 0.2 setosa
4 4.6 3.1 1.5 0.2 setosa
5 5.0 3.6 1.4 0.2 setosa
<class 'rpy2.robjects.vectors.StrVector'>
0 1 2 3
0 i r i s
['iris']
我用 pandas 0.20.1 + python 3.6 x64 + Windows 7.
数据帧的广义转换被证明是一项昂贵的操作,因为某些类型的列需要复制。本地转换规则可能会更好:
from rpy2.robjects import pandas2ri
from rpy2.robjects import default_converter
from rpy2.robjects.conversion import localconverter
print(r.data('iris'))
with localconverter(default_converter + pandas2ri.converter) as cv:
pd_iris = r('iris')
# this is a pandas DataFrame
pd_iris
否则,下面是"just working"这一端(Linux,默认为rpy2的分支头):
import pandas as pd
from rpy2.robjects import r
from rpy2.robjects import pandas2ri
pandas2ri.activate()
pd_iris = r('iris')
pd_iris
如果它不适合您,则 Windows 上的 rpy2 可能存在问题(还有另一个问题 - Windows 上不完全支持 rpy2)。
我正在尝试将保存为 RData 的数据框从 R 导入到 pandas 数据框。我该怎么做?我尝试使用 rpy2 失败,如下所示:
import pandas as pd
from rpy2.robjects import r
from rpy2.robjects import pandas2ri
pandas2ri.activate()
# I use iris for convenience but I could have done r.load('my_data.RData')
print(r.data('iris'))
print(r['iris'].head())
print(type(r.data('iris')))
print(pandas2ri.ri2py_dataframe(r.data('iris')))
print(pandas2ri.ri2py(r.data('iris')))
print(pd.DataFrame(r.data('iris')))
输出:
[1] "iris"
Sepal.Length Sepal.Width Petal.Length Petal.Width Species
1 5.1 3.5 1.4 0.2 setosa
2 4.9 3.0 1.4 0.2 setosa
3 4.7 3.2 1.3 0.2 setosa
4 4.6 3.1 1.5 0.2 setosa
5 5.0 3.6 1.4 0.2 setosa
<class 'rpy2.robjects.vectors.StrVector'>
0 1 2 3
0 i r i s
['iris']
我用 pandas 0.20.1 + python 3.6 x64 + Windows 7.
数据帧的广义转换被证明是一项昂贵的操作,因为某些类型的列需要复制。本地转换规则可能会更好:
from rpy2.robjects import pandas2ri
from rpy2.robjects import default_converter
from rpy2.robjects.conversion import localconverter
print(r.data('iris'))
with localconverter(default_converter + pandas2ri.converter) as cv:
pd_iris = r('iris')
# this is a pandas DataFrame
pd_iris
否则,下面是"just working"这一端(Linux,默认为rpy2的分支头):
import pandas as pd
from rpy2.robjects import r
from rpy2.robjects import pandas2ri
pandas2ri.activate()
pd_iris = r('iris')
pd_iris
如果它不适合您,则 Windows 上的 rpy2 可能存在问题(还有另一个问题 - Windows 上不完全支持 rpy2)。