Pandas to_dict 使用 outtype='records' 更改索引类型

Question

我正尝试在以下 DataFrame 上调用 to_dict 函数：

导入 pandas 作为 pd

数据 = {"a": [1,2,3,4,5], "b": [90,80,40,60,30]}

df = pd.DataFrame(数据)

df.reset_index().to_dict("r")

[{'a': 1, 'b': 90, 'index': 0},
 {'a': 2, 'b': 80, 'index': 1},
 {'a': 3, 'b': 40, 'index': 2},
 {'a': 4, 'b': 60, 'index': 3},
 {'a': 5, 'b': 30, 'index': 4}]

但是，如果我对数据帧执行浮点运算，就会出现我的问题，这会将索引变为浮点数：

(df*1.0).reset_index().to_dict("r")

[{'a': 1.0, 'b': 90.0, 'index': 0.0},  
{'a': 2.0, 'b': 80.0, 'index': 1.0},  
{'a': 3.0, 'b': 40.0, 'index': 2.0},  
{'a': 4.0, 'b': 60.0, 'index': 3.0},  
{'a': 5.0, 'b': 30.0, 'index': 4.0}]

任何人都可以解释上述行为或推荐解决方法，或验证这是否可能是 pandas 错误？ to_dict 方法中其他外类型的 None 会改变索引，如上所示。

我已经在 pandas 0.14 和 0.18（最新）

上复制了这个

非常感谢！

Answer 1

此问题已在 github here

上得到解答

我会在这里传达答案，这样问题可能会被标记为已解决并从未回答的 pandas 问题列表中移出。

来自Github：

Nothing to do with the index, just the fact that you have any float dtypes in the data

If you look at the code, we use DataFrame.values, which returns a NumPy array, which must have a single dtype (float64 in this case).

--TomAugspurger

该问题的解决方法是：

[x._asdict() for x in df.itertuples()]

生成 OrderedDict 对象列表

[OrderedDict([('Index', 0), ('a', 1.0), ('b', 90)]),
 OrderedDict([('Index', 1), ('a', 2.0), ('b', 80)]),
 OrderedDict([('Index', 2), ('a', 3.0), ('b', 40)]),
 OrderedDict([('Index', 3), ('a', 4.0), ('b', 60)]),
 OrderedDict([('Index', 4), ('a', 5.0), ('b', 30)])]

Pandas to_dict 使用 outtype='records' 更改索引类型

Pandas to_dict changes index type with outtype='records'

python

indexing

records

pandas