将包含许多工作表的 Microsoft Excel 工作簿上传到 Azure ML Studio

Upload Microsoft Excel Workbook with Many Sheets into Azure ML Studio

我想将我的 Excel 工作簿上传到 Azure 机器学习工作室。原因是我有一些数据想加入到我的其他 .csv 文件中以创建训练数据集。 当我上传我的 Excel 时,我没有得到 .xlsx.xls,但得到其他扩展名,例如 .csv.txt 等。

这是它的样子,

无论如何我都上传了,但现在却出现了奇怪的字符。我怎样才能上传 excel 工作簿并获取我的工作表,这样我就可以加入数据并进行数据准备。有什么建议吗?

您可以将工作簿另存为(一组)CSV 文件并分别上传。

一个 CSV 文件,一个“Comma Separated Values”文件,就是这样。一个平面文件,其中包含一些以逗号分隔的值。如果你加载一个 Excel 文件,它会搞砸,因为 Excel 文件中有更多信息,而不仅仅是用逗号分隔的值。看看 File -> Save as -> Save as type 你可以 select 'CSV (comma delimited) (*.csv)'

免责声明:不,它并不总是逗号...

In addition, the term "CSV" also denotes some closely related delimiter-separated formats that use different field delimiters. These include tab-separated values and space-separated values. A delimiter that is not present in the field data (such as tab) keeps the format parsing simple. These alternate delimiter-separated files are often even given a .csv extension despite the use of a non-comma field separator.

编辑
显然 Excel 文件 受到 支持:Supported data sources for Azure Machine Learning data preparation

Excel (.xls/.xlsx)
通过指定 sheet 名称或编号,一次读取一个 Excel 文件 sheet。

而且,仅支持 UTF-8:Import Data - Technical notes

Azure Machine Learning requires UTF-8 encoding. If the data you are importing uses a different encoding, or was exported from a data source that uses a different default encoding, various problems might appear in the text.