MlContext 不了解我的温度类型

MlContext doesn't understand the type of my temperature

我是 ML.NET 的初学者,我的数据有点问题。当我把它们放在 mlContext.Fit(...);这是我收到的错误:

Column 'Temperature' has values of I4which is not the same as earlier observed type of R4.

这是我的代码:

try
            {
                var mlContext = new MLContext();
                var reader = mlContext.Data.CreateTextReader<TrainData>(separatorChar: ',', hasHeader: false);
                var trainData = _context.Datas.Last();
                IDataView trainingdataView = reader.Read(Path.Combine(hostingEnvironment.WebRootPath, "data010220192341.txt"));
                var pipeline = mlContext.Transforms.Conversion.MapValueToKey("Delay")
                    .Append(mlContext.Transforms.Categorical.OneHotEncoding("StationDepart"))
                    .Append(mlContext.Transforms.Categorical.OneHotEncoding("StationArrival"))
                    .Append(mlContext.Transforms.Categorical.OneHotEncoding("Day"))
                    .Append(mlContext.Transforms.Categorical.OneHotEncoding("Train"))
                    .Append(mlContext.Transforms.Categorical.OneHotEncoding("WeatherText"))
                    .Append(mlContext.Transforms.Categorical.OneHotEncoding("HasPrecipitation"))
                    .Append(mlContext.Transforms.Categorical.OneHotEncoding("PrecipitationType"))
                    .Append(mlContext.Transforms.Concatenate("Features", "StationDepart", "StationArrival", "Day", "Train", "WeatherText", "Temperature", "Humidity", "HasPrecipitation", "PrecipitationType", "Time"))
                    .Append(mlContext.MulticlassClassification.Trainers.StochasticDualCoordinateAscent(labelColumn: "Delay", featureColumn: "Features"))
                    .Append(mlContext.Transforms.Conversion.MapKeyToValue("PredictedTime"));
                var model = pipeline.Fit(trainingdataView);
                var prediction = model.CreatePredictionEngine<TrainData, TrainPrediction>(mlContext).Predict(
                    new TrainData()
                    {
                        StationDepart = "Charleroi-Sud",
                        StationArrival = "Mons",
                        Day = "Friday",
                        Train = "BE.NMBS.IC3825",
                        WeatherText = "Partly cloudy",
                        Temperature = -1,
                        Humidity=0,
                        HasPrecipitation = false,
                        PrecipitationType=null,
                        Time=0444
                    });
                return prediction.PredictedTime.ToString();
            }
            catch (Exception e)
            {
                return e.Message;
            }

因此,当我在文本文件中获取数据时,我对字符串列进行编码,然后尝试训练模型,但此时我收到了错误。 我的数据是:

Charleroi-Sud,Mons,Thursday,BE.NMBS.IC3831,Partly sunny,-2,0,False,,1044,0 Charleroi-Sud,Mons,Thursday,BE.NMBS.IC932,Mostly cloudy,-2,0,False,,1112,0 Charleroi-Sud,Mons,Thursday,BE.NMBS.IC3832,Mostly cloudy,-1,0,False,,1144,0 Charleroi-Sud,Mons,Thursday,BE.NMBS.IC933,Cloudy,-1,0,False,,1212,0 Charleroi-Sud,Mons,Thursday,BE.NMBS.IC3842,Mostly cloudy,-1,0,False,,2144,0 Charleroi-Sud,Mons,Thursday,BE.NMBS.IC943,Mostly cloudy,-1,0,False,,2212,0 Charleroi-Sud,Mons,Thursday,BE.NMBS.IC3843,Mostly cloudy,-1,0,False,,2247,0 Charleroi-Sud,Mons,Friday,BE.NMBS.IC3825,Partly cloudy,-1,0,False,,0444,0 Charleroi-Sud,Mons,Friday,BE.NMBS.IC3826,Mostly cloudy,-1,0,False,,0544,0 Charleroi-Sud,Mons,Friday,BE.NMBS.IC927,Mostly cloudy,-1,0,False,,0612,0

如您所见,数据和温度之间的每个“,”都是一个整数。 在 TrainData 中,这样做是这样的:

public class TrainData
    {
        [LoadColumn(0)]
        public string StationDepart { get; set; }
        [LoadColumn(1)]
        public string StationArrival { get; set; }
        [LoadColumn(2)]
        public string Day { get; set; }
        [LoadColumn(3)]
        public string Train { get; set; }
        [LoadColumn(4)]
        public string WeatherText { get; set; }
        [LoadColumn(5)]
        public int Temperature { get; set; }
        [LoadColumn(6)]
        public int Humidity { get; set; }
        [LoadColumn(7)]
        public bool HasPrecipitation { get; set; }
        [LoadColumn(8)]
        public string PrecipitationType { get; set; }
        [LoadColumn(9)]
        public int Time { get; set; }
        [LoadColumn(10)]
        public int Delay { get; set; }
    }

问题是时间、延迟和温度需要是浮点数而不是整数。

public class TrainData
    {
        [LoadColumn(0)]
        public string StationDepart { get; set; }
        [LoadColumn(1)]
        public string StationArrival { get; set; }
        [LoadColumn(2)]
        public string Day { get; set; }
        [LoadColumn(3)]
        public string Train { get; set; }
        [LoadColumn(4)]
        public string WeatherText { get; set; }
        [LoadColumn(5)]
        public float Temperature { get; set; }
        [LoadColumn(6)]
        public float Humidity { get; set; }
        [LoadColumn(7)]
        public bool HasPrecipitation { get; set; }
        [LoadColumn(8)]
        public string PrecipitationType { get; set; }
        [LoadColumn(9)]
        public float Time { get; set; }
        [LoadColumn(10)]
        public float Delay { get; set; }
    }