Azure ML Web 服务超时

Question

我在 Azure ML 中创建了一个简单的实验并使用 http 客户端触发它。在 Azure ML 工作区中，执行时一切正常。但是，当我使用 http 客户端触发实验时，实验超时并失败。为 http 客户端设置超时值似乎不起作用。

有什么方法可以设置这个超时值，使实验不会失败？

Answer 1

确保您正确设置了客户端超时值。如果支持 Web 服务的服务器超时，那么它将发回带有 HTTP 状态代码 504 BackendScoreTimeout（或可能 409 GatewayTimeout）的响应。但是，如果您根本没有收到回复，那么您的客户等待的时间还不够长。

您可以通过运行在 ML Studio 中进行实验来找出大量时间。转到实验属性以找出它运行持续了多长时间，然后将大约两倍的时间作为超时值。

Answer 2

我在作为 Web 服务发布的 Azure ML 实验中遇到了类似的问题。大多数情况下运行正常，但有时会返回超时错误。问题是实验本身有90秒运行的时间限制。因此，很可能您的实验有运行时间超过此限制并且 returns 出现超时错误。 hth

Answer 3

看起来无法根据 a feature request that is still marked as "planned" as of 4/1/2018 设置此超时。

MSDN forums from 2017的建议是使用Batch Execution Service，启动机器学习实验，然后异步询问是否完成。

这是来自 Azure ML Web 服务管理示例代码的代码片段（所有注释均来自其示例代码）：

        using (HttpClient client = new HttpClient())
        {
            var request = new BatchExecutionRequest()
            {

                Outputs = new Dictionary<string, AzureBlobDataReference> () {
                    {
                        "output",
                        new AzureBlobDataReference()
                        {
                            ConnectionString = storageConnectionString,
                            RelativeLocation = string.Format("{0}/outputresults.file_extension", StorageContainerName) /*Replace this with the location you would like to use for your output file, and valid file extension (usually .csv for scoring results, or .ilearner for trained models)*/
                        }
                    },
                },    

                GlobalParameters = new Dictionary<string, string>() {
                }
            };

            client.DefaultRequestHeaders.Authorization = new AuthenticationHeaderValue("Bearer", apiKey);

            // WARNING: The 'await' statement below can result in a deadlock
            // if you are calling this code from the UI thread of an ASP.Net application.
            // One way to address this would be to call ConfigureAwait(false)
            // so that the execution does not attempt to resume on the original context.
            // For instance, replace code such as:
            //      result = await DoSomeTask()
            // with the following:
            //      result = await DoSomeTask().ConfigureAwait(false)

            Console.WriteLine("Submitting the job...");

            // submit the job
            var response = await client.PostAsJsonAsync(BaseUrl + "?api-version=2.0", request);

            if (!response.IsSuccessStatusCode)
            {
                await WriteFailedResponse(response);
                return;
            }

            string jobId = await response.Content.ReadAsAsync<string>();
            Console.WriteLine(string.Format("Job ID: {0}", jobId));

            // start the job
            Console.WriteLine("Starting the job...");
            response = await client.PostAsync(BaseUrl + "/" + jobId + "/start?api-version=2.0", null);
            if (!response.IsSuccessStatusCode)
            {
                await WriteFailedResponse(response);
                return;
            }

            string jobLocation = BaseUrl + "/" + jobId + "?api-version=2.0";
            Stopwatch watch = Stopwatch.StartNew();
            bool done = false;
            while (!done)
            {
                Console.WriteLine("Checking the job status...");
                response = await client.GetAsync(jobLocation);
                if (!response.IsSuccessStatusCode)
                {
                    await WriteFailedResponse(response);
                    return;
                }

                BatchScoreStatus status = await response.Content.ReadAsAsync<BatchScoreStatus>();
                if (watch.ElapsedMilliseconds > TimeOutInMilliseconds)
                {
                    done = true;
                    Console.WriteLine(string.Format("Timed out. Deleting job {0} ...", jobId));
                    await client.DeleteAsync(jobLocation);
                }
                switch (status.StatusCode) {
                    case BatchScoreStatusCode.NotStarted:
                        Console.WriteLine(string.Format("Job {0} not yet started...", jobId));
                        break;
                    case BatchScoreStatusCode.Running:
                        Console.WriteLine(string.Format("Job {0} running...", jobId));
                        break;
                    case BatchScoreStatusCode.Failed:
                        Console.WriteLine(string.Format("Job {0} failed!", jobId));
                        Console.WriteLine(string.Format("Error details: {0}", status.Details));
                        done = true;
                        break;
                    case BatchScoreStatusCode.Cancelled:
                        Console.WriteLine(string.Format("Job {0} cancelled!", jobId));
                        done = true;
                        break;
                    case BatchScoreStatusCode.Finished:
                        done = true;
                        Console.WriteLine(string.Format("Job {0} finished!", jobId));
                        ProcessResults(status);
                        break;
                }

                if (!done) {
                    Thread.Sleep(1000); // Wait one second
                }
            }
        }

Azure ML Web 服务超时

Azure ML web service times out

c#

azure

azure-machine-learning-studio