Ignite SqlQuery 超时和取消 - If/when 会抛出 QueryCancelledException

Ignite SqlQuery timeout & cancellation - If/when does QueryCancelledException get thrown

正如该文档建议的那样,可以通过设置 https://ignite.apache.org/releases/2.4.0/javadoc/org/apache/ignite/cache/query/SqlQuery.html#setTimeout-int-java.util.concurrent.TimeUnit-

在执行 SqlQuery 时设置超时

QueryCancelledException 的文档还提到,如果查询在执行时被取消或超时,则会抛出已检查的异常,https://ignite.apache.org/releases/2.4.0/javadoc/org/apache/ignite/cache/query/QueryCancelledException.html

这里提到的也是cancel/timeout长运行查询的一种方式, https://apacheignite-sql.readme.io/v2.4/docs/query-cancellation

但奇怪的是 java 所有 IgniteCache.query(..) 方法的文档,https://ignite.apache.org/releases/2.4.0/javadoc/org/apache/ignite/IgniteCache.html#query-org.apache.ignite.cache.query.Query- 没有声明这个已检查的异常或任何已检查的异常被抛出(与 QueryCursor.getAll() 方法相同)导致混淆在何处以及如何编码查询超时处理。

我编写了以下代码,但无法使查询超时以快速测试我的代码路径部分并查看其是否正确。我希望在 IgniteCache.query(..) 方法和 QueryCursor.getAll() 及其相关方法中都会抛出异常。

显然 SqlQuery.setTimeout(int timeout, TimeUnit timeUnit) 的最小超时粒度是 TimeUnit.MILLISECONDS,我在初始测试期间意识到这使得强制测试超时变得更加困难。

下面的代码看起来正确吗? (我想避免使用游标方法并依赖在 try-with-resources 中调用的 IgniteCache.query(..) 来检测超时)。这行得通吗?

@Scheduled(fixedDelayString = "${checkInterval}", initialDelayString = "${checkDelay}")
private final void monitorHealth() {
    if(!isReady) {
        return;
    }
    try (QueryCursor<Entry<Integer, FabricInfo>> cursor = fabricInfoCache.query(SQL_QUERY)) {
        cursor.iterator();
        // Reset the query time out counter..
        if(retryCount != 0) {
            retryCount = 0;
            LOGGER.warn("Client health check query executed without getting timed out before the configured maximum number of timeout retries was reached. Reseting retryCount to zero.");
        }
    } catch (Exception e) {
        if(e.getCause() instanceof QueryCancelledException) {
            retryCount++;
            LOGGER.warn("Client health check query timed out for the {} time.", retryCount);

            if(retryCount > QUERY_MAX_RETRIES_VALUE) {
                // Query timed out the maximum number of times..
                LOGGER.error("Client health check query timed out repeatedly for the maximum number of times configured : {}. Initating a disconnect-reconnect.", retryCount);
                reconnectAction();
            }
        } else {
            if (e.getCause() instanceof IgniteClientDisconnectedException) {
                LOGGER.error("Client health check query failed due to client node getting disconnected from cluster. Initating a disconnect-reconnect.", e.getCause());
            } else {
                // Treat other failures like CacheStoppedException, etc same as IgniteClientDisconnectedException...
                LOGGER.error("Client health check query failed. Initating a disconnect-reconnect.", e.getCause());
            }
            reconnectAction();
        }
    }
}

谢谢 穆图

QueryCancelledException is thrown from methods of QueryCursor, wrapped into IgniteException,是RuntimeException的子类。

调用IgniteCache#query(...)方法后,查询并没有立即执行。只有在调用 QueryCursor#iterator() 方法时才会发生。

例如,您可以查看 Ignite 项目中的以下测试,该测试检查是否遵守查询取消和超时:IgniteCacheLocalQueryCancelOrTimeoutSelfTest