如何使用 return 一个 Mono 的生成包装调用来创建 Flux
How do I create a Flux using generate wrapping calls that return a Mono
我有一个我想使用 Flux.generate 的示例,因为我不想进行昂贵的阻塞调用,除非/直到订阅者要求它。具体来说,我多次调用 Elasticsearch(有效地进行分页)直到没有更多的匹配。我在 Iterator<SearchResponse>
中使用标准阻塞调用实现了这一点。每次调用生成 lambda 块和通量然后以 .subscribeOn(Schedulers.boundedElastic())
结束。但是,我想使用 Spring 的 ReactiveElasticsearchClient
,return 是一个 Mono<SearchResponse>
,但我仍然想一次一个地使用它。
这是之前使用分块的代码:
public Iterator<SearchResponse> createDeepQueryIterator(@NonNull PITSearchInput input){
return new PointInTimeIterator(elasticClient, input);
}
public Flux<SearchResponse> createDeepQueryFlux(@NonNull PITSearchInput input){
return Flux.<SearchResponse, PointInTimeIterator>generate(
() -> new PointInTimeIterator(elasticClient, input),
(deepQueryIterator, sink) -> {
if (deepQueryIterator.hasNext()) {
sink.next(deepQueryIterator.next());
}else{
sink.complete();
}
return deepQueryIterator;
},
(deepQueryIterator) -> deepQueryIterator.shutdown())
.subscribeOn(Schedulers.boundedElastic());
}
上面的代码运行良好,因为它会等待对 ES 进行下一次调用,直到订阅者准备好接收下一个数据块。
在下面我尝试使用 Spring 的 ReactiveElasticsearchClient
但问题是在订阅者处理第一个之前对 ES 进行了多次调用。
public Flux<SearchResponse> createDeepQuery(PointInTimeIteratorFactory.PITSearchInput input) {
log.info("Creating flux");
AtomicReference<PitId> pitId = new AtomicReference<>();
AtomicInteger count = new AtomicInteger();
Mono<PitId> pitIdMono =
Mono.fromCallable(
() -> {
pitId.set(createPIT(input));
return pitId.get();
})
.subscribeOn(Schedulers.boundedElastic());
Mono<SearchResponse> searchResponseMono =
pitIdMono.flatMap(
p -> {
log.info("Calling search");
return reactiveElasticsearchClient.searchForResponse(createSearchRequestFrom(p, input));
});
Flux<SearchResponse> expand =
searchResponseMono
.expand(
(searchResponse -> {
int hitCount = searchResponse.getHits().getHits().length;
count.addAndGet(hitCount);
log.info("Previous returned {} hits totaling {}", hitCount, count.get());
if (count.get() > input.getMaxTotalSize()
|| hitCount < input.getMaxSizePerQuery()){
log.info("Returning empty");
return Mono.empty();
}
log.info("Calling search");
pitId.set(new PitId(searchResponse.pointInTimeId()));
return reactiveElasticsearchClient.searchForResponse(
createSearchRequestFrom(searchResponse, input));
}))
.doFinally(
p -> {
deletePIT(pitId.get());
});
return expand;
}
所以问题不是使用反应式客户端的能力来 return 在 Flux
中 Mono<SearchResponse>
而是只在需要时一次这样做订户。
下面是来自上面 Flux -> Mono 方法的日志记录,PitTest
日志记录来自 flux 的 onNext() 测试。
2021-12-02 13:13:37.300 INFO 13704 --- [ main] a.a.t.ReactivePointInTimeIteratorFactory : Creating flux
2021-12-02 13:13:37.346 INFO 13704 --- [oundedElastic-1] a.a.t.ReactivePointInTimeIteratorFactory : Creating PIT
2021-12-02 13:13:37.407 INFO 13704 --- [oundedElastic-1] a.a.t.ReactivePointInTimeIteratorFactory : Calling search
2021-12-02 13:13:38.176 INFO 13704 --- [or-http-epoll-2] a.a.t.ReactivePointInTimeIteratorFactory : Previous returned 50 hits totaling 50
2021-12-02 13:13:38.177 INFO 13704 --- [or-http-epoll-2] a.a.t.ReactivePointInTimeIteratorFactory : Calling search
2021-12-02 13:13:38.177 INFO 13704 --- [or-http-epoll-2] a.a.t.ReactivePointInTimeIteratorFactory : Setting searchAfter to 1634877306267
2021-12-02 13:13:38.228 INFO 13704 --- [or-http-epoll-2] a.a.t.ReactivePointInTimeIteratorFactory : Previous returned 50 hits totaling 100
2021-12-02 13:13:38.228 INFO 13704 --- [or-http-epoll-2] a.a.t.ReactivePointInTimeIteratorFactory : Calling search
2021-12-02 13:13:38.228 INFO 13704 --- [or-http-epoll-2] a.a.t.ReactivePointInTimeIteratorFactory : Setting searchAfter to 1634877606162
2021-12-02 13:13:38.271 INFO 13704 --- [or-http-epoll-2] a.a.t.ReactivePointInTimeIteratorFactory : Previous returned 50 hits totaling 150
2021-12-02 13:13:38.271 INFO 13704 --- [or-http-epoll-2] a.a.t.ReactivePointInTimeIteratorFactory : Calling search
2021-12-02 13:13:38.272 INFO 13704 --- [or-http-epoll-2] a.a.t.ReactivePointInTimeIteratorFactory : Setting searchAfter to 1634877606362
2021-12-02 13:13:38.311 INFO 13704 --- [or-http-epoll-2] a.a.t.ReactivePointInTimeIteratorFactory : Previous returned 50 hits totaling 200
2021-12-02 13:13:38.312 INFO 13704 --- [or-http-epoll-2] a.a.t.ReactivePointInTimeIteratorFactory : Calling search
2021-12-02 13:13:38.312 INFO 13704 --- [or-http-epoll-2] a.a.t.ReactivePointInTimeIteratorFactory : Setting searchAfter to 1634877906244
2021-12-02 13:13:38.344 INFO 13704 --- [or-http-epoll-2] a.a.t.ReactivePointInTimeIteratorFactory : Previous returned 50 hits totaling 250
2021-12-02 13:13:38.345 INFO 13704 --- [or-http-epoll-2] a.a.t.ReactivePointInTimeIteratorFactory : Returning empty
2021-12-02 13:13:38.345 INFO 13704 --- [or-http-epoll-2] a.a.t.ReactivePointInTimeIteratorFactory : Closing PIT ReactivePointInTimeIteratorFactory.PitId(id=m_2xAwENYWN0aXZpdHlzdG9yZRZQQkRGWldmclI2cWZITEpoWDI1cGlRABZCZU8xbm55ZlFabXREYmNEdThESG1RAAAAAAAAWQcTFm5BcXdPU2xTUWE2bEU4dkVPVkpkWFEBFlBCREZaV2ZyUjZxZkhMSmhYMjVwaVEAAA==)
2021-12-02 13:13:40.171 INFO 13704 --- [ parallel-1] p.actss.activity.store.PitTest : [1634877306066]
2021-12-02 13:13:42.172 INFO 13704 --- [ parallel-2] p.actss.activity.store.PitTest : [1634877306272]
2021-12-02 13:13:44.172 INFO 13704 --- [ parallel-3] p.actss.activity.store.PitTest : [1634877606166]
2021-12-02 13:13:46.173 INFO 13704 --- [ parallel-4] p.actss.activity.store.PitTest : [1634877906057]
2021-12-02 13:13:48.174 INFO 13704 --- [ parallel-1] p.actss.activity.store.PitTest : [1634877906248]
2021-12-02 13:13:48.174 INFO 13704 --- [ parallel-1] p.actss.activity.store.PitTest : Complete
2021-12-02 13:13:48.174 INFO 13704 --- [ main] p.actss.activity.store.PitTest : blah
2021-12-02 13:13:48.175 INFO 13704 --- [ parallel-1] p.actss.activity.store.PitTest : onComplete
更新:为完整性添加 PitTest 代码:
@Test
void testReactoiveFluxIt() throws InterruptedException {
Flux<SearchResponse> deepQuery = reactivePointInTimeIteratorFactory.createDeepQuery(...);
deepQuery
.delayElements(Duration.ofMillis(2000))
.doOnNext(p -> log.info(Arrays.toString(p.getHits().getHits()[0].getSortValues()))) //
.doOnComplete(() -> log.info("Complete")) //
.doFinally(p -> log.info(p.toString()))
.blockLast();
log.info("blah");
Thread.sleep(5000);
}
delayElements
切换到并行调度程序并将每个发出的元素延迟 2 秒。这就是排序值在之后打印的原因。
我有一个我想使用 Flux.generate 的示例,因为我不想进行昂贵的阻塞调用,除非/直到订阅者要求它。具体来说,我多次调用 Elasticsearch(有效地进行分页)直到没有更多的匹配。我在 Iterator<SearchResponse>
中使用标准阻塞调用实现了这一点。每次调用生成 lambda 块和通量然后以 .subscribeOn(Schedulers.boundedElastic())
结束。但是,我想使用 Spring 的 ReactiveElasticsearchClient
,return 是一个 Mono<SearchResponse>
,但我仍然想一次一个地使用它。
这是之前使用分块的代码:
public Iterator<SearchResponse> createDeepQueryIterator(@NonNull PITSearchInput input){
return new PointInTimeIterator(elasticClient, input);
}
public Flux<SearchResponse> createDeepQueryFlux(@NonNull PITSearchInput input){
return Flux.<SearchResponse, PointInTimeIterator>generate(
() -> new PointInTimeIterator(elasticClient, input),
(deepQueryIterator, sink) -> {
if (deepQueryIterator.hasNext()) {
sink.next(deepQueryIterator.next());
}else{
sink.complete();
}
return deepQueryIterator;
},
(deepQueryIterator) -> deepQueryIterator.shutdown())
.subscribeOn(Schedulers.boundedElastic());
}
上面的代码运行良好,因为它会等待对 ES 进行下一次调用,直到订阅者准备好接收下一个数据块。
在下面我尝试使用 Spring 的 ReactiveElasticsearchClient
但问题是在订阅者处理第一个之前对 ES 进行了多次调用。
public Flux<SearchResponse> createDeepQuery(PointInTimeIteratorFactory.PITSearchInput input) {
log.info("Creating flux");
AtomicReference<PitId> pitId = new AtomicReference<>();
AtomicInteger count = new AtomicInteger();
Mono<PitId> pitIdMono =
Mono.fromCallable(
() -> {
pitId.set(createPIT(input));
return pitId.get();
})
.subscribeOn(Schedulers.boundedElastic());
Mono<SearchResponse> searchResponseMono =
pitIdMono.flatMap(
p -> {
log.info("Calling search");
return reactiveElasticsearchClient.searchForResponse(createSearchRequestFrom(p, input));
});
Flux<SearchResponse> expand =
searchResponseMono
.expand(
(searchResponse -> {
int hitCount = searchResponse.getHits().getHits().length;
count.addAndGet(hitCount);
log.info("Previous returned {} hits totaling {}", hitCount, count.get());
if (count.get() > input.getMaxTotalSize()
|| hitCount < input.getMaxSizePerQuery()){
log.info("Returning empty");
return Mono.empty();
}
log.info("Calling search");
pitId.set(new PitId(searchResponse.pointInTimeId()));
return reactiveElasticsearchClient.searchForResponse(
createSearchRequestFrom(searchResponse, input));
}))
.doFinally(
p -> {
deletePIT(pitId.get());
});
return expand;
}
所以问题不是使用反应式客户端的能力来 return 在 Flux
中 Mono<SearchResponse>
而是只在需要时一次这样做订户。
下面是来自上面 Flux -> Mono 方法的日志记录,PitTest
日志记录来自 flux 的 onNext() 测试。
2021-12-02 13:13:37.300 INFO 13704 --- [ main] a.a.t.ReactivePointInTimeIteratorFactory : Creating flux
2021-12-02 13:13:37.346 INFO 13704 --- [oundedElastic-1] a.a.t.ReactivePointInTimeIteratorFactory : Creating PIT
2021-12-02 13:13:37.407 INFO 13704 --- [oundedElastic-1] a.a.t.ReactivePointInTimeIteratorFactory : Calling search
2021-12-02 13:13:38.176 INFO 13704 --- [or-http-epoll-2] a.a.t.ReactivePointInTimeIteratorFactory : Previous returned 50 hits totaling 50
2021-12-02 13:13:38.177 INFO 13704 --- [or-http-epoll-2] a.a.t.ReactivePointInTimeIteratorFactory : Calling search
2021-12-02 13:13:38.177 INFO 13704 --- [or-http-epoll-2] a.a.t.ReactivePointInTimeIteratorFactory : Setting searchAfter to 1634877306267
2021-12-02 13:13:38.228 INFO 13704 --- [or-http-epoll-2] a.a.t.ReactivePointInTimeIteratorFactory : Previous returned 50 hits totaling 100
2021-12-02 13:13:38.228 INFO 13704 --- [or-http-epoll-2] a.a.t.ReactivePointInTimeIteratorFactory : Calling search
2021-12-02 13:13:38.228 INFO 13704 --- [or-http-epoll-2] a.a.t.ReactivePointInTimeIteratorFactory : Setting searchAfter to 1634877606162
2021-12-02 13:13:38.271 INFO 13704 --- [or-http-epoll-2] a.a.t.ReactivePointInTimeIteratorFactory : Previous returned 50 hits totaling 150
2021-12-02 13:13:38.271 INFO 13704 --- [or-http-epoll-2] a.a.t.ReactivePointInTimeIteratorFactory : Calling search
2021-12-02 13:13:38.272 INFO 13704 --- [or-http-epoll-2] a.a.t.ReactivePointInTimeIteratorFactory : Setting searchAfter to 1634877606362
2021-12-02 13:13:38.311 INFO 13704 --- [or-http-epoll-2] a.a.t.ReactivePointInTimeIteratorFactory : Previous returned 50 hits totaling 200
2021-12-02 13:13:38.312 INFO 13704 --- [or-http-epoll-2] a.a.t.ReactivePointInTimeIteratorFactory : Calling search
2021-12-02 13:13:38.312 INFO 13704 --- [or-http-epoll-2] a.a.t.ReactivePointInTimeIteratorFactory : Setting searchAfter to 1634877906244
2021-12-02 13:13:38.344 INFO 13704 --- [or-http-epoll-2] a.a.t.ReactivePointInTimeIteratorFactory : Previous returned 50 hits totaling 250
2021-12-02 13:13:38.345 INFO 13704 --- [or-http-epoll-2] a.a.t.ReactivePointInTimeIteratorFactory : Returning empty
2021-12-02 13:13:38.345 INFO 13704 --- [or-http-epoll-2] a.a.t.ReactivePointInTimeIteratorFactory : Closing PIT ReactivePointInTimeIteratorFactory.PitId(id=m_2xAwENYWN0aXZpdHlzdG9yZRZQQkRGWldmclI2cWZITEpoWDI1cGlRABZCZU8xbm55ZlFabXREYmNEdThESG1RAAAAAAAAWQcTFm5BcXdPU2xTUWE2bEU4dkVPVkpkWFEBFlBCREZaV2ZyUjZxZkhMSmhYMjVwaVEAAA==)
2021-12-02 13:13:40.171 INFO 13704 --- [ parallel-1] p.actss.activity.store.PitTest : [1634877306066]
2021-12-02 13:13:42.172 INFO 13704 --- [ parallel-2] p.actss.activity.store.PitTest : [1634877306272]
2021-12-02 13:13:44.172 INFO 13704 --- [ parallel-3] p.actss.activity.store.PitTest : [1634877606166]
2021-12-02 13:13:46.173 INFO 13704 --- [ parallel-4] p.actss.activity.store.PitTest : [1634877906057]
2021-12-02 13:13:48.174 INFO 13704 --- [ parallel-1] p.actss.activity.store.PitTest : [1634877906248]
2021-12-02 13:13:48.174 INFO 13704 --- [ parallel-1] p.actss.activity.store.PitTest : Complete
2021-12-02 13:13:48.174 INFO 13704 --- [ main] p.actss.activity.store.PitTest : blah
2021-12-02 13:13:48.175 INFO 13704 --- [ parallel-1] p.actss.activity.store.PitTest : onComplete
更新:为完整性添加 PitTest 代码:
@Test
void testReactoiveFluxIt() throws InterruptedException {
Flux<SearchResponse> deepQuery = reactivePointInTimeIteratorFactory.createDeepQuery(...);
deepQuery
.delayElements(Duration.ofMillis(2000))
.doOnNext(p -> log.info(Arrays.toString(p.getHits().getHits()[0].getSortValues()))) //
.doOnComplete(() -> log.info("Complete")) //
.doFinally(p -> log.info(p.toString()))
.blockLast();
log.info("blah");
Thread.sleep(5000);
}
delayElements
切换到并行调度程序并将每个发出的元素延迟 2 秒。这就是排序值在之后打印的原因。