ElasticSearch对字段去重统计技术，实现SQL中的count(distinct(*)) 效果

本文转载自查看原文 2020-06-18 18:47 1690 RestHighlevelClinet/ DISTINCT/ ES

 1 public long countDistinctField(String esIndex, String countField, SearchSourceBuilder sourceBuilder) {
 2          long count = 0;
 3          if (StringUtils.isBlank(esIndex) || StringUtils.isBlank(countField)) {
 4              return count;
 5          }
 6  
 7          SearchRequest searchRequest = new SearchRequest();
 8          searchRequest.indices(esIndex);
 9          String identifier = UUID.randomUUID().toString();
10          AggregationBuilder aggregationBuilder = AggregationBuilders.cardinality(identifier).field(countField);
11          sourceBuilder.aggregation(aggregationBuilder);
12          sourceBuilder.size(0);
13          searchRequest.source(sourceBuilder);
14          try {
15              SearchResponse result = client.search(searchRequest, RequestOptions.DEFAULT);
16              Histogram histogram = (Histogram) result.getAggregations().asMap().get(countField);
17              long total_value = 0;
18              for (Histogram.Bucket t : histogram.getBuckets()) {
19                  Cardinality cardinality = t.getAggregations().get(identifier);
20                  long value = cardinality.getValue();
21                  total_value = total_value + value;
22              }
23              return total_value;
24          } catch (Exception e) {
25              log.error("Count field failed!", e);
26          }
27          return 0;
28      }

免责声明！

本站转载的文章为个人学习借鉴使用，本站对版权不负任何法律责任。如果侵犯了您的隐私权益，请联系本站邮箱yoyou2525@163.com删除。

猜您在找 mysql count distinct 统计结果去重 (转)ElasticSearch中"distinct","count"和"group by"的实现 SQL distinct用法---count(distinct 字段1,字段2) count distinct 多个字段或者 count(*) 统计group by 结果 ES中对应的SQL的count(distinct 列名) java实现统计符合条件的去重过的数量 - - count distinct if case Python中实现count(distinct ) SQL server 中 COUNT DISTINCT 函数 SQL COUNT DISTINCT 函数 ES elasticsearch 实现 count单字段，分组取前多少位，以地理位置中心进行统计