• ElasticSearch:文章检索


    实现目标

    image.png

    思路与ES前期准备

    image.png

    使用postman添加映射put请求 :

    搜索结果展示内容:标题、布局、枫叶图片、发布时间、作者名称、文章id、作者id、静态url 需要对:内容、标题进行分词

    json "content":{ "type":"text", "ananlyze":"ik_smart" }

    http://${url}:${port}/appinfoarticle

    image.png

    校验尝试:

    GET请求查询映射:http://${url}:${port}/appinfoarticle

    DELETE请求,删除索引及映射:http://${url}:${port}/appinfoarticle

    GET请求,查询所有文档:http://${url}:${port}/appinfoarticle/_search

    image.png

    查询所有文章,批量导入到es库中

    ```java BulkRequest bulkRequest = new BulkRequest("appinfoarticle");

    for (SearchArticleVo searchArticleVo : searchArticleVos) {

    1. IndexRequest indexRequest = new IndexRequest().id(searchArticleVo.getId().toString())
    2. .source(JSON.toJSONString(searchArticleVo), XContentType.JSON);
    3. //批量添加数据
    4. bulkRequest.add(indexRequest);

    } restHighLevelClient.bulk(bulkRequest, RequestOptions.DEFAULT);

    ``` 其中: bulkRequest用法

    ```java RestHighLevelClient restHighLevelClient = new RestHighLevelClient();//创建一个es客户端,用于执行请求

    BulkRequest bulkRequest = new BulkRequest(); //创建一个批量请求

    IndexRequest indexRequest1 = new IndexRequest("indexname"); //创建第一个“添加文档”的请求 indexRequest1.id("1") .source(XContentType.JSON,"fieldname", "foo1");//设置添加的文档信息 IndexRequest indexRequest2 = new IndexRequest("indexname"); //创建第二个“添加文档”的请求 indexRequest2.id("1") .source(XContentType.JSON,"fieldname", "foo2");//设置添加的文档信息

    bulkRequest.add(indexRequest1); //将第一个“添加文档”请求加入到批量请求中 bulkRequest.add(indexRequest2); //将第二个“添加文档”请求加入到批量请求中

    BulkResponse bulkResponse = restHighLevelClient.bulk(bulkRequest, RequestOptions.DEFAULT);//执行批量请求,返回执行结果

    ```

    查询时各种条件的构造的正确姿势

    由于es在java中查询没法像mybatis那样方便,而且es的构造器使用也比较繁琐,理解不是很方便,所以下面来记录es构造器BoolQueryBuilder查询时各种条件的构造的正确姿势。

    1.构造准备

    java //1.构建SearchRequest请求对象,指定索引库 SearchRequest searchRequest = new SearchRequest("data_info"); //2.构建SearchSourceBuilder查询对象 SearchSourceBuilder sourceBuilder = new SearchSourceBuilder(); //2.1 这个条件用于返回所有命中条件的数据数量, 不设置则返回大概数值 sourceBuilder.trackTotalHits(true); //3.检索条件构造 BoolQueryBuilder bqb = QueryBuilders.boolQuery();

    2.条件构造:must可以使用filter代替,效率更高,因为must会对_score评估

    ```java //3.1 完全匹配 bqb.must(QueryBuilders.matchQuery("code", 666L);

    //3.2 模糊匹配 bqb.must(QueryBuilders.matchPhraseQuery("name", "张");

    //3.3 in的效果 传单个参数就是完全匹配 bqb.must(QueryBuilders.termsQuery("code", new Long[]{1L, 2L, 3L}); bqb.must(QueryBuilders.termsQuery("code", 1L, 2L, 3L);

    //3.4 or条件 BoolQueryBuilder shouldQuery = QueryBuilders.boolQuery(); shouldQuery.should(QueryBuilders.matchQuery("code", 1L); shouldQuery.should(QueryBuilders.matchQuery("code", 2L); shouldQuery.minimumShouldMatch(1); //至少满足一个 bqb.must(shouldQuery);

    //3.5 非null bqb.must(QueryBuilders.existsQuery("iden")); //是null
    bqb.mustNot(QueryBuilders.existsQuery("iden"));

    //3.6 大于等于gte (gt-大于 lt-小于 lte-小于等于) bqb.must(QueryBuilders.rangeQuery("time").gte(new Date());

    //3.7 中文完全匹配 bqb.must(queryBuilder.matchPhraseQuery("key", value));

    //3.8 匹配多个字段 bqb.must(queryBuilder.multiMatchQuery(value, key1, key2, key3)); ```

    3.构造完成 准备查询

    ```java //4.将QuseryBuilder对象设置到SearchSourceBuilder对象中 sourceBuilder.query(bqb); //5.排序 sourceBuilder.sort("updateTime", SortOrder.DESC); //6.分页 sourceBuilder.from((dto.getPageNum() - 1) * dto.getPageSize()); sourceBuilder.size(dto.getPageSize());

    //设置高亮 title HighlightBuilder highlightBuilder = new HighlightBuilder(); highlightBuilder.field("title"); highlightBuilder.preTags(""); highlightBuilder.postTags("");

    searchBuilder.highlighter(highlightBuilder);

    //7.将SearchSourceBuilder设置到SearchRequest中 searchRequest.source(sourceBuilder);

    //8.调用方法查询数据 SearchResponse searchResponse = client.search(searchRequest, RequestOptions.DEFAULT);

    //9.解析返回结果 SearchHit[] hits = searchResponse.getHits().getHits(); for (int i = 0; i < hits.length; i++) { System.out.println("返回的结果: " + hits[i].getSourceAsString()); }

    log.info("返回总数为:" + searchResponse.getHits().getTotalHits()); int total = (int)searchResponse.getHits().getTotalHits().value; ```

    检索业务实现

    1. 配置config ```java @Getter @Setter @Configuration @ConfigurationProperties(prefix = "elasticsearch") public class ElasticSearchConfig { private String host; private int port;

      @Bean public RestHighLevelClient client(){ System.out.println(host); System.out.println(port); return new RestHighLevelClient(RestClient.builder( new HttpHost( host, port, "http" ) )); } } ```

    2. 构造一个UserSearchDto其中包含字段 搜索关键字searchWords、当前页pageNum、分页条数pageSize、最小时间minBehotTime。其中提供一个分页算法

    java public int getFromIndex(){ if(this.pageNum<1)return 0; if(this.pageSize<1) this.pageSize = 10; return this.pageSize * (pageNum-1); } 2. 进行编写es代码

    ```java // 传入参数为UserSearchDto // 1 设置查询条件 SearchRequest searchRequest = new SearchRequest("key_name"); SearchSourceBuilder searchSourceBuilder = new SearchSourceBuilder();

    // 2 设置查询条件 BoolQueryBuilder boolQueryBuilder = QueryBuilders.query();

    // 关键字的分词后查询 QueryStringQueryBuilder queryStringQueryBuilder = QueryBuilders.queryStringQuery(dto.getSearchWords()).field("title").field("content").defaultOperatior(Operator.OR); boolQueryBuilder.must(queryStringQueryBuilder);

    // 查询小于minBehotTim的数据 RangeQueryBuilder rangeQueryBuilder = QueryBuilders.rangeQuery("publishTime").lt(dto.getMinHotTime().getTime()); boolQueryBuilder.filter(rangeQueryBuilder);

    // 分页查询 searchSourceBuilder.from(0); searchSourceBuilder.size(dto.getPageSize());

    // 按照发布时间倒序查询 searchSourceBuilder.sort("publishTime", SortOrder.DESC);

    // 设置高亮 title HighlightBuilder highlightBuilder = new HighlightBuilder(); highlightBuilder.field("title"); highlightBuilder.preTags(""); highlightBuilder.postTags("") searchSourceSearch.highlighter(highlightBuilder);

    // 执行查询 searchSourceBuilder.query(boolQueryBuilder); searchRequest.source(searchSourceBuilder); SearchResponse searchResponse = restHighLevelClient.search(searchRequet, RequestOptions.DEFAULT); ```

    文章审核后构建索引

    收集数据并发送消息

    ```java //发送消息,创建索引 createArticleESIndex(apArticle,content,path);

    @Autowired private KafkaTemplate kafkaTemplate;

    /** * 送消息,创建索引 * @param apArticle * @param content * @param path */ private void createArticleESIndex(ApArticle apArticle, String content, String path) { SearchArticleVo vo = new SearchArticleVo(); BeanUtils.copyProperties(apArticle,vo); vo.setContent(content); vo.setStaticUrl(path);

    kafkaTemplate.send(ArticleConstants.ARTICLE_ES_SYNC_TOPIC, JSON.toJSONString(vo));

    }

    public static final String ARTICLEESSYNC_TOPIC = "article.es.sync.topic"; ```

    搜索的微服务 接收消息并创建场景

    1. 搜索微服务中添加kafka的配置,nacos配置

    yaml spring: kafka: bootstrap-servers: ${url}:${port} consumer: group-id: ${spring.application.name} key-deserializer: org.apache.kafka.common.serialization.StringDeserializer value-deserializer: org.apache.kafka.common.serialization.StringDeserializer 2. 定义监听接收消息,保存索引数据,class SyncArticleListener

    ```java @AutoWired private RestHighLevelClient restHighLevelClient;

    @KafkaListener(topics = ArticleConstants.ARTICLEESSYNC_TOPIC) public void onMessage(String message){ if(StringUtils.isNotBlank(message)){ log.info("SyncArticleListener message = {}", message); SearchArticleVo searchArticleVo = JSON.parseObject } } ```

  • 相关阅读:
    WEB前端网页设计 CSS网页代码 基础参数(三)
    JAVA计算机毕业设计电影网站系统设计Mybatis+系统+数据库+调试部署
    怎么在 CSDN 写好技术博客
    【数据挖掘工程师-笔试】2022年海尔 公司
    Linux权限
    【Java】解决安全漏洞
    MySQL-sql的优化
    软件测试基本技能点
    Java环境准备——JDK下载和安装、IDEA下载和安装
    制作一个简单HTML抗疫逆行者网页作业(HTML+CSS)
  • 原文地址:https://blog.csdn.net/Dream__Y/article/details/132974094