温馨提示×

温馨提示×

您好,登录后才能下订单哦!

密码登录×
登录注册×
其他方式登录
点击 登录注册 即表示同意《亿速云用户服务条款》

elasticsearch分词器怎么使用

发布时间:2022-03-22 16:10:20 来源:亿速云 阅读:354 作者:iii 栏目:大数据

本篇内容主要讲解“elasticsearch分词器怎么使用”,感兴趣的朋友不妨来看看。本文介绍的方法操作简单快捷,实用性强。下面就让小编来带大家学习“elasticsearch分词器怎么使用”吧!

1、在Elasticsearch的plugins目录下建一个ik目录,将ik安装包解压到该目录 或 ./bin/elasticsearch-plugin install 

2、重启Elasticsearch

3、测试分词器

GET /_analyze
{
  "text": "中华人民共和国国徽",
  # 两种:ik_smart较粗粒度的拆分 和 ik_max_word较细粒度的拆分
  "analyzer": "ik_max_word"
}

4、使用分词器 在模版中定义好分词

"settings": {
    "index": {
      "number_of_shards": "1",
      "number_of_replicas": "0"
    },
    "analysis": {
      "analyzer": {
        "ik":{
          "tokenizer" : "ik_max_word"
        }
      }
    }
  },
  .......
  "mappings": {
      "_doc" : {   
        "dynamic_templates": [
          {
            "integers": {
              "mapping": {
                "type": "long"
              },
              "match_mapping_type": "long",
              "match": "*"
            }
          },
          {
            "doubles": {
              "mapping": {
                "type": "double"
              },
              "match_mapping_type": "double",
              "match": "*"
            }
          },
          {
            "time_mapping": {
              "match_pattern": "regex",
              "mapping": {
                "format": "yyyy-MM-dd HH:mm:ss||strict_date_optional_time||epoch_millis",
                "type": "date"
              },
              "match_mapping_type": "string",
              "match": ".+(time|Time|TIME|date|Date|DATE)"
            }
          },
          {
            "ip_match": {
              "match_pattern": "regex",
              "mapping": {
                "type": "ip"
              },
              "match_mapping_type": "string",
              "match": ".+(IP|Ip|ip|iP)"
            }
          },
          {
            "port_match": {
              "match_pattern": "regex",
              "mapping": {
                "type": "long"
              },
              "match_mapping_type": "string",
              "match": ".+(Port|port|PORT)"
            }
          },
          {
            "name_mapping": {
              "match_pattern": "regex",
              "mapping": {
                "fields": {
                  "keyword": {
                    "ignore_above": 2560,
                    "type": "keyword"
                  }
                },
                "type": "text"
              },
              "match_mapping_type": "string",
              "match": ".+(Name|name)"
            }
          },
          {
            "strings": {
              "mapping": {
                "type": "keyword"
              },
              "match_mapping_type": "string",
              "match": "*"
            }
          }
        ],
        "date_detection": false,
        "properties": {
          "content": {
            "type" : "text",
            "analyzer": "ik_max_word",
            "fields" : {
              "keyword" : {
                "type" : "keyword",
                "ignore_above" : 256
              }
            }
          },
          "level": {
            "type": "keyword"
          },
          "target": {
            "type": "keyword"
          },
          "source": {
            "type": "keyword"
          },
          "alerttime": {
            "type": "date",
            "format": "yyyy-MM-dd HH:mm:ss"
          }
        }
      }
     }

到此,相信大家对“elasticsearch分词器怎么使用”有了更深的了解,不妨来实际操作一番吧!这里是亿速云网站,更多相关内容可以进入相关频道进行查询,关注我们,继续学习!

向AI问一下细节

免责声明:本站发布的内容(图片、视频和文字)以原创、转载和分享为主,文章观点不代表本网站立场,如果涉及侵权请联系站长邮箱:is@yisu.com进行举报,并提供相关证据,一经查实,将立刻删除涉嫌侵权内容。

AI