Change default mapping of string to "not analyzed" in Elasticsearch

Tags:

In my system, the insertion of data is always done through csv files via logstash. I never pre-define the mapping. But whenever I input a string it is always taken to be analyzed, as a result an entry like hello I am Sinha is split into hello,I,am,Sinha. Is there anyway I could change the default/dynamic mapping of elasticsearch so that all strings, irrespective of index, irrespective of type are taken to be not analyzed? Or is there a way of setting it in the .conf file? Say my conf file looks like

Click to copy

input {         file {           path => "/home/sagnik/work/logstash-1.4.2/bin/promosms_dec15.csv"           type => "promosms_dec15"           start_position => "beginning"           sincedb_path => "/dev/null"       } } filter {      csv {         columns => ["Comm_Plan","Queue_Booking","Order_Reference","Multi_Ordertype"]         separator => ","     }       ruby {           code => "event['Generation_Date'] = Date.parse(event['Generation_Date']);"     }  } output {       elasticsearch {          action => "index"         host => "localhost"         index => "promosms-%{+dd.MM.YYYY}"         workers => 1     } }

I want all the strings to be not analyzed and I don't mind it being the default setting for all future data to be inserted into elasticsearch either

812

asked Dec 15 '14 11:12

Sagnik Sinha

2 Answers

Just create a template. run

Click to copy

curl -XPUT localhost:9200/_template/template_1 -d '{     "template": "*",     "settings": {         "index.refresh_interval": "5s"     },     "mappings": {         "_default_": {             "_all": {                 "enabled": true             },             "dynamic_templates": [                 {                     "string_fields": {                         "match": "*",                         "match_mapping_type": "string",                         "mapping": {                             "index": "not_analyzed",                             "omit_norms": true,                             "type": "string"                         }                     }                 }             ],             "properties": {                 "@version": {                     "type": "string",                     "index": "not_analyzed"                 },                 "geoip": {                     "type": "object",                     "dynamic": true,                     "path": "full",                     "properties": {                         "location": {                             "type": "geo_point"                         }                     }                 }             }         }     } }'

109

answered Oct 20 '22 10:10

Sagnik Sinha

You can query the .raw version of your field. This was added in Logstash 1.3.1:

The logstash index template we provide adds a “.raw” field to every field you index. These “.raw” fields are set by logstash as “not_analyzed” so that no analysis or tokenization takes place – our original value is used as-is!

So if your field is called foo, you'd query foo.raw to return the not_analyzed (not split on delimiters) version.

answered Oct 20 '22 11:10

Banjer

Related questions
                            
                                Creating DataFrame from ElasticSearch Results
                            
                                Elasticsearch: Job for elasticsearch.service failed
                            
                                Elasticsearch OutOfMemoryError Java heap space
                            
                                ElasticSearch updates are not immediate, how do you wait for ElasticSearch to finish updating it's index?
                            
                                Filter out metadata fields and only return source fields in elasticsearch
                            
                                running Elastic Search as a Windows service
                            
                                ElasticSearch vs SQL Full Text Search [closed]
                            
                                Elasticsearch relationship mappings (one to one and one to many)
                            
                                How do I create a stacked graph of HTTP codes in Kibana?
                            
                                is there any way to import a json file(contains 100 documents) in elasticsearch server.?
                            
                                Regarding elastic search memory usage
                            
                                Limit the number of results returned by Elastic Search
                            
                                What does "Limit of total fields [1000] in index [] has been exceeded" means in Elasticsearch
                            
                                What is an index in Elasticsearch
                            
                                Sync postgreSql data with ElasticSearch
                            
                                Defining analyzer while querying in elasticSearch
                            
                                Multilingual elasticsearch indexing best practice/experiences
                            
                                Combined non-Nested and Nested Query in Elasticsearch
                            
                                Location of custom Kibana dashboards in ElasticSearch
                            
                                Max limit on the number of values I can specify in the ids filter or generally query clause?

Donate For Us

If you love us? You can donate to us via Paypal or buy me a coffee so we can maintain and grow! Thank you!

Donate Us With

Change default mapping of string to "not analyzed" in Elasticsearch

Tags:

elasticsearch

elasticsearch-mapping

logstash

logstash-grok

Sagnik Sinha

People also ask

2 Answers

Sagnik Sinha

Banjer

Recent Activity

Donate For Us