博客
关于我
强烈建议你试试无所不能的chatGPT,快点击我
es Update API
阅读量:6720 次
发布时间:2019-06-25

本文共 4271 字,大约阅读时间需要 14 分钟。

hot3.png

es Update API 博客分类: 搜索引擎,爬虫

The update API allows to update a document based on a script provided. The operation gets the document (collocated with the shard) from the index, runs the script (with optional script language and parameters), and index back the result (also allows to delete, or ignore the operation). It uses versioning to make sure no updates have happened during the "get" and "reindex".

Note, this operation still means full reindex of the document, it just removes some network roundtrips and reduces chances of version conflicts between the get and the index. The _source field need to be enabled for this feature to work.

For example, lets index a simple doc:

curl -XPUT localhost:9200/test/type1/1-d '{
"counter":1,"tags":["red"]}'

Scripted updates

Now, we can execute a script that would increment the counter:

curl -XPOST 'localhost:9200/test/type1/1/_update'-d '{
"script":{
"inline":"ctx._source.counter += count","params":{
"count":4}}}'

We can add a tag to the list of tags (note, if the tag exists, it will still add it, since its a list):

curl -XPOST 'localhost:9200/test/type1/1/_update'-d '{
"script":{
"inline":"ctx._source.tags += tag","params":{
"tag":"blue"}}}'

In addition to _source, the following variables are available through the ctx map: _index, _type, _id, _version, _routing, _parent, _timestamp, _ttl.

We can also add a new field to the document:

curl -XPOST 'localhost:9200/test/type1/1/_update'-d '{
"script":"ctx._source.name_of_new_field = \"value_of_new_field\""}'

Or remove a field from the document:

curl -XPOST 'localhost:9200/test/type1/1/_update'-d '{
"script":"ctx._source.remove(\"name_of_field\")"}'

And, we can even change the operation that is executed. This example deletes the doc if the tags field contain blue, otherwise it does nothing (noop):

curl -XPOST 'localhost:9200/test/type1/1/_update'-d '{
"script":{
"inline":"ctx._source.tags.contains(tag) ? ctx.op = \"delete\" : ctx.op = \"none\"","params":{
"tag":"blue"}}}'

Updates with a partial document

The update API also support passing a partial document, which will be merged into the existing document (simple recursive merge, inner merging of objects, replacing core "keys/values" and arrays). For example:

curl -XPOST 'localhost:9200/test/type1/1/_update'-d '{
"doc":{
"name":"new_name"}}'

If both doc and script is specified, then doc is ignored. Best is to put your field pairs of the partial document in the script itself.

Detecting noop updates

If doc is specified its value is merged with the existing _source. By default the document is only reindexed if the new _source field differs from the old. Setting detect_noop to false will cause Elasticsearch to always update the document even if it hasn’t changed. For example:

curl -XPOST 'localhost:9200/test/type1/1/_update'-d '{
"doc":{
"name":"new_name"},"detect_noop":false}'

If name was new_name before the request was sent then document is still reindexed.

Upserts

If the document does not already exist, the contents of the upsert element will be inserted as a new document. If the document does exist, then the script will be executed instead:

curl -XPOST 'localhost:9200/test/type1/1/_update'-d '{
"script":{
"inline":"ctx._source.counter += count","params":{
"count":4}},"upsert":{
"counter":1}}'

scripted_upsert

If you would like your script to run regardless of whether the document exists or not — i.e. the script handles initializing the document instead of the upsert element — then set scripted_upsert to true:

curl -XPOST 'localhost:9200/sessions/session/dh3sgudg8gsrgl/_update'-d '{
"scripted_upsert":true,"script":{
"id":"my_web_session_summariser","params":{
"pageViewEvent":{
"url":"foo.com/bar","response":404,"time":"2014-01-01 12:32"}}},"upsert":{}}'

doc_as_upsert

Instead of sending a partial doc plus an upsert doc, setting doc_as_upsert to true will use the contents of doc as the upsert value:

curl -XPOST 'localhost:9200/test/type1/1/_update'-d '{
"doc":{
"name":"new_name"},"doc_as_upsert":true}'https://www.elastic.co/guide/en/elasticsearch/reference/2.1/docs-update.html#docs-update

转载于:https://my.oschina.net/xiaominmin/blog/1597001

你可能感兴趣的文章
用IE删除软件删除IE出现问题解决方法
查看>>
从“窃听门”事件解读手机Rootkit攻击
查看>>
[Android学习笔记三] Support v7提供交错式网格布局开发示例
查看>>
QoS令牌桶工作原理
查看>>
重提URL Rewrite(4):不同级别URL Rewrite的一些细节与特点
查看>>
linux子系统的初始化_subsys_initcall()【转】
查看>>
x60系统安装步骤
查看>>
Linux内核同步机制之(三):memory barrier【转】
查看>>
安装Fedora 24后必要的设置
查看>>
ES JVM使用如果超过75%就会GC较多,导致ES索引性能下降
查看>>
elasticsearch源码分析之search模块(server端)
查看>>
【转】如何用 Chrome for Android 做远程遥控 debugging
查看>>
Rails中Form标签JavaScript验证
查看>>
ArcObject开发,程序编译通过,但无法启动的解决
查看>>
Oracle 存储过程的创建,及触发器调用存储过程
查看>>
关于Socket通信服务的心跳包(转)
查看>>
.net面试问答(大汇总)
查看>>
BST B-树 B+树 B*树简介
查看>>
21、ASP.NET MVC入门到精通——ASP.NET MVC4优化
查看>>
Arm-kernel 内存收集【转】
查看>>