-
Notifications
You must be signed in to change notification settings - Fork 552
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
修复startOffset must be non-negative, and endOffset must be >= startOffset, and offsets must not go backwards #206
Conversation
update fork
…t, and offsets must not go backwards修复 当将参数“ignore_pinyin_offset”设置为false后,并向pinyin分词字段批量写入数据,即会出现“startOffset must be non-negative, and endOffset must be >= startOffset, and offsets must not go backwards”异常。测试校验发现为在reset()时,应该同样将this.processedSortCandidate = false,即可修复此问题。
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
这个bug并么有解决。。。。 |
@oIdmonk ,复现步骤麻烦提供一下,谢谢。 |
就是跟之前一样,参数设置false,同步数据就报错另外问下,7.8.0版本不能用于es7.2.0版本么
…------------------ 原始邮件 ------------------
发件人: "Medcl"<notifications@github.com>;
发送时间: 2020年7月29日(星期三) 晚上6:39
收件人: "medcl/elasticsearch-analysis-pinyin"<elasticsearch-analysis-pinyin@noreply.github.com>;
抄送: "oldmonk"<470612348@qq.com>; "Mention"<mention@noreply.github.com>;
主题: Re: [medcl/elasticsearch-analysis-pinyin] 修复startOffset must be non-negative, and endOffset must be >= startOffset, and offsets must not go backwards (#206)
@oIdmonk ,复现步骤麻烦提供一下,谢谢。
—
You are receiving this because you were mentioned.
Reply to this email directly, view it on GitHub, or unsubscribe.
|
用新版本吧。发自我的手机-------- 原始邮件 --------发件人: oldmonk <notifications@github.com>日期: 2020年7月29日周三 傍晚6:44收件人: medcl/elasticsearch-analysis-pinyin <elasticsearch-analysis-pinyin@noreply.github.com>抄送: Medcl <medcl@elastic.co>, State change <state_change@noreply.github.com>主 题: Re: [medcl/elasticsearch-analysis-pinyin] 修复startOffset must be non-negative, and endOffset must be >= startOffset, and offsets must not go backwards (#206)
就是跟之前一样,参数设置false,同步数据就报错另外问下,7.8.0版本不能用于es7.2.0版本么
…------------------ 原始邮件 ------------------
发件人: "Medcl"<notifications@github.com>;
发送时间: 2020年7月29日(星期三) 晚上6:39
收件人: "medcl/elasticsearch-analysis-pinyin"<elasticsearch-analysis-pinyin@noreply.github.com>;
抄送: "oldmonk"<470612348@qq.com>; "Mention"<mention@noreply.github.com>;
主题: Re: [medcl/elasticsearch-analysis-pinyin] 修复startOffset must be non-negative, and endOffset must be >= startOffset, and offsets must not go backwards (#206)
@oIdmonk ,复现步骤麻烦提供一下,谢谢。
—
You are receiving this because you were mentioned.
Reply to this email directly, view it on GitHub, or unsubscribe.
—You are receiving this because you modified the open/close state.Reply to this email directly, view it on GitHub, or unsubscribe.
|
公司用的项目,版本也不能随意更换的
…------------------ 原始邮件 ------------------
发件人: "medcl/elasticsearch-analysis-pinyin" <notifications@github.com>;
发送时间: 2020年7月29日(星期三) 晚上7:05
收件人: "medcl/elasticsearch-analysis-pinyin"<elasticsearch-analysis-pinyin@noreply.github.com>;
抄送: "oldmonk"<470612348@qq.com>;"Mention"<mention@noreply.github.com>;
主题: Re: [medcl/elasticsearch-analysis-pinyin] 修复startOffset must be non-negative, and endOffset must be >= startOffset, and offsets must not go backwards (#206)
用新版本吧。发自我的手机-------- 原始邮件 --------发件人: oldmonk <notifications@github.com>日期: 2020年7月29日周三 傍晚6:44收件人: medcl/elasticsearch-analysis-pinyin <elasticsearch-analysis-pinyin@noreply.github.com>抄送: Medcl <medcl@elastic.co>, State change <state_change@noreply.github.com>主 题: Re: [medcl/elasticsearch-analysis-pinyin] 修复startOffset must be non-negative, and endOffset must be >= startOffset, and offsets must not go backwards (#206)
就是跟之前一样,参数设置false,同步数据就报错另外问下,7.8.0版本不能用于es7.2.0版本么
------------------&nbsp;原始邮件&nbsp;------------------
发件人: "Medcl"<notifications@github.com&gt;;
发送时间: 2020年7月29日(星期三) 晚上6:39
收件人: "medcl/elasticsearch-analysis-pinyin"<elasticsearch-analysis-pinyin@noreply.github.com&gt;;
抄送: "oldmonk"<470612348@qq.com&gt;; "Mention"<mention@noreply.github.com&gt;;
主题: Re: [medcl/elasticsearch-analysis-pinyin] 修复startOffset must be non-negative, and endOffset must be &gt;= startOffset, and offsets must not go backwards (#206)
@oIdmonk ,复现步骤麻烦提供一下,谢谢。
—
You are receiving this because you were mentioned.
Reply to this email directly, view it on GitHub, or unsubscribe.
—You are receiving this because you modified the open/close state.Reply to this email directly, view it on GitHub, or unsubscribe.
—
You are receiving this because you were mentioned.
Reply to this email directly, view it on GitHub, or unsubscribe.
|
7.5.1 问题依旧存在, 具体触发情景为:如果文本是以英文加分词开头的,index会出现 lastStartOffset 前移的情况。 GET /medcl/_analyze
{
"text": ["liu 德华"],
"analyzer": "pinyin_analyzer"
} 结果: {
"tokens": [
{
"token": "liu",
"start_offset": 1,
"end_offset": 4,
"type": "word",
"position": 0
},
{
"token": "liu 德华",
"start_offset": 0,
"end_offset": 6,
"type": "word",
"position": 0
},
]
} |
ES6.8.8、ik6.8.8、pinyin6.8.8,写入索引数据报同样错误,数据可以写入,但是1000条会丢掉1条。请问使用哪个ik版本和pinyin版本可以解决? |
ik 6.6.1 应该可以,不用改es版本,解压ik后修改里面描述文件中es的版本做下适配 |
@liqunlin @ytk929 大佬我这边的版本怎么还有这么个问题。貌似很新的样子 |
当将参数“ignore_pinyin_offset”设置为false后,并向pinyin分词字段批量写入数据,即会出现“startOffset must be non-negative, and endOffset must be >= startOffset, and offsets must not go backwards”异常。
测试校验发现为在reset()时,应该同样将this.processedSortCandidate = false,即可修复此问题。