apache
diff --git a/‎.github/workflows/flink_cdc_migration_test.yml‎
Lines changed: 1 addition & 2 deletions b/‎.github/workflows/flink_cdc_migration_test.yml‎
Lines changed: 1 addition & 2 deletions
diff --git a/‎docs/content.zh/docs/connectors/flink-sources/mysql-cdc.md‎
Lines changed: 1 addition & 1 deletion b/‎docs/content.zh/docs/connectors/flink-sources/mysql-cdc.md‎
Lines changed: 1 addition & 1 deletion
diff --git a/‎docs/content.zh/docs/connectors/flink-sources/vitess-cdc.md‎
Lines changed: 1 addition & 1 deletion b/‎docs/content.zh/docs/connectors/flink-sources/vitess-cdc.md‎
Lines changed: 1 addition & 1 deletion
diff --git a/‎docs/content.zh/docs/connectors/pipeline-connectors/elasticsearch.md‎
Lines changed: 275 additions & 0 deletions b/‎docs/content.zh/docs/connectors/pipeline-connectors/elasticsearch.md‎
Lines changed: 275 additions & 0 deletions
diff --git a/‎docs/content.zh/docs/connectors/pipeline-connectors/mysql.md‎
Lines changed: 4 additions & 1 deletion b/‎docs/content.zh/docs/connectors/pipeline-connectors/mysql.md‎
Lines changed: 4 additions & 1 deletion
diff --git a/‎docs/content.zh/docs/core-concept/transform.md‎
Lines changed: 19 additions & 1 deletion b/‎docs/content.zh/docs/core-concept/transform.md‎
Lines changed: 19 additions & 1 deletion
@@ -39,8 +39,7 @@ jobs:
     runs-on: ubuntu-latest
     strategy:
       matrix:
-        # '1.20.0' is excluded since FLINK-36105 has not been merged.
-        flink-version: [ '1.18.1', '1.19.1' ]
+        flink-version: [ '1.18.1', '1.19.1', '1.20.0' ]
 
     steps:
       - uses: actions/checkout@v4
 
@@ -581,7 +581,7 @@ MySQL CDC Source 使用主键列将表划分为多个分片（chunk）。 默认
  [100, +∞)
 ```
 
-对于其他主键列类型， MySQL CDC Source 将以下形式执行语句： `SELECT MAX(STR_ID) AS chunk_high FROM (SELECT * FROM TestTable WHERE STR_ID > 'uuid-001' limit 25)` 来获得每个区块的低值和高值，
+对于其他主键列类型， MySQL CDC Source 将以下形式执行语句： `SELECT MAX(STR_ID) AS chunk_high FROM (SELECT * FROM TestTable WHERE STR_ID > 'uuid-001' ORDER BY STR_ID ASC LIMIT 25)` 来获得每个区块的低值和高值，
 分割块集如下所示：
 
  ```
 
@@ -49,7 +49,7 @@ more released versions will be available in the Maven central warehouse.
 Setup Vitess server
 ----------------
 
-You can follow the Local Install via [Docker guide](https://vitess.io/docs/get-started/local-docker/), or the Vitess Operator for [Kubernetes guide](https://vitess.io/docs/get-started/operator/) to install Vitess. No special setup is needed to support Vitess connector.
+You can follow the Local Install via [Docker guide](https://vitess.io/docs/get-started/vttestserver-docker-image/), or the Vitess Operator for [Kubernetes guide](https://vitess.io/docs/get-started/operator/) to install Vitess. No special setup is needed to support Vitess connector.
 
 ### Checklist
 * Make sure that the VTGate host and its gRPC port (default is 15991) is accessible from the machine where the Vitess connector is installed
 
@@ -0,0 +1,275 @@
+---
+title: "Elasticsearch"
+weight: 7
+type: docs
+aliases:
+- /connectors/pipeline-connectors/elasticsearch
+---
+<!--
+Licensed to the Apache Software Foundation (ASF) under one
+or more contributor license agreements.  See the NOTICE file
+distributed with this work for additional information
+regarding copyright ownership.  The ASF licenses this file
+to you under the Apache License, Version 2.0 (the
+"License"); you may not use this file except in compliance
+with the License.  You may obtain a copy of the License at
+
+  http://www.apache.org/licenses/LICENSE-2.0
+
+Unless required by applicable law or agreed to in writing,
+software distributed under the License is distributed on an
+"AS IS" BASIS, WITHOUT WARRANTIES OR CONDITIONS OF ANY
+KIND, either express or implied.  See the License for the
+specific language governing permissions and limitations
+under the License.
+-->
+
+# Elasticsearch Pipeline Connector
+
+Elasticsearch Pipeline 连接器可以用作 Pipeline 的 Data Sink, 将数据写入 Elasticsearch。 本文档介绍如何设置 Elasticsearch Pipeline 连接器。
+
+
+How to create Pipeline
+----------------
+
+从 MySQL 读取数据同步到 Elasticsearch 的 Pipeline 可以定义如下：
+
+```yaml
+source:
+   type: mysql
+   name: MySQL Source
+   hostname: 127.0.0.1
+   port: 3306
+   username: admin
+   password: pass
+   tables: adb.\.*, bdb.user_table_[0-9]+, [app|web].order_\.*
+   server-id: 5401-5404
+
+sink:
+  type: elasticsearch
+  name: Elasticsearch Sink
+  hosts: http://127.0.0.1:9092,http://127.0.0.1:9093
+  
+route:
+  - source-table: adb.\.*
+    sink-table: default_index
+    description: sync adb.\.* table to default_index
+
+pipeline:
+  name: MySQL to Elasticsearch Pipeline
+  parallelism: 2
+```
+
+Pipeline Connector Options
+----------------
+<div class="highlight">
+<table class="colwidths-auto docutils">
+   <thead>
+      <tr>
+        <th class="text-left" style="width: 25%">Option</th>
+        <th class="text-left" style="width: 8%">Required</th>
+        <th class="text-left" style="width: 7%">Default</th>
+        <th class="text-left" style="width: 10%">Type</th>
+        <th class="text-left" style="width: 50%">Description</th>
+      </tr>
+    </thead>
+    <tbody>
+    <tr>
+      <td>type</td>
+      <td>required</td>
+      <td style="word-wrap: break-word;">(none)</td>
+      <td>String</td>
+      <td>指定要使用的连接器, 这里需要设置成 <code>'elasticsearch'</code>.</td>
+    </tr>
+    <tr>
+      <td>name</td>
+      <td>optional</td>
+      <td style="word-wrap: break-word;">(none)</td>
+      <td>String</td>
+      <td>Sink 的名称。</td>
+    </tr>
+    <tr>
+      <td>hosts</td>
+      <td>required</td>
+      <td style="word-wrap: break-word;">(none)</td>
+      <td>String</td>
+      <td>要连接到的一台或多台 Elasticsearch 主机，例如: 'http://host_name:9092,http://host_name:9093'.</td>
+    </tr>
+    <tr>
+      <td>version</td>
+      <td>optional</td>
+      <td style="word-wrap: break-word;">7</td>
+      <td>Integer</td>
+      <td>指定要使用的连接器，有效值为：
+      <ul>
+        <li>6: 连接到 Elasticsearch 6.x 的集群。</li>
+        <li>7: 连接到 Elasticsearch 7.x 的集群。</li>
+        <li>8: 连接到 Elasticsearch 8.x 的集群。</li>
+      </ul>
+      </td>
+    </tr>
+    <tr>
+      <td>username</td>
+      <td>optional</td>
+      <td style="word-wrap: break-word;">(none)</td>
+      <td>String</td>
+      <td>用于连接 Elasticsearch 实例认证的用户名。</td>
+    </tr>
+    <tr>
+      <td>password</td>
+      <td>optional</td>
+      <td style="word-wrap: break-word;">(none)</td>
+      <td>String</td>
+      <td>用于连接 Elasticsearch 实例认证的密码。</td>
+    </tr>
+    <tr>
+      <td>batch.size.max</td>
+      <td>optional</td>
+      <td style="word-wrap: break-word;">500</td>
+      <td>Integer</td>
+      <td>每个批量请求的最大缓冲操作数。 可以设置为'0'来禁用它。</td>
+    </tr>
+    <tr>
+      <td>inflight.requests.max</td>
+      <td>optional</td>
+      <td style="word-wrap: break-word;">5</td>
+      <td>Integer</td>
+      <td>连接器将尝试执行的最大并发请求数。</td>
+    </tr>
+    <tr>
+      <td>buffered.requests.max</td>
+      <td>optional</td>
+      <td style="word-wrap: break-word;">1000</td>
+      <td>Integer</td>
+      <td>每个批量请求的内存缓冲区中保留的最大请求数。</td>
+    </tr>
+    <tr>
+      <td>batch.size.max.bytes</td>
+      <td>optional</td>
+      <td style="word-wrap: break-word;">5242880</td>
+      <td>Long</td>
+      <td>每个批量请求的缓冲操作在内存中的最大值(以byte为单位)。</td>
+    </tr>
+    <tr>
+      <td>buffer.time.max.ms</td>
+      <td>optional</td>
+      <td style="word-wrap: break-word;">5000</td>
+      <td>Long</td>
+      <td>每个批量请求的缓冲 flush 操作的间隔(以ms为单位)。</td>
+    </tr>
+    <tr>
+      <td>record.size.max.bytes</td>
+      <td>optional</td>
+      <td style="word-wrap: break-word;">10485760</td>
+      <td>Long</td>
+      <td>单个记录的最大大小（以byte为单位）。</td>
+    </tr>
+    </tbody>
+</table>    
+</div>
+
+Usage Notes
+--------
+
+* 写入 Elasticsearch 的 index 默认为与上游表同名字符串，可以通过 pipeline 的 route 功能进行修改。
+
+* 如果写入 Elasticsearch 的 index 不存在，不会被默认创建。
+
+Data Type Mapping
+----------------
+Elasticsearch 将文档存储在 JSON 字符串中，数据类型之间的映射关系如下表所示：
+<div class="wy-table-responsive">
+<table class="colwidths-auto docutils">
+    <thead>
+      <tr>
+        <th class="text-left">CDC type</th>
+        <th class="text-left">JSON type</th>
+        <th class="text-left" style="width:60%;">NOTE</th>
+      </tr>
+    </thead>
+    <tbody>
+    <tr>
+      <td>TINYINT</td>
+      <td>NUMBER</td>
+      <td></td>
+    </tr>
+    <tr>
+      <td>SMALLINT</td>
+      <td>NUMBER</td>
+      <td></td>
+    </tr>
+    <tr>
+      <td>INT</td>
+      <td>NUMBER</td>
+      <td></td>
+    </tr>
+    <tr>
+      <td>BIGINT</td>
+      <td>NUMBER</td>
+      <td></td>
+    </tr>
+    <tr>
+      <td>FLOAT</td>
+      <td>NUMBER</td>
+      <td></td>
+    </tr>
+    <tr>
+      <td>DOUBLE</td>
+      <td>NUMBER</td>
+      <td></td>
+    </tr>
+    <tr>
+      <td>DECIMAL(p, s)</td>
+      <td>STRING</td>
+      <td></td>
+    </tr>
+    <tr>
+      <td>BOOLEAN</td>
+      <td>BOOLEAN</td>
+      <td></td>
+    </tr>
+    <tr>
+      <td>DATE</td>
+      <td>STRING</td>
+      <td>with format: date (yyyy-MM-dd), example: 2024-10-21</td>
+    </tr>
+    <tr>
+      <td>TIMESTAMP</td>
+      <td>STRING</td>
+      <td>with format: date-time (yyyy-MM-dd HH:mm:ss.SSSSSS, with UTC time zone), example: 2024-10-21 14:10:56.000000</td>
+    </tr>
+    <tr>
+      <td>TIMESTAMP_LTZ</td>
+      <td>STRING</td>
+      <td>with format: date-time (yyyy-MM-dd HH:mm:ss.SSSSSS, with UTC time zone), example: 2024-10-21 14:10:56.000000</td>
+    </tr>
+    <tr>
+      <td>CHAR(n)</td>
+      <td>STRING</td>
+      <td></td>
+    </tr>
+    <tr>
+      <td>VARCHAR(n)</td>
+      <td>STRING</td>
+      <td></td>
+    </tr>
+    <tr>
+      <td>ARRAY</td>
+      <td>ARRAY</td>
+      <td></td>
+    </tr>
+    <tr>
+      <td>MAP</td>
+      <td>STRING</td>
+      <td></td>
+    </tr>
+    <tr>
+      <td>ROW</td>
+      <td>STRING</td>
+      <td></td>
+    </tr>
+    </tbody>
+</table>
+</div>
+
+{{< top >}}
@@ -393,7 +393,10 @@ source:
         DOUBLE UNSIGNED ZEROFILL<br>
         DOUBLE PRECISION<br>
         DOUBLE PRECISION UNSIGNED<br>
-        DOUBLE PRECISION UNSIGNED ZEROFILL
+        DOUBLE PRECISION UNSIGNED ZEROFILL<br>
+        FLOAT(p, s)<br>
+        REAL(p, s)<br>
+        DOUBLE(p, s)
       </td>
       <td>DOUBLE</td>
       <td></td>
 
@@ -126,7 +126,8 @@ Flink CDC uses [Calcite](https://calcite.apache.org/) to parse expressions and [
 | LOWER(string) | lower(string) | Returns string in lowercase. |
 | TRIM(string1) | trim('BOTH',string1) | Returns a string that removes whitespaces at both sides. |
 | REGEXP_REPLACE(string1, string2, string3) | regexpReplace(string1, string2, string3) | Returns a string from STRING1 with all the substrings that match a regular expression STRING2 consecutively being replaced with STRING3. E.g., 'foobar'.regexpReplace('oo\|ar', '') returns "fb". |
-| SUBSTRING(string FROM integer1 [ FOR integer2 ]) | substring(string,integer1,integer2) | Returns a substring of STRING starting from position INT1 with length INT2 (to the end by default). |
+| SUBSTR(string, integer1[, integer2]) | substr(string,integer1,integer2) | Returns a substring of STRING starting from position integer1 with length integer2 (to the end by default). |
+| SUBSTRING(string FROM integer1 [ FOR integer2 ]) | substring(string,integer1,integer2) | Returns a substring of STRING starting from position integer1 with length integer2 (to the end by default). |
 | CONCAT(string1, string2,…) | concat(string1, string2,…) | Returns a string that concatenates string1, string2, …. E.g., CONCAT('AA', 'BB', 'CC') returns 'AABBCC'. |
 
 ## Temporal Functions
@@ -153,6 +154,23 @@ Flink CDC uses [Calcite](https://calcite.apache.org/) to parse expressions and [
 | COALESCE(value1 [, value2]*) | coalesce(Object... objects) | Returns the first argument that is not NULL.If all arguments are NULL, it returns NULL as well. The return type is the least restrictive, common type of all of its arguments. The return type is nullable if all arguments are nullable as well. |
 | IF(condition, true_value, false_value)   | condition ? true_value : false_value | Returns the true_value if condition is met, otherwise false_value. E.g., IF(5 > 3, 5, 3) returns 5. |
 
+## Casting Functions
+
+You can use `CAST( <EXPR> AS <T> )` syntax to convert any valid expression `<EXPR>` to a specific type `<T>`. Possible conversion paths are:
+
+| Source Type                         | Target Type | Notes                                                                                      |
+|-------------------------------------|-------------|--------------------------------------------------------------------------------------------|
+| ANY                                 | STRING      | All types can be cast to STRING.                                                           |
+| NUMERIC, STRING                     | BOOLEAN     | Any non-zero numerics will be evaluated to `TRUE`.                                         |
+| NUMERIC                             | BYTE        | Value must be in the range of Byte (-128 ~ 127).                                           |
+| NUMERIC                             | SHORT       | Value must be in the range of Short (-32768 ~ 32767).                                      |
+| NUMERIC                             | INTEGER     | Value must be in the range of Integer (-2147483648 ~ 2147483647).                          |
+| NUMERIC                             | LONG        | Value must be in the range of Long (-9223372036854775808 ~ 9223372036854775807).           |
+| NUMERIC                             | FLOAT       | Value must be in the range of Float (1.40239846e-45f ~ 3.40282347e+38f).                   |
+| NUMERIC                             | DOUBLE      | Value must be in the range of Double (4.94065645841246544e-324 ~ 1.79769313486231570e+308) |
+| NUMERIC                             | DECIMAL     | Value must be in the range of BigDecimal(10, 0).                                           |
+| STRING, TIMESTAMP_TZ, TIMESTAMP_LTZ | TIMESTAMP   | String type value must be a valid `ISO_LOCAL_DATE_TIME` string.                            |
+
 # Example
 ## Add computed columns
 Evaluation expressions can be used to generate new columns. For example, if we want to append two computed columns based on the table `web_order` in the database `mydb`, we may define a transform rule as follows: