@@ -19,8 +19,10 @@ List of these files (see https://www.unicode.org/Public/UCD/latest/ucd/):
19
19
20
20
Process:
21
21
* The “source of truth” is the Unihan database maintained by the CJK/Unihan group, including data maintained by Michel.
22
- * The CJK/Unihan group posts data files into an internal location.
23
- * KenW vets these files and posts them to https://www.unicode.org/Public/draft/UCD/ucd/ .
22
+ * The CJK/Unihan group maintains the data files in the Unicode-internal unihan-tools repo
23
+ and creates GitHub releases with the /Public data files.
24
+ * These include RSIndex.txt and RSIndex.pdf which are published in the charts folder, not in the ucd folder.
25
+ * An infrastructure person copies these files to /Public/draft/ucd, /Public/draft/charts, /Public/{version}/... as appropriate.
24
26
* A unicodetools GitHub contributor fetches these files, preprocesses the contents of Unihan.zip,
25
27
and creates a pull request as for “regular” data files.
26
28
(The processed data files go into .../unicodetools/data/ucd/dev/Unihan.)
@@ -48,6 +50,7 @@ Changes are made in a GitHub pull request.
48
50
* Updated files could be shared in various ways including via email or via private FTP areas.
49
51
* Updated files should be based on the latest (or fairly recent) data in the unicodetools repo.
50
52
* Updated files should not be posted directly to https://www.unicode.org/Public/ ...
53
+ * We work with an infra person to publish whole UCD/alpha/beta/final data file drops into /Public .
51
54
52
55
Pull request cycle:
53
56
* One commit for manual or contributed data changes.
@@ -81,8 +84,8 @@ https://github.com/unicode-org/unicodetools/tree/main/unicodetools/data/emoji/de
81
84
82
85
Certain snapshots of the .../dev/ files are copied into https://www.unicode.org/Public/draft/
83
86
for Unicode alpha, beta, and final releases, and more as appropriate.
84
- * UCD files go into https://www.unicode.org/Public/draft/UCD /
85
- * UCA files go into https://www.unicode.org/Public/draft/UCA /
87
+ * UCD files go into https://www.unicode.org/Public/draft/ucd /
88
+ * UCA files go into https://www.unicode.org/Public/draft/uca /
86
89
* emoji files go into https://www.unicode.org/Public/draft/emoji/
87
90
* etc.
88
91
* Inside “draft” there are no folder levels with version numbers.
@@ -104,18 +107,16 @@ script from an up-to-date repo workspace.
104
107
The script copies the set of the .../dev/ data files for an alpha snapshot
105
108
from a unicodetools workspace to a target folder with the layout of https://www.unicode.org/Public/draft/ .
106
109
107
- Send the resulting zip file to Rick for posting to https://www.unicode.org/Public/draft/ .
108
- Ask Rick to add other files that are not tracked in the unicodetools repo:
109
- * Unihan.zip to .../draft/UCD/ucd
110
-
111
- TODO: Figure out new process & people replacing Rick in 2025.
110
+ Send the resulting zip file to an infra person for posting to https://www.unicode.org/Public/draft/ .
111
+ Ask the infra person to add other files that are not tracked in the unicodetools repo:
112
+ * Unihan.zip to .../draft/ucd
112
113
113
114
Note: No version/delta infixes in names of data files.
114
115
We simply use the “draft” folder and the file-internal time stamps for versioning.
115
116
116
117
### Publish an alpha snapshot
117
118
118
- For the alpha review, publish (at least) the UCD and emoji files, and the charts.
119
+ For the alpha review, publish (at least) the UCD and emoji files, the IDNA files, and the charts.
119
120
120
121
Review/edit the pub/* .sh scripts and advance the version numbers and copyright years.
121
122
@@ -124,10 +125,10 @@ script from an up-to-date repo workspace.
124
125
The script copies the set of the .../dev/ data files for an alpha snapshot
125
126
from a unicodetools workspace to a target folder with the layout of https://www.unicode.org/Public/draft/ .
126
127
127
- Send the resulting zip file to Rick for posting to https://www.unicode.org/Public/draft/ .
128
- Ask Rick to add other files that are not tracked in the unicodetools repo:
129
- * Unihan.zip to .../draft/UCD/ ucd
130
- * alpha charts to .../draft/UCD/ charts
128
+ Send the resulting zip file to an infra person for posting to https://www.unicode.org/Public/draft/ .
129
+ Ask the infra person to add other files that are not tracked in the unicodetools repo:
130
+ * Unihan.zip to .../draft/ucd
131
+ * alpha charts to .../draft/charts
131
132
132
133
Note: No version/delta infixes in names of data files.
133
134
We simply use the “draft” folder and the file-internal time stamps for versioning.
@@ -141,11 +142,11 @@ script from an up-to-date repo workspace.
141
142
The script copies the set of the .../dev/ data files for a beta snapshot
142
143
from a unicodetools workspace to a target folder with the layout of https://www.unicode.org/Public/draft/ .
143
144
144
- Send the resulting zip file to Rick for posting to https://www.unicode.org/Public/draft/ .
145
- Ask Rick to add other files that are not tracked in the unicodetools repo:
146
- * Unihan.zip to .../draft/UCD/ ucd
147
- * UCDXML files to .../draft/UCD/ ucdxml
148
- * beta charts to .../draft/UCD/ charts
145
+ Send the resulting zip file to an infra person for posting to https://www.unicode.org/Public/draft/ .
146
+ Ask the infra person to add other files that are not tracked in the unicodetools repo:
147
+ * Unihan.zip to .../draft/ucd
148
+ * UCDXML files to .../draft/ucdxml
149
+ * beta charts to .../draft/charts
149
150
150
151
### Publish a release snapshot
151
152
@@ -158,19 +159,19 @@ Verify the final set of files in the draft folder.
158
159
Run the [ pub/copy-final.sh] ( https://github.com/unicode-org/unicodetools/blob/main/pub/copy-final.sh )
159
160
script from an up-to-date repo workspace.
160
161
161
- Send the resulting zip file to Rick for posting to https://www.unicode.org/Public/ (not .../Public/draft/).
162
- Ask Rick to add other files that are not tracked in the unicodetools repo:
162
+ Send the resulting zip file to an infra person for posting to https://www.unicode.org/Public/ (not .../Public/draft/).
163
+ Ask the infra person to add other files that are not tracked in the unicodetools repo:
163
164
* Unihan.zip to .../{version}/ucd
164
165
* UCDXML files to .../{version}/ucdxml
165
166
* final charts to .../{version}/charts
166
167
167
- This script works much like the beta script, except it:
168
- * assembles all of the files for Public/ in their release folder structure,
169
- rather than for Public/draft/
170
- * creates a zipped/{version} folder with UCD.zip
168
+ TODO: Starting with 17.0, the folder structure of /Public/draft is the same as that of /Public/{version} .
169
+ Consider moving the final files from /Public/draft to /Public/{version} rather than running another script.
171
170
172
171
### Before a release
173
172
173
+ TODO: Review this section, and merge it into the previous one.
174
+
174
175
When the data files are supposed to be final, about a week or two before the release:
175
176
176
177
Verify once more that the unicodetools repo .../dev/ files match the released/published files.
@@ -182,7 +183,7 @@ https://github.com/unicode-org/unicodetools/releases/tag/final-15.1-20230908
182
183
### After a release
183
184
184
185
Copy a snapshot of the unicodetools repo .../dev/ files to a versioned unicodetools folder;
185
- for example: .../unicodetools/data/ucd/16 .0.0/ .
186
+ for example: .../unicodetools/data/ucd/17 .0.0/ .
186
187
(We no longer append a “-Update” suffix to the folder name.)
187
188
List: emoji, idna, security, uca, ucd, ucdxml
188
189
Watch for different naming conventions: emoji versions use only two fields, not three.
0 commit comments