-
Notifications
You must be signed in to change notification settings - Fork 59
/
reformat.html
287 lines (244 loc) · 7.23 KB
/
reformat.html
1
2
3
4
5
6
7
8
9
10
11
12
13
14
15
16
17
18
19
20
21
22
23
24
25
26
27
28
29
30
31
32
33
34
35
36
37
38
39
40
41
42
43
44
45
46
47
48
49
50
51
52
53
54
55
56
57
58
59
60
61
62
63
64
65
66
67
68
69
70
71
72
73
74
75
76
77
78
79
80
81
82
83
84
85
86
87
88
89
90
91
92
93
94
95
96
97
98
99
100
101
102
103
104
105
106
107
108
109
110
111
112
113
114
115
116
117
118
119
120
121
122
123
124
125
126
127
128
129
130
131
132
133
134
135
136
137
138
139
140
141
142
143
144
145
146
147
148
149
150
151
152
153
154
155
156
157
158
159
160
161
162
163
164
165
166
167
168
169
170
171
172
173
174
175
176
177
178
179
180
181
182
183
184
185
186
187
188
189
190
191
192
193
194
195
196
197
198
199
200
201
202
203
204
205
206
207
208
209
210
211
212
213
214
215
216
217
218
219
220
221
222
223
224
225
226
227
228
229
230
231
232
233
234
235
236
237
238
239
240
241
242
243
244
245
246
247
248
249
250
251
252
253
254
255
256
257
258
259
260
261
262
263
264
265
266
267
268
269
270
271
272
273
274
275
276
277
278
279
280
281
282
283
284
285
286
287
<html>
<head>
<title>
REFORMAT - Rewrite a Text File With Fixed Number of Words Per Line
</title>
</head>
<body bgcolor="#EEEEEE" link="#CC0000" alink="#FF3300" vlink="#000055">
<h1 align = "center">
REFORMAT <br> Rewrite a Text File With Fixed Number of Words Per Line
</h1>
<hr>
<p>
<b>REFORMAT</b> is a FORTRAN90 program
which reads an ASCII text file,
and writes a copy, with the property that every line (except
perhaps the last one) has the same number of values on it.
</p>
<p>
This program was "inspired" by a situation in which the data in
a file was not arranged as expected.
</p>
<p>
It was assumed that there were 4 items in every line.
</p>
<p>
But in fact, a free format had been used, and the system "felt free"
to write 3 values per line, resulting in a ragged file of lines
of alternating lengths of 3 and 1.
</p>
<p>
One can easily imagine other cases where 11 values would be written
as 3/3/3/2.
</p>
<p>
And if the system had used another limit for the number of values
per line, we would find that L values had been written as
M/M/M/M/N instead.
</p>
<p>
The purpose of this program is allow for the repair of these
"mistakes" in the simplest case. Thus, we imagine the command
<pre><code>
reformat ragged.txt smooth.txt 4
</code></pre>
would read data from the file "ragged.txt", and write it out 4 words
at a time.
</p>
<p>
In my experience, it has always been the case that several lines
of the ragged file always exactly equaled one line of the smooth
file. But it is easy to imagine more complicated situations.
</p>
<p>
The reasonable thing to do, I think, is to interpret the command
as an attempt to get ALL the information in the ragged file into
the smooth file, using the fixed limit.
</p>
<p>
This decision means the program can also be used to take a
file of four words per line and turn it into one of 3 words per line.
</p>
<p>
This version of the program is only able to deal with files that
contain nothing but real numbers. For a version that works with
abstract 'words', see REWORD below.
</p>
<h3 align = "center">
Usage:
</h3>
<p>
<blockquote>
<b>reformat</b> <i>input</i> <i>output</i> <i>length</i>
</blockquote>
where
<ul>
<li>
<i>input</i> is the file to read;
</li>
<li>
<i>output</i> is the file to create;
</li>
<li>
<i>length</i> is the number of values on each line of the output file.
</li>
</ul>
</p>
<h3 align = "center">
Licensing:
</h3>
<p>
The computer code and data files described and made available on this web page
are distributed under
<a href = "../../txt/gnu_lgpl.txt">the GNU LGPL license.</a>
</p>
<h3 align = "center">
Related Data and Programs:
</h3>
<p>
<a href = "../../cpp_src/crrm/crrm.html">
CRRM</a>,
a C++ program which
reads a text file and writes a copy which has no carriage returns.
</p>
<p>
<a href = "../../cpp_src/deblank/deblank.html">
DEBLANK</a>,
a C++ program which
reads a text file and writes a copy which has no blank lines.
</p>
<p>
<a href = "../../cpp_src/decomment/decomment.html">
DECOMMENT</a>,
a C++ program which
makes a copy of a text file which contains no "comment" lines
(that begin with "#").
</p>
<p>
<a href = "../../f_src/file_transpose/file_transpose.html">
FILE_TRANSPOSE</a>,
a FORTRAN90 program which
makes a "transposed" copy of a file, in which the I-th "word" of the J-th line
becomes the J-th word of the I-th line.
</p>
<p>
<a href = "../../f_src/filum/filum.html">
FILUM</a>,
a FORTRAN90 library which
performs various operations on files.
</p>
<p>
<a href = "../../cpp_src/reword/reword.html">
REWORD</a>,
a C++ program which
reads a text file and writes a copy which has a fixed number
of "words" per line.
</p>
<p>
<a href = "../../f_src/table_columns/table_columns.html">
TABLE_COLUMNS</a>,
a FORTRAN90 program which
will extract the data in certain columns of a file.
</p>
<p>
<a href = "../../cpp_src/uncontrol/uncontrol.html">
UNCONTROL</a>,
a C++ program which
makes a copy of a text file which contains no control characters.
</p>
<p>
<a href = "../../cpp_src/wrap/wrap.html">
WRAP</a>,
a C++ program which
makes a copy of a text file
in which no line is longer than a user-specified wrap length.
</p>
<p>
<a href = "../../cpp_src/wrap2/wrap2.html">
WRAP2</a>,
a C++ program which
wraps long lines in a text file, but which
wraps some lines "early", so as to avoid breaking words.
</p>
<h3 align = "center">
Source Code:
</h3>
<p>
<ul>
<li>
<a href = "reformat.f90">reformat.f90</a>, the source code.
</li>
<li>
<a href = "reformat.sh">reformat.sh</a>,
commands to compile the source code.
</li>
</ul>
</p>
<h3 align = "center">
Examples and Tests:
</h3>
<p>
<ul>
<li>
<a href = "ragged.txt">ragged.txt</a>,
a "ragged" file containing lines of alternating length of 3 and 1.
</li>
<li>
<a href = "smooth4.txt">smooth4.txt</a>,
a "smoothed" file containing lines of length 4.
</li>
<li>
<a href = "smooth7.txt">smooth7.txt</a>,
a "smoothed" file containing lines of length 7.
</li>
</ul>
</p>
<h3 align = "center">
List of Routines:
</h3>
<p>
<ul>
<li>
<b>MAIN</b> is the main program for REFORMAT.
</li>
<li>
<b>CH_CAP</b> capitalizes a single character.
</li>
<li>
<b>CH_EQI</b> is a case insensitive comparison of two characters for equality.
</li>
<li>
<b>CH_TO_DIGIT</b> returns the integer value of a base 10 digit.
</li>
<li>
<b>GET_UNIT</b> returns a free FORTRAN unit number.
</li>
<li>
<b>REFORMAT_FILE</b> rewrites a datafile with a given number of "words" per line.
</li>
<li>
<b>S_TO_I4</b> reads an integer value from a string.
</li>
<li>
<b>S_TO_R8</b> reads an R8 from a string.
</li>
<li>
<b>TIMESTAMP</b> prints the current YMDHMS date as a time stamp.
</li>
<li>
<b>WORD_EXTRACT</b> extracts the next word from a string.
</li>
</ul>
</p>
<p>
You can go up one level to <a href = "../f_src.html">
the FORTRAN90 source codes</a>.
</p>
<hr>
<i>
Last revised on 12 December 2005.
</i>
<!-- John Burkardt -->
</body>
<!-- Initial HTML skeleton created by HTMLINDEX. -->
</html>