-
Notifications
You must be signed in to change notification settings - Fork 1
/
tuv.diff
371 lines (371 loc) · 49.6 KB
/
tuv.diff
1
2
3
4
5
6
7
8
9
10
11
12
13
14
15
16
17
18
19
20
21
22
23
24
25
26
27
28
29
30
31
32
33
34
35
36
37
38
39
40
41
42
43
44
45
46
47
48
49
50
51
52
53
54
55
56
57
58
59
60
61
62
63
64
65
66
67
68
69
70
71
72
73
74
75
76
77
78
79
80
81
82
83
84
85
86
87
88
89
90
91
92
93
94
95
96
97
98
99
100
101
102
103
104
105
106
107
108
109
110
111
112
113
114
115
116
117
118
119
120
121
122
123
124
125
126
127
128
129
130
131
132
133
134
135
136
137
138
139
140
141
142
143
144
145
146
147
148
149
150
151
152
153
154
155
156
157
158
159
160
161
162
163
164
165
166
167
168
169
170
171
172
173
174
175
176
177
178
179
180
181
182
183
184
185
186
187
188
189
190
191
192
193
194
195
196
197
198
199
200
201
202
203
204
205
206
207
208
209
210
211
212
213
214
215
216
217
218
219
220
221
222
223
224
225
226
227
228
229
230
231
232
233
234
235
236
237
238
239
240
241
242
243
244
245
246
247
248
249
250
251
252
253
254
255
256
257
258
259
260
261
262
263
264
265
266
267
268
269
270
271
272
273
274
275
276
277
278
279
280
281
282
283
284
285
286
287
288
289
290
291
292
293
294
295
296
297
298
299
300
301
302
303
304
305
306
307
308
309
310
311
312
313
314
315
316
317
318
319
320
321
322
323
324
325
326
327
328
329
330
331
332
333
334
335
336
337
338
339
340
341
342
343
344
345
346
347
348
349
350
351
352
353
354
355
356
357
358
359
360
361
362
363
364
365
366
367
368
369
370
371
325d324
< Updated ignore patterns. 2019-10-23T19:03:04+00:00
331d329
< Force unix line endings, to make sure it works ok also on the Windows subsystem for Linux. 2019-10-07T17:52:29+00:00
340d337
< Updating svn ignores for tools/analysers/. 2019-06-14T06:39:23+00:00
345,346d341
< Updating svn ignores. 2019-05-24T09:58:34+00:00
< Updating svn ignores. 2019-05-24T09:45:30+00:00
352d346
< Updated svn ignores. 2019-02-27T10:22:06+00:00
364d357
< Ignore more files, including files that are automatically added to svn when populating a new language. This is done to avoid them showing up as noise for external languages, in which case these files might not be in our svn (but in the external svn repo instead). 2019-01-08T07:01:30+00:00
394d386
< Updated svn ignores. 2018-09-25T08:02:11+00:00
399d390
< More general ignore pattern for tools/mt/apertium/tagsets/. 2018-09-10T11:17:14+00:00
402d392
< Updated svn ignore patterns. 2018-09-08T05:21:34+00:00
412d401
< Updated svn ignores. 2018-08-30T16:17:17+00:00
415d403
< Updated svn ignores. 2018-08-29T05:36:10+00:00
417d404
< Updating svn ignores. 2018-08-28T10:49:42+00:00
431d417
< More things to ignore. 2018-05-14T10:30:20+00:00
445,446d430
< Added svn:ignore for files being output of testing. 2018-03-01T06:37:16+00:00
< Added svnignore pattern for sigma.txt. 2018-02-21T10:00:49+00:00
449d432
< Two more files to ignore. 2018-02-06T09:46:57+00:00
460d442
< Updated svn ignores. 2018-01-31T12:15:22+00:00
489d470
< Updated svn ignores. 2017-12-11T12:54:11+00:00
510,511d490
< Updated svn ignores for tokenisers and grammar checkers + subdirs. 2017-10-11T11:59:35+00:00
< Updated svn ignores for tokenisers and grammar checkers + subdirs. 2017-10-11T11:06:09+00:00
523d501
< Updating svn ignores. 2017-08-25T10:17:49+00:00
537,538d514
< Updating svn ignores. 2017-06-29T00:05:40+00:00
< Updated svn ignores. 2017-06-28T22:59:36+00:00
556d531
< Updated svn ignores. 2017-03-01T12:03:10+00:00
572d546
< Updated svn ignores. 2017-01-30T09:58:10+00:00
638d611
< Updated svn ignores. 2016-06-09T20:05:58+00:00
656d628
< Setting svn ignore patterns on tools/spellcheckers/filters/. 2016-05-10T01:01:42+00:00
677d648
< Ignore more preprocessor files = fst’s. 2016-04-14T16:01:35+00:00
681d651
< Updated svn ignores. 2016-03-15T19:55:06+00:00
684d653
< Use a more general svn ignore pattern in src/morphology/. 2016-03-07T17:11:09+00:00
704d672
< Updated the svn ignore property for recent changes in the infrastructure. 2016-02-16T22:31:44+00:00
709d676
< Updating svn:ignore’s. 2016-02-02T15:31:48+00:00
714,715d680
< Updated svn:ignore’s. 2016-02-02T10:35:45+00:00
< Updated svn:ignore’s. 2016-02-02T09:04:35+00:00
719d683
< Updated svn ignores. 2016-01-25T08:12:56+00:00
731d694
< Updated svn:ignore’s. 2015-11-18T23:09:59+00:00
746d708
< Updated svn ignores. 2015-10-20T07:52:23+00:00
771d732
< Ignore temporary files generated by the speller suggestion test script. 2015-09-02T20:01:50+00:00
815d775
< Ignore txt files in speller dirs. 2015-04-09T11:49:00+00:00
824d783
< Updated svn ignores. 2015-03-14T10:56:07+00:00
829d787
< Updated svn ignores. 2015-03-12T08:28:25+00:00
835d792
< Updated svn ignores. 2015-03-09T10:43:19+00:00
837d793
< Updated svn ignores. 2015-03-06T15:57:33+00:00
840d795
< Updated svn ignores. 2015-03-06T09:24:56+00:00
847d801
< Update svn ignores. 2015-02-27T12:58:52+00:00
883d836
< Special svn:ignore on src/orthography/. 2015-01-26T10:34:37+00:00
897d849
< Updated svn:ignore's. 2015-01-12T21:52:28+00:00
913d864
< Update ignores for src/morphology/. 2014-10-23T08:28:02+00:00
950d900
< Updated svn:ignore's. 2014-09-08T21:42:39+00:00
969d918
< Now also the svn:ignore is updated. 2014-05-28T08:03:59+00:00
973d921
< Svn ignore. 2014-04-09T15:41:19+00:00
976c924,1192
< This language, Turkana, was empty, and goes to the startup heat. 2014-01-24T12:45:39+00:00
---
> [Template merge - und] Make the the xfscript compilers quiet in silent mode, verbose in verbose mode. 2014-01-21T14:01:15+00:00
> [Template merge - und] When running LexC tests, if no tests were found, the test bench will now report that the whole test was skipped. Earlier it reported a pass. 2014-01-20T22:14:05+00:00
> [Template merge - und] Tailored silent build output for Vislcg3. 2014-01-20T20:48:44+00:00
> [Template merge - und] Increased the actual and required version number after a small bugfix in the speller version easter egg, to ensure all generated spellers have proper version info. 2014-01-20T19:04:36+00:00
> [Template merge - und] Corrected a fatal bug for non-latin spell checkers: the error model contained one letter from the easter egg not found in the acceptor. This symbol mismatch is fatal for hfst-ospell, and caused all non-latin spellers to crash (the latin spellers would all have this symbol ('p') anyway, so no problem was noticed earlier). 2014-01-19T16:23:55+00:00
> [Template merge - und] Corrected the compilation of xfscript files such that we still have a general build rule for xfscript files, but now with a following inversion when needed. Also added better feedback on the build steps in silent mode. 2014-01-17T09:37:15+00:00
> [Template merge - und] Updated required and actual version number of gtdcore. The easter egg creation for hfst spellers depends on new files in the core, and also the abbr.txt building does so. Without an updated core e.g. speller builds will fail. 2014-01-15T14:16:08+00:00
> [Template merge - und] Renamed more am-shared files. 2014-01-15T13:56:51+00:00
> [Template merge - und] Renamed topdir-include.am to src-include.am to follow the correct naming pattern. 2014-01-14T16:56:32+00:00
> [Template merge - und] Experimenting with feedback on silent builds (make V=0). Looks good. 2014-01-14T16:44:00+00:00
> [Template merge - und] Added Autotools support for building the abbr.txt file. This file is _not_ included in the regular make commands, one has to cd into the tools/preprocess/ directory, and to 'make abbr' there. This is on purpose. 2014-01-14T13:59:32+00:00
> [Template merge - und] Added a new dir tools/preprocess/ to hold resources for the preprocess utility. 2014-01-14T08:32:20+00:00
> [Template merge - und] Added automatic switch between hfst-foma and hfst-xfst for compiling xfscript files into transducers. hfst-foma is the default, with fallback to hfst-xfst if hfst-foma is not found. There are still issues with hfst-xfst. 2014-01-13T15:37:44+00:00
> [Template merge - und] Moved xfscript compilation out of phonetics-include and hyphenation include. These am-files contained and invert command that combined with an invert command in the actual xfscripts created a meaningless double inversion. 2014-01-13T14:40:29+00:00
> Removed all instances of 'invert net' in xfscript files. In reality it was a double inversion in almost all cases: first inversion within the xfscript, followed by an inversion in the command given to xfst. This double inversion made it impossible to unify otherwise identical operations for the same set of files, and would generally cause confusion as to what was going on. 2014-01-13T14:13:56+00:00
> [Template merge - und] Removed unused twolc.am file. Reorganised the code for twolc and xfscript compilation, to avoid duplicate code and prepare for improvements. Added M4 macro check that either hfst-xfst or hfst-foma is included, hfst compilation is turned off if none of them is. 2014-01-13T13:21:03+00:00
> [Template merge - und] Reduced weights for the easter egg suggestions, to avoid other suggestions to come in between. 2014-01-12T14:26:09+00:00
> [Template merge - und] Easter egg with version info now working in the hfst spellers. 2014-01-12T11:39:04+00:00
> [Template merge - und] Added initial version file for the hfst-based spellers. 2014-01-10T16:25:56+00:00
> Explicit support for local source files and targets for the syntax. 2014-01-10T08:52:27+00:00
> [Template merge - und] Added support for building (compiling into binary form) cg3 files for syntactic functions and dependency graphs. Added a template file for syntactic functions. Made the compiled binary files installable through 'make install'. 2014-01-10T08:22:51+00:00
> [Template merge - und] Added version checking of vislcg3, renamed a couple of variables, and improved configuration feedback a bit. Now we require a vislcg3 new enough to not complain about recent addition of new features. 2014-01-07T14:32:54+00:00
> [Template merge - und] Changed the file order when building zhfst files - there are still issues caused by the index.xml file being non-first. Now it is always first. 2013-12-20T11:02:17+00:00
> [Template merge - und] Finally fixed the libvoikko/zhfst spellers. Ready for Windows! 2013-12-20T09:27:13+00:00
> [Template merge - und; core reorg] Moved the common src/filters/ inside a common/ dir, to allow for other parallel dirs like smi/ and und-Cyrl/ that target only a subset of the languages. At the same time renamed gtshared/ to gtdshared/. This change requires version 0.2.0 of the gtdcore, which is part of this commit. 2013-12-19T08:58:23+00:00
> Removed the OLANG/XXX tag also from dict generators, not only from analysers. 2013-12-18T22:23:39+00:00
> [Template merge - und] Fixed a bug that hindered the GTD core from finding the version info script in the core (as opposed to installed). 2013-12-18T12:52:38+00:00
> [Template merge - und] Added version checking of the GTD core: if the core is too old, configure will stop and print an error message with instructions on how to proceed. Added an external Autoconf M4 macro for version comparison, and renamed the file of an existing module, to be more consistent and explicit in the filenames. 2013-12-18T12:25:42+00:00
> Removed the filter "remove-NG-string.regex" from the analyser-dict-gt-norm.xfst target, in order to allow Use/NG entries in dict fsts. 2013-12-17T13:38:03+00:00
> [Template merge - und] No PCDATA text elements should be on a line of its own, that seems to trip off TinyXML2. 2013-12-14T00:40:54+00:00
> [Template merge - und] Another whitespace change to make TinyXML2 happy. 2013-12-13T13:48:35+00:00
> More white space changes 2013-12-13T12:38:35+00:00
> [Template merge - und] Removed a space that tripped off TinyXML2. Tiny typo correction. 2013-12-13T11:51:55+00:00
> [Template merge - und] Added some default content to the description element, to avoid hfst-ospell to segfault. Corrected e-mail address, some other small corrections. 2013-12-11T22:41:06+00:00
> updating dict templates from und, to include the proper mobile/non-mobile spellrelax, orig_lang and semantic tag removal. 2013-12-06T01:14:18+00:00
> updating from template: r84648, two dict analysers, one with mobile spellrelax, and one without. Also removing certain semantic tags and orig_lang tags which prevent POS from being the first tag, and messing with lookups for NDS 2013-12-06T00:36:34+00:00
> [Template merge - und] Adding possibility to first look for specific regex creation shell script before falling back to a default shell script. This will allow us to create more complex or tailored regexes for certain tag sets (like the semantic tags), while having a reasonable fallback for other cases. 2013-12-02T09:06:13+00:00
> [Template merge - und] Keeping intermediate files didn't work, created an error. Now it works. 2013-12-01T21:15:02+00:00
> [Template merge - und] Fixed a make warning, made generated regex files survive the build. 2013-12-01T09:44:14+00:00
> [Template merge - und] Further cleanup of semantic tag filtering: no processing of semantic filters in the shared makefiles. 2013-11-28T10:45:26+00:00
> [Template merge - und] Added rules to generate regexes automatically from the list of extracted tags. First out is the regex to make semantic tags optional, and another to remove them completely. Also fixed file references in the relabel targets. 2013-11-25T19:41:33+00:00
> docu phon update. 2013-11-25T15:34:58+00:00
> [Template merge - und] Only build one file of tags, using hfst or xfst depending on the configuration. Extract semantic tags. 2013-11-24T13:26:35+00:00
> [Template merge - und] Reverted a change to hfst lexc compilation - the -f option doesn't work. 2013-11-23T10:46:08+00:00
> [Template merge - und] Moved tag extraction from tagsets to filters, as it has a more general use as the basis for dynamic filter construction. Tag extraction now works with both Xerox and Hfst, and handles both prefixed and suffixed tags. 2013-11-22T22:06:46+00:00
> [Template merge - und] Xerox will now stop on lexc syntax errors (done by replacing lexc with xfst - it was impossible to get lexc to stop; this is also how it is done in the old infrastructure). Hfst will not until (hfst_)foma is fixed, because foma doesn't stop on syntax errors. But one is better than none. 2013-11-22T13:10:38+00:00
> [Template merge - und] Removed one harmless but irritating warning. 2013-11-20T19:51:01+00:00
> [Template merge - und] Commented out weighting of the acceptor fst for the zhfst speller file - it causes a segfault in hfst-ospell. 2013-11-19T13:27:08+00:00
> [Template merge - und] Added a filter to remove dynamic derivation. 2013-11-18T09:57:40+00:00
> [Template merge - und] YES! Finally got weighted automata working in the speller. Added missing hfst tools, and sorted all the hfst tools alphabetically. Updated the required hfst to version 3.5.1. Weighted speller automata are now the default (change the weights and what is weighted as needed pr. language). Thanks to Krister Lindén for giving instructions on how to get this working. 2013-11-15T09:25:30+00:00
> [Template merge - und] Changed build files to support Hfst 3.5, requires 3.5. 2013-10-28T09:24:09+00:00
> [Template merge - und] Added LexSub string filter. 2013-10-23T15:14:45+00:00
> [Template merge - und] Changed voikko compression back to zip - gzip isn't voikko compatible. 2013-10-21T18:56:40+00:00
> [Template merge - und] * FINALLY fixed the automake 1.11 vs 1.13 test incompatibilities. Now we can allow version 1.11, and still get the pretty output we want in newer automakes. * Fixed references to GTCORE in test scripts. Earlier we relied solely on it beingset in the environment, now we take it from configure (which can take it from the environment or from a script). 2013-10-21T14:28:09+00:00
> [Template merge - und] One more gzip option fix. 2013-10-21T08:24:32+00:00
> [Template merge - und] Fixed argument structure of gzip - zipping was broken for hfst and gramcheck. 2013-10-21T07:42:57+00:00
> [Template merge - und] Consistently use gzip instead of zip, and find gzip outside any conditionals. 2013-10-18T16:56:26+00:00
> [Template merge - und] Redirected command feedback of the analyser shell script to stderr, to avoid cluttering the analysed text in pipe use. 2013-10-18T13:22:58+00:00
> [Template merke - und] The first lookup shell script added, with supporting infrastructure. Part 2 - now with shell script and Makefile. It's possible to make again. 2013-10-17T16:11:08+00:00
> [Template merke - und] The first lookup shell script added, with supporting infrastructure. Part 1 - no actual shell script, no Makefile. Comming in the next commit. NB! Right now making and building will break, sorry for the inconvenience. 2013-10-17T15:51:01+00:00
> [Template merge - und] Added option to automatically create a language home dir environment variable. The idea is that by setting this variable, we can reliably find transducers in the working copy dirs of the users. The default is to not do anything (but give a warning). As part of this change, I switched the shell from sh to bash, as I don't know how portable the extra code is with respect to other shells. 2013-10-17T07:56:17+00:00
> [Template merke - und:] Changed back the Automake requirement to 1.11 - 1.12 is creating too much trouble. We'll have to see what to do with the test output - the version requirements change must be followed by another change that will substantially degrade test reports on newer automakes. 2013-10-16T17:19:36+00:00
> [Template merke - und:] Made the check for GTCORE functional, looking for both the gt-core.sh script (and using its output if found), and the environment variable $GTCORE. This means that there is no need anymore to set the GTCORE variable as long as one configure, make and make install in the gtcore directory. 2013-10-16T16:50:46+00:00
> [Template merge] Corrected bug/feedback e-mail address to one actually working. 2013-10-14T07:00:14+00:00
> [template merge] Made LexC compilation break on error, at least for Xerox (Hfst only gives a warning for the same error tested). 2013-10-11T16:38:43+00:00
> [Template merge] Moved the compilation of remove-illegal-derivation-strings.regex from all langs to only the three Sámi langs actually using it. Even though potentially useful for more languages, it can hardly be considered a language universal... 2013-10-11T14:50:41+00:00
> [template merge] More build rules for the grammar checker. Now it will install. 2013-10-09T09:52:01+00:00
> [template merge] Corrected the --enable-grammarchecker option testing. 2013-10-08T17:02:38+00:00
> [template merge] Changed the order of the configure macros, to allow for testing for program availability when checking the enable options. 2013-10-08T16:55:28+00:00
> [template merge] Forgot to add the new Makefile to configure.ac. 2013-10-08T15:33:52+00:00
> [template merge] Added basic build infrastructure for a CG-based grammar checker. No template source files added yet, as this is still pretty experimental. The grammar checker is disabled by default (naturally). 2013-10-08T14:19:19+00:00
> Template merge: Copy-paste error introduced scanning of a subdir test that doesn't exist for any language but SME. Now corrected. 2013-10-04T15:25:56+00:00
> Template merge: Reorganised the phonetic build code to better support parallel phonetic transcription depending on the source language of loan words and foreign names. 2013-10-04T14:31:06+00:00
> Added check for the availability of 'see' when testing, to avoid bad fails on systems without 'see'. 2013-10-04T08:37:39+00:00
> Tempate merge: Added config feedback about vislcg3/syntactic parsing status. Added config check for the see tool (SubEthaEdit). 2013-10-04T07:53:49+00:00
> Template merge: Remove copying of the timestamp file for non-maintainers. It breaks the automatic merge, and requires a revision-explicit merge for each such language. Also added removal of originating language tags - they are only used in TTS. 2013-10-04T07:05:34+00:00
> Added compilation of the remove-orig_lang-tags filter. Sorted the filter targets alphabetically within each logical block. Template merge. 2013-10-04T05:20:26+00:00
> Improved and corrected configure feedback for spellers. Template merge. 2013-10-03T13:27:58+00:00
> Template merge: * Now all speller fst's are turned off by default (I missed a few in the previous commit). The configure feedback is slightly improved. * Corrected syntax error in a test. Improved config feedback further. 2013-10-03T12:44:18+00:00
> Template merge: Changed the default setup to only include morphological analysis and generation. This is done to reduce the build time during regular development. This means that to build spellers and other specialised fst's, they must now be enabled using ./configure. Cf. bugzilla #1710: http://giellatekno.uit.no/bugzilla/show_bug.cgi?id=1710. 2013-10-03T10:16:39+00:00
> editing. 2013-09-28T18:38:15+00:00
> Corrected filter order for the text2X transcriptors. Template merge. 2013-09-20T14:00:35+00:00
> Completely redid the text2num etc transducers. The previous solution was in the wrong place, and didn't incorporate the actual filtering. Now it does, but whether this is the way it should be needs to be tested. Template merge. 2013-09-20T13:19:16+00:00
> Another Xerox error correction - we're using LexC, not Xfst. Skipped the result stack - not needed. Finally the basic compilation works. Template merge. 2013-09-20T12:20:17+00:00
> Corrected Xerox error. Template merge. 2013-09-20T09:20:59+00:00
> Added the inverse transcriptors, to go from text to numerical expressions. Template merge. 2013-09-20T09:07:54+00:00
> Wrapped phonetic / IPA conversion in a configure option, default is 'no'. Now compiling SME with Xerox should be back to normal speed again. Template merge. 2013-09-19T19:34:37+00:00
> Added Remove ACR filter. Template merge. 2013-09-06T13:18:25+00:00
> Added compilation of the filters for the orthographic tags, and added removal of them and the IPA strings in all regular fst's. Template merge. 2013-09-06T10:18:27+00:00
> Added missing hfst tool hfst-fst2strings to the M4 autoconf macros. Template merge. 2013-09-03T09:37:36+00:00
> Forgot to rename a variable after copy-paste. Template merge. 2013-08-29T07:34:06+00:00
> Reorganised the build code for dictionaries, added a dictionary option for configure (disabled by default), and added the new filter for mobile keyboard spellrelax. 2013-08-29T05:51:16+00:00
> Several bug fixes for the apertium build targets. Now it seems to work correctly for both sme and sms, and thus hopefully all languages. Template merge. 2013-08-19T07:31:48+00:00
> [bugfix] hfst-substitute can't take lookup-optimised fst's as input. Template merge. 2013-08-18T09:32:37+00:00
> [bugfix] Removed a sma-specific filter that had crept in and stopped compilation. Added att output fst to the default apertium analyser target. Template merge. 2013-08-17T13:25:23+00:00
> Added support for building apertium transducers to all languages. It requires the use of a configure flag, i.e. it is disabled by default - --enable-apertium if you want to test. 2013-08-17T12:25:27+00:00
> Added remove-variant-string.regex, for removing strings containing +v2, +v3, +v4, +v5, but not removing +v1. (template merge). 2013-08-14T07:54:51+00:00
> Change echo to printf for cross-platform compatibility. Template update. 2013-08-12T15:03:43+00:00
> Improved error handling in testing shell scripts. Template update. 2013-08-12T08:41:50+00:00
> Merged last template changes (tagset updates by Fran). 2013-08-10T17:34:12+00:00
> docu 2013-07-13T18:14:22+00:00
> Added and formatted documentation. 2013-07-09T10:12:05+00:00
> Added ref to twolc documentation file. 2013-07-09T10:10:16+00:00
> Documentation update: Added a file WhatIsThis, it shall contain a short explanation to outsider, as the name tells. 2013-07-09T10:08:36+00:00
> Renamed refs to template dir in preparation for support for multiple template dirs. Template update. 2013-06-29T20:44:20+00:00
> Commented out examples of error models for string and word pairs - they would in most cases add symbols to the error model not found in the acceptor, and this combination would crash the speller badly. Template update. 2013-06-25T08:17:53+00:00
> Cleaned up speller fst building, removing all unnecessary inverts and streamlining the code. Prepared for the introduction of weights, but commented out for now because of bugs or inefficiences in openfst. Renamed the included hfst speller build file, to follow an emerging naming standard for the include files. 2013-06-13T14:05:40+00:00
> Added support for making variant analysers and generators using the Apertium tag convensions. The generated transducers are still not fully Apertium-compatible but they are a major step forward. Template update. 2013-06-12T13:44:33+00:00
> Renamed analyser-raw-gt-desc.hfst to generator-raw-gt-desc.hfst, to make the behavior in hfst-lookup explicit and clear. Still, the "generator" behaves as the Xerox "analyser" in hfst when in comes to composition and filtering. Confusing, I know. Template update. 2013-06-11T09:48:17+00:00
> Build the filter to remove CLB strings from speller transducers, and use it. Template update. 2013-06-10T13:40:31+00:00
> Added missing hfst tools. Removed commented-out code in the index.xml file. Template update. 2013-06-10T12:25:27+00:00
> Removed the ocr error model from the zhfst building, it causes libvoikko 3.4 to segfault. Template update. 2013-06-07T06:28:04+00:00
> Added an explicit copy operation into the hfst speller dir, to facilitate local modifications of the speller transducer before further processing, by just replacing the copy operation with whatever is needed. Template update. 2013-06-07T00:03:18+00:00
> Added string pairs and whole-word corrections to the speller error model. Added support for an ocr error model. Removed obsolete Voikko config file. Corrected bugs in the hfst M4 macros. Template update. 2013-06-06T23:22:21+00:00
> Moved the initial spell checker processing to the top spellchecker dir, to serve as the default starting point for all spell checkers. Template update. 2013-06-06T07:58:35+00:00
> Added a tagset directory in preparation for generating Apertium transducers automatically. Corrected and expanded a few M4 macros for the hfst tools. Template update. 2013-06-05T12:41:41+00:00
> Added support for testing analysers and generators only. For several of our more specialised transducers, this is more practical and useful than always generating both pairs of transducers to test both directions. 2013-05-09T09:05:51+00:00
> Corrected the existing oahpa transducer. Added dummy hfst oahpa target. Template update. 2013-05-07T00:27:03+00:00
> [bugfix] Corrected a bug in the hyphenator hfst build: fst's must be inverted in hfst. Template update. 2013-04-30T20:42:31+00:00
> [bugfix] Corrected another copy-paste error that broke speller fst's. Template update. 2013-04-27T07:36:43+00:00
> Splitted and renamed the remove-morph-border filter. Rewrote a number of targets to reflect this. There are now three filters instead of one, to allow for more flexible fst building for speech processing. 2013-04-26T12:49:21+00:00
> Added gzip compression of foma speller transducer, and proper checks for prerequisites. Foma spellers can now be disabled, they are enabled by default. Template update. 2013-04-24T11:21:36+00:00
> Corrected a bug when building foma-based spellers. Changed one fst filename to follow the naming scheme for the new infra. Improved building of the zfst speller file. 2013-04-24T07:02:32+00:00
> For some reason wasn't the und.timestamp file updated during a template merge earlier this week. Now done. 2013-04-19T11:32:09+00:00
> Added processing of new filters. Template update. 2013-04-18T10:55:34+00:00
> Do not try to build hfst-based tools if hfst building is not enabled. Template update. 2013-04-15T09:46:48+00:00
> [feature] Moved some of the fst-speller building one level up, and added support for building foma-based spellers. Template update. 2013-04-11T16:51:27+00:00
> Renamed phonetics source and target files to reflect the actual purpose. Template update. 2013-04-10T05:50:34+00:00
> Add possibility to build morph segmenting automaton. Template update. 2013-04-09T21:56:28+00:00
> Added a top-level misc/ dir to hold private / non-svn files needed during development of the language. All files are ignored. 2013-04-09T19:54:22+00:00
> [bugfix] Corrected hfst text2ipa fst: the final fst needs to be inverted before being used in lookup. Template update. 2013-04-08T06:58:53+00:00
> [bugfix] Corrected the homonymy and variant filters used for generators - those tags should be optional, not completely removed. Template update. 2013-04-05T12:35:06+00:00
> [infra] We require gawk specifically, not any awk whatsoever. Improved config feedback. Template update. 2013-04-05T10:16:54+00:00
> [bugfix-infra] Corrected reference to the built fst's. Template update. 2013-03-20T17:03:42+00:00
> Updated the zhfst building to reflect recent changes in Voikko. There is now official support for zhfst speller files, but with a new location and no *.pro file. Also added simple support for local loading of the zhfst file - voikkospell requires that the file is located within a dir named '3'. 2013-03-18T09:34:55+00:00
> Further improvements to the test run output. Template update. 2013-03-13T18:26:55+00:00
> More tweaks to make the test output compact and readable. Template update. 2013-03-13T16:27:15+00:00
> Moved Oahpa transducer compilation to a separate (included) file, and added support for compiling dictionary transducers, also in a separate include file. Template update. 2013-03-13T11:48:35+00:00
> We need the last part of the path to properly identify the lexc file tested. Template update. 2013-03-13T10:16:59+00:00
> Made the morph-tester test runner (LexC and YAML tests) less verbose. All messages are one-liners, except for FAILs. Commented the code. Template update. 2013-03-13T09:26:17+00:00
> More thorough cleaning in src/morphology/. Template update. 2013-03-12T08:03:23+00:00
> Moved the definitions of the transducer variables to the Makefile.am, to make it possible to extend them by local modifications. Template update. 2013-03-11T09:13:03+00:00
> Forgot to update the src/filter/Makefile.am file. Template update. 2013-03-07T15:04:55+00:00
> Split the filter 'remove-dictionary-tags' in two to remove homonymy and variant tags separately. Template update. 2013-03-07T14:47:53+00:00
> Added filter to remove NGminip strings, ie paths that should not be used for generating miniparadigms in dictionaries. Template update. 2013-03-07T11:11:14+00:00
> Added infrastructure for building fst's for list-based spellers. The actual building is not yet implemented. Template update. 2013-03-06T07:19:50+00:00
> Remove doc build dir when cleaning. Template update. 2013-02-27T07:12:03+00:00
> Deleted files in obsolete locations. Moved one file not previously moved. 2013-02-26T23:57:43+00:00
> Forgot to update the config file. Template update. 2013-02-26T23:45:39+00:00
> Reorganised the tools/ dir to fit better with coming development. Template update. 2013-02-26T23:26:57+00:00
> Second part of update to handle validation of generated documentation. Essentially whenever new documentation is created due to source files being changed, a forrest site is built. During that build process, any blocking issues with the generated jspwiki pages will be revealed, thus going a long way towards ensuring that such errors do not end up in svn and from there block the building of our public sites. 2013-02-21T12:57:26+00:00
> Added check for forrest as part of configuring the documentation extraction. Forrest will be used to validate the jspwiki documents during the build, to avoid that invalid documents enter the svn repository and corrupts the web page building. First step towards that goal. 2013-02-18T18:08:44+00:00
> Upped the required automake version from 1.11 to 1.12, to avoid all hassles with the test harnesses and backwards compatibility. Template update. 2013-02-14T15:45:55+00:00
> These files should have been removed in the earlier commits regarding changes to the test bench, but where lost during the template merge earlier today, and not noticed until now. Finally deleted. 2013-02-14T15:26:19+00:00
> Even more portable testing... 2013-02-14T15:14:52+00:00
> Even more portable testing... 2013-02-14T10:44:31+00:00
> Improved portability & correctness of conditional tests in the morphology testing. Template merge. 2013-02-14T09:13:00+00:00
> Major update to the LexC testing. Now test data directly in the LexC code is supported by the python test script morph-tester.py (it reads the lexc files directly), which solves the bugs with multiple wordforms for the same morphosyntactic inflection. It is also a bit faster than the awk solution, and allows an unlimited number of different transducers to be tested dynamically directly in the lexc code. 2013-02-13T18:24:04+00:00
> Two more source files copied from gt/sme/src/. Template update. 2013-02-11T14:10:37+00:00
> Generated files checked in. 2013-01-26T12:25:46+00:00
> Added generated file. 2013-01-24T08:08:14+00:00
> Added links file. 2013-01-24T08:07:47+00:00
> Finally found out how to get the old test behaviour back. We want the serial tests, because it gives direct feedback to the linguists. Automake 1.13 uses parallel testing by default, which logs all test results to files. 2013-01-23T23:23:07+00:00
> Added support for processing twolc files for documentation extraction. 2013-01-23T21:41:04+00:00
> Some files may contain digits in their filename. Extended the filename match pattern for the Links target. Template update. 2013-01-23T21:02:34+00:00
> Added support for automatically building a file with links to each individual jspwiki file generated based. Template update. 2013-01-23T20:24:41+00:00
> Aajege has only been involved with the SMA source... 2013-01-23T16:14:04+00:00
> Forgot to add the jspwiki preamble file. Now added. 2013-01-23T15:11:44+00:00
> Forgot to add support for the conditional CAN_DOCC in the previous commit. Template update. 2013-01-22T17:38:39+00:00
> * Added initial support for extracting documentation from comments in the source code. Only jspwiki supported initially. * Also added initial support for extracting test data from source code comments. Only yaml tests in lexc is supported initially. 2013-01-22T16:48:30+00:00
> The final fix to get the XML-to-LexC conversion working on Cygwin. Template update. 2013-01-11T14:35:43+00:00
> Concatenate all LexC source files into one file explicitly, instead of letting hfst-lexc do it. This is more robust cross-platform, and makes the file used for transducer compilation easily available for debugging. Template update. 2013-01-11T12:47:29+00:00
> Corrected the host detection test for Cygwin. Template update. 2013-01-11T09:51:07+00:00
> Template updates: * Added support for XSL conversion of XML source files on Cygwin. * Made spell-relax a language-specific file. 2013-01-11T09:24:52+00:00
> Made Voikko support optional instead of required. Template update. 2013-01-10T10:38:26+00:00
> Rewrote LexC and TwolC Xerox rules to make them work on Cygwin: the Windows Xerox tools need a script file as input, the scripts can't be piped in as on *nix systems. Removed the hack in the previous commit. The bug can be worked around by avoiding linebreaks in the piped script. 2013-01-10T09:40:39+00:00
> Added hack to work around a very strange bug in LexC transducer saving - the filename is slightly garbled if the save command is passed in from a script generated by a make file (but the same command passed in from a manually typed script works correctly). This hack is required for the new infra to work on the virtual Linux machines gtlab, gtoahpa, at least. The hack should be removed as soon as we have a correctly working LexC (the broken LexC is the newest one). 2013-01-08T10:06:02+00:00
> More robust Saxon/Java setup: no need to define CLASSPATH. The M4 macros will look for a couple of predefined pathnames, and pick the first saxon9he.jar file it finds. More locations should be added as needed. Also corrected the logic for reporting whether xslt transformation could be enabled or not, and added a warning if xml source files are found but no xml transformations could be enabled. 2013-01-04T17:18:53+00:00
> Require at least HFST 3.4 - it includes all backends, and simplifies dependency handling quite a bit. Template update. 2013-01-03T16:32:23+00:00
> Fixed parsing of regexes for hfst, due to a bug in hfst-regex2fst when parsing regexes with comments after the regex is closed. Template update. 2012-12-19T09:22:48+00:00
> Refactored the yaml test code, moving duplicate parts to a separate file. Makes for much easier adaption to new transducer types, as well as much simplified maintenance. The svn merge test from sma turned out well - renaming files ahead of the merge seems to make the merge go smoother. 2012-12-18T14:12:10+00:00
> First step in making the digit transcriptor transducers work. The transducers are compiled, and are given proper names according to the fst naming conventions, and the Xerox transducers work in the digit-2-string direction. The Hfst transducers do not yet work (segmentation fault due to running out of memory because of an infinite recursion), and the string-2-digit direction is not yet in place. Still, moving forward towards a working system. 2012-12-13T00:03:29+00:00
> Made the yaml test scrips obey configuration options, ie only run the hfst tests if hfst is turned on at configuration time. Template update. 2012-12-12T21:52:39+00:00
> new dummy from sma, this one with uppar:lower like for number. 2012-12-12T00:07:18+00:00
> new dummy from sma, this one with uppar:lower like for number. 2012-12-12T00:04:15+00:00
> Automake requirement reduced to 1.11, after getting confirmation that that version is fine for finding Python (the main problem issue that triggered the version requirement). Template merge. 2012-11-29T09:44:43+00:00
> There's too much trouble with finding the correct Python version when using Automake v.1.10, causing a lot of frustration and wasted time for users. We thus require 1.12 from now on. Template update. 2012-11-29T07:30:10+00:00
> The noun lemma generation test script has been updated to only test the transducer types that have been turned on at configuration time. Template update. 2012-11-28T13:21:53+00:00
> This is after running the update-all-from-core.sh. 2012-11-21T09:34:18+00:00
> Grep out comments from regex files in orthography/, as there is a bug in hfst-regexp2fst. Also added an initial Hunspell include file. Template update. 2012-11-14T15:37:23+00:00
> Several updates: * slightly improved feedback from the configure script * improved hfst spell checker building * added basic support for building Oahpa transducers, *disabled* by default 2012-11-14T11:32:13+00:00
> Renamed 'dictionary' `spellerautomaton` in giellatekno.m4. The old variable name and printouts were confusing - 'dictionary' has manh meanings, and some very concrete ones in the context of the GT/Divvun work. Template update. 2012-11-14T05:28:47+00:00
> Minimise after every compose operation - always. Template update. 2012-11-12T08:21:41+00:00
> Bug fixes to the Saxon/Java configuration. Template update. 2012-11-06T13:55:09+00:00
> Call saxon checks from confugre. In previous commit: Ubuntu version of xml2lexc with autostuff. Template update. 2012-11-05T15:19:50+00:00
> Only check hfst version if requested using '--with-hfst', otherwise disable. Likewise, disable xfst if requested and print warning if both are disabled. Template update. 2012-11-05T13:44:16+00:00
> Added simple feedback to autogen.sh, valuable when processing many languages. Some reformatting of am-shared/hfst-spellchecker-include.am. 2012-10-30T08:51:56+00:00
> Fixed a simple syntax error in src/hyphenation/hyphenation.xfscript and src/phonetics/convert2ipa.xfscript. Now all languages finally build cleanly, thus making real errors easier to spot. 2012-10-30T07:57:50+00:00
> Removed double inversion from the hfst generator - it didn't work. Template update. 2012-10-27T09:41:29+00:00
> Tried to silence the build of transducers etc, but its effect is not very profound. Still it does make for nice and readable breaks in the stream of compiler messages, so I think it is an improvement. Merge from the template. 2012-10-26T12:14:45+00:00
> Make the silent build rules backwards compatible. Template update. 2012-10-26T09:42:47+00:00
> Enabled spellrelax functionality. Simplified compilation of hfst regex expressions. Corrected hfst filter compilation. Took the first steps in silencing the verbose make output. Update from the template. 2012-10-26T01:57:50+00:00
> hfst-preprocess-for-optimized-lookup-format has been removed from the hfst distribution. Merge from template. 2012-10-10T17:19:56+00:00
> Always fail hfst check if hfst-info can't be found. As it was, really old hfst installations were accepted as good-enough, which broke compilation really bad. Merge from template. 2012-10-09T16:28:09+00:00
> Add requirement of foma for hfst compilation; remove distinction between WANT_[HX]FST and CAN_[HX]FST. Merge from template. 2012-10-09T05:11:05+00:00
> Remove NBSP; Inspired by svn r63631, went through all files in gt and langs 2012-10-05T09:30:48+00:00
> The local modifications to Makefile.am files must be before the fallback pattern targets, it seems, otherwise the fallback targets are used. Template merge. 2012-10-04T15:33:20+00:00
> I believe I finally have fixed the yaml testing shell scripts, at least testing now works as intended in fao, izh, sma and smj. Merge from the template. 2012-10-04T10:24:44+00:00
> Escaping in the yaml test scripts didn't work - removing the single quotes did. Also added an underscore in front of the transducer string in the yaml testing, to avoid that the test scripts get too greedy when we get more transducers and test data. Merge from template. 2012-10-04T08:39:54+00:00
> Forgot to escape the single quotes used within the backtic expression. Small variable correction in the yaml test bench. Merge from template. 2012-10-03T16:55:27+00:00
> Corrected fail check in noun lemma generation test. Template merge. 2012-10-03T11:01:29+00:00
> Split the yaml test runner in two, one for norm and one for desc transducers, and updated the autoconf file correspondingly. Updated the lemma generator test to work with the renamed transducer. Made all test runners more robust. 2012-10-03T10:37:53+00:00
> Renamed the yaml test runner in anticipation of changes coming from the template. 2012-10-03T09:51:47+00:00
> Renamed all existing targets to follow the naming scheme defined at http://divvun.no/doc/infra/infraremake/TransducerNamesInTheNewInfra.html. Also added making of true normative and descriptive analysers and generators, as well as moved all of the hfst speller building to the tools/spellcheckers/hfstspeller/ dir. More explicit separation of local and central code in src/. 2012-10-02T16:40:19+00:00
> Added a simple header to the beginning of the compilation, to make it easier to spot each new language when building all languages in $GTHOME/langs/. Merge from the template. 2012-10-02T14:05:39+00:00
> With the recent fixes to regexp parsing in hfst-regexp2fst it was possible to bring the hfst compilation up to par with the Xerox compilation. In principle the Xerox and Hfst transducers should behave exactly the same - any deviation is a candidate bug in either the Xerox or the Hfst tools. This update requires hfst 3.3.14 to work properly, the requirement is added to the configure.ac file. 2012-10-02T10:33:50+00:00
> Removed references to newinfra/. Corrected info in INSTALL. 2012-09-25T18:04:03+00:00
> Added warning about missing YAML testing, with short instructions on how to enable them. Template update. 2012-09-25T06:19:32+00:00
> The top-level syntax include AM file had not been changed to reflect the rle->cg3 suffix change. Merge from template. The previous merge was incomplete due to a bug in the merge script. 2012-09-21T14:16:04+00:00
> Corrected a bug in the default generate-noun-lemmas.sh test script. Made file references more robust. Update from template. This also clears any warnings left over from the cg3 file renaming, such that we get a clean merge in the next template update. 2012-09-20T09:26:56+00:00
> The VislCG3 team has lately switched to a *.cg3 suffix. Now we do the same in the new infra - the new suffix is definitely more transparent. 2012-09-19T18:26:12+00:00
> Variables=cleaner code. Update from the template. 2012-09-18T06:56:51+00:00
> Updated the yaml test runner to properly report the exit value of the yaml tests, and also to give directions for how to see the details of each test if it failed. Update from the template. 2012-09-17T14:10:42+00:00
> Corrected typo in shell scripts. 2012-09-17T11:03:08+00:00
> Several testing shell script updates: correct exit value when data files are not found, proper use of Autoconf-made variables (will free the test scripts from relying on the user setting up environment variables), and better checks on the availability of test data for the lemma and replaced all hard-coded file refs with variables in the noun generation test. 2012-09-17T09:30:31+00:00
> Added check for the Xerox lookup tool, which also defines the LOOKUP variable. Update from template. 2012-09-17T08:14:29+00:00
> Reorganised AC processing of shell scripts to be more future-proof and avoid annoying (and useless) warning from chmod. Added AC variable to the AC-processed shell script to make casual by-lookers aware of the fact that the resulting shell script file is generated by AC. 2012-09-17T07:37:40+00:00
> Corrected error in previous commit. Finally things are working as they should. It might be necessary to run ./autogen.sh and ./configure before compilation is running smoothly again. 2012-09-15T12:52:14+00:00
> Forgot to update configure.ac. 2012-09-15T10:43:23+00:00
> Refined the yaml test runner: more informative banner, ignore extra analyses (= removes false alarms). Merge from template. 2012-09-15T08:17:53+00:00
> Added basic setup for running YAML tests in the test/src/morphology/ dir. The default setup will run all *.yaml files found in this dir, but this can be modified in the shell (*.sh.in) script. If there are yaml files in that dir, they will be automatically run by 'make check'. 2012-09-14T19:28:03+00:00
> Enable yaml tests by magic. Merge from the template. 2012-09-14T11:54:49+00:00
> Added conditional support for running python-based tests in test/src/morphology. 2012-09-14T11:27:52+00:00
> Added checks for Python 3.1+ and py-yaml, and defined CAN_YAML_TEST. The idea is that we will run the python-based tests only if the prerequisites are available to us, and skip them if not. 2012-09-14T10:54:10+00:00
> Added support for transcribing transducers, ie transducers that change the input from one orthographical representation to another, e.g. date and time expressions as strings or digits to the opposite form. 2012-09-10T10:37:35+00:00
> Renamed the default error model file, to follow the naming scheme used in the zhfst guidelines. 2012-09-10T09:30:47+00:00
> Renamed the default error model file, to follow the naming scheme used in the zhfst guidelines. This makes compilation much easier, and should cause the present makefile to actually build spellers. Tommi already did this for FIN. 2012-09-10T09:25:01+00:00
> Don't remove the *.tmp files - that destroys the dependency relationships for (auto)make, which forces a full recompilation of all target fst's, and a lot of extra waiting time. 2012-09-10T09:00:36+00:00
> Add missing src to hfst spellchecker automaton path. Merge from template. 2012-09-08T13:00:22+00:00
> ign 2012-09-08T12:54:23+00:00
> Added missing reference to dialect tag filter. Update from the template. 2012-09-08T08:42:58+00:00
> Updated my simplistic noun generation script to be aware of its new location. 2012-09-07T14:48:50+00:00
> Reorganised the test dir, in anticipation of a larger set of tools and source types in need of testing. Merge from the template. 2012-09-07T13:49:17+00:00
> Added test/data/typos.txt to hold a list of collected typos. The list is used both for testing spellers, and as part of the preprocessor used with the Xerox lookup tool. 2012-09-07T06:27:36+00:00
> Major template update of all languages (except those already updated by Jack): * proper tag deletion of tags only used for transducer manipulation, not for analysis (manipulations mostly not yet implemented) * making optional some tag sets for the generators * updated README with correct and working instructions for first time installers, also for svn users * added hooks for easily adding language-specific operations on transducers * silenced the und.timestamp message unless you are a GTMAINTAINER (thanks to Tommi) => more synchronized template merges, less noise for regular users * a number of other small fixes 2012-09-06T19:19:58+00:00
> Autoconf updates from the template. Intended goal: better hfst testing before enabling it. 2012-08-30T05:39:16+00:00
> Added border removal to the basic analyser and generator, such that they become useful. Also changed the order of the dir processing in src/, to ensure that the filters are built before they are needed. 2012-08-29T03:12:39+00:00
> Added border removal to the basic analyser and generator, such that they become useful. Also changed the order of the dir processing in src/, to ensure that the filters are built before they are needed. 2012-08-29T03:06:20+00:00
> Corrected syntax error. 2012-08-29T01:46:18+00:00
> Made the first test script more robust: it bails out if no transducer is found, and gives basic feedback to whether it is testing Xerox or Hfst. The test data files are not deleted after the test run, so that they can be easily inspected if needed, even after a successful test run. 2012-08-29T01:34:20+00:00
> Added the first test script: it tests whether noun lemmas do generate. The script does contain some language-specific bits, and must thus be adapted to the requirements of each language. 2012-08-29T00:17:53+00:00
> Corrected reference to inituppercase.?fst. Template update. 2012-08-28T19:34:40+00:00
> Corrected compilation of hyphenation rules. Template update. 2012-08-28T18:44:51+00:00
> Corrected compilation of phonetic/orth2ipa rules. Merge from the template. 2012-08-28T17:41:58+00:00
> Added basic structure for hyphenation and conversion to IPA. Merge from the template. 2012-08-28T05:45:14+00:00
> Added Hunspell dir. Merge from templates. 2012-08-27T16:44:43+00:00
> Two template updates at once: 2012-08-27T15:48:00+00:00
> Added build support for xml source files. Updates from the template. 2012-08-27T07:33:38+00:00
> Added initial support for xml source files. NB! The support isn't fully according to GNU (autotools) standards yet, but will have to do for the moment. 2012-08-25T07:24:18+00:00
> A lot of cleanup and corrections: * suffix rules in more places (although not in all - that is not possible) * removed automake warning about pattern rules - we need them * checked all *-include.am files for consistency, added missing Xerox and HFST targets were needed, corrected vars to HFST tools, added comments and generally made the files easier to maintain (I hope). 2012-08-24T13:47:44+00:00
> Added Autoconf processing of the Makefile.am files in test/. Update from the template. 2012-08-22T18:08:58+00:00
> Updated Test dir with subdirs and make-files. Updates from the template. 2012-08-22T13:36:55+00:00
> Initial ATR harmony implementation (overgenerates) 2012-07-06T18:30:13+00:00
> Few verbs to test 2012-06-27T22:51:13+00:00
> Merge filters and correct speller script nicely to tuv 2012-06-27T18:35:00+00:00
> Merge from core: Capital initials for Xerox transducers, more comments. 2012-06-21T22:21:51+00:00
> More merge testing, now fixing the xfst script file extension (Xerox fst compilation) for TUV. 2012-06-21T09:07:06+00:00
> Basic data for tuv experiment 2012-06-21T08:03:08+00:00
> Add initial stuff for turkana 2012-06-21T00:35:48+00:00