|
1 | 1 | [[analysis-stemmer-tokenfilter]]
|
2 | 2 | === Stemmer Token Filter
|
3 | 3 |
|
| 4 | +// Adds attribute for the 'minimal_portuguese' stemmer values link. |
| 5 | +// This link contains ~, which is converted to subscript. |
| 6 | +// This attribute prevents that substitution. |
| 7 | +// See https://github.com/asciidoctor/asciidoctor/wiki/How-to-prevent-URLs-containing-formatting-characters-from-getting-mangled |
| 8 | +:min-pt-stemmer-values-url: http://www.inf.ufrgs.br/~buriol/papers/Orengo_CLEF07.pdf |
| 9 | + |
4 | 10 | A filter that stems words (similar to `snowball`, but with more
|
5 | 11 | options). The `language`/`name` parameter controls the stemmer with the
|
6 | 12 | following available values:
|
@@ -47,7 +53,7 @@ http://dl.acm.org/citation.cfm?id=1141523&dl=ACM&coll=DL&CFID=179095584&CFTOKEN=
|
47 | 53 | http://www.illc.uva.nl/Publications/ResearchReports/MoL-2003-02.text.pdf[indonesian],
|
48 | 54 | http://www.ercim.eu/publication/ws-proceedings/CLEF2/savoy.pdf[light_italian],
|
49 | 55 | http://dl.acm.org/citation.cfm?id=1141523&dl=ACM&coll=DL&CFID=179095584&CFTOKEN=80067181[light_portuguese],
|
50 |
| -http://www.inf.ufrgs.br/\~buriol/papers/Orengo_CLEF07.pdf[minimal_portuguese], |
| 56 | +{min-pt-stemmer-values-url}[minimal_portuguese], |
51 | 57 | http://www.inf.ufrgs.br/\~viviane/rslp/index.htm[portuguese],
|
52 | 58 | http://doc.rero.ch/lm.php?url=1000%2C43%2C4%2C20091209094227-CA%2FDolamic_Ljiljana_-_Indexing_and_Searching_Strategies_for_the_Russian_20091209.pdf[light_russian],
|
53 | 59 | http://www.ercim.eu/publication/ws-proceedings/CLEF2/savoy.pdf[light_spanish],
|
|
0 commit comments