]> git.openstreetmap.org Git - nominatim.git/commit
new word table layout for icu tokenizer
authorSarah Hoffmann <lonvia@denofr.de>
Tue, 20 Jul 2021 08:27:06 +0000 (10:27 +0200)
committerSarah Hoffmann <lonvia@denofr.de>
Wed, 28 Jul 2021 09:31:47 +0000 (11:31 +0200)
commit83775289523eda29fe8d82ff2e92c6faa5c76898
tree45bee7da2d6820181d4381669a83d679061072a1
parent34dcf02deebe9d57a328389dd0b000defdcba697
new word table layout for icu tokenizer

The table now directly reflects the different token types.
Extra information is saved in a json structure that may be
dynamically extended in the future without affecting the
table layout.
lib-sql/tokenizer/icu_tokenizer_tables.sql [new file with mode: 0644]
nominatim/tokenizer/legacy_icu_tokenizer.py