]> git.openstreetmap.org Git - nominatim.git/commit
move abbreviation computation into import phase
authorSarah Hoffmann <lonvia@denofr.de>
Fri, 28 May 2021 20:06:13 +0000 (22:06 +0200)
committerSarah Hoffmann <lonvia@denofr.de>
Sun, 4 Jul 2021 08:28:20 +0000 (10:28 +0200)
commit8413075249e1bb2832df4edd0f66d61f77fb9f99
tree58e9eff6709ef97120c46b925986b49440a0bc56
parent6ba00e6aee2645ad958857a0df170915c8e59cdb
move abbreviation computation into import phase

This adds precomputation of abbreviated terms for names and removes
abbreviation of terms in the query. Basic import works but still
needs some thorough testing as well as speed improvements during
import.

New dependency for python library datrie.
lib-php/tokenizer/legacy_icu_tokenizer.php
lib-sql/tokenizer/legacy_icu_tokenizer.sql
nominatim/tokenizer/icu_name_processor.py [new file with mode: 0644]
nominatim/tokenizer/icu_rule_loader.py [new file with mode: 0644]
nominatim/tokenizer/legacy_icu_tokenizer.py
nominatim/tokenizer/legacy_tokenizer.py
nominatim/tools/database_import.py
settings/legacy_icu_tokenizer.yaml [new file with mode: 0644]
test/python/test_tokenizer_icu_name_processor.py [new file with mode: 0644]
test/python/test_tokenizer_icu_rule_loader.py [new file with mode: 0644]