]> git.openstreetmap.org Git - nominatim.git/log
nominatim.git
6 months agoMerge pull request #3692 from lonvia/word-lookup-variants
Sarah Hoffmann [Mon, 31 Mar 2025 14:38:31 +0000 (16:38 +0200)]
Merge pull request #3692 from lonvia/word-lookup-variants

Avoid matching penalty for abbreviated search terms

6 months agoMerge pull request #3691 from lonvia/more-search-tweaks
Sarah Hoffmann [Mon, 31 Mar 2025 13:06:09 +0000 (15:06 +0200)]
Merge pull request #3691 from lonvia/more-search-tweaks

More tweaks to search wights

6 months agoadapt tests to extended results
Sarah Hoffmann [Fri, 21 Mar 2025 19:24:09 +0000 (20:24 +0100)]
adapt tests to extended results

6 months agoadd lookup word to variants in word table
Sarah Hoffmann [Fri, 21 Mar 2025 08:02:52 +0000 (09:02 +0100)]
add lookup word to variants in word table

6 months agodo not write any word counts on initial word insert
Sarah Hoffmann [Thu, 20 Mar 2025 20:33:27 +0000 (21:33 +0100)]
do not write any word counts on initial word insert

6 months agodo not save word counts of 1
Sarah Hoffmann [Thu, 20 Mar 2025 19:01:46 +0000 (20:01 +0100)]
do not save word counts of 1

This is the default setting, which will be assumed when the count is
missing.

6 months agoalways use lookup when requested
Sarah Hoffmann [Mon, 31 Mar 2025 09:15:26 +0000 (11:15 +0200)]
always use lookup when requested

Doesn't seem to cause any issues in production.

6 months agoincrease allowable address counts
Sarah Hoffmann [Mon, 31 Mar 2025 09:12:08 +0000 (11:12 +0200)]
increase allowable address counts

6 months agomore agressively reduce expected count for multi-word terms
Sarah Hoffmann [Mon, 31 Mar 2025 09:06:21 +0000 (11:06 +0200)]
more agressively reduce expected count for multi-word terms

Improves searching of non-latin scripts with forced token spaces.

6 months agoMerge pull request #3690 from lonvia/fix-signature
Sarah Hoffmann [Mon, 31 Mar 2025 09:17:03 +0000 (11:17 +0200)]
Merge pull request #3690 from lonvia/fix-signature

Fix function signature for newer SQLAlchemy

6 months agofix function signature for newer SQLAlchemy
Sarah Hoffmann [Mon, 31 Mar 2025 07:42:29 +0000 (09:42 +0200)]
fix function signature for newer SQLAlchemy

6 months agoMerge pull request #3684 from lonvia/compact-en-variants
Sarah Hoffmann [Mon, 24 Mar 2025 14:15:13 +0000 (15:15 +0100)]
Merge pull request #3684 from lonvia/compact-en-variants

Clean up English variants

6 months agorestrict some English variants to end of word
Sarah Hoffmann [Fri, 21 Mar 2025 20:22:38 +0000 (21:22 +0100)]
restrict some English variants to end of word

6 months agomake English variant list more compact
Sarah Hoffmann [Fri, 21 Mar 2025 20:13:34 +0000 (21:13 +0100)]
make English variant list more compact

6 months agoMerge pull request #3682 from lonvia/fix-postcode-case
Sarah Hoffmann [Fri, 21 Mar 2025 10:41:24 +0000 (11:41 +0100)]
Merge pull request #3682 from lonvia/fix-postcode-case

Fix case issues when parsing postcodes

6 months agomake query upper-case when parsing postcodes
Sarah Hoffmann [Fri, 21 Mar 2025 08:44:15 +0000 (09:44 +0100)]
make query upper-case when parsing postcodes

The postcode patterns expect upper-case letters.

6 months agoMerge pull request #3679 from lonvia/output-fixes
Sarah Hoffmann [Wed, 19 Mar 2025 20:56:28 +0000 (21:56 +0100)]
Merge pull request #3679 from lonvia/output-fixes

Minor fixes for v1 frontend code

6 months agofix layer setting for structured search
Sarah Hoffmann [Tue, 18 Mar 2025 10:20:20 +0000 (11:20 +0100)]
fix layer setting for structured search

6 months agofix debug output for lookup type
Sarah Hoffmann [Tue, 18 Mar 2025 10:21:54 +0000 (11:21 +0100)]
fix debug output for lookup type

6 months agoMerge pull request #3678 from lonvia/search-tweaks
Sarah Hoffmann [Wed, 19 Mar 2025 15:00:52 +0000 (16:00 +0100)]
Merge pull request #3678 from lonvia/search-tweaks

Some minor tweaks to postcode parsing in query

6 months agopenalize postcode searches with multiple name qualifiers
Sarah Hoffmann [Wed, 19 Mar 2025 09:05:36 +0000 (10:05 +0100)]
penalize postcode searches with multiple name qualifiers

6 months agouse word_token length when penalizing against postcodes
Sarah Hoffmann [Tue, 18 Mar 2025 22:07:27 +0000 (23:07 +0100)]
use word_token length when penalizing against postcodes

6 months agopostcode token should have transliterated term in word_token
Sarah Hoffmann [Tue, 18 Mar 2025 21:55:33 +0000 (22:55 +0100)]
postcode token should have transliterated term in word_token

6 months agoimprove handling of leading postcodes
Sarah Hoffmann [Tue, 18 Mar 2025 21:32:58 +0000 (22:32 +0100)]
improve handling of leading postcodes

Setting the direction of the query while yielding assignments is
a bad idea because it may override a direction already set.

6 months agoMerge pull request #3673 from otbutz/parallel_safe
Sarah Hoffmann [Tue, 18 Mar 2025 20:46:53 +0000 (21:46 +0100)]
Merge pull request #3673 from otbutz/parallel_safe

Mark functions as PARALLEL SAFE

6 months agoMerge pull request #3676 from lonvia/adjust-place-levels-sa
Sarah Hoffmann [Tue, 18 Mar 2025 17:31:48 +0000 (18:31 +0100)]
Merge pull request #3676 from lonvia/adjust-place-levels-sa

Adjust place ranks for Saudi-Arabia

6 months agoadjust for special use of province and municipality in Saudi-Arabia
Sarah Hoffmann [Tue, 18 Mar 2025 15:38:10 +0000 (16:38 +0100)]
adjust for special use of province and municipality in Saudi-Arabia

6 months agoMark functions as PARALLEL SAFE
Thomas Butz [Thu, 13 Mar 2025 09:04:40 +0000 (10:04 +0100)]
Mark functions as PARALLEL SAFE

6 months agoadapt typing for latest SQLAlchemy version
Sarah Hoffmann [Thu, 13 Mar 2025 09:49:08 +0000 (10:49 +0100)]
adapt typing for latest SQLAlchemy version

7 months agoMerge pull request #3671 from lonvia/remove-osm2pgsql-libdir
Sarah Hoffmann [Tue, 11 Mar 2025 10:22:46 +0000 (11:22 +0100)]
Merge pull request #3671 from lonvia/remove-osm2pgsql-libdir

Remove code for setting osm2pgsql location via config.lib_dir

7 months agoremove code for setting osm2pgsql via config.lib_dir
Sarah Hoffmann [Tue, 11 Mar 2025 07:45:10 +0000 (08:45 +0100)]
remove code for setting osm2pgsql via config.lib_dir

With the internal osm2pgsql gone, configuration of the binary location
via settings is the only option left that makes sense.

7 months agoMerge pull request #3670 from lonvia/flake-for-tests
Sarah Hoffmann [Mon, 10 Mar 2025 08:35:24 +0000 (09:35 +0100)]
Merge pull request #3670 from lonvia/flake-for-tests

Extend linting with flake to tests

7 months agoremove unused bdd util functions
Sarah Hoffmann [Sun, 9 Mar 2025 16:34:40 +0000 (17:34 +0100)]
remove unused bdd util functions

7 months agoenable flake for bdd test code
Sarah Hoffmann [Sun, 9 Mar 2025 16:34:04 +0000 (17:34 +0100)]
enable flake for bdd test code

7 months agoalso enable flake for tests in github actions
Sarah Hoffmann [Sun, 9 Mar 2025 15:03:02 +0000 (16:03 +0100)]
also enable flake for tests in github actions

7 months agoenable flake for Python tests
Sarah Hoffmann [Sun, 9 Mar 2025 14:33:24 +0000 (15:33 +0100)]
enable flake for Python tests

7 months agoMerge pull request #3667 from eumiro/simplify-int-float
Sarah Hoffmann [Sun, 9 Mar 2025 08:44:15 +0000 (09:44 +0100)]
Merge pull request #3667 from eumiro/simplify-int-float

Simplify  int/float manipulation

7 months agoSimplify int/float manipulation
Miroslav Šedivý [Wed, 5 Mar 2025 19:55:53 +0000 (20:55 +0100)]
Simplify  int/float manipulation

7 months agoMerge pull request #3666 from eumiro/math-isclose
Sarah Hoffmann [Thu, 6 Mar 2025 16:53:01 +0000 (17:53 +0100)]
Merge pull request #3666 from eumiro/math-isclose

Replace custom Almost with stdlib math.isclose

7 months agoMerge pull request #3664 from eumiro/consolidate-random
Sarah Hoffmann [Thu, 6 Mar 2025 16:52:19 +0000 (17:52 +0100)]
Merge pull request #3664 from eumiro/consolidate-random

Consolidate usage of random module

7 months agoReplace custom Almost with stdlib math.isclose
Miroslav Šedivý [Wed, 5 Mar 2025 19:35:01 +0000 (20:35 +0100)]
Replace custom Almost with stdlib math.isclose

7 months agoConsolidate usage of random module
Miroslav Šedivý [Tue, 4 Mar 2025 19:01:47 +0000 (20:01 +0100)]
Consolidate usage of random module

7 months agoMerge pull request #3665 from lonvia/pattern-matching-postcodes
Sarah Hoffmann [Wed, 5 Mar 2025 15:02:03 +0000 (16:02 +0100)]
Merge pull request #3665 from lonvia/pattern-matching-postcodes

Add full parsing of postcodes in query

7 months agorestrict postcode parsing in typed phrases
Sarah Hoffmann [Wed, 5 Mar 2025 09:08:07 +0000 (10:08 +0100)]
restrict postcode parsing in typed phrases

Postcodes can only appear in postcode-type phrases and must then
cover the full phrase

7 months agoadd unit tests for postcode parser
Sarah Hoffmann [Tue, 4 Mar 2025 15:25:00 +0000 (16:25 +0100)]
add unit tests for postcode parser

7 months agoadapt BDD tests to new postcode handling
Sarah Hoffmann [Tue, 4 Mar 2025 14:18:46 +0000 (15:18 +0100)]
adapt BDD tests to new postcode handling

7 months agoadd support for country prefixes in postcodes
Sarah Hoffmann [Tue, 4 Mar 2025 14:18:27 +0000 (15:18 +0100)]
add support for country prefixes in postcodes

7 months agocache all info of ICUQueryAnalyser in a single object
Sarah Hoffmann [Sun, 2 Mar 2025 16:31:04 +0000 (17:31 +0100)]
cache all info of ICUQueryAnalyser in a single object

7 months agoremove postcode computation for word table during import
Sarah Hoffmann [Sat, 1 Mar 2025 09:20:33 +0000 (10:20 +0100)]
remove postcode computation for word table during import

7 months agoreenable ZIP+ test
Sarah Hoffmann [Fri, 28 Feb 2025 16:19:56 +0000 (17:19 +0100)]
reenable ZIP+ test

7 months agoshow token begin and end in debug output
Sarah Hoffmann [Fri, 28 Feb 2025 16:19:37 +0000 (17:19 +0100)]
show token begin and end in debug output

7 months agoadd postcode parser
Sarah Hoffmann [Tue, 25 Feb 2025 19:56:07 +0000 (20:56 +0100)]
add postcode parser

7 months agomake word generation from query a class method
Sarah Hoffmann [Wed, 26 Feb 2025 16:22:14 +0000 (17:22 +0100)]
make word generation from query a class method

7 months agosearch: merge QueryPart array with QueryNodes
Sarah Hoffmann [Wed, 26 Feb 2025 13:37:08 +0000 (14:37 +0100)]
search: merge QueryPart array with QueryNodes

The basic information on terms is pretty much always used together
with the node inforamtion. Merging them together saves some
allocation while making lookup easier at the same time.

7 months agoenable parsing of US ZIP+ codes
Sarah Hoffmann [Tue, 25 Feb 2025 19:29:06 +0000 (20:29 +0100)]
enable parsing of US ZIP+ codes

The four-digit part of these postcodes will simply be ignored.

7 months agoMerge pull request #3659 from lonvia/custom-datrie-structure
Sarah Hoffmann [Mon, 24 Feb 2025 15:49:42 +0000 (16:49 +0100)]
Merge pull request #3659 from lonvia/custom-datrie-structure

Replace datrie library with a simple custom Python implementation

7 months agoremove datrie dependency
Sarah Hoffmann [Tue, 18 Feb 2025 20:12:02 +0000 (21:12 +0100)]
remove datrie dependency

7 months agoreplace datrie library with a more simple pure-Python class
Sarah Hoffmann [Tue, 18 Feb 2025 20:09:12 +0000 (21:09 +0100)]
replace datrie library with a more simple pure-Python class

7 months agoMerge pull request #3658 from lonvia/minor-query-parsing-optimisations
Sarah Hoffmann [Mon, 24 Feb 2025 09:16:47 +0000 (10:16 +0100)]
Merge pull request #3658 from lonvia/minor-query-parsing-optimisations

Minor query parsing optimisations

7 months agoadapt documentation for PhraseType type
Sarah Hoffmann [Fri, 21 Feb 2025 16:16:42 +0000 (17:16 +0100)]
adapt documentation for PhraseType type

7 months agoreplace PhraseType enum with simple int constants
Sarah Hoffmann [Fri, 21 Feb 2025 15:44:12 +0000 (16:44 +0100)]
replace PhraseType enum with simple int constants

7 months agoreplace TokenType enum with simple char constants
Sarah Hoffmann [Fri, 21 Feb 2025 09:23:41 +0000 (10:23 +0100)]
replace TokenType enum with simple char constants

7 months agoreplace BreakType enum with simple char constants
Sarah Hoffmann [Fri, 21 Feb 2025 08:57:48 +0000 (09:57 +0100)]
replace BreakType enum with simple char constants

7 months agoconsistently use query module as qmod
Sarah Hoffmann [Fri, 21 Feb 2025 08:31:21 +0000 (09:31 +0100)]
consistently use query module as qmod

7 months agoavoid yielding when extracting words from query
Sarah Hoffmann [Thu, 20 Feb 2025 22:32:39 +0000 (23:32 +0100)]
avoid yielding when extracting words from query

7 months agoremove word_number counting for phrases
Sarah Hoffmann [Thu, 20 Feb 2025 16:36:50 +0000 (17:36 +0100)]
remove word_number counting for phrases

We can just examine the break types to know if we are dealing
with a partial token.

7 months agoMerge pull request #3655 from lonvia/remove-name-ranking-in-postcode-search
Sarah Hoffmann [Thu, 20 Feb 2025 13:32:43 +0000 (14:32 +0100)]
Merge pull request #3655 from lonvia/remove-name-ranking-in-postcode-search

Tweak penalties for postcode searches

7 months agoremove address penalty for postcode search
Sarah Hoffmann [Thu, 20 Feb 2025 10:05:28 +0000 (11:05 +0100)]
remove address penalty for postcode search

Searches of the form <postcode> <city> are in fact quite common.

7 months agodo not rerank address by full match in postcode search
Sarah Hoffmann [Thu, 20 Feb 2025 09:15:03 +0000 (10:15 +0100)]
do not rerank address by full match in postcode search

The reranking result will not be completely correct because
the address of a postcode refer to the address _and_ name
of the parent and reranking was only done against the
address. We assume here that the postcode is precise enough
as to not require a penalty to to partial matches.

7 months agoMerge pull request #3653 from lonvia/trailing-spaces-in-normalization
Sarah Hoffmann [Wed, 19 Feb 2025 16:25:59 +0000 (17:25 +0100)]
Merge pull request #3653 from lonvia/trailing-spaces-in-normalization

Strip leading and trailing space markers during normalization

7 months agostrip normalisation results of normal and special spaces
Sarah Hoffmann [Wed, 19 Feb 2025 09:26:06 +0000 (10:26 +0100)]
strip normalisation results of normal and special spaces

7 months agoadd test for structured query with leading spaces
Sarah Hoffmann [Wed, 19 Feb 2025 08:38:10 +0000 (09:38 +0100)]
add test for structured query with leading spaces

7 months agoMerge pull request #3652 from lonvia/update-variants
Sarah Hoffmann [Tue, 18 Feb 2025 18:47:45 +0000 (19:47 +0100)]
Merge pull request #3652 from lonvia/update-variants

Cleanup and updates of tokenizer variant configuration

7 months agoMerge pull request #3649 from lonvia/actions-move-to-ubuntu22
Sarah Hoffmann [Tue, 18 Feb 2025 12:21:09 +0000 (13:21 +0100)]
Merge pull request #3649 from lonvia/actions-move-to-ubuntu22

Move Github actions to Unbuntu-22 image

7 months agoremove e-ë mutation, this is taken care of by transliteration
Sarah Hoffmann [Tue, 18 Feb 2025 09:31:44 +0000 (10:31 +0100)]
remove e-ë mutation, this is taken care of by transliteration

7 months agoAdd USPS Standard Suffix Abbreviation
mhsr21 [Tue, 7 May 2024 07:21:10 +0000 (03:21 -0400)]
Add USPS Standard Suffix Abbreviation

7 months agofix Norwegian variants
Sarah Hoffmann [Mon, 17 Feb 2025 16:51:33 +0000 (17:51 +0100)]
fix Norwegian variants

More cases of 'no' being interpreted as fasle by yaml.

7 months agoremove ineffective and dupicate variants
Sarah Hoffmann [Mon, 17 Feb 2025 11:07:53 +0000 (12:07 +0100)]
remove ineffective and dupicate variants

7 months agoMerge pull request #3555 from IvanShift/patch-1
Sarah Hoffmann [Mon, 17 Feb 2025 17:44:11 +0000 (18:44 +0100)]
Merge pull request #3555 from IvanShift/patch-1

Fixed Russian abbreviation list

7 months agoactions: use Debians's script for adding the Postgres apt repo
Sarah Hoffmann [Mon, 17 Feb 2025 16:56:23 +0000 (17:56 +0100)]
actions: use Debians's script for adding the Postgres apt repo

7 months agoactions: move tests to Ubuntu-20
Sarah Hoffmann [Mon, 17 Feb 2025 16:54:49 +0000 (17:54 +0100)]
actions: move tests to Ubuntu-20

7 months agoMerge pull request #3648 from lonvia/extratags-for-geocodejson
Sarah Hoffmann [Mon, 17 Feb 2025 10:14:52 +0000 (11:14 +0100)]
Merge pull request #3648 from lonvia/extratags-for-geocodejson

Enable output of extratags for geocodejson format

7 months agoadd documentation for new extra field
Sarah Hoffmann [Mon, 17 Feb 2025 09:04:23 +0000 (10:04 +0100)]
add documentation for new extra field

7 months agoadd optional output of extratags to geocodejson
Sarah Hoffmann [Sun, 16 Feb 2025 09:16:40 +0000 (10:16 +0100)]
add optional output of extratags to geocodejson

8 months agoprepare 5.0.0 release
Sarah Hoffmann [Wed, 5 Feb 2025 09:54:38 +0000 (10:54 +0100)]
prepare 5.0.0 release

8 months agoMerge pull request #3635 from lonvia/replace-wikimedia-importance-test-data
Sarah Hoffmann [Tue, 14 Jan 2025 15:49:52 +0000 (16:49 +0100)]
Merge pull request #3635 from lonvia/replace-wikimedia-importance-test-data

Update wikimedia importance file for test database

8 months agoadapt tests to changed wikimedia importance test table
Sarah Hoffmann [Tue, 14 Jan 2025 13:19:17 +0000 (14:19 +0100)]
adapt tests to changed wikimedia importance test table

8 months agoreplace wikimedia importance file for test data with CSV version
Sarah Hoffmann [Tue, 14 Jan 2025 08:16:25 +0000 (09:16 +0100)]
replace wikimedia importance file for test data with CSV version

8 months agoMerge pull request #3626 from lonvia/import-performance
Sarah Hoffmann [Fri, 10 Jan 2025 15:44:33 +0000 (16:44 +0100)]
Merge pull request #3626 from lonvia/import-performance

Import performance

8 months agoMerge pull request #3631 from lonvia/avoid-transactions
Sarah Hoffmann [Fri, 10 Jan 2025 15:44:18 +0000 (16:44 +0100)]
Merge pull request #3631 from lonvia/avoid-transactions

Creating tables and indexes in autocommit mode

8 months agoMerge pull request #3633 from lonvia/restrict-long-ways
Sarah Hoffmann [Fri, 10 Jan 2025 15:06:37 +0000 (16:06 +0100)]
Merge pull request #3633 from lonvia/restrict-long-ways

Ignore overly long ways during import

8 months agoignore overly long ways during import
Sarah Hoffmann [Fri, 10 Jan 2025 12:55:43 +0000 (13:55 +0100)]
ignore overly long ways during import

9 months agouse autocommit when creating tables and indexes
Sarah Hoffmann [Thu, 9 Jan 2025 16:14:37 +0000 (17:14 +0100)]
use autocommit when creating tables and indexes

Might avoid some deadlock situations with autovacuum.

9 months agoMerge pull request #3629 from lonvia/additional-breaks
Sarah Hoffmann [Thu, 9 Jan 2025 12:55:29 +0000 (13:55 +0100)]
Merge pull request #3629 from lonvia/additional-breaks

Introduce new break types and phrase splitting for Japanese addresses

9 months agoadd japanese phrase preprocessing
Sarah Hoffmann [Wed, 8 Jan 2025 18:43:25 +0000 (19:43 +0100)]
add japanese phrase preprocessing

Code adapted from GSOC code by @miku.

9 months agokeep break indicators [:-] during normalisation
Sarah Hoffmann [Tue, 7 Jan 2025 20:32:32 +0000 (21:32 +0100)]
keep break indicators [:-] during normalisation

All punctuation will be converted to '-'. Soft breaks : may be
added by preprocessors. The break signs are only used during
query analysis and are ignored during import token analysis.

9 months agoadd inner word break penalty
Sarah Hoffmann [Tue, 7 Jan 2025 19:40:58 +0000 (20:40 +0100)]
add inner word break penalty

9 months agoadd SOFT_PHRASE break and enable parsing
Sarah Hoffmann [Mon, 6 Jan 2025 16:10:24 +0000 (17:10 +0100)]
add SOFT_PHRASE break and enable parsing

Also enables parsing of PART breaks.

9 months agosplit up MultiPolygons before adding them to large_areas table
Sarah Hoffmann [Fri, 20 Dec 2024 10:03:15 +0000 (11:03 +0100)]
split up MultiPolygons before adding them to large_areas table

9 months agomove geometry split into insertLocationAreaLarge()
Sarah Hoffmann [Thu, 19 Dec 2024 19:09:27 +0000 (20:09 +0100)]
move geometry split into insertLocationAreaLarge()

thus insert only needs to be called once.