]> git.openstreetmap.org Git - nominatim.git/log
nominatim.git
4 months agofix layer setting for structured search
Sarah Hoffmann [Tue, 18 Mar 2025 10:20:20 +0000 (11:20 +0100)]
fix layer setting for structured search

4 months agofix debug output for lookup type
Sarah Hoffmann [Tue, 18 Mar 2025 10:21:54 +0000 (11:21 +0100)]
fix debug output for lookup type

4 months agoMerge pull request #3678 from lonvia/search-tweaks
Sarah Hoffmann [Wed, 19 Mar 2025 15:00:52 +0000 (16:00 +0100)]
Merge pull request #3678 from lonvia/search-tweaks

Some minor tweaks to postcode parsing in query

4 months agopenalize postcode searches with multiple name qualifiers
Sarah Hoffmann [Wed, 19 Mar 2025 09:05:36 +0000 (10:05 +0100)]
penalize postcode searches with multiple name qualifiers

4 months agouse word_token length when penalizing against postcodes
Sarah Hoffmann [Tue, 18 Mar 2025 22:07:27 +0000 (23:07 +0100)]
use word_token length when penalizing against postcodes

4 months agopostcode token should have transliterated term in word_token
Sarah Hoffmann [Tue, 18 Mar 2025 21:55:33 +0000 (22:55 +0100)]
postcode token should have transliterated term in word_token

4 months agoimprove handling of leading postcodes
Sarah Hoffmann [Tue, 18 Mar 2025 21:32:58 +0000 (22:32 +0100)]
improve handling of leading postcodes

Setting the direction of the query while yielding assignments is
a bad idea because it may override a direction already set.

4 months agoMerge pull request #3673 from otbutz/parallel_safe
Sarah Hoffmann [Tue, 18 Mar 2025 20:46:53 +0000 (21:46 +0100)]
Merge pull request #3673 from otbutz/parallel_safe

Mark functions as PARALLEL SAFE

4 months agoMerge pull request #3676 from lonvia/adjust-place-levels-sa
Sarah Hoffmann [Tue, 18 Mar 2025 17:31:48 +0000 (18:31 +0100)]
Merge pull request #3676 from lonvia/adjust-place-levels-sa

Adjust place ranks for Saudi-Arabia

4 months agoadjust for special use of province and municipality in Saudi-Arabia
Sarah Hoffmann [Tue, 18 Mar 2025 15:38:10 +0000 (16:38 +0100)]
adjust for special use of province and municipality in Saudi-Arabia

4 months agoMark functions as PARALLEL SAFE
Thomas Butz [Thu, 13 Mar 2025 09:04:40 +0000 (10:04 +0100)]
Mark functions as PARALLEL SAFE

4 months agoadapt typing for latest SQLAlchemy version
Sarah Hoffmann [Thu, 13 Mar 2025 09:49:08 +0000 (10:49 +0100)]
adapt typing for latest SQLAlchemy version

5 months agoMerge pull request #3671 from lonvia/remove-osm2pgsql-libdir
Sarah Hoffmann [Tue, 11 Mar 2025 10:22:46 +0000 (11:22 +0100)]
Merge pull request #3671 from lonvia/remove-osm2pgsql-libdir

Remove code for setting osm2pgsql location via config.lib_dir

5 months agoremove code for setting osm2pgsql via config.lib_dir
Sarah Hoffmann [Tue, 11 Mar 2025 07:45:10 +0000 (08:45 +0100)]
remove code for setting osm2pgsql via config.lib_dir

With the internal osm2pgsql gone, configuration of the binary location
via settings is the only option left that makes sense.

5 months agoMerge pull request #3670 from lonvia/flake-for-tests
Sarah Hoffmann [Mon, 10 Mar 2025 08:35:24 +0000 (09:35 +0100)]
Merge pull request #3670 from lonvia/flake-for-tests

Extend linting with flake to tests

5 months agoremove unused bdd util functions
Sarah Hoffmann [Sun, 9 Mar 2025 16:34:40 +0000 (17:34 +0100)]
remove unused bdd util functions

5 months agoenable flake for bdd test code
Sarah Hoffmann [Sun, 9 Mar 2025 16:34:04 +0000 (17:34 +0100)]
enable flake for bdd test code

5 months agoalso enable flake for tests in github actions
Sarah Hoffmann [Sun, 9 Mar 2025 15:03:02 +0000 (16:03 +0100)]
also enable flake for tests in github actions

5 months agoenable flake for Python tests
Sarah Hoffmann [Sun, 9 Mar 2025 14:33:24 +0000 (15:33 +0100)]
enable flake for Python tests

5 months agoMerge pull request #3667 from eumiro/simplify-int-float
Sarah Hoffmann [Sun, 9 Mar 2025 08:44:15 +0000 (09:44 +0100)]
Merge pull request #3667 from eumiro/simplify-int-float

Simplify  int/float manipulation

5 months agoSimplify int/float manipulation
Miroslav Šedivý [Wed, 5 Mar 2025 19:55:53 +0000 (20:55 +0100)]
Simplify  int/float manipulation

5 months agoMerge pull request #3666 from eumiro/math-isclose
Sarah Hoffmann [Thu, 6 Mar 2025 16:53:01 +0000 (17:53 +0100)]
Merge pull request #3666 from eumiro/math-isclose

Replace custom Almost with stdlib math.isclose

5 months agoMerge pull request #3664 from eumiro/consolidate-random
Sarah Hoffmann [Thu, 6 Mar 2025 16:52:19 +0000 (17:52 +0100)]
Merge pull request #3664 from eumiro/consolidate-random

Consolidate usage of random module

5 months agoReplace custom Almost with stdlib math.isclose
Miroslav Šedivý [Wed, 5 Mar 2025 19:35:01 +0000 (20:35 +0100)]
Replace custom Almost with stdlib math.isclose

5 months agoConsolidate usage of random module
Miroslav Šedivý [Tue, 4 Mar 2025 19:01:47 +0000 (20:01 +0100)]
Consolidate usage of random module

5 months agoMerge pull request #3665 from lonvia/pattern-matching-postcodes
Sarah Hoffmann [Wed, 5 Mar 2025 15:02:03 +0000 (16:02 +0100)]
Merge pull request #3665 from lonvia/pattern-matching-postcodes

Add full parsing of postcodes in query

5 months agorestrict postcode parsing in typed phrases
Sarah Hoffmann [Wed, 5 Mar 2025 09:08:07 +0000 (10:08 +0100)]
restrict postcode parsing in typed phrases

Postcodes can only appear in postcode-type phrases and must then
cover the full phrase

5 months agoadd unit tests for postcode parser
Sarah Hoffmann [Tue, 4 Mar 2025 15:25:00 +0000 (16:25 +0100)]
add unit tests for postcode parser

5 months agoadapt BDD tests to new postcode handling
Sarah Hoffmann [Tue, 4 Mar 2025 14:18:46 +0000 (15:18 +0100)]
adapt BDD tests to new postcode handling

5 months agoadd support for country prefixes in postcodes
Sarah Hoffmann [Tue, 4 Mar 2025 14:18:27 +0000 (15:18 +0100)]
add support for country prefixes in postcodes

5 months agocache all info of ICUQueryAnalyser in a single object
Sarah Hoffmann [Sun, 2 Mar 2025 16:31:04 +0000 (17:31 +0100)]
cache all info of ICUQueryAnalyser in a single object

5 months agoremove postcode computation for word table during import
Sarah Hoffmann [Sat, 1 Mar 2025 09:20:33 +0000 (10:20 +0100)]
remove postcode computation for word table during import

5 months agoreenable ZIP+ test
Sarah Hoffmann [Fri, 28 Feb 2025 16:19:56 +0000 (17:19 +0100)]
reenable ZIP+ test

5 months agoshow token begin and end in debug output
Sarah Hoffmann [Fri, 28 Feb 2025 16:19:37 +0000 (17:19 +0100)]
show token begin and end in debug output

5 months agoadd postcode parser
Sarah Hoffmann [Tue, 25 Feb 2025 19:56:07 +0000 (20:56 +0100)]
add postcode parser

5 months agomake word generation from query a class method
Sarah Hoffmann [Wed, 26 Feb 2025 16:22:14 +0000 (17:22 +0100)]
make word generation from query a class method

5 months agosearch: merge QueryPart array with QueryNodes
Sarah Hoffmann [Wed, 26 Feb 2025 13:37:08 +0000 (14:37 +0100)]
search: merge QueryPart array with QueryNodes

The basic information on terms is pretty much always used together
with the node inforamtion. Merging them together saves some
allocation while making lookup easier at the same time.

5 months agoenable parsing of US ZIP+ codes
Sarah Hoffmann [Tue, 25 Feb 2025 19:29:06 +0000 (20:29 +0100)]
enable parsing of US ZIP+ codes

The four-digit part of these postcodes will simply be ignored.

5 months agoMerge pull request #3659 from lonvia/custom-datrie-structure
Sarah Hoffmann [Mon, 24 Feb 2025 15:49:42 +0000 (16:49 +0100)]
Merge pull request #3659 from lonvia/custom-datrie-structure

Replace datrie library with a simple custom Python implementation

5 months agoremove datrie dependency
Sarah Hoffmann [Tue, 18 Feb 2025 20:12:02 +0000 (21:12 +0100)]
remove datrie dependency

5 months agoreplace datrie library with a more simple pure-Python class
Sarah Hoffmann [Tue, 18 Feb 2025 20:09:12 +0000 (21:09 +0100)]
replace datrie library with a more simple pure-Python class

5 months agoMerge pull request #3658 from lonvia/minor-query-parsing-optimisations
Sarah Hoffmann [Mon, 24 Feb 2025 09:16:47 +0000 (10:16 +0100)]
Merge pull request #3658 from lonvia/minor-query-parsing-optimisations

Minor query parsing optimisations

5 months agoadapt documentation for PhraseType type
Sarah Hoffmann [Fri, 21 Feb 2025 16:16:42 +0000 (17:16 +0100)]
adapt documentation for PhraseType type

5 months agoreplace PhraseType enum with simple int constants
Sarah Hoffmann [Fri, 21 Feb 2025 15:44:12 +0000 (16:44 +0100)]
replace PhraseType enum with simple int constants

5 months agoreplace TokenType enum with simple char constants
Sarah Hoffmann [Fri, 21 Feb 2025 09:23:41 +0000 (10:23 +0100)]
replace TokenType enum with simple char constants

5 months agoreplace BreakType enum with simple char constants
Sarah Hoffmann [Fri, 21 Feb 2025 08:57:48 +0000 (09:57 +0100)]
replace BreakType enum with simple char constants

5 months agoconsistently use query module as qmod
Sarah Hoffmann [Fri, 21 Feb 2025 08:31:21 +0000 (09:31 +0100)]
consistently use query module as qmod

5 months agoavoid yielding when extracting words from query
Sarah Hoffmann [Thu, 20 Feb 2025 22:32:39 +0000 (23:32 +0100)]
avoid yielding when extracting words from query

5 months agoremove word_number counting for phrases
Sarah Hoffmann [Thu, 20 Feb 2025 16:36:50 +0000 (17:36 +0100)]
remove word_number counting for phrases

We can just examine the break types to know if we are dealing
with a partial token.

5 months agoMerge pull request #3655 from lonvia/remove-name-ranking-in-postcode-search
Sarah Hoffmann [Thu, 20 Feb 2025 13:32:43 +0000 (14:32 +0100)]
Merge pull request #3655 from lonvia/remove-name-ranking-in-postcode-search

Tweak penalties for postcode searches

5 months agoremove address penalty for postcode search
Sarah Hoffmann [Thu, 20 Feb 2025 10:05:28 +0000 (11:05 +0100)]
remove address penalty for postcode search

Searches of the form <postcode> <city> are in fact quite common.

5 months agodo not rerank address by full match in postcode search
Sarah Hoffmann [Thu, 20 Feb 2025 09:15:03 +0000 (10:15 +0100)]
do not rerank address by full match in postcode search

The reranking result will not be completely correct because
the address of a postcode refer to the address _and_ name
of the parent and reranking was only done against the
address. We assume here that the postcode is precise enough
as to not require a penalty to to partial matches.

5 months agoMerge pull request #3653 from lonvia/trailing-spaces-in-normalization
Sarah Hoffmann [Wed, 19 Feb 2025 16:25:59 +0000 (17:25 +0100)]
Merge pull request #3653 from lonvia/trailing-spaces-in-normalization

Strip leading and trailing space markers during normalization

5 months agostrip normalisation results of normal and special spaces
Sarah Hoffmann [Wed, 19 Feb 2025 09:26:06 +0000 (10:26 +0100)]
strip normalisation results of normal and special spaces

5 months agoadd test for structured query with leading spaces
Sarah Hoffmann [Wed, 19 Feb 2025 08:38:10 +0000 (09:38 +0100)]
add test for structured query with leading spaces

5 months agoMerge pull request #3652 from lonvia/update-variants
Sarah Hoffmann [Tue, 18 Feb 2025 18:47:45 +0000 (19:47 +0100)]
Merge pull request #3652 from lonvia/update-variants

Cleanup and updates of tokenizer variant configuration

5 months agoMerge pull request #3649 from lonvia/actions-move-to-ubuntu22
Sarah Hoffmann [Tue, 18 Feb 2025 12:21:09 +0000 (13:21 +0100)]
Merge pull request #3649 from lonvia/actions-move-to-ubuntu22

Move Github actions to Unbuntu-22 image

5 months agoremove e-ë mutation, this is taken care of by transliteration
Sarah Hoffmann [Tue, 18 Feb 2025 09:31:44 +0000 (10:31 +0100)]
remove e-ë mutation, this is taken care of by transliteration

5 months agoAdd USPS Standard Suffix Abbreviation
mhsr21 [Tue, 7 May 2024 07:21:10 +0000 (03:21 -0400)]
Add USPS Standard Suffix Abbreviation

5 months agofix Norwegian variants
Sarah Hoffmann [Mon, 17 Feb 2025 16:51:33 +0000 (17:51 +0100)]
fix Norwegian variants

More cases of 'no' being interpreted as fasle by yaml.

5 months agoremove ineffective and dupicate variants
Sarah Hoffmann [Mon, 17 Feb 2025 11:07:53 +0000 (12:07 +0100)]
remove ineffective and dupicate variants

5 months agoMerge pull request #3555 from IvanShift/patch-1
Sarah Hoffmann [Mon, 17 Feb 2025 17:44:11 +0000 (18:44 +0100)]
Merge pull request #3555 from IvanShift/patch-1

Fixed Russian abbreviation list

5 months agoactions: use Debians's script for adding the Postgres apt repo
Sarah Hoffmann [Mon, 17 Feb 2025 16:56:23 +0000 (17:56 +0100)]
actions: use Debians's script for adding the Postgres apt repo

5 months agoactions: move tests to Ubuntu-20
Sarah Hoffmann [Mon, 17 Feb 2025 16:54:49 +0000 (17:54 +0100)]
actions: move tests to Ubuntu-20

5 months agoMerge pull request #3648 from lonvia/extratags-for-geocodejson
Sarah Hoffmann [Mon, 17 Feb 2025 10:14:52 +0000 (11:14 +0100)]
Merge pull request #3648 from lonvia/extratags-for-geocodejson

Enable output of extratags for geocodejson format

5 months agoadd documentation for new extra field
Sarah Hoffmann [Mon, 17 Feb 2025 09:04:23 +0000 (10:04 +0100)]
add documentation for new extra field

5 months agoadd optional output of extratags to geocodejson
Sarah Hoffmann [Sun, 16 Feb 2025 09:16:40 +0000 (10:16 +0100)]
add optional output of extratags to geocodejson

6 months agoprepare 5.0.0 release
Sarah Hoffmann [Wed, 5 Feb 2025 09:54:38 +0000 (10:54 +0100)]
prepare 5.0.0 release

6 months agoMerge pull request #3635 from lonvia/replace-wikimedia-importance-test-data
Sarah Hoffmann [Tue, 14 Jan 2025 15:49:52 +0000 (16:49 +0100)]
Merge pull request #3635 from lonvia/replace-wikimedia-importance-test-data

Update wikimedia importance file for test database

6 months agoadapt tests to changed wikimedia importance test table
Sarah Hoffmann [Tue, 14 Jan 2025 13:19:17 +0000 (14:19 +0100)]
adapt tests to changed wikimedia importance test table

6 months agoreplace wikimedia importance file for test data with CSV version
Sarah Hoffmann [Tue, 14 Jan 2025 08:16:25 +0000 (09:16 +0100)]
replace wikimedia importance file for test data with CSV version

7 months agoMerge pull request #3626 from lonvia/import-performance
Sarah Hoffmann [Fri, 10 Jan 2025 15:44:33 +0000 (16:44 +0100)]
Merge pull request #3626 from lonvia/import-performance

Import performance

7 months agoMerge pull request #3631 from lonvia/avoid-transactions
Sarah Hoffmann [Fri, 10 Jan 2025 15:44:18 +0000 (16:44 +0100)]
Merge pull request #3631 from lonvia/avoid-transactions

Creating tables and indexes in autocommit mode

7 months agoMerge pull request #3633 from lonvia/restrict-long-ways
Sarah Hoffmann [Fri, 10 Jan 2025 15:06:37 +0000 (16:06 +0100)]
Merge pull request #3633 from lonvia/restrict-long-ways

Ignore overly long ways during import

7 months agoignore overly long ways during import
Sarah Hoffmann [Fri, 10 Jan 2025 12:55:43 +0000 (13:55 +0100)]
ignore overly long ways during import

7 months agouse autocommit when creating tables and indexes
Sarah Hoffmann [Thu, 9 Jan 2025 16:14:37 +0000 (17:14 +0100)]
use autocommit when creating tables and indexes

Might avoid some deadlock situations with autovacuum.

7 months agoMerge pull request #3629 from lonvia/additional-breaks
Sarah Hoffmann [Thu, 9 Jan 2025 12:55:29 +0000 (13:55 +0100)]
Merge pull request #3629 from lonvia/additional-breaks

Introduce new break types and phrase splitting for Japanese addresses

7 months agoadd japanese phrase preprocessing
Sarah Hoffmann [Wed, 8 Jan 2025 18:43:25 +0000 (19:43 +0100)]
add japanese phrase preprocessing

Code adapted from GSOC code by @miku.

7 months agokeep break indicators [:-] during normalisation
Sarah Hoffmann [Tue, 7 Jan 2025 20:32:32 +0000 (21:32 +0100)]
keep break indicators [:-] during normalisation

All punctuation will be converted to '-'. Soft breaks : may be
added by preprocessors. The break signs are only used during
query analysis and are ignored during import token analysis.

7 months agoadd inner word break penalty
Sarah Hoffmann [Tue, 7 Jan 2025 19:40:58 +0000 (20:40 +0100)]
add inner word break penalty

7 months agoadd SOFT_PHRASE break and enable parsing
Sarah Hoffmann [Mon, 6 Jan 2025 16:10:24 +0000 (17:10 +0100)]
add SOFT_PHRASE break and enable parsing

Also enables parsing of PART breaks.

7 months agosplit up MultiPolygons before adding them to large_areas table
Sarah Hoffmann [Fri, 20 Dec 2024 10:03:15 +0000 (11:03 +0100)]
split up MultiPolygons before adding them to large_areas table

7 months agomove geometry split into insertLocationAreaLarge()
Sarah Hoffmann [Thu, 19 Dec 2024 19:09:27 +0000 (20:09 +0100)]
move geometry split into insertLocationAreaLarge()

thus insert only needs to be called once.

7 months agomore efficient belongs-to-address determination
Sarah Hoffmann [Thu, 19 Dec 2024 17:08:56 +0000 (18:08 +0100)]
more efficient belongs-to-address determination

7 months agoMerge pull request #3619 from lonvia/demote-farms
Sarah Hoffmann [Sun, 22 Dec 2024 08:13:42 +0000 (09:13 +0100)]
Merge pull request #3619 from lonvia/demote-farms

Remove farms and isolated dwellings from computed addresses

7 months agoremove farms and isolated dwellings from computed addresses
Sarah Hoffmann [Fri, 20 Dec 2024 21:59:02 +0000 (22:59 +0100)]
remove farms and isolated dwellings from computed addresses

Farms and isolated dwellings are usually confined to a very small
area. It does not make sense if they are automatically used in
addressing surrounding features. Still works to use them for
parenting when used with addr:place.

7 months agodocs: update Update docs for virtualenv use
Sarah Hoffmann [Fri, 20 Dec 2024 10:27:45 +0000 (11:27 +0100)]
docs: update Update docs for virtualenv use

7 months agoMerge pull request #3618 from osm-search/settings-md-table-space-osm-index
Sarah Hoffmann [Thu, 19 Dec 2024 07:40:31 +0000 (08:40 +0100)]
Merge pull request #3618 from osm-search/settings-md-table-space-osm-index

Settings.md - one setting was repeated

7 months agoSettings.md - one setting was repeated
mtmail [Wed, 18 Dec 2024 20:58:51 +0000 (21:58 +0100)]
Settings.md - one setting was repeated

7 months agoMerge pull request #3617 from mtmail/pr-3615-wording
Sarah Hoffmann [Wed, 18 Dec 2024 10:04:21 +0000 (11:04 +0100)]
Merge pull request #3617 from mtmail/pr-3615-wording

Slight wording changes for Import-Styles.md

7 months agoSlight wording changes for Import-Styles.md
marc tobias [Tue, 17 Dec 2024 00:28:02 +0000 (01:28 +0100)]
Slight wording changes for Import-Styles.md

7 months agoMerge pull request #3616 from osm-search/tokenizers-md-typo
Sarah Hoffmann [Tue, 17 Dec 2024 07:43:16 +0000 (08:43 +0100)]
Merge pull request #3616 from osm-search/tokenizers-md-typo

fix typo in Tokenizers.md

7 months agofix typo in Tokenizers.md
mtmail [Mon, 16 Dec 2024 22:38:18 +0000 (23:38 +0100)]
fix typo in Tokenizers.md

7 months agoMerge pull request #3615 from lonvia/overhaul-osm2pgsql-style
Sarah Hoffmann [Mon, 16 Dec 2024 18:13:46 +0000 (19:13 +0100)]
Merge pull request #3615 from lonvia/overhaul-osm2pgsql-style

Overhaul osm2pgsql style

7 months agoexclude more tourism=information types
Sarah Hoffmann [Mon, 16 Dec 2024 09:44:37 +0000 (10:44 +0100)]
exclude more tourism=information types

7 months agoreintroduce brand and remove etymology
Sarah Hoffmann [Thu, 12 Dec 2024 09:23:50 +0000 (10:23 +0100)]
reintroduce brand and remove etymology

7 months agoexclude unnamed swimming pools
Sarah Hoffmann [Wed, 11 Dec 2024 09:59:32 +0000 (10:59 +0100)]
exclude unnamed swimming pools

Publicly accessible ones are usually mapped as the bigger area
with the water park.

7 months agoinclude lock names mapped with "lock_name"
Sarah Hoffmann [Mon, 9 Dec 2024 16:57:08 +0000 (17:57 +0100)]
include lock names mapped with "lock_name"

Fixes #3365.

7 months agodo not save names when falling back to addresses
Sarah Hoffmann [Mon, 9 Dec 2024 16:28:30 +0000 (17:28 +0100)]
do not save names when falling back to addresses

If an object doesn't have a useable main tag, then the names should
always be ignored, independently of the presence of housenumbers.
We have to assume that the name belongs to a feature that was
intentionally filtered out.

7 months agoswitch to subtags for tourism=information and natural=water
Sarah Hoffmann [Mon, 9 Dec 2024 16:12:35 +0000 (17:12 +0100)]
switch to subtags for tourism=information and natural=water