]>
git.openstreetmap.org Git - nominatim.git/log
Sarah Hoffmann [Tue, 18 Mar 2025 10:20:20 +0000 (11:20 +0100)]
fix layer setting for structured search
Sarah Hoffmann [Tue, 18 Mar 2025 10:21:54 +0000 (11:21 +0100)]
fix debug output for lookup type
Sarah Hoffmann [Wed, 19 Mar 2025 15:00:52 +0000 (16:00 +0100)]
Merge pull request #3678 from lonvia/search-tweaks
Some minor tweaks to postcode parsing in query
Sarah Hoffmann [Wed, 19 Mar 2025 09:05:36 +0000 (10:05 +0100)]
penalize postcode searches with multiple name qualifiers
Sarah Hoffmann [Tue, 18 Mar 2025 22:07:27 +0000 (23:07 +0100)]
use word_token length when penalizing against postcodes
Sarah Hoffmann [Tue, 18 Mar 2025 21:55:33 +0000 (22:55 +0100)]
postcode token should have transliterated term in word_token
Sarah Hoffmann [Tue, 18 Mar 2025 21:32:58 +0000 (22:32 +0100)]
improve handling of leading postcodes
Setting the direction of the query while yielding assignments is
a bad idea because it may override a direction already set.
Sarah Hoffmann [Tue, 18 Mar 2025 20:46:53 +0000 (21:46 +0100)]
Merge pull request #3673 from otbutz/parallel_safe
Mark functions as PARALLEL SAFE
Sarah Hoffmann [Tue, 18 Mar 2025 17:31:48 +0000 (18:31 +0100)]
Merge pull request #3676 from lonvia/adjust-place-levels-sa
Adjust place ranks for Saudi-Arabia
Sarah Hoffmann [Tue, 18 Mar 2025 15:38:10 +0000 (16:38 +0100)]
adjust for special use of province and municipality in Saudi-Arabia
Thomas Butz [Thu, 13 Mar 2025 09:04:40 +0000 (10:04 +0100)]
Mark functions as PARALLEL SAFE
Sarah Hoffmann [Thu, 13 Mar 2025 09:49:08 +0000 (10:49 +0100)]
adapt typing for latest SQLAlchemy version
Sarah Hoffmann [Tue, 11 Mar 2025 10:22:46 +0000 (11:22 +0100)]
Merge pull request #3671 from lonvia/remove-osm2pgsql-libdir
Remove code for setting osm2pgsql location via config.lib_dir
Sarah Hoffmann [Tue, 11 Mar 2025 07:45:10 +0000 (08:45 +0100)]
remove code for setting osm2pgsql via config.lib_dir
With the internal osm2pgsql gone, configuration of the binary location
via settings is the only option left that makes sense.
Sarah Hoffmann [Mon, 10 Mar 2025 08:35:24 +0000 (09:35 +0100)]
Merge pull request #3670 from lonvia/flake-for-tests
Extend linting with flake to tests
Sarah Hoffmann [Sun, 9 Mar 2025 16:34:40 +0000 (17:34 +0100)]
remove unused bdd util functions
Sarah Hoffmann [Sun, 9 Mar 2025 16:34:04 +0000 (17:34 +0100)]
enable flake for bdd test code
Sarah Hoffmann [Sun, 9 Mar 2025 15:03:02 +0000 (16:03 +0100)]
also enable flake for tests in github actions
Sarah Hoffmann [Sun, 9 Mar 2025 14:33:24 +0000 (15:33 +0100)]
enable flake for Python tests
Sarah Hoffmann [Sun, 9 Mar 2025 08:44:15 +0000 (09:44 +0100)]
Merge pull request #3667 from eumiro/simplify-int-float
Simplify int/float manipulation
Miroslav Šedivý [Wed, 5 Mar 2025 19:55:53 +0000 (20:55 +0100)]
Simplify int/float manipulation
Sarah Hoffmann [Thu, 6 Mar 2025 16:53:01 +0000 (17:53 +0100)]
Merge pull request #3666 from eumiro/math-isclose
Replace custom Almost with stdlib math.isclose
Sarah Hoffmann [Thu, 6 Mar 2025 16:52:19 +0000 (17:52 +0100)]
Merge pull request #3664 from eumiro/consolidate-random
Consolidate usage of random module
Miroslav Šedivý [Wed, 5 Mar 2025 19:35:01 +0000 (20:35 +0100)]
Replace custom Almost with stdlib math.isclose
Miroslav Šedivý [Tue, 4 Mar 2025 19:01:47 +0000 (20:01 +0100)]
Consolidate usage of random module
Sarah Hoffmann [Wed, 5 Mar 2025 15:02:03 +0000 (16:02 +0100)]
Merge pull request #3665 from lonvia/pattern-matching-postcodes
Add full parsing of postcodes in query
Sarah Hoffmann [Wed, 5 Mar 2025 09:08:07 +0000 (10:08 +0100)]
restrict postcode parsing in typed phrases
Postcodes can only appear in postcode-type phrases and must then
cover the full phrase
Sarah Hoffmann [Tue, 4 Mar 2025 15:25:00 +0000 (16:25 +0100)]
add unit tests for postcode parser
Sarah Hoffmann [Tue, 4 Mar 2025 14:18:46 +0000 (15:18 +0100)]
adapt BDD tests to new postcode handling
Sarah Hoffmann [Tue, 4 Mar 2025 14:18:27 +0000 (15:18 +0100)]
add support for country prefixes in postcodes
Sarah Hoffmann [Sun, 2 Mar 2025 16:31:04 +0000 (17:31 +0100)]
cache all info of ICUQueryAnalyser in a single object
Sarah Hoffmann [Sat, 1 Mar 2025 09:20:33 +0000 (10:20 +0100)]
remove postcode computation for word table during import
Sarah Hoffmann [Fri, 28 Feb 2025 16:19:56 +0000 (17:19 +0100)]
reenable ZIP+ test
Sarah Hoffmann [Fri, 28 Feb 2025 16:19:37 +0000 (17:19 +0100)]
show token begin and end in debug output
Sarah Hoffmann [Tue, 25 Feb 2025 19:56:07 +0000 (20:56 +0100)]
add postcode parser
Sarah Hoffmann [Wed, 26 Feb 2025 16:22:14 +0000 (17:22 +0100)]
make word generation from query a class method
Sarah Hoffmann [Wed, 26 Feb 2025 13:37:08 +0000 (14:37 +0100)]
search: merge QueryPart array with QueryNodes
The basic information on terms is pretty much always used together
with the node inforamtion. Merging them together saves some
allocation while making lookup easier at the same time.
Sarah Hoffmann [Tue, 25 Feb 2025 19:29:06 +0000 (20:29 +0100)]
enable parsing of US ZIP+ codes
The four-digit part of these postcodes will simply be ignored.
Sarah Hoffmann [Mon, 24 Feb 2025 15:49:42 +0000 (16:49 +0100)]
Merge pull request #3659 from lonvia/custom-datrie-structure
Replace datrie library with a simple custom Python implementation
Sarah Hoffmann [Tue, 18 Feb 2025 20:12:02 +0000 (21:12 +0100)]
remove datrie dependency
Sarah Hoffmann [Tue, 18 Feb 2025 20:09:12 +0000 (21:09 +0100)]
replace datrie library with a more simple pure-Python class
Sarah Hoffmann [Mon, 24 Feb 2025 09:16:47 +0000 (10:16 +0100)]
Merge pull request #3658 from lonvia/minor-query-parsing-optimisations
Minor query parsing optimisations
Sarah Hoffmann [Fri, 21 Feb 2025 16:16:42 +0000 (17:16 +0100)]
adapt documentation for PhraseType type
Sarah Hoffmann [Fri, 21 Feb 2025 15:44:12 +0000 (16:44 +0100)]
replace PhraseType enum with simple int constants
Sarah Hoffmann [Fri, 21 Feb 2025 09:23:41 +0000 (10:23 +0100)]
replace TokenType enum with simple char constants
Sarah Hoffmann [Fri, 21 Feb 2025 08:57:48 +0000 (09:57 +0100)]
replace BreakType enum with simple char constants
Sarah Hoffmann [Fri, 21 Feb 2025 08:31:21 +0000 (09:31 +0100)]
consistently use query module as qmod
Sarah Hoffmann [Thu, 20 Feb 2025 22:32:39 +0000 (23:32 +0100)]
avoid yielding when extracting words from query
Sarah Hoffmann [Thu, 20 Feb 2025 16:36:50 +0000 (17:36 +0100)]
remove word_number counting for phrases
We can just examine the break types to know if we are dealing
with a partial token.
Sarah Hoffmann [Thu, 20 Feb 2025 13:32:43 +0000 (14:32 +0100)]
Merge pull request #3655 from lonvia/remove-name-ranking-in-postcode-search
Tweak penalties for postcode searches
Sarah Hoffmann [Thu, 20 Feb 2025 10:05:28 +0000 (11:05 +0100)]
remove address penalty for postcode search
Searches of the form <postcode> <city> are in fact quite common.
Sarah Hoffmann [Thu, 20 Feb 2025 09:15:03 +0000 (10:15 +0100)]
do not rerank address by full match in postcode search
The reranking result will not be completely correct because
the address of a postcode refer to the address _and_ name
of the parent and reranking was only done against the
address. We assume here that the postcode is precise enough
as to not require a penalty to to partial matches.
Sarah Hoffmann [Wed, 19 Feb 2025 16:25:59 +0000 (17:25 +0100)]
Merge pull request #3653 from lonvia/trailing-spaces-in-normalization
Strip leading and trailing space markers during normalization
Sarah Hoffmann [Wed, 19 Feb 2025 09:26:06 +0000 (10:26 +0100)]
strip normalisation results of normal and special spaces
Sarah Hoffmann [Wed, 19 Feb 2025 08:38:10 +0000 (09:38 +0100)]
add test for structured query with leading spaces
Sarah Hoffmann [Tue, 18 Feb 2025 18:47:45 +0000 (19:47 +0100)]
Merge pull request #3652 from lonvia/update-variants
Cleanup and updates of tokenizer variant configuration
Sarah Hoffmann [Tue, 18 Feb 2025 12:21:09 +0000 (13:21 +0100)]
Merge pull request #3649 from lonvia/actions-move-to-ubuntu22
Move Github actions to Unbuntu-22 image
Sarah Hoffmann [Tue, 18 Feb 2025 09:31:44 +0000 (10:31 +0100)]
remove e-ë mutation, this is taken care of by transliteration
mhsr21 [Tue, 7 May 2024 07:21:10 +0000 (03:21 -0400)]
Add USPS Standard Suffix Abbreviation
Sarah Hoffmann [Mon, 17 Feb 2025 16:51:33 +0000 (17:51 +0100)]
fix Norwegian variants
More cases of 'no' being interpreted as fasle by yaml.
Sarah Hoffmann [Mon, 17 Feb 2025 11:07:53 +0000 (12:07 +0100)]
remove ineffective and dupicate variants
Sarah Hoffmann [Mon, 17 Feb 2025 17:44:11 +0000 (18:44 +0100)]
Merge pull request #3555 from IvanShift/patch-1
Fixed Russian abbreviation list
Sarah Hoffmann [Mon, 17 Feb 2025 16:56:23 +0000 (17:56 +0100)]
actions: use Debians's script for adding the Postgres apt repo
Sarah Hoffmann [Mon, 17 Feb 2025 16:54:49 +0000 (17:54 +0100)]
actions: move tests to Ubuntu-20
Sarah Hoffmann [Mon, 17 Feb 2025 10:14:52 +0000 (11:14 +0100)]
Merge pull request #3648 from lonvia/extratags-for-geocodejson
Enable output of extratags for geocodejson format
Sarah Hoffmann [Mon, 17 Feb 2025 09:04:23 +0000 (10:04 +0100)]
add documentation for new extra field
Sarah Hoffmann [Sun, 16 Feb 2025 09:16:40 +0000 (10:16 +0100)]
add optional output of extratags to geocodejson
Sarah Hoffmann [Wed, 5 Feb 2025 09:54:38 +0000 (10:54 +0100)]
prepare 5.0.0 release
Sarah Hoffmann [Tue, 14 Jan 2025 15:49:52 +0000 (16:49 +0100)]
Merge pull request #3635 from lonvia/replace-wikimedia-importance-test-data
Update wikimedia importance file for test database
Sarah Hoffmann [Tue, 14 Jan 2025 13:19:17 +0000 (14:19 +0100)]
adapt tests to changed wikimedia importance test table
Sarah Hoffmann [Tue, 14 Jan 2025 08:16:25 +0000 (09:16 +0100)]
replace wikimedia importance file for test data with CSV version
Sarah Hoffmann [Fri, 10 Jan 2025 15:44:33 +0000 (16:44 +0100)]
Merge pull request #3626 from lonvia/import-performance
Import performance
Sarah Hoffmann [Fri, 10 Jan 2025 15:44:18 +0000 (16:44 +0100)]
Merge pull request #3631 from lonvia/avoid-transactions
Creating tables and indexes in autocommit mode
Sarah Hoffmann [Fri, 10 Jan 2025 15:06:37 +0000 (16:06 +0100)]
Merge pull request #3633 from lonvia/restrict-long-ways
Ignore overly long ways during import
Sarah Hoffmann [Fri, 10 Jan 2025 12:55:43 +0000 (13:55 +0100)]
ignore overly long ways during import
Sarah Hoffmann [Thu, 9 Jan 2025 16:14:37 +0000 (17:14 +0100)]
use autocommit when creating tables and indexes
Might avoid some deadlock situations with autovacuum.
Sarah Hoffmann [Thu, 9 Jan 2025 12:55:29 +0000 (13:55 +0100)]
Merge pull request #3629 from lonvia/additional-breaks
Introduce new break types and phrase splitting for Japanese addresses
Sarah Hoffmann [Wed, 8 Jan 2025 18:43:25 +0000 (19:43 +0100)]
add japanese phrase preprocessing
Code adapted from GSOC code by @miku.
Sarah Hoffmann [Tue, 7 Jan 2025 20:32:32 +0000 (21:32 +0100)]
keep break indicators [:-] during normalisation
All punctuation will be converted to '-'. Soft breaks : may be
added by preprocessors. The break signs are only used during
query analysis and are ignored during import token analysis.
Sarah Hoffmann [Tue, 7 Jan 2025 19:40:58 +0000 (20:40 +0100)]
add inner word break penalty
Sarah Hoffmann [Mon, 6 Jan 2025 16:10:24 +0000 (17:10 +0100)]
add SOFT_PHRASE break and enable parsing
Also enables parsing of PART breaks.
Sarah Hoffmann [Fri, 20 Dec 2024 10:03:15 +0000 (11:03 +0100)]
split up MultiPolygons before adding them to large_areas table
Sarah Hoffmann [Thu, 19 Dec 2024 19:09:27 +0000 (20:09 +0100)]
move geometry split into insertLocationAreaLarge()
thus insert only needs to be called once.
Sarah Hoffmann [Thu, 19 Dec 2024 17:08:56 +0000 (18:08 +0100)]
more efficient belongs-to-address determination
Sarah Hoffmann [Sun, 22 Dec 2024 08:13:42 +0000 (09:13 +0100)]
Merge pull request #3619 from lonvia/demote-farms
Remove farms and isolated dwellings from computed addresses
Sarah Hoffmann [Fri, 20 Dec 2024 21:59:02 +0000 (22:59 +0100)]
remove farms and isolated dwellings from computed addresses
Farms and isolated dwellings are usually confined to a very small
area. It does not make sense if they are automatically used in
addressing surrounding features. Still works to use them for
parenting when used with addr:place.
Sarah Hoffmann [Fri, 20 Dec 2024 10:27:45 +0000 (11:27 +0100)]
docs: update Update docs for virtualenv use
Sarah Hoffmann [Thu, 19 Dec 2024 07:40:31 +0000 (08:40 +0100)]
Merge pull request #3618 from osm-search/settings-md-table-space-osm-index
Settings.md - one setting was repeated
mtmail [Wed, 18 Dec 2024 20:58:51 +0000 (21:58 +0100)]
Settings.md - one setting was repeated
Sarah Hoffmann [Wed, 18 Dec 2024 10:04:21 +0000 (11:04 +0100)]
Merge pull request #3617 from mtmail/pr-3615-wording
Slight wording changes for Import-Styles.md
marc tobias [Tue, 17 Dec 2024 00:28:02 +0000 (01:28 +0100)]
Slight wording changes for Import-Styles.md
Sarah Hoffmann [Tue, 17 Dec 2024 07:43:16 +0000 (08:43 +0100)]
Merge pull request #3616 from osm-search/tokenizers-md-typo
fix typo in Tokenizers.md
mtmail [Mon, 16 Dec 2024 22:38:18 +0000 (23:38 +0100)]
fix typo in Tokenizers.md
Sarah Hoffmann [Mon, 16 Dec 2024 18:13:46 +0000 (19:13 +0100)]
Merge pull request #3615 from lonvia/overhaul-osm2pgsql-style
Overhaul osm2pgsql style
Sarah Hoffmann [Mon, 16 Dec 2024 09:44:37 +0000 (10:44 +0100)]
exclude more tourism=information types
Sarah Hoffmann [Thu, 12 Dec 2024 09:23:50 +0000 (10:23 +0100)]
reintroduce brand and remove etymology
Sarah Hoffmann [Wed, 11 Dec 2024 09:59:32 +0000 (10:59 +0100)]
exclude unnamed swimming pools
Publicly accessible ones are usually mapped as the bigger area
with the water park.
Sarah Hoffmann [Mon, 9 Dec 2024 16:57:08 +0000 (17:57 +0100)]
include lock names mapped with "lock_name"
Fixes #3365.
Sarah Hoffmann [Mon, 9 Dec 2024 16:28:30 +0000 (17:28 +0100)]
do not save names when falling back to addresses
If an object doesn't have a useable main tag, then the names should
always be ignored, independently of the presence of housenumbers.
We have to assume that the name belongs to a feature that was
intentionally filtered out.
Sarah Hoffmann [Mon, 9 Dec 2024 16:12:35 +0000 (17:12 +0100)]
switch to subtags for tourism=information and natural=water