Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Added --no-superuser flag #3204

Closed
Closed
Show file tree
Hide file tree
Changes from all commits
Commits
Show all changes
36 commits
Select commit Hold shift + click to select a range
67e0330
Added --no-superuser flag
robbe-haesendonck Sep 19, 2023
276bd05
Added --prepare-database flag
robbe-haesendonck Sep 20, 2023
7408888
Updated CI tests to check new import flags
robbe-haesendonck Sep 20, 2023
73ba0f7
Added version check for PostGis and Postgres
robbe-haesendonck Sep 20, 2023
efb7d91
Merge branch 'osm-search:master' into no-superuser-flag
robbe-haesendonck Sep 21, 2023
f8f4c02
Fixed linting errors
robbe-haesendonck Sep 21, 2023
d7ae23e
Fixed typo
robbe-haesendonck Sep 21, 2023
c670d19
Added check for hstore extension
robbe-haesendonck Sep 21, 2023
2f93f1b
Fixed pylint warnings
robbe-haesendonck Sep 21, 2023
0128922
Renamed flags
robbe-haesendonck Sep 25, 2023
8f07410
Fixed ci-tests, osm-file flag
robbe-haesendonck Sep 25, 2023
a9c24a9
Added check to see if hstore is loaded
robbe-haesendonck Sep 25, 2023
a151f50
Changed naming of flags.
robbe-haesendonck Sep 26, 2023
a4b71fc
Added new psql user for importing the data
robbe-haesendonck Sep 26, 2023
b285ff5
Improved logic.
robbe-haesendonck Sep 26, 2023
a66cb94
Removed _is_complete_import check
robbe-haesendonck Sep 27, 2023
e36b5be
Removed unused variable, fixed connection
robbe-haesendonck Sep 27, 2023
1ba8bc1
Made sure legacy import command still works
robbe-haesendonck Sep 27, 2023
36dfe73
Updated ci tests to reflect changes in Nominatim CLI
robbe-haesendonck Sep 27, 2023
8733c10
Fixed formatting
robbe-haesendonck Sep 27, 2023
13c3c61
Updated check to see if osm_file is set
robbe-haesendonck Sep 27, 2023
ddedd92
Updated no-superuser install ci test
robbe-haesendonck Sep 27, 2023
4cdeb8a
Fixed setting tokenizer property
robbe-haesendonck Sep 27, 2023
3077887
Disabled pylint too-many-branches
robbe-haesendonck Sep 27, 2023
e310972
Fixed legacy import command
robbe-haesendonck Sep 28, 2023
51f0a6f
CI tests: Removed creation of user www-data.
robbe-haesendonck Sep 29, 2023
8ac35ef
Added missing return types to functions
robbe-haesendonck Sep 29, 2023
2b20c14
Fixed typechecking error
robbe-haesendonck Nov 23, 2023
f03be8e
Merge branch 'osm-search:master' into no-superuser-flag
robbe-haesendonck Nov 23, 2023
0ad3bb6
Fixing CI tests for install-no-superuser
robbe-haesendonck Nov 23, 2023
771e1e0
Added osm-import to sudoers file for tokenizer setup
robbe-haesendonck Nov 23, 2023
f40f6d9
Added changing permissions of nominatim-project dir
robbe-haesendonck Nov 23, 2023
7d1b842
Connect using localhost instead of socket
robbe-haesendonck Nov 23, 2023
095aff2
Removed unnecessary check for --prepare-database flag
robbe-haesendonck Nov 23, 2023
afe90a9
Merge branch 'osm-search:master' into no-superuser-flag
robbe-haesendonck Nov 24, 2023
3efd1eb
Merge branch 'osm-search:master' into no-superuser-flag
robbe-haesendonck Dec 4, 2023
File filter

Filter by extension

Filter by extension


Conversations
Failed to load comments.
Loading
Jump to
Jump to file
Failed to load files.
Loading
Diff view
Diff view
92 changes: 92 additions & 0 deletions .github/workflows/ci-tests.yml
Original file line number Diff line number Diff line change
Expand Up @@ -349,3 +349,95 @@ jobs:
- name: Clean up database (reverse-only import)
run: nominatim refresh --postcodes --word-tokens
working-directory: /home/nominatim/nominatim-project

install-no-superuser:
runs-on: ubuntu-latest
needs: create-archive

strategy:
matrix:
name: [Ubuntu-22]
include:
- name: Ubuntu-22
image: "ubuntu:22.04"
ubuntu: 22
install_mode: install-apache

container:
image: ${{ matrix.image }}
env:
LANG: en_US.UTF-8

defaults:
run:
shell: sudo -Hu nominatim bash --noprofile --norc -eo pipefail {0}

steps:
- name: Prepare container (Ubuntu)
run: |
export APT_LISTCHANGES_FRONTEND=none
export DEBIAN_FRONTEND=noninteractive
apt-get update -qq
apt-get install -y git sudo wget
ln -snf /usr/share/zoneinfo/$CONTAINER_TIMEZONE /etc/localtime && echo $CONTAINER_TIMEZONE > /etc/timezone
shell: bash

- name: Setup import user
run: |
useradd -m nominatim
echo 'nominatim ALL=(ALL:ALL) NOPASSWD: ALL' > /etc/sudoers.d/nominiatim
echo "/home/nominatim/Nominatim/vagrant/Install-on-${OS}.sh no $INSTALL_MODE" > /home/nominatim/vagrant.sh
shell: bash
env:
OS: ${{ matrix.name }}
INSTALL_MODE: ${{ matrix.install_mode }}

- uses: actions/download-artifact@v3
with:
name: full-source
path: /home/nominatim

- name: Install Nominatim
run: |
export USERNAME=nominatim
export USERHOME=/home/nominatim
export NOSYSTEMD=yes
export HAVE_SELINUX=no
tar xf nominatim-src.tar.bz2
. vagrant.sh
working-directory: /home/nominatim

- name: Prepare import environment
run: |
mv Nominatim/test/testdb/apidb-test-data.pbf test.pbf
mv Nominatim/settings/flex-base.lua flex-base.lua
mv Nominatim/settings/import-extratags.lua import-extratags.lua
mv Nominatim/settings/taginfo.lua taginfo.lua
rm -rf Nominatim
mkdir data-env-reverse
working-directory: /home/nominatim

- name: Prepare Database
run: |
nominatim import --prepare-database
working-directory: /home/nominatim/nominatim-project

- name: Create import user
run: |
sudo -u postgres createuser -S osm-import
sudo -u postgres psql -c "ALTER USER \"osm-import\" WITH PASSWORD 'osm-import';"
working-directory: /home/nominatim/nominatim-project

- name: Grant import user rights
run: |
sudo -u postgres psql -c "GRANT INSERT, UPDATE ON ALL TABLES IN SCHEMA public TO \"osm-import\";"
lonvia marked this conversation as resolved.
Show resolved Hide resolved
working-directory: /home/nominatim/nominatim-project

- name: Run import
run: |
NOMINATIM_DATABASE_DSN="pgsql:host=127.0.0.1;dbname=nominatim;user=osm-import;password=osm-import" nominatim import --continue import-from-file --osm-file ../test.pbf
working-directory: /home/nominatim/nominatim-project

- name: Check full import
run: nominatim admin --check-database
working-directory: /home/nominatim/nominatim-project
1 change: 1 addition & 0 deletions nominatim/clicmd/args.py
Original file line number Diff line number Diff line change
Expand Up @@ -87,6 +87,7 @@ class NominatimArgs:
offline: bool
ignore_errors: bool
index_noanalyse: bool
prepare_database: bool

# Arguments to 'index'
boundaries_only: bool
Expand Down
105 changes: 64 additions & 41 deletions nominatim/clicmd/setup.py
Original file line number Diff line number Diff line change
Expand Up @@ -40,13 +40,15 @@ class SetupAll:

def add_args(self, parser: argparse.ArgumentParser) -> None:
group_name = parser.add_argument_group('Required arguments')
group1 = group_name.add_mutually_exclusive_group(required=True)
group1 = group_name.add_argument_group()
group1.add_argument('--osm-file', metavar='FILE', action='append',
help='OSM file to be imported'
' (repeat for importing multiple files)')
' (repeat for importing multiple files)',
default=None)
group1.add_argument('--continue', dest='continue_at',
choices=['load-data', 'indexing', 'db-postprocess'],
help='Continue an import that was interrupted')
choices=['import-from-file', 'load-data', 'indexing', 'db-postprocess'],
help='Continue an import that was interrupted',
default=None)
group2 = parser.add_argument_group('Optional arguments')
group2.add_argument('--osm2pgsql-cache', metavar='SIZE', type=int,
help='Size of cache to be used by osm2pgsql (in MB)')
Expand All @@ -65,9 +67,11 @@ def add_args(self, parser: argparse.ArgumentParser) -> None:
help='Continue import even when errors in SQL are present')
group3.add_argument('--index-noanalyse', action='store_true',
help='Do not perform analyse operations during index (expert only)')
group3.add_argument('--prepare-database', action='store_true',
help='Create the database but do not import any data')


def run(self, args: NominatimArgs) -> int: # pylint: disable=too-many-statements
def run(self, args: NominatimArgs) -> int: # pylint: disable=too-many-statements, too-many-branches
from ..data import country_info
from ..tools import database_import, refresh, postcodes, freeze
from ..indexer.indexer import Indexer
Expand All @@ -76,43 +80,61 @@ def run(self, args: NominatimArgs) -> int: # pylint: disable=too-many-statements

country_info.setup_country_config(args.config)

if args.continue_at is None:
if args.osm_file is None and args.continue_at is None and not args.prepare_database:
raise UsageError("No input files (use --osm-file).")

if args.osm_file is not None and args.continue_at not in ('import-from-file', None):
raise UsageError(f"Cannot use --continue {args.continue_at} and --osm-file together.")

if args.continue_at is not None and args.prepare_database:
raise UsageError(
"Cannot use --continue and --prepare-database together."
)


if args.prepare_database or args.continue_at is None:
LOG.warning('Creating database')
database_import.setup_database_skeleton(args.config.get_libpq_dsn(),
rouser=args.config.DATABASE_WEBUSER)
if args.prepare_database:
return 0

if args.continue_at in (None, 'import-from-file'):
files = args.get_osm_file_list()
if not files:
raise UsageError("No input files (use --osm-file).")

LOG.warning('Creating database')
database_import.setup_database_skeleton(args.config.get_libpq_dsn(),
rouser=args.config.DATABASE_WEBUSER)

LOG.warning('Setting up country tables')
country_info.setup_country_tables(args.config.get_libpq_dsn(),
args.config.lib_dir.data,
args.no_partitions)

LOG.warning('Importing OSM data file')
database_import.import_osm_data(files,
args.osm2pgsql_options(0, 1),
drop=args.no_updates,
ignore_errors=args.ignore_errors)

LOG.warning('Importing wikipedia importance data')
data_path = Path(args.config.WIKIPEDIA_DATA_PATH or args.project_dir)
if refresh.import_wikipedia_articles(args.config.get_libpq_dsn(),
data_path) > 0:
LOG.error('Wikipedia importance dump file not found. '
'Calculating importance values of locations will not '
'use Wikipedia importance data.')

LOG.warning('Importing secondary importance raster data')
if refresh.import_secondary_importance(args.config.get_libpq_dsn(),
args.project_dir) != 0:
LOG.error('Secondary importance file not imported. '
'Falling back to default ranking.')

self._setup_tables(args.config, args.reverse_only)

if args.continue_at is None or args.continue_at == 'load-data':
if args.continue_at in ('import-from-file', None):
# Check if the correct plugins are installed
database_import.check_existing_database_plugins(args.config.get_libpq_dsn())
LOG.warning('Setting up country tables')
country_info.setup_country_tables(args.config.get_libpq_dsn(),
args.config.lib_dir.data,
args.no_partitions)

LOG.warning('Importing OSM data file')
database_import.import_osm_data(files,
robbe-haesendonck marked this conversation as resolved.
Show resolved Hide resolved
args.osm2pgsql_options(0, 1),
drop=args.no_updates,
ignore_errors=args.ignore_errors)

LOG.warning('Importing wikipedia importance data')
data_path = Path(args.config.WIKIPEDIA_DATA_PATH or args.project_dir)
if refresh.import_wikipedia_articles(args.config.get_libpq_dsn(),
data_path) > 0:
LOG.error('Wikipedia importance dump file not found. '
'Calculating importance values of locations will not '
'use Wikipedia importance data.')

LOG.warning('Importing secondary importance raster data')
if refresh.import_secondary_importance(args.config.get_libpq_dsn(),
args.project_dir) != 0:
LOG.error('Secondary importance file not imported. '
'Falling back to default ranking.')

self._setup_tables(args.config, args.reverse_only)

if args.continue_at in ('import-from-file', 'load-data', None):
LOG.warning('Initialise tables')
with connect(args.config.get_libpq_dsn()) as conn:
database_import.truncate_data_tables(conn)
Expand All @@ -123,12 +145,13 @@ def run(self, args: NominatimArgs) -> int: # pylint: disable=too-many-statements
LOG.warning("Setting up tokenizer")
tokenizer = self._get_tokenizer(args.continue_at, args.config)

if args.continue_at is None or args.continue_at == 'load-data':
if args.continue_at in ('import-from-file', 'load-data', None):
LOG.warning('Calculate postcodes')
postcodes.update_postcodes(args.config.get_libpq_dsn(),
args.project_dir, tokenizer)

if args.continue_at is None or args.continue_at in ('load-data', 'indexing'):
if args.continue_at in \
('import-from-file', 'load-data', 'indexing', None):
LOG.warning('Indexing places')
indexer = Indexer(args.config.get_libpq_dsn(), tokenizer, num_threads)
indexer.index_full(analyse=not args.index_noanalyse)
Expand Down Expand Up @@ -185,7 +208,7 @@ def _get_tokenizer(self, continue_at: Optional[str],
"""
from ..tokenizer import factory as tokenizer_factory

if continue_at is None or continue_at == 'load-data':
if continue_at in ('import-from-file', 'load-data', None):
# (re)initialise the tokenizer data
return tokenizer_factory.create_tokenizer(config)

Expand Down
9 changes: 9 additions & 0 deletions nominatim/db/connection.py
Original file line number Diff line number Diff line change
Expand Up @@ -174,6 +174,15 @@ def postgis_version_tuple(self) -> Tuple[int, int]:

return (int(version_parts[0]), int(version_parts[1]))


def extension_loaded(self, extension_name: str) -> bool:
""" Return True if the hstore extension is loaded in the database.
"""
with self.cursor() as cur:
cur.execute('SELECT extname FROM pg_extension WHERE extname = %s', (extension_name, ))
return cur.rowcount > 0


class ConnectionContext(ContextManager[Connection]):
""" Context manager of the connection that also provides direct access
to the underlying connection.
Expand Down
22 changes: 21 additions & 1 deletion nominatim/tools/database_import.py
Original file line number Diff line number Diff line change
Expand Up @@ -23,7 +23,8 @@
from nominatim.db.sql_preprocessor import SQLPreprocessor
from nominatim.tools.exec_utils import run_osm2pgsql
from nominatim.errors import UsageError
from nominatim.version import POSTGRESQL_REQUIRED_VERSION, POSTGIS_REQUIRED_VERSION
from nominatim.version import POSTGRESQL_REQUIRED_VERSION, \
POSTGIS_REQUIRED_VERSION

LOG = logging.getLogger()

Expand All @@ -38,6 +39,25 @@ def _require_version(module: str, actual: Tuple[int, int], expected: Tuple[int,
raise UsageError(f'{module} is too old.')


def _require_loaded(extension_name: str, conn: Connection) -> None:
""" Check that the given extension is loaded. """
if not conn.extension_loaded(extension_name):
LOG.fatal('Required module %s is not loaded.', extension_name)
raise UsageError(f'{extension_name} is not loaded.')


def check_existing_database_plugins(dsn: str) -> None:
""" Check that the database has the required plugins installed."""
with connect(dsn) as conn:
_require_version('PostgreSQL server',
conn.server_version_tuple(),
POSTGRESQL_REQUIRED_VERSION)
_require_version('PostGIS',
conn.postgis_version_tuple(),
POSTGIS_REQUIRED_VERSION)
_require_loaded('hstore', conn)


def setup_database_skeleton(dsn: str, rouser: Optional[str] = None) -> None:
""" Create a new database for Nominatim and populate it with the
essential extensions.
Expand Down
Loading