You can find the rdb_host
and rdb_port
on the database information page.
Using the PostGIS extension on a Database for PostgreSQL
- PostGIS
- postgresql
- postgre
- database
- rdb
PostGIS is an extension to the PostgreSQL database management system developed by the Canadian company Refractions Research Inc. The extension allows storing and managing Geographic Information Systems (GIS) objects in PostgreSQL. It is available on Scaleway Databases.
PostGIS was published under the GNU GPL license, which means that the solution is available as open-source software, differently from other GIS systems, which are closed-source software. Spatial data types refer to shapes such as point, line, and polygon. Spatial databases fully integrate spatial data with an object-relational database. PostGIS includes support for GiST-based R-Tree spatial indexes and functions for the analysis and processing of GIS objects.
- “How close to my office is the nearest metro station?”
- “What is the shortest route from my home to the shopping mall?”
- “Where should a new hospital be built to be as near as possible to as many people as possible in a city?”
The answers to these frequently asked questions used to be provided by specialized GIS (Geographic Information Systems) applications. However, these were not a viable solution for dealing with massive spatial datasets (for example: a dataset that includes all the routes in Europe). This is where modern systems like PostGIS step in.
Before you start
To complete the actions presented below, you must have:
- A Scaleway account logged into the console
- Owner status or IAM permissions allowing you to perform actions in the intended Organization
- A Database Instance
Creating a new database for PostGIS
- Connect to your Scaleway console and enter the Managed Databases section.
- Choose your Database Instance by clicking on its name.
- Click the Managed Databases tab, then on Create Database.
- Enter the name of the new database. In this tutorial, we use
townssurvey
. - Click the Users tab, followed by and Update Permissions.
- Grant your user permission to access the newly created
townssurvey
database.
Enabling the PostGIS extension
-
Connect to your Database using the
psql
client:psql -h <rbd_host> --port <rdb_port> -d townssurvey -U <rdb_user>Enter the password for your database user when prompted.
Tip -
Once connected enable the
postgis
extension for the database:CREATE EXTENSION postgis; -
Verify that everything went well, by running the following command:
SELECT PostGIS_version();You will see an output like the following example, which indicates that the extension has been correctly installed on the database:
townssurvey=> SELECT PostGIS_version();postgis_version---------------------------------------3.0 USE_GEOS=1 USE_PROJ=1 USE_STATS=1(1 row) -
Log out of the PostgreSQL shell when the database is ready.
\q
Optimizing the Database Instance for GIS database objects
PostgreSQL is designed to run in very different configurations, from small test or development setups to large databases handling millions of rows. GIS database objects are large in comparison to plain text data. Scaleway Databases are optimized for performance by default, but it is possible to fine-tune them for some parameters.
- Enter the configuration section of your Database Instance in the Scaleway console.
- Choose your Database Instance from the list, and click it to see the Database information page. Then click Advanced Settings. The advanced settings tab displays.
- Click to configure the parameters of your PostgreSQL server.
Tip
The parameter
work_mem
defines the amount of memory that internal sorting operations and hash tables can consume in the RAM, set this value to 16 MB. - Save the configuration by clicking on the check mark symbol (✔). The configuration of your Database Instance is updated:
Your Database Instance is configured now, and it is time to load some spatial data.
Downloading and converting a dataset
To get familiar with spatial data, you can download demo data and enter it into the database.
You can run the following commands either on a local computer running Ubuntu or on an Instance.
-
Update and upgrade the system.
apt update && apt upgrade -
Install the required software for the following steps using
apt
:apt install -y wget unzip postgis -
Create a directory to store the downloaded data and change to it:
mkdir -p /postgis/townssurvey && cd /postgis/townssurvey -
Download a dataset. We use a sample provided by MassGIS Data which contains information about the legal boundary for each of the 351 municipalities in Massachusetts.
wget https://s3.us-east-1.amazonaws.com/download.massgis.digital.mass.gov/shapefiles/state/townssurvey_shp.zip -
Unzip the downloaded data and list the unpacked data to view it:
unzip townssurvey_shp.zip && ls -
You will notice several
.dbf
,.prj
,.shx
, and.shp
files in the folder. These files together are called a ShapeFile, a common digital vector storage format for storing geometric location and associated attribute information for legacy GIS software.To import the data into the database, it has to be converted to SQL. To achieve this, PostGIS provides a tool called
shp2pgsql
which can automatically convert.shp
files into readable.sql
files. Convert the fileTOWNSSURVEY_POLY.shp
into an SQL file by running the following command:shp2pgsql -d ./TOWNSSURVEY_POLY.shp > /postgis/TOWNSSURVEY_POLY.sql -
Import the generated SQL file into your database using the
psql
client:psql -h <rbd_host> --port <rdb_port> -d townssurvey -U <rdb_user> -f /postgis/TOWNSSURVEY_POLY.sqlAn output like the following displays on the screen:
INSERT 0 1INSERT 0 1[...]INSERT 0 1INSERT 0 1COMMITANALYZEThe import has been completed successfully once the word
ANALYZE
displays.
Querying spatial data
Now as some demo data is available in the database, you can start querying it.
-
Connect to the
psql
console:psql -h <rbd_host> --port <rdb_port> -d townssurvey -U <rdb_user> -
List the tables available in the database:
townssurvey=> \dtThe command above will return a list of two tables:
List of relationsSchema | Name | Type | Owner--------+------------------+-------+-----------------public | spatial_ref_sys | table | _rdb_superadminpublic | townssurvey_poly | table | rdb_user(2 rows) -
The table to query is
townssurvey_poly
. To view a list of all available columns in the table, run the following command:townssurvey=> \d townssurvey_polyWhich returns a list as follows:
Table "public.townssurvey_poly"Column | Type | Collation | Nullable | Default------------+------------------------+-----------+----------+-----------------------------------------------gid | integer | | not null | nextval('townssurvey_poly_gid_seq'::regclass)town | character varying(21) | | |town_id | integer | | |pop1980 | bigint | | |pop1990 | bigint | | |pop2000 | bigint | | |popch80_90 | integer | | |popch90_00 | bigint | | |type | character varying(2) | | |island | integer | | |coastal_po | character varying(3) | | |fourcolor | integer | | |fips_stco | bigint | | |ccd_mcd | character varying(3) | | |fips_place | character varying(5) | | |fips_mcd | bigint | | |fips_count | integer | | |acres | numeric | | |square_mil | numeric | | |pop2010 | bigint | | |popch00_10 | bigint | | |shape_leng | numeric | | |shape_area | numeric | | |geom | geometry(MultiPolygon) | | |Indexes:"townssurvey_poly_pkey" PRIMARY KEY, btree (gid)The
geom
column data type contains spatial data of theMultiPolygon
type.As the data is now ready, we can find answers to questions like ”What are the 10 towns located in the northernmost part of Massachusetts?“.
To get the answer to this question, it is possible to query the spatial data in the database. As the town boundaries are not linear, there is more than one value for latitude. To get the latitude for each town, the centroid of each town has to be gathered using the
ST_Centroid
function of PostGIS.The centroid’s Y value is then extracted using the
ST_Y
function and this value can be used as the latitude.As several cities have more than one entry in the database, the results are filtered using the
DISTINCT ON
function from PostgreSQL.The following query returns a list of the 10 northernmost towns in Massachusetts:
SELECT town, ST_Y(ST_Centroid(geom)) as latitude FROM (SELECT DISTINCT ON (town) *FROM townssurvey_poly) tORDER BY latitude DESCLIMIT 10;The result will look like the following list:
town | latitude--------------+------------------AMESBURY | 953947.884028235SALISBURY | 952906.606868324MERRIMAC | 952462.995879863NEWBURYPORT | 950674.541316222WEST NEWBURY | 949284.811099743NEWBURY | 945386.010693761HAVERHILL | 944694.91004576GROVELAND | 944664.684131484CLARKSBURG | 943028.430506038MONROE | 942395.280914487(10 rows) -
Quit the database.
\q
Conclusion
You have enabled the PostGIS extension for PostgreSQL, imported sample data, and created your first queries. For more information about PostGIS, refer to the official documentation.