Sqitch

sqitchtutorial-snowflake

Name

sqitchtutorial-snowflake - A tutorial introduction to Sqitch change management on Snowflake

Synopsis

sqitch *

Description

This tutorial explains how to create a sqitch-enabled Snowflake project, use a VCS for deployment planning, and work with other developers to make sure changes remain in sync and in the proper order.

We’ll start by creating a new project from scratch, a fictional antisocial networking site called Flipr. All examples use Git as the VCS and Snowflake as the storage engine, but for the most part you can substitute other VCSes and database engines in the examples as appropriate.

If you’d like to manage a PostgreSQL database, see sqitchtutorial.

If you’d like to manage an SQLite database, see sqitchtutorial-sqlite.

If you’d like to manage an Oracle database, see sqitchtutorial-oracle.

If you’d like to manage a MySQL database, see sqitchtutorial-mysql.

If you’d like to manage a Firebird database, see sqitchtutorial-firebird.

If you’d like to manage a Vertica database, see sqitchtutorial-vertica.

If you’d like to manage an Exasol database, see sqitchtutorial-exasol.

Connection Configuration

Sqitch requires ODBC to connect to the Snowflake database. As such, you’ll need to make sure that the Snowflake ODBC driver is installed and properly configured. At its simplest, on Unix-like systems, name the driver “Snowflake” by adding this entry to odbcinst.ini (usually found in /etc, /usr/etc, or /usr/local/etc):

[Snowflake]
Description = ODBC for Snowflake
Driver      = /usr/lib64/snowflake/odbc/lib/libSnowflake.so

Note that you’ll need to adjust the path depending on the version of the ODBC driver, and where you installed it.

See the Snowflake ODBC documentation for details on downloading, installing, and configuring ODBC for your platform.

Starting a New Project

Usually the first thing to do when starting a new project is to create a source code repository. So let’s do that with Git:

> mkdir flipr
> cd flipr
> git init .
Initialized empty Git repository in /flipr/.git/
> touch README.md
> git add .
> git commit -am 'Initialize project, add README.'

If you’re a Git user and want to follow along the history, the repository used in these examples is on GitHub.

Now that we have a repository, let’s get started with Sqitch. Every Sqitch project must have a name associated with it, and, optionally, a unique URI. We recommend including the URI, as it increases the uniqueness of object identifiers internally, and will prevent the deployment of a different project with the same name. So let’s specify one when we initialize Sqitch:

> sqitch init flipr --uri https://github.com/sqitchers/sqitch-snowflake-intro/ --engine snowflake
Created sqitch.conf
Created sqitch.plan
Created deploy/
Created revert/
Created verify/

Let’s have a look at sqitch.conf:

> cat sqitch.conf
[core]
  engine = snowflake
  # plan_file = sqitch.plan
  # top_dir = .
# [engine "snowflake"]
  # target = db:snowflake:
  # registry = sqitch
  # client = snowsql

Good, it picked up on the fact that we’re creating changes for the Snowflake engine, thanks to the --engine snowflake option, and saved it to the file. Furthermore, it wrote a commented-out [engine "snowflake"] section with all the available Snowflake engine-specific settings commented out and ready to be edited as appropriate.

By default, Sqitch will read sqitch.conf in the current directory for settings. But it will also read ~/.sqitch/sqitch.conf for user-specific settings. Since Snowflake’s snowsql client is not in the path on my system, let’s go ahead an tell it where to find the client on our computer (don’t bother if you’re using the Docker image because it uses the client inside the container, not on your host machine):

> sqitch config --user engine.snowflake.client /Applications/SnowSQL.app/Contents/MacOS/snowsql

And let’s also tell it who we are, since this data will be used in all of our projects:

> sqitch config --user user.name 'Marge N. O’Vera'
> sqitch config --user user.email 'marge@example.com'

Have a look at ~/.sqitch/sqitch.conf and you’ll see this:

> cat ~/.sqitch/sqitch.conf
[engine "snowflake"]
  client = /Applications/SnowSQL.app/Contents/MacOS/snowsql
[user]
  name = Marge N. O’Vera
  email = marge@example.com

Which means that Sqitch should be able to find snowsql for any project, and that it will always properly identify us when planning and committing changes.

Back to the repository. Have a look at the plan file, sqitch.plan:

> cat sqitch.plan
%syntax-version=1.0.0
%project=flipr
%uri=https://github.com/sqitchers/sqitch-snowflake-intro/

Note that it has picked up on the name and URI of the app we’re building. Sqitch uses this data to manage cross-project dependencies. The %syntax-version pragma is always set by Sqitch, so that it always knows how to parse the plan, even if the format changes in the future.

Let’s commit these changes and start creating the database changes.

> git add .
> git commit -am 'Initialize Sqitch configuration.'
[main b731cc3] Initialize Sqitch configuration.
 2 files changed, 15 insertions(+)
 create mode 100644 sqitch.conf
 create mode 100644 sqitch.plan

Our First Change

First, our project will need a schema. This creates a nice namespace for all of the objects that will be part of the flipr app. Run this command:

> sqitch add appschema -n 'Add schema for all flipr objects.'
Created deploy/appschema.sql
Created revert/appschema.sql
Created verify/appschema.sql
Added "appschema" to sqitch.plan

The add command adds a database change to the plan and writes deploy, revert, and verify scripts that represent the change. Now we edit these files. The deploy script’s job is to create the schema. So we add this to deploy/appschema.sql:

CREATE SCHEMA flipr;

The revert script’s job is to precisely revert the change to the deploy script, so we add this to revert/appschema.sql:

DROP SCHEMA flipr;

Now we can try deploying this change. We tell Sqitch where to send the change via a database URI. Let’s say we’re using the account name example, username movera, database flipr, and warehouse sqitch, and an ODBC driver named Snowflake (see “Connection Configuration” for details). The URI would be structured like this:

db:snowflake://movera@example/flipr?Driver=Snowflake;warehouse=sqitch

A few notes on the URL format:

Back to the change. We just tell Sqitch to use that URI to deploy it:

> sqitch deploy 'db:snowflake://movera@example/flipr?Driver=Snowflake'
Adding registry tables to db:snowflake://movera@example/flipr?Driver=Snowflake
Deploying changes to db:snowflake://movera@example/flipr?Driver=Snowflake
  + appschema .. ok

First Sqitch created registry tables used to track database changes. The structure and name of the registry varies between databases (Snowflake uses a schema to namespace its registry, while SQLite and MySQL use separate databases). Next, Sqitch deploys changes. We only have one so far; the + reinforces the idea that the change is being added to the database.

Note that this process can take quite a bit of time. Sqitch connects to the database via ODBC and retains the connection throughout, but the creation of the registry and all change scripts run through individual runs of snowsql. These connections can be quite slow. So if Sqitch seems hung, just wait; it’s most likely waiting on Snowflake.

With this change deployed, if you connect to the database, you’ll be able to see the schema:

> snowsql --accountname example --username movera --dbname flipr -o friendly=false \
  --query "SHOW TERSE SCHEMAS LIKE 'flipr'"
+-------------------------------+-------+------+---------------+-------------+
| created_on                    | name  | kind | database_name | schema_name |
|-------------------------------+-------+------+---------------+-------------|
| 2018-07-27 14:47:22.614 +0000 | FLIPR | NULL | DWHEELER      | NULL        |
+-------------------------------+-------+------+---------------+-------------+
1 Row(s) produced. Time Elapsed: 0.283s

Trust, But Verify

But that’s too much work. Do you really want to do something like that after every deploy?

Here’s where the verify script comes in. Its job is to test that the deploy did was it was supposed to. It should do so without regard to any data that might be in the database, and should throw an error if the deploy was not successful. In Snowflake, the simplest way to do so for schema is probably to simply create an object in the schema. Put this SQL into verify/appschema.sql:

CREATE TEMPORARY TABLE flipr.verify__ (id INT);

In truth, you can use any query that generates an SQL error if the schema doesn’t exist. Another handy way to do that is to divide by zero if an object doesn’t exist. For example, to throw an error when the flipr schema does not exist, you could do something like this:

USE WAREHOUSE &warehouse;
SELECT 1/COUNT(*) FROM information_schema.schemata WHERE schema_name = 'FLIPR';

Note the USE WAREHOUSE statement which is provided in the default Snowflake change script templates. For scripts that execute queries requiring compute resources (typically DML and SELECT statements), we’ll need to use a virtual warehouse. This statement lets the script use the warehouse that Sqitch itself uses for its registry, which should be a reasonable default, since Sqitch is already using this warehouse. You can always change it to a different warehouse if need be. If not, Sqitch always sets this variable (as well as &registry containing the name of the Sqitch registry schema) for all deploy, revert, and verify script executions.

Now run the verify script with the verify command:

> sqitch verify 'db:snowflake://movera@example/flipr?Driver=Snowflake'
Verifying db:snowflake://movera@example/flipr?Driver=Snowflake
  * appschema .. ok
Verify successful

Looks good! If you want to make sure that the verify script correctly dies if the schema doesn’t exist, temporarily change the schema name in the script to something that doesn’t exist, something like:

CREATE TEMPORARY TABLE nonesuch.verify__ (id INT);

Then verify again:

> sqitch verify 'db:snowflake://movera@example/flipr?Driver=Snowflake'
Verifying db:snowflake://movera@example/flipr?Driver=Snowflake
* appschema ..
002003 (02000): SQL compilation error:
Schema 'FLIPR.NONESUCH' does not exist.
# Verify script "verify/appschema.sql" failed.
not ok

Verify Summary Report
---------------------
Changes: 1
Errors:  1
Verify failed

It’s even nice enough to tell us what the problem is. Or, for the divide-by-zero example, change the schema name:

USE WAREHOUSE &warehouse;
SELECT 1/COUNT(*) FROM information_schema.schemata WHERE schema_name = 'NONESUCH';

Then the verify will look something like:

> sqitch verify 'db:snowflake://movera@example/flipr?Driver=Snowflake'
Verifying db:snowflake://movera@example/flipr?Driver=Snowflake
* appschema ..
100051 (22012): Division by zero
# Verify script "verify/appschema.sql" failed.
not ok

Verify Summary Report
---------------------
Changes: 1
Errors:  1
Verify failed

Less useful error output, but enough to alert us that something has gone wrong.

Don’t forget to change the schema name back before continuing!

Status, Revert, Log, Repeat

For purely informational purposes, we can always see how a deployment was recorded via the status command, which reads the registry tables from the database:

> sqitch status 'db:snowflake://movera@example/flipr?Driver=Snowflake'
# On database db:snowflake://movera@example/flipr?Driver=Snowflake
# Project:  flipr
# Change:   5a2ac4ae6801bfe392483ee5912b4e3592cdd57a
# Name:     appschema
# Deployed: 2018-07-27 10:47:23 -0400
# By:       Marge N. O’Vera <marge@example.com>
#
Nothing to deploy (up-to-date)

Let’s make sure that we can revert the change:

> sqitch revert 'db:snowflake://movera@example/flipr?Driver=Snowflake'
Revert all changes from db:snowflake://movera@example/flipr?Driver=Snowflake? [Yes]
  - appschema .. ok

The revert command first prompts to make sure that we really do want to revert. This is to prevent unnecessary accidents. You can pass the -y option to disable the prompt. Also, notice the - before the change name in the output, which reinforces that the change is being removed from the database. And now the schema should be gone:

> snowsql --accountname example --username movera --dbname flipr -o friendly=false \
  --query "SHOW TERSE SCHEMAS LIKE 'flipr'"
+------------+------+------+---------------+-------------+
| created_on | name | kind | database_name | schema_name |
|------------+------+------+---------------+-------------|
+------------+------+------+---------------+-------------+
0 Row(s) produced. Time Elapsed: 0.204s

And the status message should reflect as much:

> sqitch status 'db:snowflake://movera@example/flipr?Driver=Snowflake'
# On database db:snowflake://movera@example/flipr?Driver=Snowflake
No changes deployed

Of course, since nothing is deployed, the verify command has nothing to verify:

> sqitch verify 'db:snowflake://movera@example/flipr?Driver=Snowflake'
Verifying db:snowflake://movera@example/flipr?Driver=Snowflake
No changes deployed

However, we still have a record that the change happened, visible via the log command:

> sqitch log 'db:snowflake://movera@example/flipr?Driver=Snowflake'
On database db:snowflake://movera@example/flipr?Driver=Snowflake
Revert 5a2ac4ae6801bfe392483ee5912b4e3592cdd57a
Name:      appschema
Committer: Marge N. O’Vera <marge@example.com>
Date:      2018-07-27 10:48:48 -0400

    Add schema for all flipr objects.

Deploy 5a2ac4ae6801bfe392483ee5912b4e3592cdd57a
Name:      appschema
Committer: Marge N. O’Vera <marge@example.com>
Date:      2018-07-27 10:47:24 -0400

    Add schema for all flipr objects.

Note that the actions we took are shown in reverse chronological order, with the revert first and then the deploy.

Cool. Now let’s commit it.

> git add .
> git commit -m 'Add flipr schema.'
[main 7fd5ace] Add flipr schema.
4 files changed, 10 insertions(+)
create mode 100644 deploy/appschema.sql
create mode 100644 revert/appschema.sql
create mode 100644 verify/appschema.sql

And then deploy again. This time, let’s use the --verify option, so that the verify script is applied when the change is deployed:

> sqitch deploy --verify 'db:snowflake://movera@example/flipr?Driver=Snowflake'
Deploying changes to db:snowflake://movera@example/flipr?Driver=Snowflake
  + appschema .. ok

And now the schema should be back:

> snowsql --accountname example --username movera --dbname flipr -o friendly=false \
  --query "SHOW TERSE SCHEMAS LIKE 'flipr'"
+-------------------------------+-------+------+---------------+-------------+
| created_on                    | name  | kind | database_name | schema_name |
|-------------------------------+-------+------+---------------+-------------|
| 2018-07-27 14:52:50.116 +0000 | FLIPR | NULL | DWHEELER      | NULL        |
+-------------------------------+-------+------+---------------+-------------+
1 Row(s) produced. Time Elapsed: 0.283s

When we look at the status, the deployment will be there:

> sqitch status 'db:snowflake://movera@example/flipr?Driver=Snowflake'
# On database db:snowflake://movera@example/flipr?Driver=Snowflake
# Project:  flipr
# Change:   5a2ac4ae6801bfe392483ee5912b4e3592cdd57a
# Name:     appschema
# Deployed: 2018-07-27 10:52:54 -0400
# By:       Marge N. O’Vera <marge@example.com>
#
Nothing to deploy (up-to-date)

On Target

I’m getting a little tired of always having to type db:snowflake://movera@example/flipr?Driver=Snowflake, aren’t you? This database connection URI tells Sqitch how to connect to the deployment target, but we don’t have to keep using the URI. We can name the target:

> sqitch target add flipr_test 'db:snowflake://movera@example/flipr?Driver=Snowflake'

The target command, inspired by git-remote, allows management of one or more named deployment targets. We’ve just added a target named flipr_test, which means we can use the string flipr_test for the target, rather than the URI. But since we’re doing so much testing, we can also tell Sqitch to deploy to the flipr_test target by default:

> sqitch engine add snowflake flipr_test

Now we can omit the target argument altogether, unless we need to deploy to another database. Which we will, eventually, but at least our examples will be simpler from here on in, e.g.:

> sqitch status
# On database flipr_test
# Project:  flipr
# Change:   5a2ac4ae6801bfe392483ee5912b4e3592cdd57a
# Name:     appschema
# Deployed: 2018-07-27 10:52:54 -0400
# By:       Marge N. O’Vera <marge@example.com>
#
Nothing to deploy (up-to-date)

Yay, that allows things to be a little more concise. Let’s also make sure that changes are verified after deploying them:

> sqitch config --bool deploy.verify true
> sqitch config --bool rebase.verify true

We’ll see the rebase command a bit later. In the meantime, let’s commit the new configuration and and make some more changes!

> git commit -am 'Set default deployment target and always verify.'
[main 3834a8d] Set default deployment target and always verify.
 1 files changed, 8 insertions(+), 0 deletions(-)

Deploy with Dependency

Let’s add another change, this time to create a table. Our app will need users, of course, so we’ll create a table for them. First, add the new change:

> sqitch add users --requires appschema -n 'Creates table to track our users.'
Created deploy/users.sql
Created revert/users.sql
Created verify/users.sql
Added "users [appschema]" to sqitch.plan

Note that we’re requiring the appschema change as a dependency of the new users change. Although that change has already been added to the plan and therefore should always be applied before the users change, it’s a good idea to be explicit about dependencies.

Now edit the scripts. When you’re done, deploy/users.sql should look like this:

-- Deploy flipr:users to snowflake
-- requires: appschema

USE WAREHOUSE &warehouse;
CREATE TABLE flipr.users (
    nickname  TEXT         PRIMARY KEY,
    password  TEXT         NOT NULL,
    fullname  TEXT         NOT NULL,
    twitter   TEXT         NOT NULL,
    timestamp TIMESTAMP_TZ NOT NULL DEFAULT CURRENT_TIMESTAMP
);

A few things to notice here. On the second line, the dependence on the appschema change has been listed. This doesn’t do anything, but the default deploy Snowflake template lists it here for your reference while editing the file. Useful, right?

The table itself will be created in the flipr schema. This is why we need to require the appschema change.

On the fourth line, the USE WAREHOUSE statement was inserted by the default Snowflake template. We don’t actually need it to create a table, but there’s no harm in leaving it here.

Now for the verify script. The simplest way to check that the table was created and has the expected columns without touching the data? Just select from the table with a false WHERE clause. Here the USE WAREHOUSE statement is required so that the SELECT statement can actually execute. Probably easiest just to leave the default, which uses the warehouse that Sqitch uses to maintain its registry. Edit verify/users.sql to look like this:

USE WAREHOUSE &warehouse;
SELECT nickname, password, fullname, twitter, timestamp
  FROM flipr.users
WHERE FALSE;

Now for the revert script: all we have to do is drop the table. Add this to revert/users.sql:

DROP TABLE flipr.users;

Couldn’t be much simpler, right? Let’s deploy this bad boy:

> sqitch deploy
Deploying changes to flipr_test
  + users .. ok

We know, since verification is enabled, that the table must have been created. But for the purposes of visibility, let’s have a quick look:

> snowsql --accountname example --username movera --dbname flipr -o friendly=false \
  --query "SHOW TERSE TABLES LIKE 'users' IN flipr"
+-------------------------------+-------+-------+---------------+-------------+
| created_on                    | name  | kind  | database_name | schema_name |
|-------------------------------+-------+-------+---------------+-------------|
| 2018-07-27 15:13:21.767 +0000 | USERS | TABLE | DWHEELER      | FLIPR       |
+-------------------------------+-------+-------+---------------+-------------+
1 Row(s) produced. Time Elapsed: 0.318s

We can also verify all currently deployed changes with the verify command:

> sqitch verify
Verifying flipr_test
  * appschema .. ok
  * users ...... ok
Verify successful

Now have a look at the status:

> sqitch status
# On database flipr_test
# Project:  flipr
# Change:   d251b2c9b4bc46a4b4db6b7a8a637951484e6f6b
# Name:     users
# Deployed: 2018-07-27 11:09:12 -0400
# By:       Marge N. O’Vera <marge@example.com>
#
Nothing to deploy (up-to-date)

Success! Let’s make sure we can revert the change, as well:

> sqitch revert --to @HEAD^ -y
Reverting changes to appschema from flipr_test
  - users .. ok

Note that we’ve used the --to option to specify the change to revert to. And what do we revert to? The symbolic tag @HEAD, when passed to revert, always refers to the last change deployed to the database. (For other commands, it refers to the last change in the plan.) Appending the caret (^) tells Sqitch to select the change prior to the last deployed change. So we revert to appschema, the penultimate change. The other potentially useful symbolic tag is @ROOT, which refers to the first change deployed to the database (or in the plan, depending on the command).

Back to the database. The users table should be gone but the flipr schema should still be around:

> snowsql --accountname example --username movera --dbname flipr -o friendly=false \
  --query "SHOW TERSE TABLES LIKE 'users' IN flipr"
+------------+------+------+---------------+-------------+
| created_on | name | kind | database_name | schema_name |
|------------+------+------+---------------+-------------|
+------------+------+------+---------------+-------------+
0 Row(s) produced. Time Elapsed: 0.367s

The status command politely informs us that we have undeployed changes:

# On database flipr_test
# Project:  flipr
# Change:   5a2ac4ae6801bfe392483ee5912b4e3592cdd57a
# Name:     appschema
# Deployed: 2018-07-27 10:52:54 -0400
# By:       Marge N. O’Vera <marge@example.com>
#
Undeployed change:
  * users

As does the verify command:

> sqitch verify
Verifying flipr_test
  * appschema .. ok
Undeployed change:
  * users
Verify successful

Note that the verify is successful, because all currently-deployed changes are verified. The list of undeployed changes (just “users” here) reminds us about the current state.

Okay, let’s commit and deploy again:

> git add .
> git commit -am 'Add users table.'
[main 8c16c09] Add users table.
 4 files changed, 22 insertions(+)
 create mode 100644 deploy/users.sql
 create mode 100644 revert/users.sql
 create mode 100644 verify/users.sql
> sqitch deploy
Deploying changes to flipr_test
  + users .. ok

Looks good. Check the status:

> sqitch status
# Project:  flipr
# Change:   d251b2c9b4bc46a4b4db6b7a8a637951484e6f6b
# Name:     users
# Deployed: 2018-07-27 11:19:30 -0400
# By:       Marge N. O’Vera <marge@example.com>
#
Nothing to deploy (up-to-date)

Excellent. Let’s do some more!

Add Two at Once

Let’s add a couple more changes. Our app will need to store status messages from users. Let’s call them – and the table to store them – “flips”. And we’ll also need a view that lists user names with their flips. Let’s add changes for them both:

> sqitch add flips -r appschema -r users -n 'Adds table for storing flips.'
Created deploy/flips.sql
Created revert/flips.sql
Created verify/flips.sql
Added "flips [appschema users]" to sqitch.plan

> sqitch add userflips -r appschema -r users -r flips \
  -n 'Creates the userflips view.'
Created deploy/userflips.sql
Created revert/userflips.sql
Created verify/userflips.sql
Added "userflips [appschema users flips]" to sqitch.plan

Now might be a good time to have a look at the deployment plan:

> cat sqitch.plan
%syntax-version=1.0.0
%project=flipr
%uri=https://github.com/sqitchers/sqitch-snowflake-intro/

appschema 2018-07-27T14:27:24Z Marge N. O’Vera <marge@example.com> # Add schema for all flipr objects.
users [appschema] 2018-07-27T15:03:56Z Marge N. O’Vera <marge@example.com> # Creates table to track our users.
flips [appschema users] 2018-07-27T15:23:41Z Marge N. O’Vera <marge@example.com> # Adds table for storing flips.
userflips [appschema users flips] 2018-07-27T15:23:50Z Marge N. O’Vera <marge@example.com> # Creates the userflips view.

Each change appears on a single line with the name of the change, a bracketed list of dependencies, a timestamp, the name and email address of the user who planned the change, and a note.

Let’s write the code for the new changes. Here’s what deploy/flips.sql should look like:

-- Deploy flipr:flips to snowflake
-- requires: appschema
-- requires: users

USE WAREHOUSE &warehouse;
CREATE TABLE flipr.flips (
    id        INTEGER        PRIMARY KEY,
    nickname  TEXT           NOT NULL REFERENCES flipr.users(nickname),
    body      VARCHAR(180)   NOT NULL DEFAULT '',
    timestamp TIMESTAMP_TZ NOT NULL DEFAULT CURRENT_TIMESTAMP
);

Here’s what verify/flips.sql might look like:

-- Verify flipr:flips on snowflake

USE WAREHOUSE &warehouse;
SELECT id, nickname, body, timestamp
  FROM flipr.flips
 WHERE FALSE;

And revert/flips.sql should look something like this:

-- Revert flipr:flips from snowflake

USE WAREHOUSE &warehouse;
DROP TABLE flipr.flips;

Now for userflips; deploy/userflips.sql might look like this:

-- Deploy flipr:userflips to snowflake
-- requires: appschema
-- requires: users
-- requires: flips

USE WAREHOUSE &warehouse;
CREATE OR REPLACE VIEW flipr.userflips AS
SELECT f.id, u.nickname, u.fullname, f.body, f.timestamp
  FROM flipr.users u
  JOIN flipr.flips f ON u.nickname = f.nickname;

Use a SELECT statement in verify/userflips.sql again:

-- Verify flipr:userflips on snowflake

USE WAREHOUSE &warehouse;
SELECT id, nickname, fullname, body, timestamp
  FROM flipr.userflips
 WHERE FALSE;

And of course, its revert script, revert/userflips.sql, should look something like:

-- Revert flipr:userflips from snowflake

USE WAREHOUSE &warehouse;
DROP VIEW flipr.userflips;

Try em out!

> sqitch deploy
Deploying changes to flipr_test
  + flips ...... ok
  + userflips .. ok

Do we have the new table and view? Of course we do, they were verified. Still, have a look:

> snowsql --accountname example --username movera --dbname flipr -o friendly=false \
  --query "SHOW TERSE TABLES LIKE 'flips' IN flipr"
+-------------------------------+-------+-------+---------------+-------------+
| created_on                    | name  | kind  | database_name | schema_name |
|-------------------------------+-------+-------+---------------+-------------|
| 2018-07-27 15:31:07.137 +0000 | FLIPS | TABLE | DWHEELER      | FLIPR       |
+-------------------------------+-------+-------+---------------+-------------+
1 Row(s) produced. Time Elapsed: 0.225s

> snowsql --accountname example --username movera --dbname flipr -o friendly=false \
  --query "SHOW TERSE VIEWS LIKE 'userflips' IN flipr"
+-------------------------------+-----------+------+---------------+-------------+
| created_on                    | name      | kind | database_name | schema_name |
|-------------------------------+-----------+------+---------------+-------------|
| 2018-07-27 15:29:25.733 +0000 | USERFLIPS | VIEW | DWHEELER      | FLIPR       |
+-------------------------------+-----------+------+---------------+-------------+
1 Row(s) produced. Time Elapsed: 0.299s

And what’s the status?

> sqitch status
# On database flipr_test
# Project:  flipr
# Change:   73cd50c99de2a8b3eab206c73514afbeb952023c
# Name:     userflips
# Deployed: 2018-07-27 11:31:24 -0400
# By:       Marge N. O’Vera <marge@example.com>
#
Nothing to deploy (up-to-date)

Looks good. Let’s make sure revert works:

> sqitch revert -y --to @HEAD^^
Reverting changes to users from flipr_test
  - userflips .. ok
  - flips ...... ok

> snowsql --accountname example --username movera --dbname flipr -o friendly=false \
  --query "SHOW TERSE TABLES LIKE 'flips' IN flipr"
+------------+------+------+---------------+-------------+
| created_on | name | kind | database_name | schema_name |
|------------+------+------+---------------+-------------|
+------------+------+------+---------------+-------------+
0 Row(s) produced. Time Elapsed: 0.306s

Note the use of @HEAD^^ to specify that the revert be to two changes prior the last deployed change. Looks good. Let’s do the commit and re-deploy dance:

> git add .
> git commit -m 'Add flips table and userflips view.'
[main b36f48b] Add flips table and userflips view.
 7 files changed, 43 insertions(+)
 create mode 100644 deploy/flips.sql
 create mode 100644 deploy/userflips.sql
 create mode 100644 revert/flips.sql
 create mode 100644 revert/userflips.sql
 create mode 100644 verify/flips.sql
 create mode 100644 verify/userflips.sql

> sqitch deploy
Deploying changes to flipr_test
  + flips ...... ok
  + userflips .. ok

> sqitch status
# Project:  flipr
# Change:   73cd50c99de2a8b3eab206c73514afbeb952023c
# Name:     userflips
# Deployed: 2018-07-27 11:38:02 -0400
# By:       Marge N. O’Vera <marge@example.com>
#
Nothing to deploy (up-to-date)

> sqitch verify
Verifying flipr_test
  * appschema .. ok
  * users ...... ok
  * flips ...... ok
  * userflips .. ok
Verify successful

Great, we’re fully up-to-date!

Ship It!

Let’s do a first release of our app. Let’s call it 1.0.0-dev1 Since we want to have it go out with deployments tied to the release, let’s tag it:

> sqitch tag v1.0.0-dev1 -n 'Tag v1.0.0-dev1.'
Tagged "userflips" with @v1.0.0-dev1
> git commit -am 'Tag the database with v1.0.0-dev1.'
[main 84ed9db] Tag the database with v1.0.0-dev1.
 1 files changed, 1 insertion(+)
> git tag v1.0.0-dev1 -am 'Tag v1.0.0-dev1'

We can try deploying to make sure the tag gets picked up like so:

> sqitch deploy
Nothing to deploy (up-to-date)
> sqitch status
# On database flipr_test
# Project:  flipr
# Change:   73cd50c99de2a8b3eab206c73514afbeb952023c
# Name:     userflips
# Tag:      @v1.0.0-dev1
# Deployed: 2018-07-27 11:38:02 -0400
# By:       Marge N. O’Vera <marge@example.com>
#
Nothing to deploy (up-to-date)

Note the new “Tag” line in the output of sqitch status: no new changes needed to be deployed, but Sqitch did deploy the tag on the userflips change. Now let’s bundle everything up for release:

> sqitch bundle
Bundling into bundle
Writing config
Writing plan
Writing scripts
  + appschema
  + users
  + flips
  + userflips @v1.0.0-dev1

Now we can package the bundle directory and distribute it. When it gets installed somewhere, users can use Sqitch to deploy to the database. Let’s try deploying it to another database, flipr_prod:

> cd bundle
> sqitch deploy 'db:snowflake://movera@example/flipr_prod?Driver=Snowflake'
Adding registry tables to db:snowflake://movera@example/flipr_prod?Driver=Snowflake'
Deploying changes to db:snowflake://movera@example/flipr_prod?Driver=Snowflake'
  + appschema ............... ok
  + users ................... ok
  + flips ................... ok
  + userflips @v1.0.0-dev1 .. ok

Notice how the tag on userflips now appears in the deploy output. Nice, eh? Now, package it up and ship it!

> cd ..
> mv bundle flipr-v1.0.0-dev1
> tar -czf flipr-v1.0.0-dev1.tgz flipr-v1.0.0-dev1

Making a Hash of Things

Now that we’ve got the basics of the app done, let’s add a feature. Gotta track the hashtags associated with flips, right? Let’s add a table for them. But since other folks are working on other tasks in the repository, we’ll work on a branch, so we can all stay out of each other’s way. So let’s branch:

> git checkout -b hashtags
Switched to a new branch 'hashtags'

Now we can add a new change to create a table for hashtags.

> sqitch add hashtags --requires flips -n 'Adds table for storing hashtags.'
Created deploy/hashtags.sql
Created revert/hashtags.sql
Created verify/hashtags.sql
Added "hashtags [appschema flips]" to sqitch.plan

You know the drill by now. Add this to deploy/hashtags.sql

CREATE TABLE flipr.hashtags (
    flip_id   INTEGER       NOT NULL REFERENCES flipr.flips(id),
    hashtag   VARCHAR(128)  NOT NULL,
    PRIMARY KEY (flip_id, hashtag)
);

Again, select from the table in verify/hashtags.sql:

SELECT flip_id, hashtag FROM flipr.hashtags WHERE FALSE;

And drop it in revert/hashtags.sql

DROP TABLE flipr.hashtags;

And give it a whirl:

> sqitch deploy
Deploying changes to flipr_test
  + hashtags .. ok

Look good?

> sqitch status --show-tags
# On database flipr_test
# Project:  flipr
# Change:   d750cbeec487841c45715115a31297739fbb4046
# Name:     hashtags
# Deployed: 2018-07-27 11:53:02 -0400
# By:       Marge N. O’Vera <marge@example.com>
# 
# Tag:
#   @v1.0.0-dev1 - 2018-07-27 11:41:13 -0400 - Marge N. O’Vera <marge@example.com>
# 
Nothing to deploy (up-to-date)

Note the use of --show-tags to show all the deployed tags. Make sure we can revert, too:

> sqitch revert -y --onto @HEAD^
Reverting changes to userflips @v1.0.0-dev1 from flipr_test
  - hashtags .. ok
> sqitch deploy
Deploying changes to flipr_test
  + hashtags .. ok

Great! Now make it so:

> git add .
> git commit -m 'Add hashtags table.'
[hashtags 06a0bf4] Add hashtags table.
 4 files changed, 19 insertions(+)
 create mode 100644 deploy/hashtags.sql
 create mode 100644 revert/hashtags.sql
 create mode 100644 verify/hashtags.sql

Good, we’ve finished this feature. Time to merge back into main.

Emergency

Let’s do it:

> git checkout main
Switched to branch 'main'
> git pull
Updating 84ed9db..31d026c
Fast-forward
 deploy/lists.sql | 11 +++++++++++
 revert/lists.sql |  4 ++++
 sqitch.plan      |  2 ++
 verify/lists.sql |  6 ++++++
 4 files changed, 23 insertions(+)
 create mode 100644 deploy/lists.sql
 create mode 100644 revert/lists.sql
 create mode 100644 verify/lists.sql

Hrm, that’s interesting. Looks like someone made some changes to main. They added list support. Well, let’s see what happens when we merge our changes.

> git merge --no-ff hashtags
Auto-merging sqitch.plan
CONFLICT (content): Merge conflict in sqitch.plan
Automatic merge failed; fix conflicts and then commit the result.

Oh, a conflict in sqitch.plan. Not too surprising, since both the merged lists branch and our hashtags branch added changes to the plan. Let’s try a different approach.

The truth is, we got lazy. Those changes when we pulled main from the origin should have raised a red flag. It’s considered a bad practice not to look at what’s changed in main before merging in a branch. What one should do is either:

So let’s restore things to how they were at main:

> git reset --hard HEAD
HEAD is now at 31d026c Merge branch 'lists'

That throws out our botched merge. Now let’s go back to our branch and rebase it on main:

> git checkout hashtags
Switched to branch 'hashtags'
> git rebase main
First, rewinding head to replay your work on top of it...
Applying: Add hashtags table.
Using index info to reconstruct a base tree...
M       sqitch.plan
Falling back to patching base and 3-way merge...
Auto-merging sqitch.plan
CONFLICT (content): Merge conflict in sqitch.plan
error: Failed to merge in the changes.
Patch failed at 0001 Add hashtags table.
Use 'git am --show-current-patch' to see the failed patch

Resolve all conflicts manually, mark them as resolved with
"git add/rm <conflicted_files>", then run "git rebase --continue".
You can instead skip this commit: run "git rebase --skip".
To abort and get back to the state before "git rebase", run "git rebase --abort".

Oy, that’s kind of a pain. It seems like no matter what we do, we’ll need to resolve conflicts in that file. Except in Git. Fortunately for us, we can tell Git to resolve conflicts in sqitch.plan differently. Because we only ever append lines to the file, we can have it use the “union” merge driver, which, according to its docs:

Run 3-way file level merge for text files, but take lines from both versions, instead of leaving conflict markers. This tends to leave the added lines in the resulting file in random order and the user should verify the result. Do not use this if you do not understand the implications.

This has the effect of appending lines from all the merging files, which is exactly what we need. So let’s give it a try. First, back out the botched rebase:

> git rebase --abort
HEAD is now at 06a0bf4 Add hashtags table.

Now add the union merge driver to .gitattributes for sqitch.plan and rebase again:

> echo sqitch.plan merge=union > .gitattributes
> git rebase main
First, rewinding head to replay your work on top of it...
Applying: Add hashtags table.
Using index info to reconstruct a base tree...
M       sqitch.plan
Falling back to patching base and 3-way merge...
Auto-merging sqitch.plan

Ah, that looks a bit better. Let’s have a look at the plan:

> cat sqitch.plan
%syntax-version=1.0.0
%project=flipr
%uri=https://github.com/sqitchers/sqitch-snowflake-intro/

appschema 2018-07-27T14:27:24Z Marge N. O’Vera <marge@example.com> # Add schema for all flipr objects.
users [appschema] 2018-07-27T15:03:56Z Marge N. O’Vera <marge@example.com> # Creates table to track our users.
flips [appschema users] 2018-07-27T15:23:41Z Marge N. O’Vera <marge@example.com> # Adds table for storing flips.
userflips [appschema users flips] 2018-07-27T15:23:50Z Marge N. O’Vera <marge@example.com> # Creates the userflips view.
@v1.0.0-dev1 2018-07-27T15:40:25Z Marge N. O’Vera <marge@example.com> # Tag v1.0.0-dev1.

lists [appschema flips] 2018-07-27T16:00:00Z Marge N. O’Vera <marge@example.com> # Adds table for storing lists.
hashtags [flips] 2018-07-27T15:51:16Z Marge N. O’Vera <marge@example.com> # Adds table for storing hashtags.

Note that it has appended the changes from the merged “lists” branch, and then merged the changes from our “hashtags” branch. Test it to make sure it works as expected:

> sqitch rebase -y
Reverting all changes from flipr_test
  - hashtags ................ ok
  - userflips @v1.0.0-dev1 .. ok
  - flips ................... ok
  - users ................... ok
  - appschema ............... ok
Deploying changes to flipr_test
  + appschema ............... ok
  + users ................... ok
  + flips ................... ok
  + userflips @v1.0.0-dev1 .. ok
  + lists ................... ok
  + hashtags ................ ok

Note the use of rebase, which combines a revert and a deploy into a single command. Handy, right? It correctly reverted our changes, and then deployed them all again in the proper order. So let’s commit .gitattributes; seems worthwhile to keep that change:

> git add .
> git commit -m 'Add `.gitattributes` with union merge for `sqitch.plan`.'
[hashtags 86596a9] Add `.gitattributes` with union merge for `sqitch.plan`.
 1 files changed, 1 insertions(+), 0 deletions(-)
 create mode 100644 .gitattributes

Merges Mastered

And now, finally, we can merge into main:

> git checkout main
Switched to branch 'main'
> git merge --no-ff hashtags -m "Merge branch 'hashtags'"
Merge made by the 'recursive' strategy.
 .gitattributes      | 1 +
 deploy/hashtags.sql | 9 ++++++++++
 revert/hashtags.sql | 4 ++++
 sqitch.plan         | 1 +
 verify/hashtags.sql | 4 ++++
 5 files changed, 19 insertions(+)
 create mode 100644 .gitattributes
 create mode 100644 deploy/hashtags.sql
 create mode 100644 revert/hashtags.sql
 create mode 100644 verify/hashtags.sql

And double-check our work:

> cat sqitch.plan
%syntax-version=1.0.0
%project=flipr
%uri=https://github.com/sqitchers/sqitch-snowflake-intro/

appschema 2018-07-27T14:27:24Z Marge N. O’Vera <marge@example.com> # Add schema for all flipr objects.
users [appschema] 2018-07-27T15:03:56Z Marge N. O’Vera <marge@example.com> # Creates table to track our users.
flips [appschema users] 2018-07-27T15:23:41Z Marge N. O’Vera <marge@example.com> # Adds table for storing flips.
userflips [appschema users flips] 2018-07-27T15:23:50Z Marge N. O’Vera <marge@example.com> # Creates the userflips view.
@v1.0.0-dev1 2018-07-27T15:40:25Z Marge N. O’Vera <marge@example.com> # Tag v1.0.0-dev1.

lists [appschema flips] 2018-07-27T16:00:00Z Marge N. O’Vera <marge@example.com> # Adds table for storing lists.
hashtags [flips] 2018-07-27T15:51:16Z Marge N. O’Vera <marge@example.com> # Adds table for storing hashtags.

Much much better, a nice clean main now. And because it is now identical to the “hashtags” branch, we can just carry on. Go ahead and tag it, bundle, and release:

> sqitch tag v1.0.0-dev2 -n 'Tag v1.0.0-dev2.'
Tagged "hashtags" with @v1.0.0-dev2
> git commit -am 'Tag the database with v1.0.0-dev2.'
[main 1c67e0d] Tag the database with v1.0.0-dev2.
 1 files changed, 1 insertion(+)
> git tag v1.0.0-dev2 -am 'Tag v1.0.0-dev2'
> sqitch bundle --dest-dir flipr-1.0.0-dev2
Bundling into flipr-1.0.0-dev2
Writing config
Writing plan
Writing scripts
  + appschema
  + users
  + flips
  + userflips @v1.0.0-dev1
  + lists
  + hashtags @v1.0.0-dev2

Note the use of the --dest-dir option to sqitch bundle. Just a nicer way to create the top-level directory name so we don’t have to rename it from bundle.

In Place Changes

Well, some folks have been testing the 1.0.0-dev2 release and have demanded that Twitter user links be added to Flipr pages. Why anyone would want to include social network links in an anti-social networking app is beyond us programmers, but we’re just the plumbers, right? Gotta go with what Product demands. The upshot is that we need to update the userflips view, which is used for the feature in question, to include the Twitter user names.

Normally, modifying views in database changes is a PITA. You have to make changes like these:

  1. Copy deploy/userflips.sql to deploy/userflips_twitter.sql.
  2. Edit deploy/userflips_twitter.sql to drop and re-create the view with the twitter column to the view.
  3. Copy deploy/userflips.sql to revert/userflips_twitter.sql. Yes, copy the original change script to the new revert change.
  4. Add a DROP VIEW statement to revert/userflips_twitter.sql.
  5. Copy verify/userflips.sql to verify/userflips_twitter.sql.
  6. Modify verify/userflips_twitter.sql to include a check for the twiter column.
  7. Test the changes to make sure you can deploy and revert the userflips_twitter change.

But you can have Sqitch do most of the work for you. The only requirement is that a tag appear between the two instances of a change we want to modify. In general, you’re going to make a change like this after a release, which you’ve tagged anyway, right? Well we have, with @v1.0.0-dev2 added in the previous section. With that, we can let Sqitch do most of the hard work for us, thanks to the rework command, which is similar to add:

> sqitch rework userflips -n 'Adds userflips.twitter.'
Added "userflips [userflips@v1.0.0-dev2]" to sqitch.plan.
Modify these files as appropriate:
  * deploy/userflips.sql
  * revert/userflips.sql
  * verify/userflips.sql

Oh, so we can edit those files in place. Nice! How does Sqitch do it? Well, in point of fact, it has copied the files to stand in for the previous instance of the userflips change, which we can see via git status:

> git status
On branch main
Your branch is up to date with 'origin/main'.

Changes not staged for commit:
  (use "git add <file>..." to update what will be committed)
  (use "git checkout -- <file>..." to discard changes in working directory)

  modified:   revert/userflips.sql
  modified:   sqitch.plan

Untracked files:
  (use "git add <file>..." to include in what will be committed)

  deploy/userflips@v1.0.0-dev2.sql
  revert/userflips@v1.0.0-dev2.sql
  verify/userflips@v1.0.0-dev2.sql

no changes added to commit (use "git add" and/or "git commit -a")

The “Untracked files” part of the output is the first thing to notice. They’re all named userflips@v1.0.0-dev2.sql. What that means is: “the userflips change as it was implemented as of the @v1.0.0-dev2 tag.” These are copies of the original scripts, and thereafter Sqitch will find them when it needs to run scripts for the first instance of the userflips change. As such, it’s important not to change them again. But hey, if you’re reworking the change, you shouldn’t need to.

The other thing to notice is that revert/userflips.sql has changed. Sqitch replaced it with the original deploy script. As of now, deploy/userflips.sql and revert/userflips.sql are identical. This is on the assumption that the deploy script will be changed (we’re reworking it, remember?), and that the revert script should actually change things back to how they were before. Of course, the original deploy script may not be idempotent – that is, able to be applied multiple times without changing the result beyond the initial application. If it’s not, you will likely need to modify it so that it properly restores things to how they were after the original deploy script was deployed. Or, more simply, it should revert changes back to how they were as-of the deployment of deploy/userflips@v1.0.0-dev2.sql.

Fortunately, our view deploy scripts are already idempotent, thanks to the use of the OR REPLACE expression. No matter how many times a deployment script is run, the end result will be the same instance of the view, with no duplicates or errors.

As a result, there is no need to explicitly add changes. So go ahead. Modify the script to add the twitter column to the view. Make this change to deploy/userflips.sql:

@@ -5,6 +5,6 @@

USE WAREHOUSE &warehouse;
CREATE OR REPLACE VIEW flipr.userflips AS
-SELECT f.id, u.nickname, u.fullname, f.body, f.timestamp
+SELECT SELECT f.id, u.nickname, u.fullname, u.twitter, f.body, f.timestamp
  FROM flipr.users u
  JOIN flipr.flips f ON u.nickname = f.nickname;

Next, modify verify/userflips.sql to check for the twitter column. Here’s the diff:

@@ -1,6 +1,6 @@
 -- Verify flipr:userflips on snowflake

-SELECT id, nickname, fullname, body, timestamp
+SELECT id, nickname, fullname, twitter, body, timestamp
   FROM flipr.userflips
  WHERE FALSE;

Now try a deployment:

> sqitch deploy
Deploying changes to flipr_test
  + userflips .. ok

So, are the changes deployed?

> snowsql --accountname example --username movera --dbname flipr -o friendly=false \
  --query "SHOW VIEWS LIKE 'userflips' IN flipr"
+-------------------------------+-----------+----------+---------------+-------------+--------+---------+---------------------------------------------------------------------+-----------+
| created_on                    | name      | reserved | database_name | schema_name | owner  | comment | text                                                                | is_secure |
|-------------------------------+-----------+----------+---------------+-------------+--------+---------+---------------------------------------------------------------------+-----------|
| 2018-07-27 18:19:29.818 +0000 | USERFLIPS |          | DWHEELER      | FLIPR       | SQITCH |         | CREATE OR REPLACE VIEW flipr.userflips AS                           | false     |
|                               |           |          |               |             |        |         | SELECT f.id, u.nickname, u.fullname, u.twitter, f.body, f.timestamp |           |
|                               |           |          |               |             |        |         |   FROM flipr.users u                                                |           |
|                               |           |          |               |             |        |         |   JOIN flipr.flips f ON u.nickname = f.nickname;                    |           |
+-------------------------------+-----------+----------+---------------+-------------+--------+---------+---------------------------------------------------------------------+-----------+
1 Row(s) produced. Time Elapsed: 0.413s

Awesome, the view now includes the twitter column. But can we revert?

> sqitch revert --to @HEAD^ -y
Reverting changes to hashtags @v1.0.0-dev2 from flipr_test
  - userflips .. ok

Did that work, is the twitter column gone?

> snowsql --accountname example --username movera --dbname flipr -o friendly=false \
  --query "SHOW VIEWS LIKE 'userflips' IN flipr"
+-------------------------------+-----------+----------+---------------+-------------+--------+---------+----------------------------------------------------------+-----------+
| created_on                    | name      | reserved | database_name | schema_name | owner  | comment | text                                                     | is_secure |
|-------------------------------+-----------+----------+---------------+-------------+--------+---------+----------------------------------------------------------+-----------|
| 2018-07-27 18:50:52.064 +0000 | USERFLIPS |          | DWHEELER      | FLIPR       | SQITCH |         | CREATE OR REPLACE VIEW flipr.userflips AS                | false     |
|                               |           |          |               |             |        |         | SELECT f.id, u.nickname, u.fullname, f.body, f.timestamp |           |
|                               |           |          |               |             |        |         |   FROM flipr.users u                                     |           |
|                               |           |          |               |             |        |         |   JOIN flipr.flips f ON u.nickname = f.nickname;         |           |
+-------------------------------+-----------+----------+---------------+-------------+--------+---------+----------------------------------------------------------+-----------+
1 Row(s) produced. Time Elapsed: 0.362s

Yes, it works! Sqitch properly finds the original instances of these changes in the new script files that include tags.

Excellent. Let’s go ahead and commit these changes:

> git add .
> git commit -m 'Add the twitter column to the userflips view.'
[main c004445] Add the twitter column to the userflips view.
 7 files changed, 31 insertions(+), 4 deletions(-)
 create mode 100644 deploy/userflips@v1.0.0-dev2.sql
 create mode 100644 revert/userflips@v1.0.0-dev2.sql
 create mode 100644 verify/userflips@v1.0.0-dev2.sql

More to Come

Sqitch is a work in progress. Better integration with version control systems is planned to make managing idempotent reworkings even easier. Stay tuned.

Author

David E. Wheeler david@justatheory.com

License

Copyright (c) 2012-2024 iovation Inc., David E. Wheeler

Permission is hereby granted, free of charge, to any person obtaining a copy of this software and associated documentation files (the “Software”), to deal in the Software without restriction, including without limitation the rights to use, copy, modify, merge, publish, distribute, sublicense, and/or sell copies of the Software, and to permit persons to whom the Software is furnished to do so, subject to the following conditions:

The above copyright notice and this permission notice shall be included in all copies or substantial portions of the Software.

THE SOFTWARE IS PROVIDED “AS IS”, WITHOUT WARRANTY OF ANY KIND, EXPRESS OR IMPLIED, INCLUDING BUT NOT LIMITED TO THE WARRANTIES OF MERCHANTABILITY, FITNESS FOR A PARTICULAR PURPOSE AND NONINFRINGEMENT. IN NO EVENT SHALL THE AUTHORS OR COPYRIGHT HOLDERS BE LIABLE FOR ANY CLAIM, DAMAGES OR OTHER LIABILITY, WHETHER IN AN ACTION OF CONTRACT, TORT OR OTHERWISE, ARISING FROM, OUT OF OR IN CONNECTION WITH THE SOFTWARE OR THE USE OR OTHER DEALINGS IN THE SOFTWARE.