How can I run python code after a DBT run (or a specific model) is completed?

Asked 10/11, 2021 at 18:59 Answered 22/11, 2021 at 19:55

I would like to be able to run an ad-hoc python script that would access and run analytics on the model calculated by a dbt run, are there any best practices around this?

Manipur answered 10/11, 2021 at 18:59 Comment(0)

We recently built a tool that could that caters very much to this scenario. It leverages the ease of referencing tables from dbt in Python-land. It's called dbt-fal.

The idea is that you would define the python scripts you would like to run after your dbt models are run:

# schema.yml
models:
- name: iris
  meta:
    owner: "@matteo"
    fal:
      scripts:
        - "notify.py"

And then the file notify.py is called if the iris model was run in the last dbt run:

# notify.py
import os
from slack_sdk import WebClient
from slack_sdk.errors import SlackApiError

CHANNEL_ID = os.getenv("SLACK_BOT_CHANNEL")
SLACK_TOKEN = os.getenv("SLACK_BOT_TOKEN")

client = WebClient(token=SLACK_TOKEN)
message_text = f"""Model: {context.current_model.name}
Status: {context.current_model.status}
Owner: {context.current_model.meta['owner']}"""


try:
    response = client.chat_postMessage(
        channel=CHANNEL_ID,
        text=message_text
    )
except SlackApiError as e:
    assert e.response["error"]

Each script is ran with a reference to the current model for which it is running in a context variable.

To start using fal, just pip install fal and start writing your python scripts.

Hospice answered 22/11, 2021 at 19:55 Comment(3)

this looks promising let me try it out! – Manipur 22/11, 2021 at 20:3

This is unfortunately no longer a useful answer. The linked Python package does something completely different now. The old Python package has been sunset but can be found here instead: github.com/fal-ai/dbt-fal – Diphosgene 12/8 at 11:30

updated the answer @elevendollar, thanks for the comment – Hospice 13/8 at 15:36

For production, I'd recommend an orchestration layer such as apache airflow.

See this blog post to get started, but essentially you'll have an orchestration DAG (note - not a dbt DAG) that does something like:

dbt run <with args> -> your python code

Fair warning, though, this can add a bit of complexity to your project.

I suppose you could get a similar effect with a CI/CD tool like github actions or circleCI

Mot answered 12/11, 2021 at 18:29 Comment(1)

Thanks for the suggestion! But this seems a little more involved than what I am looking for.. – Manipur 13/11, 2021 at 22:50

Recommended topics

Hot tags