It is not clear to me if the aws mwaa documentation says the following "With Amazon Managed Workflows for Apache Airflow (MWAA) you pay only for what you use. There are no minimum fees or upfront commitments. You pay for the time your Airflow Environment runs..", why there is a cost for environment 24/7? If I want to run ETL job only once per day, do I have to pay for the environment 24h?
MWAA can not be paused/stopped in the way that for example ec2 instances can. Even if there are no jobs active or running, the MWAA instance is still running and generating charges.
why there is a cost for environment 24/7?
MWAA is not a stateless service. It is a service for stateful Airflow deployments, which means it requires continuous compute resources to run.
Amazon Managed Workflows for Apache Airflow Pricing
If I want to run ETL job only once per day, do I have to pay for the environment 24h?
Yes. When your ETL job is not running, Airflow is still deployed and running.
MWAA can not be paused/stopped in the way that for example ec2 instances can. Even if there are no jobs active or running, the MWAA instance is still running and generating charges.
Just to elaborate on the accepted answer, which is correct...
The basic functionality of Apache Airflow provides 2 components: A web server that Data Engineers can interact with to do things like:
- review what job instances were run and their status
- See how the DAGs are organized
- review errors
- execute ad-hoc runs of a DAG
In addition, Apache Airflow has a scheduler that spawns containers using AWS Fargate. It is these containers that execute your ETL jobs
Both of these components of the applications are running 24x7 and you need to pay for the resources required to run these components.
The actual Fargate containers executing your ETL jobs only exist for the duration of each ETL job execution. So, you only pay for the execution time of that ETL job.
© 2022 - 2024 — McMap. All rights reserved.
per hour
. – Typify