airflow Questions

2

I am new to Docker so I have followed the official guide here for running Airflow with it-https://airflow.apache.org/docs/apache-airflow/stable/start/docker.html When I start Airflow all services w...
Impish asked 26/12, 2021 at 20:48

3

i am running the airflow pipeline but codes looks seems good but actually i'm getting the airflow.exceptions.AirflowException: Cycle detected in DAG. Faulty task: can u please help to resolve this...
Monzon asked 17/8, 2017 at 9:51

2

Is it possible to run an airflow task only when a specific event occurs like an event of dropping a file into a specific S3 bucket. Something similar to AWS Lambda events There is S3KeySensor but ...
Meridel asked 4/11, 2019 at 13:52

4

In my DAG, I've got a task flow like this: ... >> EmrAddStepsOperator >> EmrStepSensor A success of EmrAddStepsOperator means, "I was able to tell EMR to start." A failure of EmrStep...
Chariot asked 14/8, 2019 at 17:16

5

I am using airflow cli's backfill command to manually run some backfill jobs. airflow backfill mydag -i -s 2018-01-11T16-00-00 -e 2018-01-31T23-00-00 --reset_dagruns --rerun_failed_tasks The d...
Mouldy asked 20/9, 2018 at 15:47

2

My company uses git-sync to sync zipped dags to airflow. We use airflow helm charts to deploy airflow. I wonder if I can let airflow only pick up zipped dags in a specific folder such as dags-dev i...
Jilolo asked 27/8, 2020 at 4:28

9

Solved

I have a dag that we'll deploy to multiple different airflow instances and in our airflow.cfg we have dags_are_paused_at_creation = True but for this specific dag we want it to be turned on without...
Liberal asked 5/6, 2017 at 1:1

0

I am deploying airflow latest release using Docker, everything works fine until I point the database to an external Postgres Database. As soon as the database is pointed to an external database the...
Humbug asked 15/2, 2023 at 20:1

4

I am using the official docker-compose file of airflow to spin it up. Some of my containers seem unhealthy: 34d8698d67e7 apache/airflow:2.0.2 "/usr/bin/dumb-init …" 31 minutes ago Up 28 m...
Brazell asked 11/5, 2021 at 13:16

13

Solved

I am using airflow for my data pipeline project. I have configured my project in airflow and start the airflow server as a backend process using following command airflow webserver -p 8080 -D True...
Hither asked 22/8, 2016 at 7:19

3

Solved

I am getting started with Airflow and trying to use the KubernetesPodOperator, but I am having trouble with downloading images from private registries. I did some research but I couldn't find an an...
Epigone asked 21/8, 2018 at 20:58

4

I have Airflow jobs, which are running fine on the EMR cluster. what I need is, let's say if I have a 4 airflow jobs which required an EMR cluster for let's say 20 min to complete the task. why not...
Beaverboard asked 18/3, 2019 at 18:15

8

We're running into a permission error when using Airflow, receiving the following error: PermissionError: [Errno 13] Permission denied: '/usr/local/airflow/logs/scheduler/2019-12-18/../../../../ho...
Hankypanky asked 19/12, 2019 at 15:43

3

I have a DAG in airflow and for now it is running each hour (@hourly). Is it possible to have it running each 5 minutes ?
Klayman asked 15/8, 2017 at 8:39

3

Solved

I am working on a project that grabs a set of input data from AWS S3, pre-processes and divvies it up, spins up 10K batch containers to process the divvied data in parallel on AWS Batch, post-aggre...
Plenitude asked 22/9, 2020 at 19:58

5

Solved

I made a small change to an existing workflow, and it has broken airflow. Here is the code: dag_name = platform + "_" + report['table'] dag = DAG( dag_name, catchup=True, default_args=default_...
Gregory asked 16/6, 2019 at 5:23

9

I am a new user of Airbnb's open source workflow/datapipeline software airflow. There are dozens of default example dags after the web UI is started. I tried many ways to remove these dags, but I'v...
Nipha asked 14/4, 2017 at 11:42

2

Solved

I have a little problem, I want to do the typical conditional like setting_x = Variable.get('setting_x') variable = setting_x if setting_x else 0 But since the Airflow model throws an exception...
Tyburn asked 14/6, 2018 at 17:0

1

I am using docker compose with bitnami's airflow image as well as minio. I can get airflow to talk to AWS S3, but when I try to substitute Minio I am getting this error: File "/opt/bitnami/air...
Purple asked 14/6, 2022 at 5:22

2

We have an Airflow deployment with Celery executors. Many of our DAGs require a local processing step of some file in a BashOperator or PythonOperator. However, in our understanding the tasks of ...
Semiramis asked 12/2, 2018 at 21:40

6

Solved

I need to reference a variable that's returned by a BashOperator. In my task_archive_s3_file, I need to get the filename from get_s3_file. The task simply prints {{ ti.xcom_pull(task_ids=submit_fil...
Gynarchy asked 5/9, 2017 at 15:59

4

Solved

I am using airflow to trigger jobs on databricks. I have many DAGs running databricks jobs and I whish to have to use only one cluster instead of many, since to my understanding this will reduce th...
Robichaux asked 6/2, 2019 at 20:1

4

Solved

I'm trying to access external files in a Airflow Task to read some sql, and I'm getting "file not found". Has anyone come across this? from airflow import DAG from airflow.operators.python_operato...
Casanova asked 23/3, 2017 at 17:27

5

I haven't been able to move common code outside of the dag directory that airflow uses. I've looked in the airflow source and found imp.load_source. Is it possible to use imp.load_source to load ...
Spaceport asked 13/7, 2017 at 2:58

3

Solved

My questions : What is a good directory structure in order to organize your dags and tasks? (the dags examples show only couple of tasks) I currently have my dags at the root of the dags folder a...
Snuff asked 7/6, 2017 at 23:47

© 2022 - 2024 — McMap. All rights reserved.