How to use connection hooks with `KubernetesPodOperator` as environment variables on Apache Airflow on GCP Cloud Composer

Tags:

I'd like to use connections saved in airflow in a task which uses the KubernetesPodOperator.

When developing the image I've used environment variables to pass database connection information down to the container, but the production environment has the databases saved as connection hooks.

What is the best way to extract the database connection information and pass it down to the container?

env_vars = {'database_usr': 'xxx', 'database_pas': 'xxx'}

KubernetesPodOperator(
        dag=dag,
        task_id="example-task",
        name="example-task",
        namespace="default",
        image="eu.gcr.io/repo/image:tag",
        image_pull_policy="Always",
        arguments=["-v", "image-command", "image-arg"],
        env_vars=env_vars,
    )

903

asked Mar 16 '20 17:03

wab

1 Answers

My current solution is to grab the variables from the connection using BaseHook:

from airflow.hooks.base_hook import BaseHook


def connection_to_dict(connection_id):
    """Returns connection params from Airflow as a dictionary.

    Parameters
    ----------
    connection_id : str
        Name of the connection in Airflow, e.g. `mysql_default`

    Returns
    -------
    dict
        Unencrypted values.
    """
    conn_obj = BaseHook.get_connection(connection_id)
    d = conn_obj.__dict__
    if ('is_encrypted', True) in d.items():
        d['password'] = conn_obj.get_password()
    return d

and then passing those as environment variables to the Kubernetes pod operator.

101

answered Oct 12 '22 22:10

wab

Related questions
                            
                                why the kubernetes pod use the none network instead of the bridge network on the worker node?
                            
                                Kubernetes: Failed to pull image from private container registry
                            
                                How to get the resource usage of a pod in Kubernetes?
                            
                                How do I configure my DNS to work with Rancher 2.0 ingress?
                            
                                Kubernetes certbot standalone not working
                            
                                Traefik on Kubernetes wrong Client IP on incoming connections
                            
                                Exposing a K8s TCP Service Endpoint to the Public Internet Without a Load Balancer
                            
                                Kubernetes pods can't ping each other using ClusterIP
                            
                                What is the correct way to use feign with spring cloud kubernetes?
                            
                                How to have a header routing logic with nginx ingress-controller?
                            
                                GKE container killed by 'Memory cgroup out of memory' but monitoring, local testing and pprof shows usage far below limit
                            
                                Running Kubernetes on vCenter
                            
                                how to extend environment variable for a container in Kubernetes
                            
                                GKE kubernetes delayed_job pod logs
                            
                                How to determine if a job is failed
                            
                                Use kubectl context in kubernetes client-go
                            
                                Kubernetes Node Memory Limits
                            
                                Can Ambassador handle CORS requests?
                            
                                EKS not able to authenticate to Kubernetes with Kubectl - "User: is not authorized to perform: sts:AssumeRole"
                            
                                K8S - using Prometheus to monitor another prometheus instance in secure way

Donate For Us

If you love us? You can donate to us via Paypal or buy me a coffee so we can maintain and grow! Thank you!

Donate Us With

How to use connection hooks with `KubernetesPodOperator` as environment variables on Apache Airflow on GCP Cloud Composer

Tags:

kubernetes

airflow

google-cloud-composer

kubernetes-operator

wab

People also ask

1 Answers

wab

Recent Activity

Donate For Us