Перейти до основного
https://catalogartifact.azureedge.net/publicartifacts/pcloudhosting.airflow-d738389b-3c06-4225-82d1-5ff30ca6d9a0/image2_pCloud216.png

Airflow

Видавець: pcloudhosting

(1 оцінки)

Version 3.2.2 + Free Support on Ubuntu 24.04

Apache Airflow is an open-source workflow orchestration platform used for building, scheduling, automating, and monitoring data pipelines and task-based workflows using Python-based DAGs (Directed Acyclic Graphs) with a web-based UI and API support.

Features of Apache Airflow:
  • Web-based UI for monitoring and managing workflows.
  • Supports DAG (Directed Acyclic Graph) based workflow design.
  • Task scheduling, retries, and dependency management.
  • Highly scalable distributed task execution architecture.
  • Built-in logging, monitoring, and alerting system.
  • REST API support for automation and integrations.
  • Supports plugins and extensible provider ecosystem.
  • Integration with databases, cloud platforms, and DevOps tools.
  • Role-based authentication and access control.
  • Widely used for ETL, data pipelines, and workflow automation.

Usage Instructions:

To check Apache Airflow version:
$ sudo su
$ cd /opt
$ cd ~/airflow
$ source airflow_venv/bin/activate
$ airflow version

To create Airflow admin user:

$ airflow users create \
  --username newuser \
  --firstname New \
  --lastname User \
  --role Admin \
  --email newuser@example.com

Enter password when prompted.

To start Apache Airflow API Server:

$ airflow api-server --port 8080

To get Already created login details: cat /var/airflow/credentials.txt

To access Apache Airflow Web Interface: http://your-server-ip:8080

Disclaimer: Apache Airflow is a powerful workflow orchestration tool. For production usage, ensure proper authentication setup, secure database configuration, resource tuning, and DAG validation practices.