BigData - NotJustTech

DataOps: Code of Conduct

Kate Belova Apr 18, 2023 0 Comments

I'm often asked what DataOps is and how to be effective in it. DataOps is a relatively new term that refers to the practice of applying DevOps principles to data…

airflow BigData DataOps

HowTo: Apache Airflow with Docker on Linux

Kate Belova Mar 28, 2023 0 Comments

Here's a guide to setting up Apache Airflow with Docker on a Linux machine, with shared DAGs and plugins folder, extra plugins, specific Python packages on Airflow workers, and a…

airflow BigData DataOps

HowTo: install Apache Airflow with Rancher Desktop (MacOS)

Kate Belova Mar 28, 2023 0 Comments

Setting up Apache Airflow on macOS using Rancher Desktop involves several steps. In this guide, we'll walk you through installing Rancher Desktop, deploying a Kubernetes cluster, and deploying Airflow using…

BigData

HowTo: Run Cloudera Quickstart in Docker

Kate Belova Jan 31, 2019 0 Comments

The best way to familiarize yourself with the Hadoop ecosystem or to do proof of concept: is to play with it in a sandbox. Cloudera provides 2 Quick Start options:…

BigData

How to create avro based table in Impala

Kate Belova Nov 20, 2018 0 Comments

Consider the following situation: A bundle of .avro files is stored on HDFS. They need to be converted to Impala tables. Schemas are not provided with files, at least not…

DataOps: Code of Conduct

HowTo: Apache Airflow with Docker on Linux

HowTo: install Apache Airflow with Rancher Desktop (MacOS)

HowTo: Run Cloudera Quickstart in Docker

How to create avro based table in Impala

You Missed

Expect unexpected

DataOps: Code of Conduct

HowTo: Apache Airflow with Docker on Linux

HowTo: install Apache Airflow with Rancher Desktop (MacOS)