Kate Belova - NotJustTech

Expect unexpected

Kate Belova May 16, 2023 0 Comments

It was a super early and gloomy autumn morning, and Mother was packing me for the school trip to a day of forest camping. I was still only half awake…

BigData DataOps self-developing

DataOps: Code of Conduct

Kate Belova Apr 18, 2023 0 Comments

I'm often asked what DataOps is and how to be effective in it. DataOps is a relatively new term that refers to the practice of applying DevOps principles to data…

airflow BigData DataOps

HowTo: Apache Airflow with Docker on Linux

Kate Belova Mar 28, 2023 0 Comments

Here's a guide to setting up Apache Airflow with Docker on a Linux machine, with shared DAGs and plugins folder, extra plugins, specific Python packages on Airflow workers, and a…

airflow BigData DataOps

HowTo: install Apache Airflow with Rancher Desktop (MacOS)

Kate Belova Mar 28, 2023 0 Comments

Setting up Apache Airflow on macOS using Rancher Desktop involves several steps. In this guide, we'll walk you through installing Rancher Desktop, deploying a Kubernetes cluster, and deploying Airflow using…

self-developing technical_communication

Mastering Technical Communication: Overcoming Challenges

Kate Belova Mar 8, 2023 0 Comments

Have you ever felt like banging your head over the wall of misunderstanding or lack of wording to describe a technical solution and prove your point of view? Oh my,…

Testing

Testing Airflow data pipelines with Catcher end to end

Kate Belova May 20, 2020 0 Comments

This article is about writing end-to-end test for a data pipeline. It will cover Airflow, as one of the most popular data pipeline scheduler now days and one of the…

BigData

HowTo: Run Cloudera Quickstart in Docker

Kate Belova Jan 31, 2019 0 Comments

The best way to familiarize yourself with the Hadoop ecosystem or to do proof of concept: is to play with it in a sandbox. Cloudera provides 2 Quick Start options:…

BigData

How to create avro based table in Impala

Kate Belova Nov 20, 2018 0 Comments

Consider the following situation: A bundle of .avro files is stored on HDFS. They need to be converted to Impala tables. Schemas are not provided with files, at least not…

You Missed