Why Data Governance so important for an organization
Data is the new OIL. If you google “what is data governance?” There are a lot of different definitions that are self sufficient to explain it. In this article, I would like to fill in my perspective on Data Governance. Data…
Modern Data Engineering in Cloud Platform
Data engineering is more of a common term today and traditionally it is also being referred to as ETL, BI, and Big data developers. Modern Data Engineering in cloud platform ought to support fully developed and operationalized data pipelines which should…
How to Setup SonarQube locally on Mac
SonarQube is an open platform for managing continuous inspection of the code quality, which can locate potential errors in the codebase. It currently supports more than 20+ programming languages including support on C#, JavaScript, C/C++, TypeScript, Go, Python, etc., and other…
How to Install Hive on Mac with Homebrew
Hive is a data warehouse tool based on Hadoop for data extraction, transformation, and loading. This is a mechanism that can store, query, and analyze large-scale data stored in Hadoop. In this article, I will take you through how to Install…
How to Install Hadoop on Mac with Homebrew
In this article, I will take you through step by step on how to easily install Hadoop 3.3.0 on a mac OS – Big Sur (version 11.2.1) with HomeBrew for a single node cluster in pseudo-distributed mode. Install Hadoop on Mac…
What is the difference between Hot Warm and Cold data storage?
At present Data Engineering has become so popular and it is acting as the backbone for many companies like Google, Uber, Amazon, Tesla, etc. Also, companies are finding more ways to get meaningful insights from the data to prevent threats, understand…
Python Unit Testing With PyTest 101
Unit Testing is the practice of writing a series of small unit test cases and validating the behavior of production code at the level of functions & classes in isolation. In simpler terms, every line of code written in development will…