The role of .gitignore file is to ignore the changes of the specified file or folder. Say for example if you don’t want to track certain files and their changes throughout the life cycle of the project (example below), we […]
Hive is a data warehouse tool based on Hadoop for data extraction, transformation, and loading. This is a mechanism that can store, query, and analyze large-scale data stored in Hadoop. In this article, I will take you through how to
In this article, I will take you through step by step on how to easily install Hadoop 3.3.0 on a mac OS – Big Sur (version 11.2.1) with HomeBrew for a single node cluster in pseudo-distributed mode. Install Hadoop on
At present Data Engineering has become so popular and it is acting as the backbone for many companies like Google, Uber, Amazon, Tesla, etc. Also, companies are finding more ways to get meaningful insights from the data to prevent threats,
Recently, one of my good friend asked me privately how did I survive 15 years in the IT industry. So I thought to give my learnings and advice for young engineers. Who am I I started my IT career at
Foggy on DevOps and Cloud jargon? In this article, I have covered 70+ DevOps and Cloud computing IT glossary from A to Z. It has definitions for most popular DevOps/cloud computing terms and acronyms. A Availability Zones Availability Zones are unique
Summary: Kubernetes is an open-source version of Borg. It is a container orchestration platform that automates the deployment, scaling, management, and networking of containers. It has been open-sourced since 2014 and primarily developed by Google using the Go Programming Language.
Unit Testing is the practice of writing a series of small unit test cases and validating the behavior of production code at the level of functions & classes in isolation. In simpler terms, every line of code written in development