Skip to content


Hello, World!

  • Thanks for visiting the knowledge base of SRE and DevOps!
  • Sincerely hope it could help organizations achieve greater efficiency and reliability in their software development and operations.
  • Let’s make the world reliable, sustainable, secure and efficient together!

A system architecture is the conceptual model that defines the structure, behavior, and more views of a system. Architecture helps stakeholders understand and analyze how the system will achieve essential qualities such as modifiability, availability, and security.


AIOps is the application of artificial intelligence (AI) capabilities, such as natural language processing and machine learning models, to automate and streamline operational workflows. AIOps combines big data and machine learning to automate IT operations processes, including event correlation, anomaly detection and causality determination.


Ansible is a suite of software tools that enables infrastructure as code. It is open-source and the suite includes software provisioning, configuration management, and application deployment functionality.


Big data is a combination of structured, semi-structured and unstructured data that organizations collect, analyze and mine for information and insights. It’s used in machine learning projects, predictive modeling and other advanced analytics applications.

Big Data

Bash is a Unix shell and command language written by Brian Fox for the GNU Project as a free software replacement for the Bourne shell. The shell’s name is an acronym for Bourne-Again SHell, a pun on the name of the Bourne shell that it replaces and the notion of being “born again”.


Cloud computing is the delivery of different services through the Internet, including data storage, servers, databases, networking, and software. Cloud computing is the on-demand delivery of IT resources over the Internet with pay-as-you-go pricing.


Containerization is one of the technologies that allows developers to build cloud-native applications. It works with other cloud-native technologies, such as service mesh and APIs, to allow microservices to work cohesively in a cloud-native application. A container is a standard unit of software that packages up code and all its dependencies so the application runs quickly and reliably from one computing environment to another.


A content delivery network (CDN) is a group of geographically distributed servers that speed up the delivery of web content by bringing it closer to where users are.


Cloud native is the software approach of building, deploying, and managing modern applications in cloud computing environments. Modern companies want to build highly scalable, flexible, and resilient applications that they can update quickly to meet customer demands.

Cloud Native

CI/CD or CICD is the combined practices of continuous integration and continuous delivery or, less often, continuous deployment. They are sometimes referred to collectively as continuous development or continuous software development.


crictl is a command-line interface for CRI-compatible container runtimes. You can use it to inspect and debug container runtimes and applications on a Kubernetes node. crictl and its source are hosted in the cri-tools repository.


Colima means Containers in Lima. Since Lima is aka Linux on Mac.


Cloud services are infrastructure, platforms, or software that are hosted by third-party providers and made available to users through the internet.

Cloud Service

Docker is a set of platform as a service products that use OS-level virtualization to deliver software in packages called containers.


DevOps is a combination of software development (dev) and operations (ops). It is defined as a software engineering methodology which aims to integrate the work of development teams and operations teams by facilitating a culture of collaboration and shared responsibility.


DevSecOps stands for development, security, and operations. It’s an approach to culture, automation, and platform design that integrates security as a shared responsibility throughout the entire IT lifecycle.


The Domain Name System is a hierarchical and distributed naming system for computers, services, and other resources in the Internet or other Internet Protocol networks.


A distributed system is a collection of independent components located on different machines that share messages with each other in order to achieve common goals.

Distributed Systems

Debugging is the process of finding and fixing errors or bugs in the source code of any software.


Software engineering is the branch of computer science that deals with the design, development, testing, and maintenance of software applications.


FinOps is an evolving cloud financial management discipline and cultural practice that enables organizations to get maximum business value by helping engineering, finance, technology and business teams to collaborate on data-driven spending decisions.


GitOps uses Git repositories as a single source of truth to deliver infrastructure as code. Submitted code checks the CI process, while the CD process checks and applies requirements for things like security, infrastructure as code, or any other boundaries set for the application framework.


Go is a statically typed, compiled high-level programming language designed at Google by Robert Griesemer, Rob Pike, and Ken Thompson.


Apache Hadoop is a collection of open-source software utilities that facilitates using a network of many computers to solve problems involving massive amounts of data and computation.


The Hypertext Transfer Protocol is an application layer protocol in the Internet protocol suite model for distributed, collaborative, hypermedia information systems.


Infrastructure as Code (IaC) is the managing and provisioning of infrastructure through code instead of through manual processes.


Jenkins is an open source automation server. It helps automate the parts of software development related to building, testing, and deploying, facilitating continuous integration and continuous delivery.


Kubernetes is an open-source container orchestration system for automating software deployment, scaling, and management.


Linux is a family of open-source Unix-like operating systems based on the Linux kernel, an operating system kernel first released on September 17, 1991, by Linus Torvalds.


Logging is the act of keeping a log of events that occur in a computer system, such as problems, errors or just information on current operations.


A load balancer is a device that acts as a reverse proxy and distributes network or application traffic across a number of servers. Load balancers are used to increase capacity (concurrent users) and reliability of applications.


Lua is a lightweight, high-level, multi-paradigm programming language designed primarily for embedded use in applications.


A message queue is a form of asynchronous service-to-service communication used in serverless and microservices architectures.


A monitoring system is software that helps system administrators monitor their infrastructure. These tools monitor system devices, traffic, and applications, and sound the alarm in the event of malfunctions and disruptions.


A computer network is a set of computers sharing resources located on or provided by network nodes.


NoSQL stands for “not only SQL” rather than “no SQL” at all.


An operating system (OS) is the program that, after being initially loaded into the computer by a boot program, manages all of the other application programs in a computer.


Observability is a measure of how well internal states of a system can be inferred from knowledge of its external outputs.


OpenShift supports the application development lifecycle, provisioning and managing container images and workloads into a total ecosystem.


OpenStack is an open source cloud computing infrastructure software project and is one of the three most active open source projects in the world.


Performance tuning improves the price to performance ratio for a system or set of services by reallocating the available computing, network, or storage resources.


Python is a high-level, general-purpose programming language.


Perl is a family of two high-level, general-purpose, interpreted, dynamic programming languages.


PHP is a general-purpose scripting language geared towards web development.


Puppet is a software configuration management tool which includes its own declarative language to describe system configuration.


Podman (the POD MANager) is a tool for managing containers and images, volumes mounted into those containers, and pods made from groups of containers.


A dynamic, open source programming language with a focus on simplicity and productivity.


Redis is an open-source in-memory storage, used as a distributed, in-memory key–value database, cache and message broker, with optional durability.


Site reliability engineering is a set of principles and practices that applies aspects of software engineering to IT infrastructure and operations.


Shell is a program that takes commands from the keyboard and gives them to the operating system to perform.


System Design is defined as a process of creating an architecture for different components, interfaces, and modules of the system and providing corresponding data helpful in implementing such elements in systems.

System Design

Serverless computing is a cloud computing execution model in which the cloud provider allocates machine resources on demand, taking care of the servers on behalf of their customers.


Zero Trust is a security framework requiring all users, whether in or outside the organization’s network, to be authenticated, authorized, and continuously validated for security configuration and posture before being granted or keeping access to applications and data.