#Kubernetes — blogs.social

CIO.com - The voice of IT leadership @cio.com.web.brid.gy

4d

Adopting AI models is easy — scaling them requires shared open standards

The AI market is as competitive as any I have seen. When organizations look to implement the latest AI model or agent platform, many skip over the infrastructure-building required for successful deployment. This instinct is understandable – teams want to move quickly, deliver business impact and avoid falling behind in a fast-paced market. But models and frameworks only deliver value over time if…

Read more →

Scaling with Datadog for SMBs @index.datadog.criticalcloud.ai.ap.brid.gy

Jun 4

How To Monitor Kubernetes Event Metrics in Datadog

Extend Kubernetes event history with Datadog, collect warning events via the Agent, and turn them into dashboards and alerts.

Read more →

Scaling with Datadog for SMBs @index.datadog.criticalcloud.ai.ap.brid.gy

Jun 4

How to Integrate Datadog with Cloud-Native Tools

How to connect Datadog to AWS, Azure, GCP, and Kubernetes; configure metrics, logs, dashboards, alerts, and automation.

Read more →

RCR Wireless News - Intelligence on All Things Wireless | RCR W… @rcrwireless.com.web.brid.gy

May 29

Softbank targets sovereign AI demand in Japan

The company’s AI Data Center GPU Cloud is intended for customers facing restrictions on transferring data overseas and those seeking infrastructure optimized specifically for the Japanese market In sum – what to know: Sovereign AI focus – Softbank said the…

BandaAncha:️ Noticias, guías y foros sobre fibra y móvil [Unoffi… @bandaancha.eu.web.brid.gy

May 25

El Kubernetes heredado de MásMóvil que está detrás de las webs de (casi) todas las marcas de MásOrange

El Kubernetes de MásOrange, ex-MásMóvil

Soy Bocchi94 y me conocerán por hilos como Un año con el router Zowee 5G de Simyo y aportaciones al foro como la imagen que sale al visitar la web fuerademercado.eu .

Hoy les vengo a hablar de la infraestructura que utiliza el grupo MásMóvil desde hace años y lentamente se está extendiendo a todo el grupo MásOrange. En algunos grupos de Telegram y por…

Read more →

United States @computerworld.com.web.brid.gy

May 26

Google adds open source Agent Executor to support AI agents in production

Google has introduced Agent Executor, an open source runtime aimed at helping enterprises run AI agents more reliably at scale, as attention shifts from building agent prototypes to managing the operational challenges of putting them into production.

To address those production-related challenges, the runtime, according to the company, comes with capabilities that are geared towards supporting…

Read more →

CIO.com - The voice of IT leadership @cio.com.web.brid.gy

May 26

Google adds open source Agent Executor to support AI agents in production

Google has introduced Agent Executor, an open source runtime aimed at helping enterprises run AI agents more reliably at scale, as attention shifts from building agent prototypes to managing the operational challenges of putting them into production.

To address those production-related challenges, the runtime, according to the company, comes with capabilities that are geared towards supporting…

Read more →

pandolin.io [Unofficial] @pandolin.io.web.brid.gy

May 25

Wer hält das Ruder

Ein Nachmittag auf dem Canonical Day Frankfurt, ein Pizzakonzern, eine Raumfahrtbehörde und die Frage, die das Wort Kubernetes seit der Antike mit sich trägt.

Sahil Kapoor's Playbook @sahil.sahilkapoor.com.ap.brid.gy

May 17

Helm is to Kubernetes what apt is to Ubuntu or npm is to Node.js. Instead of maintaining dozens of raw Kubernetes YAML files for every application, you package them into a Helm Chart with templated values. A helm install command deploys the full application stack with environment-specific configuration.

Chart Structure

myapp/
Chart.yaml # Chart metadata (name, version, description)
…

Read more →

Sahil Kapoor's Playbook @sahil.sahilkapoor.com.ap.brid.gy

May 17

ArgoCD is a Kubernetes controller that continuously watches a Git repository and ensures your cluster's actual state matches what's declared in that repo. If someone manually modifies a Kubernetes resource ("drift"), ArgoCD detects it and either alerts or auto-corrects. This GitOps model makes every deployment auditable, reversible, and reproducible.

GitOps Model

The GitOps workflow:

1.…

Read more →

Sahil Kapoor's Playbook @sahil.sahilkapoor.com.ap.brid.gy

May 17

Pulumi is an Infrastructure as Code platform that challenges Terraform's HCL-based approach by letting you write infrastructure definitions in languages you already know. Instead of learning a new DSL, you write a TypeScript program that creates an S3 bucket, configures IAM roles, and deploys an EKS cluster, with full access to loops, conditionals, functions, and packages.

How Pulumi Differs…

Read more →

Sahil Kapoor's Playbook @sahil.sahilkapoor.com.ap.brid.gy

May 17

HashiCorp Vault is the de facto secrets management solution for production infrastructure. Instead of storing database passwords in environment variables, AWS keys in CI/CD pipelines, and API tokens scattered across services, Vault centralizes secrets with fine-grained access control, audit logging, and dynamic credential generation.

Core Concepts

Secrets Engines, plugins that handle…

Read more →

Sahil Kapoor's Playbook @sahil.sahilkapoor.com.ap.brid.gy

May 17

Traefik (pronounced "traffic") is a reverse proxy built for dynamic environments. Where Nginx requires you to restart the process after every config change, Traefik watches your Docker socket or Kubernetes API and reconfigures itself in real time as containers start and stop. This makes it the natural choice for containerized microservices.

Core Concepts

EntryPoints, ports Traefik listens…

Read more →

Sahil Kapoor's Playbook @sahil.sahilkapoor.com.ap.brid.gy

May 17

WireGuard is a VPN protocol created by Jason Donenfeld and merged into the Linux kernel in 2020. It replaces OpenVPN and IPsec for most new deployments with a dramatically simpler design: cryptokey routing, a single modern cipher suite (ChaCha20, Poly1305, Curve25519, BLAKE2s), and a stateless design that is auditable, fast, and secure by default.

The Design Philosophy

WireGuard has no…

Read more →

Sahil Kapoor's Playbook @sahil.sahilkapoor.com.ap.brid.gy

May 17

Inference Endpoint

An inference endpoint is the serving layer for a trained model. After training (or downloading) an LLM, you need infrastructure to accept requests, run the forward pass, and return outputs at scale. That infrastructure, whether it's Hugging Face Inference Endpoints, AWS SageMaker, your own Vllm deployment, or a managed service like OpenAI, is the inference endpoint.

Request Flow

Client…

Read more →

Sahil Kapoor's Playbook @sahil.sahilkapoor.com.ap.brid.gy

May 17

vLLM (Virtual LLM) is an open-source inference engine from UC Berkeley that dramatically increases the throughput of serving large language models on GPU hardware. It was introduced in 2023 with PagedAttention, a novel memory management technique that treats the KV cache like virtual memory in an OS, reducing waste from up to 60–80% of GPU memory down to under 4%.

The Problem: KV Cache…

Read more →

Sahil Kapoor's Playbook @sahil.sahilkapoor.com.ap.brid.gy

May 12

Availability Zone

An availability zone (AZ) is an isolated data centre within a cloud region. AZs in the same region share low-latency private network links but use physically separate buildings, power, cooling, and uplinks, so a fault in one AZ does not propagate to the others. AZs are the cloud's unit of fault isolation below the region level.

Why multi-AZ matters

Hardware failures. A single AZ can lose…

Sahil Kapoor's Playbook @sahil.sahilkapoor.com.ap.brid.gy

May 12

Service Mesh

A service mesh is an infrastructure layer that handles service-to-service communication for a microservices application: routing, retries, timeouts, mTLS encryption, authorization, traffic shifting, and observability. The mesh moves these cross-cutting concerns out of each service into a dedicated layer, typically implemented as a sidecar proxy or a kernel module per pod.

How it works

Most…

Sahil Kapoor's Playbook @sahil.sahilkapoor.com.ap.brid.gy

May 12

Helm is the package manager for Kubernetes. It packages a set of Kubernetes manifests into a chart, parameterised by values, that can be installed, upgraded, rolled back, and shared via repositories. Helm charts are the standard distribution format for installable Kubernetes applications.

Core concepts

Chart. A directory of YAML templates, a values.yaml, and metadata (Chart.yaml) describing…

Sahil Kapoor's Playbook @sahil.sahilkapoor.com.ap.brid.gy

May 12

Container Runtime

A container runtime is the software component that actually creates and runs containers from an OCI image. It manages namespaces, cgroups, mount points, capabilities, and the container process lifecycle. Higher-level tools (Docker, Kubernetes) delegate the final "run this container" step to a runtime.

Two levels of runtime

High-level runtime. Pulls images, manages the local image store and…

Sahil Kapoor's Playbook @sahil.sahilkapoor.com.ap.brid.gy

May 12

Podman is an open-source container engine compatible with the Docker CLI but architected without a long-running root daemon. Containers run as a regular user, in a rootless namespace, with a pod abstraction borrowed from Kubernetes. Podman is the default container engine on Red Hat Enterprise Linux and Fedora.

How it differs from Docker

Daemonless. No central dockerd service; each podman…

Sahil Kapoor's Playbook @sahil.sahilkapoor.com.ap.brid.gy

May 12

Docker Compose is a tool for defining and running multi-container applications on a single host using a declarative YAML file. It is the standard way to spin up a local development environment with a database, a cache, the application server, and any other supporting services in one command.

How it works

A docker-compose.yml (or compose.yaml) file declares services, networks, volumes, and…

Sahil Kapoor's Playbook @sahil.sahilkapoor.com.ap.brid.gy

May 12

OCI

The Open Container Initiative (OCI) is an open governance body, hosted by the Linux Foundation, that maintains the standard specifications for container images, runtimes, and distribution. OCI specs are why a container built with one tool runs on another, and why Docker, Podman, Kubernetes, and cloud platforms can interoperate.

The three specs

Image Specification. Defines the on-disk format…

Sahil Kapoor's Playbook @sahil.sahilkapoor.com.ap.brid.gy

May 12

containerd is an industry-standard container runtime daemon that manages the complete container lifecycle on a host: image pull and storage, container execution, snapshotting, and networking. It is the runtime that Kubernetes, Docker, and most container platforms call underneath.

How it fits

Container tooling is typically layered. Docker and Kubernetes are high-level interfaces; they delegate…

Sahil Kapoor's Playbook @sahil.sahilkapoor.com.ap.brid.gy

May 12

Serverless is a cloud execution model in which the provider runs and scales the underlying compute, and the customer is billed only for actual usage (request count, execution time, memory). Despite the name, servers still exist; they are simply invisible to the customer. "Serverless" usually refers to functions-as-a-service (FaaS), but increasingly includes serverless databases, queues, and…

Sahil Kapoor's Playbook @sahil.sahilkapoor.com.ap.brid.gy

May 12

Cloud computing is the model of delivering compute, storage, networking, and software as on-demand services over the internet, billed by usage rather than by capital purchase. Cloud providers operate the underlying hardware and abstract it behind APIs, so customers consume resources elastically without managing data centres.

Service models

IaaS (Infrastructure as a Service). Virtual…

Sahil Kapoor's Playbook @sahil.sahilkapoor.com.ap.brid.gy

May 12

Blue-Green Deployment is a release strategy that maintains two identical production environments and switches all traffic from one to the other when a new version is ready. The currently live environment is "blue"; the new version is staged in "green"; when green is verified, a single traffic switch makes green live and blue idle (and available as instant rollback).

How it works

Two complete…

Sahil Kapoor's Playbook @sahil.sahilkapoor.com.ap.brid.gy

May 12

ArgoCD is an open-source declarative continuous-delivery tool for Kubernetes that runs inside the cluster and applies manifests from a Git repository. It is the most widely deployed GitOps controller in the Kubernetes ecosystem.

How it works

An Application object in ArgoCD points at a Git repo, a path inside it, and a target cluster and namespace. ArgoCD continuously compares the manifests in…

Sahil Kapoor's Playbook @sahil.sahilkapoor.com.ap.brid.gy

May 12

GitOps is an operational pattern in which the desired state of infrastructure and applications is declared in a Git repository, and an automated controller continuously reconciles the running system toward that state. Git becomes the source of truth; deployments happen by merging pull requests, not by running ad-hoc commands.

Core principles

Declarative. The repo describes the desired…

Sahil Kapoor's Playbook @sahil.sahilkapoor.com.ap.brid.gy

May 12

Prometheus is an open-source time-series database and monitoring system that scrapes metrics from instrumented applications over HTTP, stores them locally, and exposes a powerful query language (PromQL) for dashboards and alerts. It is the de facto standard for cloud-native infrastructure monitoring.

How it works

Applications expose a /metrics endpoint in the Prometheus exposition format. A…