Adopting AI models is easy — scaling them requires shared open standards

The AI market is as competitive as any I have seen. When organizations look to implement the latest AI model or agent platform, many skip over the infrastructure-building required for successful deployment. This instinct is understandable – teams want to move quickly, deliver business impact and avoid falling behind in a fast-paced market. But models and frameworks only deliver value over time if…

Read more →
El Kubernetes heredado de MásMóvil que está detrás de las webs de (casi) todas las marcas de MásOrange

El Kubernetes de MásOrange, ex-MásMóvil

Soy Bocchi94 y me conocerán por hilos como Un año con el router Zowee 5G de Simyo y aportaciones al foro como la imagen que sale al visitar la web fuerademercado.eu .

Hoy les vengo a hablar de la infraestructura que utiliza el grupo MásMóvil desde hace años y lentamente se está extendiendo a todo el grupo MásOrange. En algunos grupos de Telegram y por…

Read more →
Google adds open source Agent Executor to support AI agents in production

Google has introduced Agent Executor, an open source runtime aimed at helping enterprises run AI agents more reliably at scale, as attention shifts from building agent prototypes to managing the operational challenges of putting them into production.

To address those production-related challenges, the runtime, according to the company, comes with capabilities that are geared towards supporting…

Read more →
Google adds open source Agent Executor to support AI agents in production

Google has introduced Agent Executor, an open source runtime aimed at helping enterprises run AI agents more reliably at scale, as attention shifts from building agent prototypes to managing the operational challenges of putting them into production.

To address those production-related challenges, the runtime, according to the company, comes with capabilities that are geared towards supporting…

Read more →

Helm is to Kubernetes what apt is to Ubuntu or npm is to Node.js. Instead of maintaining dozens of raw Kubernetes YAML files for every application, you package them into a Helm Chart with templated values. A helm install command deploys the full application stack with environment-specific configuration.

Chart Structure

myapp/
Chart.yaml # Chart metadata (name, version, description)

Read more →

ArgoCD is a Kubernetes controller that continuously watches a Git repository and ensures your cluster's actual state matches what's declared in that repo. If someone manually modifies a Kubernetes resource ("drift"), ArgoCD detects it and either alerts or auto-corrects. This GitOps model makes every deployment auditable, reversible, and reproducible.

GitOps Model

The GitOps workflow:

1.…

Read more →

Pulumi is an Infrastructure as Code platform that challenges Terraform's HCL-based approach by letting you write infrastructure definitions in languages you already know. Instead of learning a new DSL, you write a TypeScript program that creates an S3 bucket, configures IAM roles, and deploys an EKS cluster, with full access to loops, conditionals, functions, and packages.

How Pulumi Differs…

Read more →

HashiCorp Vault is the de facto secrets management solution for production infrastructure. Instead of storing database passwords in environment variables, AWS keys in CI/CD pipelines, and API tokens scattered across services, Vault centralizes secrets with fine-grained access control, audit logging, and dynamic credential generation.

Core Concepts

  • Secrets Engines, plugins that handle…
Read more →

Traefik (pronounced "traffic") is a reverse proxy built for dynamic environments. Where Nginx requires you to restart the process after every config change, Traefik watches your Docker socket or Kubernetes API and reconfigures itself in real time as containers start and stop. This makes it the natural choice for containerized microservices.

Core Concepts

  • EntryPoints, ports Traefik listens…
Read more →

WireGuard is a VPN protocol created by Jason Donenfeld and merged into the Linux kernel in 2020. It replaces OpenVPN and IPsec for most new deployments with a dramatically simpler design: cryptokey routing, a single modern cipher suite (ChaCha20, Poly1305, Curve25519, BLAKE2s), and a stateless design that is auditable, fast, and secure by default.

The Design Philosophy

WireGuard has no…

Read more →
Inference Endpoint

An inference endpoint is the serving layer for a trained model. After training (or downloading) an LLM, you need infrastructure to accept requests, run the forward pass, and return outputs at scale. That infrastructure, whether it's Hugging Face Inference Endpoints, AWS SageMaker, your own Vllm deployment, or a managed service like OpenAI, is the inference endpoint.

Request Flow

  1. Client…
Read more →

vLLM (Virtual LLM) is an open-source inference engine from UC Berkeley that dramatically increases the throughput of serving large language models on GPU hardware. It was introduced in 2023 with PagedAttention, a novel memory management technique that treats the KV cache like virtual memory in an OS, reducing waste from up to 60–80% of GPU memory down to under 4%.

The Problem: KV Cache…

Read more →
Availability Zone

An availability zone (AZ) is an isolated data centre within a cloud region. AZs in the same region share low-latency private network links but use physically separate buildings, power, cooling, and uplinks, so a fault in one AZ does not propagate to the others. AZs are the cloud's unit of fault isolation below the region level.

Why multi-AZ matters

  • Hardware failures. A single AZ can lose…
Service Mesh

A service mesh is an infrastructure layer that handles service-to-service communication for a microservices application: routing, retries, timeouts, mTLS encryption, authorization, traffic shifting, and observability. The mesh moves these cross-cutting concerns out of each service into a dedicated layer, typically implemented as a sidecar proxy or a kernel module per pod.

How it works

Most…

Helm is the package manager for Kubernetes. It packages a set of Kubernetes manifests into a chart, parameterised by values, that can be installed, upgraded, rolled back, and shared via repositories. Helm charts are the standard distribution format for installable Kubernetes applications.

Core concepts

  • Chart. A directory of YAML templates, a values.yaml, and metadata (Chart.yaml) describing…
Container Runtime

A container runtime is the software component that actually creates and runs containers from an OCI image. It manages namespaces, cgroups, mount points, capabilities, and the container process lifecycle. Higher-level tools (Docker, Kubernetes) delegate the final "run this container" step to a runtime.

Two levels of runtime

  • High-level runtime. Pulls images, manages the local image store and…

Podman is an open-source container engine compatible with the Docker CLI but architected without a long-running root daemon. Containers run as a regular user, in a rootless namespace, with a pod abstraction borrowed from Kubernetes. Podman is the default container engine on Red Hat Enterprise Linux and Fedora.

How it differs from Docker

  • Daemonless. No central dockerd service; each podman…

Docker Compose is a tool for defining and running multi-container applications on a single host using a declarative YAML file. It is the standard way to spin up a local development environment with a database, a cache, the application server, and any other supporting services in one command.

How it works

A docker-compose.yml (or compose.yaml) file declares services, networks, volumes, and…

OCI

The Open Container Initiative (OCI) is an open governance body, hosted by the Linux Foundation, that maintains the standard specifications for container images, runtimes, and distribution. OCI specs are why a container built with one tool runs on another, and why Docker, Podman, Kubernetes, and cloud platforms can interoperate.

The three specs

  • Image Specification. Defines the on-disk format…

containerd is an industry-standard container runtime daemon that manages the complete container lifecycle on a host: image pull and storage, container execution, snapshotting, and networking. It is the runtime that Kubernetes, Docker, and most container platforms call underneath.

How it fits

Container tooling is typically layered. Docker and Kubernetes are high-level interfaces; they delegate…

Serverless is a cloud execution model in which the provider runs and scales the underlying compute, and the customer is billed only for actual usage (request count, execution time, memory). Despite the name, servers still exist; they are simply invisible to the customer. "Serverless" usually refers to functions-as-a-service (FaaS), but increasingly includes serverless databases, queues, and…

Cloud computing is the model of delivering compute, storage, networking, and software as on-demand services over the internet, billed by usage rather than by capital purchase. Cloud providers operate the underlying hardware and abstract it behind APIs, so customers consume resources elastically without managing data centres.

Service models

  • IaaS (Infrastructure as a Service). Virtual…

Blue-Green Deployment is a release strategy that maintains two identical production environments and switches all traffic from one to the other when a new version is ready. The currently live environment is "blue"; the new version is staged in "green"; when green is verified, a single traffic switch makes green live and blue idle (and available as instant rollback).

How it works

Two complete…

ArgoCD is an open-source declarative continuous-delivery tool for Kubernetes that runs inside the cluster and applies manifests from a Git repository. It is the most widely deployed GitOps controller in the Kubernetes ecosystem.

How it works

An Application object in ArgoCD points at a Git repo, a path inside it, and a target cluster and namespace. ArgoCD continuously compares the manifests in…

GitOps is an operational pattern in which the desired state of infrastructure and applications is declared in a Git repository, and an automated controller continuously reconciles the running system toward that state. Git becomes the source of truth; deployments happen by merging pull requests, not by running ad-hoc commands.

Core principles

  • Declarative. The repo describes the desired…

Prometheus is an open-source time-series database and monitoring system that scrapes metrics from instrumented applications over HTTP, stores them locally, and exposes a powerful query language (PromQL) for dashboards and alerts. It is the de facto standard for cloud-native infrastructure monitoring.

How it works

Applications expose a /metrics endpoint in the Prometheus exposition format. A…

Page 1 Older →