Robert Kubiś

Platform & Tools Leader

Robert Kubiś

Observability · Automation · AI-enabled Operations · DevOps Platforms

I build and lead platform capabilities that make engineering teams faster, safer and more reliable.

platform-lead --focus="observability,automation,ai-ops"

About Me

Platform leadership for reliable engineering teams

I am a Platform & Tools leader focused on observability, automation, AI-enabled operations and reliable internal platforms. My background combines DevOps, cloud architecture, SRE practices and engineering leadership across platform, tooling, monitoring and operational support areas. Recently, my work has focused on observability platforms built around Grafana, Prometheus, VictoriaMetrics, Loki, Alloy, monitoring-as-code and Git-reviewed operational configuration. I explore practical AI for operations: deterministic telemetry summaries, meeting intelligence, human-in-the-loop workflows and safe read-only integrations. I like connecting technical platform work with real operational value: clearer ownership, fewer manual steps, better incident response and more useful signals for engineering teams.

Current focus

What I focus on now

Observability Platform

Grafana, Prometheus, VictoriaMetrics, Loki, alerting, ownership, monitoring-as-code and operational context during incidents.

AI-enabled Operations

AI-assisted reporting, deterministic signal collection, transcript intelligence, safe read-only integrations and human approval before action.

Automation & Orchestration

Repeatable operational workflows, Jira/PagerDuty patterns, request portals, GitOps-style reviews and reducing manual toil.

Engineering Leadership

Platform teams, stakeholder alignment, coaching, prioritization, 24/7 support awareness and translating technical debt into risk.

Selected work

Stories that represent my current direction

Platform & Tools Leadership

Leading platform streams across monitoring, automation, portals and operational support with a focus on ownership, reliable delivery and stakeholder clarity.

AI Observability Agent

Exploring safe AI-assisted operational reporting based on deterministic telemetry, structured summaries and read-only observability integrations.

Automation Engine

Reducing repeated manual operational work by turning support and incident-related patterns into safer, more repeatable workflows.

AI Meeting Intelligence

Using meeting transcripts to extract decisions, owners, follow-ups and possible Jira or GitHub issues with human-in-the-loop guardrails.

Capabilities

Skills grouped by platform capability

Platform Leadership

  • Team leadership and delegation
  • Stakeholder alignment
  • Roadmap and priority clarity
  • Operational ownership models

Observability & Reliability

  • Grafana, Prometheus, VictoriaMetrics
  • Loki, logs and operational signals
  • Actionable alerts and incident context
  • SLO and reliability thinking

Automation & GitOps

  • Terraform and Infrastructure as Code
  • Git-reviewed configuration
  • GitHub Actions and CI/CD
  • Repeatable operational workflows

AI for Operations

  • AI-assisted summaries and reports
  • Human-in-the-loop automation
  • Read-only tool integrations
  • Guardrails and safe workflows

Cloud & DevOps Foundation

  • AWS and cloud architecture
  • Docker and Kubernetes concepts
  • Linux system administration
  • Python and Bash automation

Collaboration & Delivery

  • Jira and Confluence
  • Scrum and Kanban
  • Incident follow-up and postmortems
  • Clear requirements and acceptance criteria
Hey! The hacker effects are just an easter egg now. The real story is platform leadership. 😉