Issue 01 · Sarajevo → Global
Available for select engagements

§ 001 — Introduction

Semsudin Sefić

AI Systems Architect
Principal QA & DevOps

I ship production-grade AI systems that actually work.

Multi-agent platforms, enterprise RAG, and the DevOps discipline to keep them standing. Eleven years turning prototypes into systems CTOs actually trust.

Currently building

  • Alva multi-agent platform @ Alfa Laval
  • AI-driven QA agents @ Mars Petcare
  • CTO @ Qafana.com
Book a call See the work

11yrs

In the field

50+

Engineers led

1000+

Tests shipped

750%

Pipeline speed-up

Shipped alongside

Enterprise · SaaS · MarTech · Manufacturing

Alfa Laval

Alva AI Platform

Mars Petcare

Kinship · AdoptAPet

APCOA

Core QA Transformation

ERS

Test Automation Architect

System Verification

Tech Lead · 50+ eng

Authority Partners

Test Automation Eng

Andela

Vetted network

§ 002 — Services

What I do

Three pillars. One thing: AI systems that ship, and stay shipped.

01

AI Systems Architecture

Multi-agent platforms that survive production.

ReAct agents on LlamaIndex, 10+ LLMs orchestrated through a single interface, full RAG with document parsing, chunking, pgvector. End-to-end streaming. Proper observability via OpenTelemetry. The architecture decisions that separate a cool demo from a system your ops team doesn't hate.

LlamaIndex FastAPI RAG pgvector Azure OpenAI Multi-LLM

Proof Shipped Alva at Alfa Laval — React UI, .NET 8 orchestrator, Python agent, 10+ LLMs.

02

AI-Driven QA

Tests that write themselves, heal themselves, and ship themselves.

Custom AI agents that auto-generate test plans from UI exploration, create Playwright tests from tickets, self-heal failing tests, and move tickets through the pipeline. Integrated into the dev cycle to catch regressions, OWASP top-10 vulns, and performance bottlenecks — before QA even touches them.

Playwright AI Agents ISTQB AI Testing Self-healing Security scans

Proof 1000+ tests across 6 Mars Petcare apps. Authenticated test pass rate 20% → 100%.

03

Enterprise DevOps

CI/CD pipelines that do the right thing by default.

Unified pipelines across 7 services with smart change detection and parallel deploys. Prod container registry isolation via image promotion. Bicep IaC with RBAC that actually works. Consolidated coverage reporting across .NET and Python. The boring infrastructure that makes the exciting stuff possible.

Azure DevOps GitHub Actions Bicep Docker Clean Architecture

Proof System Verification — 300% stability gain, 750% faster test execution. Alva — unified 7-service pipeline.

§ 003 — Selected Work

Case studies

The work behind the metrics.

01 / 03

Feb 2021 – Present

Manufacturing · Enterprise GenAI

AI Systems Architect

Alfa Laval

Architected Alva — a production multi-agent AI platform orchestrating 10+ LLMs through a unified chat interface. Built the full stack end-to-end: React UI, .NET 8 orchestrator, Python/FastAPI agent service, Azure infrastructure as code.

LlamaIndexFastAPI.NET 8ReactPostgreSQLpgvector +4 more

10+

LLMs orchestrated

7

Services in pipeline

GB → MB

Docker context reduced

All

Architecture layers owned

Read case study →

02 / 03

Oct 2023 – Mar 2025

MarTech · Consumer

Senior QA Engineer · AI-Driven QA Lead

Mars Petcare — Kinship & AdoptAPet

Built a multi-app Playwright framework from scratch covering 6 projects across two Mars Petcare brands. Pioneered AI-driven QA agents that auto-generate test plans, create tests from work items, and self-heal failing suites.

PlaywrightTypeScriptGitHub ActionsAWS S3Slack APIIterable API +4 more

1000+

Automated tests

6

Apps covered

20% → 100%

Auth pass rate

US + UK

Regions supported

Read case study →

03 / 03

Mar 2018 – Apr 2023

Enterprise QA Services

Tech Lead · Senior Test Automation Engineer

System Verification

Led QA delivery across 50+ engineers on multiple concurrent projects. Transformed inherited test solutions into stable, fast, modern pipelines — and built the team to run them.

C# / .NETSeleniumRanorexAzure DevOpsTFSDocker +4 more

50+

Engineers led

+300%

Stability gain

+750%

Execution speed-up

-500%

Initial setup time

Read case study →

§ 004 — Toolbox

Stack

Not every tool. The right ones.

Eleven years of language-hopping and framework-chasing distilled into the stack I actually reach for when the stakes are real.

01

AI & Agents

  • · LlamaIndex
  • · FastAPI
  • · Azure OpenAI
  • · Anthropic Claude
  • · OpenAI GPT-4o
  • · Mistral
  • · pgvector
  • · Tavily
  • · FLUX
  • · ReAct agents
  • · RAG
  • · Streaming LLM
02

QA & Testing

  • · Playwright
  • · Cypress
  • · Selenium
  • · Appium
  • · JMeter
  • · LoadRunner
  • · ISTQB AI Testing
  • · OWASP scans
  • · Self-healing tests
  • · AI test generation
  • · xUnit
  • · pytest
03

DevOps & Infra

  • · Azure DevOps
  • · GitHub Actions
  • · Jenkins
  • · Bicep IaC
  • · Docker
  • · Azure App Service
  • · Azure Functions
  • · ACR
  • · Application Insights
  • · OpenTelemetry
  • · Cloudflare
  • · AWS Lambda
04

Backend & Data

  • · C# / .NET 8
  • · Python 3.12
  • · TypeScript
  • · Node.js
  • · React
  • · PostgreSQL
  • · SQL Server
  • · EF Core
  • · Clean Architecture
  • · REST · gRPC
  • · Azure AD auth
  • · Auth0

§ 005 — Contact

Let's talk

Got an AI platform to ship?

Whether you need a fractional CTO, a multi-agent architecture review, an enterprise QA transformation, or hands-on implementation — pick the fastest path.

Option A · Book instantly

30 min · Free

Option B · Send a brief

Async · < 24h

Quick facts

Location
Sarajevo, BiH · GMT+2
Engagement
Full-time · Contract · Fractional
Response
< 24 hours