Production AI engineer for operations that can&#x27;t fail.

Featured

Outcome-Driven Agent Evaluation (Hive)

Evaluation patterns for agents that must improve real outcomes.

Exploration and extension of the Hive framework for outcome-driven agent development, focusing on how teams iterate when success is measured by business results rather than single-turn benchmarks.

Public GitHub fork

Repository

Outcome loops vs. toy task accuracy

Lens

Research and internal eval experiments

Use

Python

Agent Frameworks

Evaluation Design

OSS

Apache 2.0

Featured

AIDC Barcode Toolkit

Barcode and AIDC building blocks for Claude Code.

Open-source toolkit that packages real-world AIDC workflows so Claude Code can generate, validate, and reason about barcode and labeling tasks with domain-correct defaults.

Public OSS on GitHub

Distribution

Developer velocity for AIDC-heavy features

Focus

Domain moat meets LLM-native tooling

Bridge

JavaScript

Claude Code

Barcode Standards

Label Workflows

MIT License

Developer Tools

New release

Shipped SaaS products

Production multi-tenant platforms — not prototypes. Latest: Soko ERP (500+ businesses across East Africa).

Soko case study

Soko ERP

Soko is a production multi-tenant ERP for East African SMBs — now live with 500+ businesses, 2.1M+ transactions, and 99.9% uptime. One platform for POS, multi-location inventory, HR/payroll, accounts, and reporting, with M-Pesa, offline-first sync, and multi-currency support (KES/UGX/TZS).

Votia

Votia is a production voting SaaS that lets organizations create voting sessions, register nominees, share links, and collect verified votes — with email-OTP voting and paid bulk voting via Paystack, plus per-session dashboards and analytics.

DaktariDesk

DaktariDesk is a production automation SaaS for Kenyan clinics that layers on top of Google Calendar to send WhatsApp appointment reminders, recall dormant patients, and request Google reviews — protecting revenue lost to no-shows.

Next.js

TypeScript

WhatsApp Business API

Google Calendar API

Supabase

Top ~3.8%

Opus 4.7 hackathon (~500 / 13,000+)

50+

Production POS / WMS rollouts

Flagship AI case studies

OSS

Agents, evals, AIDC toolkit

4+ yrs

Enterprise & field ops

Why work with me

Production AI + eval discipline for global teams — deep GitHub activity, production rollouts, not just demos

Agents, evals & LLM systems

Claude managed agents, MCP, RAG, multi-model orchestration, and outcome-driven evals — production patterns, not demo wrappers.

Enterprise integration

POS, WMS, ERP, M-Pesa, offline-first — 50+ production deployments. AI that fits existing stacks and adoption constraints.

Domain depth & AIDC

Barcode, RFID, warehouse ops — real operational edge cases that generic AI engineers miss after go-live.

Production AI systems

Architectures and outcomes from shipped agent work — evals, integrations, and domain constraints included.

Agentic reconciliation

Vision capture → stateful walk session → ERP lookup → variance classification. Long-horizon agents with resume/retry, not one-shot API calls.

Eval-driven iteration

Outcome loops and business scorecards instead of toy task accuracy. Iterate prompts, tools, and policies against operational impact.

RAG + enterprise glue

pgvector retrieval, MCP tool use, and multi-tenant Supabase — AI that plugs into existing ERP/WMS stacks with permissions and audit trails.

Case studies & work index

Top ~3.8%

Anthropic Built with Opus hackathon (~500 / 13,000+)

50+

Production POS / WMS / ERP rollouts

2 wks → ~40 min

Warehouse audit cycle (WaybillAgent)

Featured in

Industry coverage on AIDC, floriculture traceability, and East Africa enterprise tech.

Floriculture Magazine

How Krystal East Africa is Powering the Digital Shift in Flower Growing

Featured as Technical Sales Engineer at Krystal Scanning & Mobility Group, discussing supply chain automation and QR-based traceability systems for Kenya's $835M floriculture export industry.

Skills & stack

AI-first capabilities for hire or contract—backed by operations domain depth

AI & LLM engineering

• Claude API, managed agents, and long-horizon agent workflow design
• MCP tooling and multi-step orchestration
• RAG pipelines, grounding, and retrieval quality
• Multi-model orchestration and cost optimization
• Vision / OCR in messy real-world captures
• Production evals and reliability patterns

Full-stack & enterprise systems

• Next.js 14+, React 18, TypeScript
• Supabase/PostgreSQL with RLS and pgvector
• POS, WMS, and ERP systems with repeated production rollouts
• M-Pesa and payment integrations
• Multi-tenant SaaS and white-label platforms
• Offline-first architecture for field ops

Domain expertise & ops

• Barcode, RFID, and AIDC process literacy
• HR, Payroll, Inventory, and Production modules
• Warehouse operations and logistics automation
• East African enterprise rollouts across multiple sectors
• Remote-first delivery for global teams

Live sites

AI and OSS proof first, then enterprise and client deployments. Full case studies on the work index.

AI & OSS

WaybillAgent

AI-assisted warehouse audit workflow (hackathon flagship).

Live site waybill-agent Case study

Asset Zen

Asset operations product surface with AI-assisted workflows.

Live site newassetzenapp Case study

Joseph Rwanda (portfolio)

This site — production AI engineer positioning and case studies.

Live site josephrwandaportfolio

AIDC Barcode Toolkit

OSS barcode/AIDC primitives for Claude Code and agent-assisted logistics.

GitHub repo aidc-barcode-toolkit Case study

Outcome Agent Evals (Hive fork)

Outcome-driven agent evaluation patterns — public GitHub fork.

GitHub repo hivereviewbyjoe Case study

Enterprise & ops

Soko ERP

Production ERP for East Africa — 500+ businesses, POS, inventory, HR, M-Pesa, and offline sync.

Votia

Award-style voting SaaS — sessions, nominee registration, verified email-OTP votes, and paid bulk voting via Paystack.

DaktariDesk

Automated WhatsApp appointment reminders for Kenyan clinics — Google Calendar sync, patient recalls, and review requests.

Work with me View work Hiring in Kenya?

FarmTrace

Flower & herb traceability — grower, lot, and chain-of-custody workflows.

Live site

Origami Tech

Company site — AIDC consulting and enterprise systems.

Live site origamiwebsite

Client sites

Jumbo Greens

E-commerce — fresh produce and delivery.

Live site jumbogreens

Shot by Mark

Photography portfolio (Canada).

Live site shotbymarkv2

Maonera

Farm and brand site.

Live site maonera

Tikohub

Events platform.

Live site tikohub

Live build updates

Launches, architecture notes, and hiring signals on LinkedIn; public code and demos on GitHub—no synthetic feeds.

View LinkedIn

FAQ

AI engineering delivery, reliability, remote collaboration, and how engagements run

What kind of AI systems do you build?

Can you build custom Claude and MCP-based workflows?

How do you approach reliability for AI features in production?

Do you support existing products or only greenfield builds?

What does a first engagement usually look like?

How quickly can we move from prototype to production?

How does your AIDC and warehouse background help AI projects?

Are you a fit for Forward Deployed or AI Solutions Architect roles?

What timezone overlap do you offer for remote teams?

Are you open to AI engineering roles as well as consulting?

Do you work with teams outside Kenya?

Still deciding?

Email a short brief: problem, stack, timeline, and success metric—I'll reply with a realistic path.

Email Joseph

Let's build or hire

Open to AI engineering roles (remote or hybrid) and selective consulting for US, EU, and emerging-market teams. Share your stack, timezone overlap, timeline, and what done looks like.

Download resume

Direct contact

hi@josephrwanda.com

Start an AI Engineering Conversation

Share your product goals and context below. When you are done, pick one: send the inquiry straight to my inbox, or open WhatsApp with everything prefilled so you can send it there.

Joseph RwandaHire me

Remote · Production AI · UTC+3 (Nairobi)

Production AI engineer foroperations that can't fail.

Hiring an AI engineer in Kenya or East Africa? Dedicated profile with local proof points and deployment context.

View AI engineer in Kenya profile

Featured work

Flagship case studies covering agents, evals, vision, and OSS — with architecture, scorecards, and outcomes.

Featured

WaybillAgent

Walk the warehouse, Claude does the audit.

500 / ~13,000 (Top 3.8%)

Hackathon Cohort

2 weeks -> ~40 minutes

Audit Cycle

$2,000 scanner -> phone/glasses workflow

Device Shift

Claude Opus 4.7

Claude Managed Agents

TypeScript

Next.js

Supabase

Vercel

Featured

Outcome-Driven Agent Evaluation (Hive)

Evaluation patterns for agents that must improve real outcomes.

Exploration and extension of the Hive framework for outcome-driven agent development, focusing on how teams iterate when success is measured by business results rather than single-turn benchmarks.

Public GitHub fork

Repository

Outcome loops vs. toy task accuracy

Lens

Research and internal eval experiments

Use

Python

Agent Frameworks

Evaluation Design

OSS

Apache 2.0

Featured

AIDC Barcode Toolkit

Barcode and AIDC building blocks for Claude Code.

Open-source toolkit that packages real-world AIDC workflows so Claude Code can generate, validate, and reason about barcode and labeling tasks with domain-correct defaults.

Public OSS on GitHub

Distribution

Developer velocity for AIDC-heavy features

Focus

Domain moat meets LLM-native tooling

Bridge

JavaScript

Claude Code

Barcode Standards

Label Workflows

MIT License

Developer Tools

New release

Shipped SaaS products

Production multi-tenant platforms — not prototypes. Latest: Soko ERP (500+ businesses across East Africa).

Soko case study

Top ~3.8%

Opus 4.7 hackathon (~500 / 13,000+)

50+

Production POS / WMS rollouts

Flagship AI case studies

OSS

Agents, evals, AIDC toolkit

4+ yrs

Enterprise & field ops

Why work with me

Production AI + eval discipline for global teams — deep GitHub activity, production rollouts, not just demos

Agents, evals & LLM systems

Claude managed agents, MCP, RAG, multi-model orchestration, and outcome-driven evals — production patterns, not demo wrappers.

Enterprise integration

POS, WMS, ERP, M-Pesa, offline-first — 50+ production deployments. AI that fits existing stacks and adoption constraints.

Domain depth & AIDC

Barcode, RFID, warehouse ops — real operational edge cases that generic AI engineers miss after go-live.

Production AI systems

Architectures and outcomes from shipped agent work — evals, integrations, and domain constraints included.

Agentic reconciliation

Vision capture → stateful walk session → ERP lookup → variance classification. Long-horizon agents with resume/retry, not one-shot API calls.

Eval-driven iteration

Outcome loops and business scorecards instead of toy task accuracy. Iterate prompts, tools, and policies against operational impact.

RAG + enterprise glue

pgvector retrieval, MCP tool use, and multi-tenant Supabase — AI that plugs into existing ERP/WMS stacks with permissions and audit trails.

Case studies & work index

Top ~3.8%

Anthropic Built with Opus hackathon (~500 / 13,000+)

50+

Production POS / WMS / ERP rollouts

2 wks → ~40 min

Warehouse audit cycle (WaybillAgent)

Featured in

Industry coverage on AIDC, floriculture traceability, and East Africa enterprise tech.

Floriculture Magazine

How Krystal East Africa is Powering the Digital Shift in Flower Growing

Featured as Technical Sales Engineer at Krystal Scanning & Mobility Group, discussing supply chain automation and QR-based traceability systems for Kenya's $835M floriculture export industry.

Skills & stack

AI-first capabilities for hire or contract—backed by operations domain depth

AI & LLM engineering

• Claude API, managed agents, and long-horizon agent workflow design
• MCP tooling and multi-step orchestration
• RAG pipelines, grounding, and retrieval quality
• Multi-model orchestration and cost optimization
• Vision / OCR in messy real-world captures
• Production evals and reliability patterns

Full-stack & enterprise systems

• Next.js 14+, React 18, TypeScript
• Supabase/PostgreSQL with RLS and pgvector
• POS, WMS, and ERP systems with repeated production rollouts
• M-Pesa and payment integrations
• Multi-tenant SaaS and white-label platforms
• Offline-first architecture for field ops

Domain expertise & ops

• Barcode, RFID, and AIDC process literacy
• HR, Payroll, Inventory, and Production modules
• Warehouse operations and logistics automation
• East African enterprise rollouts across multiple sectors
• Remote-first delivery for global teams

Live sites

AI and OSS proof first, then enterprise and client deployments. Full case studies on the work index.

AI & OSS

WaybillAgent

AI-assisted warehouse audit workflow (hackathon flagship).

Live site waybill-agent Case study

Asset Zen

Asset operations product surface with AI-assisted workflows.

Live site newassetzenapp Case study

Joseph Rwanda (portfolio)

This site — production AI engineer positioning and case studies.

Live site josephrwandaportfolio

AIDC Barcode Toolkit

OSS barcode/AIDC primitives for Claude Code and agent-assisted logistics.

GitHub repo aidc-barcode-toolkit Case study

Outcome Agent Evals (Hive fork)

Outcome-driven agent evaluation patterns — public GitHub fork.

GitHub repo hivereviewbyjoe Case study

Enterprise & ops

Soko ERP

Production ERP for East Africa — 500+ businesses, POS, inventory, HR, M-Pesa, and offline sync.

Votia

Award-style voting SaaS — sessions, nominee registration, verified email-OTP votes, and paid bulk voting via Paystack.

DaktariDesk

Automated WhatsApp appointment reminders for Kenyan clinics — Google Calendar sync, patient recalls, and review requests.