Work • LotusNex

Delivery patterns

Described at the architecture level. Client details are not disclosed. Each pattern reflects a real system — scoped, built, hardened, and handed off in production.

Process AutomationOperations Workflow~8 weeks

Automated document data-entry pipeline for a high-volume accounting firm

A professional services firm processing high volumes of client financial documents faced a persistent manual bottleneck. Staff were transcribing structured data from incoming documents — invoices, statements, and tax filings — into internal systems by hand. The work was repetitive and rule-bounded, but error-prone and scaling with headcount rather than revenue. An architecture review confirmed process fit and data readiness before any build commitment. The goal was to automate extraction and entry end-to-end, with validation gates before any data reached the system of record.

Architecture approach

Document intake pipeline with format normalization across PDF, scanned, and digital-native inputs
Structured extraction with schema-grounded LLM output and per-field confidence scoring
Human review routing for low-confidence extractions — clear UI for review, approval, and correction
Write pipeline with dry-run mode, diff preview, and rollback on validation failure

Production properties

End-to-end audit trail from document receipt to system write
Validation layer catches format drift before extraction failures propagate
Human review queue with SLA visibility and exception reporting
Cost telemetry per document type with budget controls and alert thresholds

Delivery outcomePipeline shipped to production with extraction validated against a held-out document sample and the human review queue operational from day one. The validation layer caught format drift across three document variants before any failures reached the system of record. Operations staff reallocated from manual transcription to exception handling and advisory work.

Retrieval InfrastructureB2B SaaS~10 weeks

Permission-aware retrieval in multi-tenant systems

A B2B SaaS platform serving regulated-industry clients needed internal knowledge retrieval exposed to end users across multiple tenants. The client came with an existing retrieval prototype. An architecture review identified access control at the query layer — not retrieval quality — as the critical production risk before go-live. Documents carried role and tenant-level permissions that a standard RAG implementation would not respect. Post-filtering was insufficient: it couldn't prevent cross-tenant context from influencing generated responses. The system had to enforce boundaries at the index query layer, before any content reached the model.

Architecture approach

Ingestion pipeline with document normalization and permission metadata extraction at ingest time
Chunking strategy tuned to document structure and role-relevant content boundaries
RBAC-aware retrieval with identity-layer integration — access enforced at query, not filtered after
Offline evaluation harness with recall/precision baselines and adversarial query regression suite

Production properties

Tenant isolation enforced at index query time — zero cross-tenant document exposure
Retrieval audit trail aligned to compliance and data access requirements
Monitoring for recall drift and retrieval latency across tenant partitions
Runbooks and ownership documentation delivered at handoff

Delivery outcomeThe system shipped to production with permission boundaries validated under adversarial query testing and evaluation baselines established. Internal teams assumed full ownership with the ability to maintain and extend the retrieval pipeline without external support.

Agent OrchestrationB2B SaaS~12 weeks

Agentic workflow with human approval gates for an operations team

An operations team managing high-volume, multi-step workflows across several internal tools wanted to automate a class of repetitive decisions — while retaining human review for actions above a defined risk threshold. A workflow discovery scoped the automation boundary before architecture work began — defining what the agent could execute autonomously, what required approval, and what had to fail deterministically rather than degrade silently.

Architecture approach

Orchestration loop with explicit tool boundary definitions and least-privilege permissioning
Risk-tiered approval flow: auto-execute, human-in-loop, and hard-stop tiers
Tool-call logging and trace spans for full auditability
Evaluation test set covering edge cases, adversarial inputs, and boundary conditions

Production properties

Deterministic fallbacks for all failure modes — no silent degradation
Cost telemetry and per-workflow token budgets
Escalation path with clear notification and override mechanics
Regression checks integrated into CI pipeline

Delivery outcomeAgentic system deployed with approval gates validated against real workflow data, cost controls active from day one, and operations team trained on override and monitoring procedures.

Agent OrchestrationB2B SaaS~12 weeks

Automated lead research and CRM enrichment system for a B2B SaaS sales team

A B2B SaaS sales team was spending significant SDR time on manual prospect research — ICP qualification, company enrichment, contact data entry, and CRM hygiene. An architecture review scoped the research and enrichment loop before build began, defining what the agent could automate fully versus what required human approval before any CRM write or outreach trigger. The research loop was repetitive but judgment-dependent: some enrichment could be fully automated, but outreach decisions required human review.

Architecture approach

Orchestration loop covering prospect discovery, enrichment from multiple data sources, and ICP scoring
Tool integrations with research APIs, enrichment providers, and CRM write endpoints — all with least-privilege permissions
Human approval gate before CRM write and before outreach action triggers
Completeness and confidence scoring with graceful degradation when external sources fail

Production properties

All enrichment actions logged with source attribution for auditability
Idempotent CRM writes with duplicate detection and merge conflict handling
Per-prospect and per-source cost telemetry with budget controls
Runbooks for common failure modes — API downtime, low-confidence bulk batches, CRM rate limits

Delivery outcomeSystem deployed with CRM write gates validated against a live prospect sample and cost telemetry active from day one. Source attribution logging confirmed enrichment provenance on every record. Sales team trained on review procedures, confidence thresholds, and override paths. No post-handoff support required.

Platform & ReliabilityB2B SaaS~8 weeks

Production observability and deployment infrastructure for an AI system with no visibility

A team had shipped an AI-powered feature to production but had no alerting, no per-workflow cost visibility, and a deployment process that required manual steps with no rollback capability. The first time quality degraded, it was reported by a customer — not surfaced by the system. An architecture review confirmed the gap: the AI system was live but the operational infrastructure that should have surrounded it from day one was absent.

Architecture approach

Full-stack observability with structured logging, distributed traces, and per-workflow latency and error-rate dashboards
Alerting on model response quality signals, token spend anomalies, and downstream system failures — with runbooks attached to each alert
CI/CD pipeline with automated regression checks against a held-out evaluation set and one-command rollback capability
Per-workflow cost telemetry with budget alert thresholds before spend reaches critical levels

Production properties

Incidents surface in dashboards with runbooks — not in customer reports
Deployments are automated, regression-tested, and reversible
Per-workflow spend is visible and bounded before scale
Team owns the full operational stack at handoff with no external dependency

Delivery outcomeObservability infrastructure operational in the first week. First degradation alert fired and was resolved internally before any user reported an issue. Deployment pipeline adopted by the team with automated regression checks running on every merge. Cost telemetry active across all workflows with budget controls in place.

AI Security & HardeningB2B SaaS~6 weeks

Adversarial testing and access control hardening for a multi-tenant AI system

A team had built an AI assistant handling customer and internal data across multiple tenants. The system had never been tested adversarially — no prompt injection testing, no cross-tenant access validation, no review of how data sensitivity was handled at the retrieval layer. A Workflow Discovery assessment surfaced critical and high-severity security findings. The engagement scope was to test, document, and close the gaps before the system was exposed to a broader user base.

Architecture approach

Adversarial test suite covering direct prompt injection, instruction override attempts, and indirect injection via retrieved documents
Access control review at the query layer — validating that tenant boundaries held under adversarial retrieval queries, not just standard usage
Data sensitivity mapping — tracing PII, financial, and customer data flows through the AI stack to identify unguarded exposure points
Remediation implementation for all critical findings with regression tests to prevent recurrence

Production properties

All critical and high-severity findings resolved and verified via retesting before broader rollout
Adversarial test suite integrated into CI pipeline for ongoing regression
Access control validated at query time — not via model instructions that can be overridden
Findings report and remediation log delivered as part of handoff documentation

Delivery outcomeAll critical findings closed before broader user exposure. Remediation verified via retesting across the full adversarial suite. Findings report used internally to align stakeholders on ongoing security posture. Adversarial test suite adopted into the team's CI pipeline.