AI Architect

p5L6kj66F2EG9zMPo2khja· All Published Jobs
Apply Now ↗
🌍 Remote📍 Pakistan📍 TELECOMMUTE

About this role

Job Summary:

As an AI Architect you will build AI-native products. You’ll lead cross-functional Innovation Delivery Squads—owning outcomes end-to-end across web, mobile, AI agents, and streaming backends. You’re a hands-on technical leader who can scope, architect, staff, and ship; then run the product safely at scale.

Job Responsibilities:

  • Stand up and run squads (Discovery → Prototype → Product → Platform & SRE).
  • Design and ship RAG/agent systems: pick models (e.g., Anthropic Claude, OpenAI, Google, or open-weights like Llama/Mistral), define tools/functions, and choose retrieval (default Postgres + pgvector, scale to Weaviate/Qdrant/Pinecone when needed).
  • Operate AI safely: evals & guardrails, structured outputs (JSON/Schema), PII redaction, refusal policies, cost/latency budgets, and LLM observability.
  • Own delivery outcomes: SLOs, quality, cost, velocity; release with feature flags and canaries.
  • Be client-facing: discovery, scoping, SoW, roadmap, QBRs.
  • Hire/coach Tech Leads, EMs, and PMs; level up practices.
  • 8–12+ yrs engineering; 4+ yrs leading multi-team delivery; shipped production web/mobile systems at scale.
  • Shipped at least one production AI app using Claude/GPT/Gemini/Llama/Mistral, backed by retrieval (pgvector or a vector DB) and a basic eval/guardrail pipeline.
  • Implemented orchestration (LangGraph/DSPy or Temporal for durable workflows), rerankers (e.g., Cohere/Jina/Voyage), and prompt/tool versioning.
  • Built with modern cloud + data: serverless/K8s, Terraform, OpenTelemetry, feature flags/experimentation.
  • Excellent client communication and commercial sense (SoWs, staffing, utilization).

Tech stack (you have hands on experience)

  • Models: Anthropic Claude; OpenAI; Google; open-weights (Llama, Mistral).
  • Orchestration & agents: LangGraph (or DSPy) for graphs; Temporal for durable, long-running tasks and SLAs.
  • Retrieval: Postgres + pgvector (default); Weaviate/Qdrant/Pinecone when scale/ops require; hybrid search with OpenSearch/Typesense.
  • Embeddings / rerankers: OpenAI/Voyage/E5/BGE; Cohere/Jina/Voyage rerank.
  • Guardrails & evals: JSON/Pydantic schemas, red-team sets, promptfoo/Ragas/DeepEval; content/PII filters.
  • Observability: OpenTelemetry traces incl. prompt/tool spans; Langfuse/Arize Phoenix (or equivalent) + Sentry/Grafana.
  • App & data: Next.js 15 (RSC), TypeScript/Go/Python; Postgres; Kafka/Redpanda/NATS; dbt/lakehouse optional.
  • Ops: Cloud Run/ECS/K8s; Terraform/OpenTofu; GitHub Actions; LaunchDarkly/Unleash; Statsig/GrowthBook.

Frequently Asked Questions

Is the salary disclosed for the AI Architect position at p5L6kj66F2EG9zMPo2khja?
The salary for this AI Architect role at p5L6kj66F2EG9zMPo2khja is not publicly listed. Click "Apply Now" to learn more about the compensation package on their official careers page.
Is the AI Architect job at p5L6kj66F2EG9zMPo2khja remote?
Yes, this AI Architect position at p5L6kj66F2EG9zMPo2khja is remote, with team members based in Pakistan, TELECOMMUTE. You can work from home or anywhere in the supported regions.
Which team or department does the AI Architect at p5L6kj66F2EG9zMPo2khja belong to?
This AI Architect position is part of the All Published Jobs department at p5L6kj66F2EG9zMPo2khja. See the full job description for more information about the team structure and responsibilities.
How do I apply for the AI Architect position at p5L6kj66F2EG9zMPo2khja?
Click the "Apply Now" button on this page. You will be redirected to p5L6kj66F2EG9zMPo2khja's official application portal hosted on workable where you can submit your application directly.
When was the AI Architect job at p5L6kj66F2EG9zMPo2khja posted?
This AI Architect position at p5L6kj66F2EG9zMPo2khja was posted on Feb 23, 2026. Apply as soon as possible — early applications are often reviewed first.
AI Architect
p5L6kj66F2EG9zMPo2khja
Apply for this role ↗

You'll be redirected to p5L6kj66F2EG9zMPo2khja's official application page on workable.