Company OverviewSPACE44 builds and operates software systems for companies that need technology to work reliably in real, day-to-day operations.We work as long-term engineering partners, embedding experienced engineers into client environments and taking responsibility for execution, stability, and ongoing improvement of production systems. Our work goes beyond traditional staffing or project delivery — we operate the systems we help build.A growing part of our work focuses on operational AI. We design, integrate, and run AI systems inside our clients’ cloud or on-prem environments, ensuring they are secure, governed, and usable in everyday business processes. This includes deploying and operating PIPE44, our AI integration and governance platform, together with AI-enabled engineering teams.SPACE44 is a remote-first company. We value clear ownership, practical problem-solving, and engineers who are comfortable running real systems in production.Who We Are Looking For?We're looking for an AI Platform Operations Engineer to ensure the reliability, security, and governance of AI systems running on the PIPE44 platform in client environments.This is a platform-focused role – you protect the systems. You'll work on deployment, access control, monitoring, and incident response. You don't need to be an AI expert on day one – we're looking for someone with strong operational discipline who's comfortable learning AI/RAG tooling on the job.You'll be client-aware (able to communicate when needed) but not client-heavy. Your primary focus is keeping the platform running reliably and securely.What You'll DoDeploy and manage AI agents and multi-agent workflowsConfigure and enforce access control, permissions, and knowledge boundariesMaintain governance standards and audit trailsBuild and monitor observability (logs, metrics, alerts)Respond to incidents and manage failure modesEnsure platform integrity across client environmentsSupport rollout of new agents and controlled scalingDocument operational procedures and runbooksRequired:3+ years in production operations (DevOps, SRE, Platform Engineering, or similar)Strong systems thinking and operational disciplineExperience with monitoring, logging, and alerting toolsSolid understanding of access control and permissions managementComfortable with incident response and troubleshootingProcess-oriented with strong documentation habitsWillingness to learn AI/RAG systems on the jobNice to Have:Exposure to LLM-based systems or RAG pipelinesExperience with agent frameworks or workflow orchestrationBackground in regulated or compliance-sensitive environmentsFamiliarity with Kubernetes, Terraform, or cloud platformsWhat This Role Is NotNot a heavy infrastructure/cloud architect roleNot client-facing as primary responsibilityNot AI research or model trainingNot sales or pre-salesEmbark on your SPACE44 journey with a few simple steps:Hireflix Chat (20 min): Share how you work and what matters most to youHR Sync (45 min): Connect your experience with our goalsCode Challenge: Complete a 1-2 hour coding sessionPsychometric Insight: Take a 1-hour assessment exploring your strengths and working styleWorking Model and BenefitsFlexible Work Schedule: Enjoy a non-linear workday designed to enhance productivity and maintain a harmonious work-life balance, with core hours for team collaborationProfessional Growth: Access advanced training opportunities in data science and machine learning to boost your career prospectsInnovative Projects and Tools: Engage in cutting-edge projects using the latest tools and technologies within a progressive remote work environmentCompetitive Income: Receive a competitive income with regular performance reviews and potential raises every six monthsGlobal Team Dynamics: Collaborate with a diverse, international team that values openness and teamworkFully Remote Environment: Work from anywhere with normal business hours and flexibility, no on-call rotation, and no planned travel requirementsGradual Onboarding: Start part-time with a planned transition to full-time as you get up to speed with our systems and processes