About Zapata 

Zapata AI is the Industrial Generative AI company, revolutionizing how enterprises solve their hardest problems with its powerful suite of Generative AI software. By combining numerical and text-based solutions, Zapata AI empowers industrial-scale commercial, government and military/defense enterprises to leverage large language models and numerical generative models better, faster, and more efficiently — delivering solutions to drive growth, savings and unprecedented insight. With proprietary science and engineering techniques and the Orquestra® platform, Zapata AI is accelerating Generative AI’s impact in Industry. 

 

About the Role 

The Zapata AI Platform Team helps configure, monitor, and maintain the hosted cloud infrastructure for all Zapata initiatives.  This includes configuring and securing the hosted architecture of Zapata’s AI/ML platform Orquestra™ and assisting with setting up managed Kubernetes cluster and cloud (AWS/Azure/GCP) provider resources for research and development projects.  We help educate on security best practices and find the best components for a solution. You will use a wide variety of open-source technologies and tools from across the open-source community including Kubernetes, ArgoCD and Crossplane. This role will closely work with US based teams and hence the suitable candidate will need to be based in a time zone that overlaps with AM EST hours.  

    

Key Responsibilities Include   

 

  • Architect and Build Distributed Systems: Design and develop robust, scalable distributed systems, leveraging deep understanding of principles and trade-offs. 
  • Mentor Team Members: Provide guidance and mentorship to junior and mid-level engineers through code reviews, architectural discussions, and technical training sessions. 
  • Collaborate with Global Teams: Effectively work with a distributed team of software developers from diverse cultural backgrounds, ensuring seamless communication and collaboration. 
  • Maintain Documentation Standards: Establish and uphold documentation standards across the team, ensuring consistency, accuracy, and clarity in all technical documentation. 
  • Product Team Collaboration: Partner with the product team to make informed decisions that enhance Orquestra® Platform 
         

Required Knowledge/Skills/Abilities     

  • BS or MS in Computer Science or related degree   
  • Experience with running production Kubernetes Clusters.  
  • Infrastructure provisioning and configuration with ArgoCD, Crossplane and Terraform. 
  • Managing Kubernetes resources and deployments using tools like Kubectl, Helm or YAML.  
  • Work with observability tools like Prometheus, Grafana 
  • Experience with cloud providers such as AWS, Azure or GCP     
  • Experience with scalable networking technologies such as Load Balancers and Firewalls.  
  • Experience with observability tools like Prometheus and Grafana.  
  • Experience with Golang and Typescript to contribute to Orquestra™ Platform Development 
  • Scripting language experience (Bash, Python).  
  • Good command of spoken and written English     
  • Comfortable with multi-feature Git environment and related best practices. 

 

Remote Job

Job Overview
Job Posted:
5 months ago
Job Expires:
Job Type
Full Time

Share This Job: