Cloud Operations Manager

Remote
Full Time
Manager/Supervisor
Cloud Operations Manager
Salary Range $120k – $193k/yr
Location: Remote. Hiring only in: CA, TX, and FL


About Work Truck Solutions

Work Truck Solutions' culture combines strong leadership, collaboration, and fun, with incredible growth opportunities for our employees in a fast-paced work environment providing employee engagement, recognition, and development. Our software company is committed to innovation in the rapidly changing commercial vehicle market space. Our vision and culture allow employees to be recognized as thought leaders and thrive in their careers.

In addition to the job responsibilities and requirements, the following are essential to be a successful member of our team:

Curiosity: you seek knowledge, ask questions, and look for answers; you’re proactive and engaged

Perseverance: you hit a delay; you know this is your moment to figure things out and to shine

Innovation: you want to make things better, solve the puzzle, create something new

Flexibility: there’s a new opportunity; you’re ready to flip the script, grow and adapt


Job Overview

The Cloud Operations Manager is responsible for the health, scalability, security, and cost-effectiveness of the organization's Azure cloud infrastructure. This role acts as a modern System Administrator and a team leader, ensuring system availability and managing the complexities of cloud infrastructure, compliance, and disaster recovery. The manager is expected to build, mentor, and lead a high-performing team of Cloud Operations Engineers.

Key Responsibilities

The Cloud Operations Manager's duties span several critical areas:

Cloud Operations Management
  • Infrastructure Management & Provisioning: Oversee all cloud infrastructure and resources, including provisioning, performing regular patch management, and proactive capacity planning.
  • Monitoring & Incident Response: Establish comprehensive system observability and maintain alerting infrastructure; serve as the escalation point for major incidents, drive resolution, and champion thorough Root Cause Analysis (RCA).
  • Security & Compliance: Define and maintain a robust security posture by enforcing Identity & Access Management (IAM), completing security audits, ensuring data encryption, and managing audit logs for regulatory compliance.
  • Cost Optimization: Actively track cloud spend against budgets, direct the team in performing right-sizing and waste elimination, and optimize rates through reserved instances and savings plans (FinOps strategy).
  • Disaster Recovery & Continuity: Direct the implementation and regular testing of comprehensive disaster recovery and business continuity plans, including backup management and maintaining a High Availability (HA) architecture across multiple zones.
  • Automation & Tooling: Guide the team in building and maintaining automation and tooling, implementing Infrastructure as Code (IaC) practices, developing self-service provisioning portals, and scripting repetitive operational tasks.

Leadership and Team Development
  • Team Leadership: Manage a team of Cloud Operations Engineers, including recruitment, performance reviews, professional development, and day-to-day work prioritization.
  • Strategy & Roadmap: Define the strategic vision, roadmap, and operational goals for the Cloud Operations function in alignment with overall business objectives.
  • Process Improvement: Develop and enforce operational procedures, standards, and best practices to ensure reliable and efficient cloud infrastructure management.
  • Cross-Functional Collaboration: Act as the primary liaison between Cloud Operations, Development, and Product teams to ensure alignment on platform needs and release readiness.

Required Skills and Experience
  • Proven experience managing infrastructure on major cloud platforms (AWS, Azure, or GCP), with at least 2 years in a leadership or managerial capacity.
  • Expertise in Infrastructure as Code (IaC) tools like Terraform or CloudFormation.
  • Strong understanding of network security, IAM, and compliance frameworks.
  • Demonstrated ability to reduce cloud costs through FinOps principles.
  • Experience in designing and testing Disaster Recovery and High Availability architectures.
  • Proficiency in scripting languages for operational automation.
  • Familiarity with tools like CloudWatch, Datadog, Jenkins, or similar systems.
  • A focus on system availability as the primary key metric (target uptime 99.99%).
  • Excellent communication, delegation, and personnel management skills.

Benefits:
  • Work on meaningful projects that shape the future of the commercial vehicle industry.
  • Competitive salary.
  • Fully remote Monday-Friday work week.
  • Comprehensive medical, dental, and 401k benefits, with complimentary life insurance.
  • Paid Time Off (PTO) and holidays.
  • Flexible scheduling, subject to manager’s approval.
  • Opportunity to work with a supportive and innovative team.

 
Share

Apply for this position

Required*
We've received your resume. Click here to update it.
Attach resume as .pdf, .doc, .docx, .odt, .txt, or .rtf (limit 5MB) or Paste resume

Paste your resume here or Attach resume file

150
To comply with government Equal Employment Opportunity and/or Affirmative Action reporting regulations, we are requesting (but NOT requiring) that you enter this personal data. This information will not be used in connection with any employment decisions, and will be used solely as permitted by state and federal law. Your voluntary cooperation would be appreciated. Learn more.
Human Check*