Back to Jobs

Staff Site Reliability Engineer

Globality, Inc. Palo Alto, California, United States Full-time
$150,000
per year

Job Description

Globality was founded with a simple yet ambitious goal: to use AI to transform enterprise spending into a smarter, fairer process - creating more efficient and inclusive markets worldwide. Nearly a decade later, our AI-powered platform is redefining procurement as a guided, insight-led experience that’s easier for everyone, open to anyone, and better for business. 

At the forefront of enterprise AI, Globality optimizes spend, drives efficiency, and delivers measurable bottom-line impact. Our culture - rooted in Trust, Collaboration, and Innovation - fosters an environment where every individual feels valued and empowered. Bring your expertise, passion, and perspective. Together, we’re shaping the future of enterprise spending. 

Role Summary:

As a Staff Site Reliability Engineer, you’ll play a key role in shaping and scaling the technical backbone that powers Globality’s business-critical services. You’ll be hands-on in driving innovation, championing reliability, and ensuring that our infrastructure can scale with the pace of our growth.
 
We're looking for someone who thrives in complex technical environments, quickly gets up to speed with new technologies, communicates effectively across all levels of the organization, and brings deep expertise in cloud platforms (AWS, GCP, etc.). You should have a passion for building resilient, distributed systems and a strong understanding of the challenges developers face when deploying modern cloud-native applications.
 
This role is hybrid, with a minimum of four days per week onsite in our Palo Alto HQ. 

What you will be doing:

  • Design and develop automation for provisioning and managing infrastructure at scale
  • Collaborate with architects to implement foundational technologies, policies, and practices that support a high-velocity, secure, and compliant platform
  • Lead strategic initiatives and execute long-term roadmaps that span teams and departments
  • Build services with resilience, scalability, security, and observability at their core
  • Identify and mitigate security risks, continuously improving the safety of our systems
  • Foster strong cross-functional relationships to influence platform adoption, reduce operational toil, and stay ahead of upcoming initiatives
  • Drive a culture of engineering excellence through automation, iteration, and continuous improvement

What we are looking for:

  • 10+ years of experience designing, building, and operating scalable, distributed infrastructure
  • Strong debugging, problem-solving, and troubleshooting skills
  • Deep experience with disaster recovery, scale testing, and capacity planning
  • Well-versed in managing large-scale systems across public, private, or hybrid cloud environments
  • Excellent in writing clean, maintainable code in languages like Python, Go, Ruby, or Java
  • Relentless focus on automation and driving operational efficiency
  • Hands-on experience with tools like Ansible, Spinnaker, Terraform, or similar configuration and delivery platforms
  • Knowledge and experience deploying, supporting, and monitoring a variety of services and application stacks
  • Strong ownership desire, integrity, and a collaborative mindset

The anticipated annual base salary range for this position is $150,000 - $250,000. Actual compensation may vary based on factors such as experience, skills, and location. This information is provided in accordance with the California Equal Pay Act. We are an equal opportunity employer and participate in the E-Verify program. We are committed to building a diverse and inclusive workplace and do not discriminate on the basis of race, color, gender, sexual orientation, gender identity, religion, national origin, age, disability, or any other protected status. 

Company Information

Location: Not specified

Type: Not specified