Staff Site Reliability Engineer

Globality, Inc. • Palo Alto, California, United States • Full-time

$150,000

per year

Automation DevOps Infrastructure Cloud Technologies Site Reliability Engineer

Job Description

Globality was founded with a simple yet ambitious goal: to use AI to transform enterprise spending into a smarter, fairer process - creating more efficient and inclusive markets worldwide. Nearly a decade later, our AI-powered platform is redefining procurement as a guided, insight-led experience that’s easier for everyone, open to anyone, and better for business.

At the forefront of enterprise AI, Globality optimizes spend, drives efficiency, and delivers measurable bottom-line impact. Our culture - rooted in Trust, Collaboration, and Innovation - fosters an environment where every individual feels valued and empowered. Bring your expertise, passion, and perspective. Together, we’re shaping the future of enterprise spending.

Role Summary:

As a Staff Site Reliability Engineer, you’ll play a key role in shaping and scaling the technical backbone that powers Globality’s business-critical services. You’ll be hands-on in driving innovation, championing reliability, and ensuring that our infrastructure can scale with the pace of our growth.

We're looking for someone who thrives in complex technical environments, quickly gets up to speed with new technologies, communicates effectively across all levels of the organization, and brings deep expertise in cloud platforms (AWS, GCP, etc.). You should have a passion for building resilient, distributed systems and a strong understanding of the challenges developers face when deploying modern cloud-native applications.

This role is hybrid, with a minimum of four days per week onsite in our Palo Alto HQ.

What you will be doing:

Design and develop automation for provisioning and managing infrastructure at scale
Collaborate with architects to implement foundational technologies, policies, and practices that support a high-velocity, secure, and compliant platform
Lead strategic initiatives and execute long-term roadmaps that span teams and departments
Build services with resilience, scalability, security, and observability at their core
Identify and mitigate security risks, continuously improving the safety of our systems
Foster strong cross-functional relationships to influence platform adoption, reduce operational toil, and stay ahead of upcoming initiatives
Drive a culture of engineering excellence through automation, iteration, and continuous improvement

What we are looking for:

10+ years of experience designing, building, and operating scalable, distributed infrastructure
Strong debugging, problem-solving, and troubleshooting skills
Deep experience with disaster recovery, scale testing, and capacity planning
Well-versed in managing large-scale systems across public, private, or hybrid cloud environments
Excellent in writing clean, maintainable code in languages like Python, Go, Ruby, or Java
Relentless focus on automation and driving operational efficiency
Hands-on experience with tools like Ansible, Spinnaker, Terraform, or similar configuration and delivery platforms
Knowledge and experience deploying, supporting, and monitoring a variety of services and application stacks
Strong ownership desire, integrity, and a collaborative mindset

The anticipated annual base salary range for this position is $150,000 - $250,000. Actual compensation may vary based on factors such as experience, skills, and location. This information is provided in accordance with the California Equal Pay Act. We are an equal opportunity employer and participate in the E-Verify program. We are committed to building a diverse and inclusive workplace and do not discriminate on the basis of race, color, gender, sexual orientation, gender identity, religion, national origin, age, disability, or any other protected status.