Site Reliability Engineer
Job Description
About Remo
Remo is building a new standard of dementia care by fundamentally changing the care journey for individuals living with dementia and their caregivers (the dyad). As a virtual dementia care provider, our expert clinical team designs personalized, comprehensive care to serve people with dementia and caregiver needs (instead of a one-size-fits-all approach). We empower family caregivers by connecting them with a vibrant community of other caregivers, expert content, and tools to manage the entire dementia journey – from anywhere, at any time. Our mission is simple – to provide accessible, comprehensive, quality dementia care for every person who needs it.
About The Role
We are seeking a highly motivated and experienced Site Reliability Engineer to join our fast-paced healthcare startup. You will be instrumental in ensuring the reliability, performance, and scalability of our modern stack infrastructure. The ideal candidate will have extensive hands-on experience with full-stack development, infrastructure management, and a deep understanding of multi-cloud environments. Experience with FHIR and other medical technologies is highly preferred.
What You’ll Be Doing
Ensure the reliability, uptime, and performance of our production systems across multiple cloud environments.
Lead and manage incident response, post-mortem analysis, and continuous improvement initiatives.
Design, implement, and maintain robust monitoring and alerting systems using Datadog.
Develop and automate infrastructure and deployment processes using Terraform and CI/CD pipelines.
Collaborate closely with development teams to optimize code quality, deployment efficiency, and system architecture.
Help improve our top of the line AI tooling in GCP.
Provide hands-on support and troubleshooting for complex production issues, ensuring minimal downtime.
Contribute to the design and implementation of scalable, fault-tolerant, and secure systems.
Participate in code reviews, architectural discussions, and knowledge sharing sessions.
Manage time effectively and drive projects to completion with excellent project management skills.
Stay up-to-date with the latest technologies, best practices, and industry trends.
You May Be a Good Fit If You
Have at least 6 years of relevant experience as a SRE or Full Stack Engineer .
Have extensive hands-on experience with full-stack development using A React Framework, Node, and Typescript.
Have a strong proficiency in Graphql.
Are an expert in multi-cloud environments (AWS, GCP) and infrastructure management.
Have proven experience with incident response, root cause analysis, and problem resolution under pressure.
Have extensive Datadog experience with pipeline, dashboard, and alert experience.
Have hands-on experience with CI/CD pipelines and automation tools.
Are proficient in Infrastructure as Code (IaC) using Terraform.
Are an excellent team player, communicator and have strong problem-solving skills.
Have strong time management and project management abilities.
You’re The Ideal Candidate If You Have:
6+ years of experience in Full Stack Engineering.
Experience with FHIR and other medical data standards.
NextJS Experience.
Python with some Data Engineering experience.
Deep understanding of serverless architectures and AWS Lambdas.
Hasura Cloud and Hasura Enterprise Experience.
Prior experience working in a healthcare startup environment.
Knowledge of containerization and orchestration technologies (Kubernetes, Docker).
Database administration experience (PostgreSQL, MongoDB).
Familiarity with security best practices and compliance requirements.
Github CI/CD Pipeline optimization skills for Docker, NodeJS, and Vercel.
Expertise in observability tools, particularly Datadog, when it comes to implementation.
Technologies:
Kubernetes
Graphql
Hasura
NextJS
NodeJS
Typescript
Python
AWS Lambdas
Datadog
Terraform
CI/CD Pipelines
Medical
• 100% Company-paid medical premiums for you and your dependents with HSA options
• Dental and vision plans (50% company-paid premium on employee’s dental plan)
• Dependent care FSA
Financial
• 100% 401(k) match of up to 4%
• $80 / month stipend for cell and wifi
Time Off
• 20 days of PTO and 11 paid holidays
• 5 days sick leave
• 16 weeks fully paid parental leave for birthing parents and 8 weeks for non-birthing parents
• Bereavement leave and pregnancy loss leave
Opt-In Ancillary Options:
• Short-term and long-term disability insurance
• Life insurance
• Critical illness, accident, and hospital indemnity insurance
• Pet insurance
• Legal advice
• Rightway Health, clinical care navigator
• Employee Assistance Program
Remo aims to reduce health inequities by improving access to affordable, high-quality dementia care. Embracing diversity and equal opportunity are core to that mission--these principles shape our culture, the products we build, and the services we deliver. We celebrate a variety of backgrounds, perspectives, and skills, reflecting the diversity of the caregivers and patients we serve.
We use E-Verify to confirm the identity and employment eligibility of all new hires: Participation Poster (PDF), Right to Work Poster (PDF)
Company Information
Location: Jackson, WY
Type: Hybrid