Senior Site Reliability Engineer Job at Favor, Texas

TEhzMjZJdXRGM0FmNGJpV252TndObS9CaFE9PQ==
  • Favor
  • Texas

Job Description

Favor’s Engineering team is responsible for the complex systems that make high-touch logistics happen in real time. This includes finding the perfect Runner (that’s what we call Favor delivery drivers), managing the communication between customers and Runners, keeping thousands of mobile applications in sync, and more. We are looking for a Senior Site Reliability Engineer to drive our cloud and configuration management and build, deploy, and monitor platforms. 

As a Senior Site Reliability Engineer, your job is to apply our company goals to our technology. Along with a team of other motivated engineers, your job will be to ensure world-class performance, efficiency, change management, monitoring, capacity planning, and emergency response capabilities. Your ultimate goal is to engineer reliable and performant solutions, increase system observability, minimize human interactions with production systems, accelerate customer value delivery, and communicate those best practices to others.

You will work closely with Engineering, Quality, Data, and Product Engineering teams to help define how we deploy and operate our products at scale. You must be a self-starter who thrives in a fast-paced, agile environment, show an eagerness to learn, and introduce new technologies as the need arises. Most importantly, we need a technical leader who can prioritize, multi-task, and deliver scalable and reliable solutions.

\n

What You'll Do
  • Assist in service disruption troubleshooting, remediation, and documentation
  • Attend Operational Review and Incident Review meetings
  • Maintain monitoring and alerting systems for Favor’s production services, including implementing and adjusting Service Level Objectives
  • Monitor the performance of production systems, giving recommendations for enhancing performance, and assisting in implementation, including conducting and writing Failure Mode Effects and Analysis documents
  • Automate operational toil and service recovery
  • Improve and iterate upon team processes
  • Provide mentorship to team members and developers
  • Engage and nurture development teams to be capable of maintaining services once they are live by measuring and monitoring availability, latency, and overall system health
  • Share an on-call rotation and be an escalation contact for service incidents

Skills You Have
  • 4+ years of Site Reliability experience with a recent focus on Kubernetes infrastructure
  • 4+ years of experience working with microservices and Service-Oriented Architectures (SOA) 
  • 4+ years of AWS experience 
  • 4+ years of experience in logging, metrics, monitoring, and alerting, preferably with tools such as OpsGenie, CloudWatch, Grafana
  • Expert understanding of Git and knowledge of coding patterns and their applicable uses to write secure, performant, testable code
  • Ability to design, deploy, and maintain production-scale distributed systems
  • Experience with automation/configuration management (Terraform, CloudFormation, CDK)
  • An understanding of system optimization issues

Who You Are
  • You understand lean and agile principles of software development and help uplevel the entire Engineering team in these areas
  • You are an expert at defining and communicating technical solutions and strategies
  • You are a force multiplier who can move an Engineering team forward through direct contributions and influence
  • You enjoy working with other engineers in a collaborative and iterative environment 
  • You have experience scaling systems and teams in a high-growth startup/medium-size company 
  • You communicate well with technical and non-technical stakeholders
  • You are comfortable working in a Linux/Unix environment
  • You are detail-oriented, with an organized thought process and the ability to act decisively under stressful conditions
  • You work well with others to solve problems
  • You have a self-motivated work process and excellent communication skills that allow you to identify areas of improvement and work with the appropriate team members to resolve
  • You are a true full-stack engineer who can navigate and advise in all areas of the software lifecycle, including design, development, deployment, debugging, monitoring, and support

\n

Life at Favor

Where you'll work: This role can be hybrid or remote, depending on the team member’s location in Texas. If you live in Austin, Texas, we ask that you work from home roughly three days per week and work at our HQ for the remaining work days. If you live in a different city in Texas, you will primarily work from home, with the opportunity to travel to Austin for company-wide events. No matter where you work best, we foster an inclusive and flexible environment to support our workforce.

Benefits: We offer premium health, vision, dental, and life insurance, alongside 401(k) options. We go beyond the basics, while also throwing in Favor delivery fee credit and H-E-B discounts! 

Paid time off (PTO): We offer unlimited PTO for salaried employees (that’s actually unlimited) and ample vacation time to all team members.

Learning and development: We encourage personal growth and education through Intern(al)ships and Learning Labs taught by Favor team members and external facilitators.

Community: Whether you’re an avid cyclist, dog lover, or Magic enthusiast, there’s a group for you here. We foster community through Employee Resource Groups (ERGs), company-wide events, happy hours, and regular connection opportunities.

Diversity, equity, and inclusion: At Favor, we believe that to be the best delivery app in Texas, we need to represent all Texans. We are committed to growing a team with different backgrounds, experiences, abilities, and perspectives, and we are an equal opportunity employer. We review all resumes and qualifications with an open mind and encourage you to apply if this role interests you!

In addition, as a candidate, if you require any accommodations throughout the recruitment process, simply let your recruiter know! Our talent acquisition team will work with you directly to ensure a smooth and delightful process.

Job Tags

Remote job, Full time, Work from home, Live in, Flexible hours, 3 days per week,

Similar Jobs

OX Floors

Call Center Representative Job at OX Floors

 ...Company Ox Floors is a rapidly growing home improvement company specializing in concrete floor coatings. We do NOT cold call. All customers have responded to our marketing...  ...the Role Have 2-3 years of solid call center experience and looking for a long-term opportunity... 

Accenture Federal Services

DevOps Engineer II Job at Accenture Federal Services

 ...At Accenture Federal Services, nothing matters more than helping the US federal government make the nation stronger and safer and life better for people.Our 13,000+ people are united in a shared purpose to pursue the limitless potential of technology and ingenuity... 

Activ8 Recruitment & Solutions

Bilingual English/Japanese Administrative Assistant (Gaming industry) (CA/YM) Job at Activ8 Recruitment & Solutions

 ...Job Description A Gaming company is seeking a Bilingual English/Japanese Administrative Assistant to join their team in Torrance , CA . This position is responsible for handling day-to-day administrative tasks to support the efficient functioning of the... 

Hanna Interpreting Services LLC

Japanese Interpreter Job at Hanna Interpreting Services LLC

 ...About Hanna Hanna is a woman- and minority-owned business committed to providing efficient and comprehensive language services. The company started in 2010 as a humble passion project and has grown to serve 1.1M clients in 250+ languages and has sponsored countless... 

Burnewiin

Audio Producer Job at Burnewiin

Position: Audio Producer Company: Burnewiin Burnewiin is seeking a highly skilled and motivated Audio Producer to join our dynamic team...  ...all aspects of the production process, including recording, editing, mixing, and mastering audio. ~ Collaborate with external partners...