Site Reliability Engineer
Company: Qumulo Careers
Location: Vancouver
Posted on: January 24, 2025
|
|
Job Description:
About the company:Qumulo is the unstructured data platform to
store and manage exabyte-scale data anywhere - at the edge, in the
core data center and in the cloud. With unstructured data growing
in more locations faster than ever before, enterprises today need a
way to store, manage, and curate data simply and efficiently in any
location, on any platform. This is precisely what Qumulo was
founded to accomplish.At Qumulo, we are building an open and
collaborative culture where people can do their best work with
customers as our magnetic field. We act as owners, we share by
default, we are data driven and experimental and as an inclusive
workplace, we encourage and celebrate multiple points of view. As
part of our culture we believe diversity drives innovation.About
the position:As an SRE at Qumulo, you will help to develop
solutions that help to manage and monitor applications we use
internally and to support our customers. We manage our internal
build and test infrastructure which includes running multiple
builds and hundreds of thousands of tests continuously in both
on-prem environments and on the cloud (such as AWS and Azure Native
Qumulo Scalable File Service [ANQ]). This build and test
environment is a core part of our engineering processes, providing
continuous feedback to our engineering teams and allowing us to
deliver new product releases regularly throughout each year. We
also build and operate managed components of ANQ, delivering a
highly available service to customers and keeping the service up to
date with our latest features.We work across engineering, product
and customer success teams to identify opportunities to improve our
processes and ensure that our existing systems are available and
working as expected. We implement solutions that reduce work
through automation, providing scalable solutions that span our
on-prem and cloud environments. We help manage the operating
expense of running systems across multiple clouds. We help drive
down failures by providing frequent feedback to engineers on their
changes with high quality test analytics.Responsibilities:You will
collaborate with a team that identifies opportunities, plans new
features, and implements solutions. You will work with team members
to build a backlog and deliver solutions iteratively. You will
troubleshoot build and test failures, diagnosing problems that vary
from build time compilation failures to integration test failures
involving both virtual machine instances and Qumulo qualified
hardware. You will implement monitoring to ensure that systems are
working as expected and can raise alerts when problems are
detected.This position does include an on-call rotation which
requires availability to respond to critical incidents impairing
our owned applications.Technologies:
#J-18808-Ljbffr
Keywords: Qumulo Careers, Vancouver , Site Reliability Engineer, Professions , Vancouver, Washington
Click
here to apply!
|