Senior Site Reliability Engineer - Opportunity for Working Remotely
Paris, France
il y a 1j

Job Description

The Elevator Pitch : Why will you enjoy this new opportunity?

The Reliability Engineering team is growing with a laser focus on bringing reliability engineering practices to our Tanzu customers.

We are a modern SRE team where the original focus on a S ite has been augmented by a focus on the Customer after all, not all workloads are websites.

We are looking for someone who views Reliability Engineering as a way of life and is interested in approaching everything you do with that mindset.

What is the primary need, technical challenge, and / or problem you will be responsible for?

Our customers need a partner to assist with establishing and meeting their reliability goals while building, running, and managing production workloads on VMware Tanzu, in particular Tanzu Kubernetes Grid (TKG).

Conversely, our sales, solutions, support, and product development teams have a partner in us that is suited to understand reliability engineering, our customers’ reliability goals, and associated challenges while building on or using our products and services.

As an engineer on this team, you will :

Exercise your advanced Linux systems and network engineering experience while coaching customers on how to optimally build, run and manage production workloads using Kubernetes-based systems.

Teach how-to adopt reliability engineering practices such as observability, error budgets, blameless retrospectives, chaos engineering, etc.

Participate in oncall and interrupts rotations in the escalation path for TKG support and field teams.

Take a balanced approach to reduce operational load, where it’s cost-effective, through software and systems engineering.

Collaborate directly with customers both via video and in writing.

Share your learnings and experiences with your team and others.

Participate within a fully remote and distributed team.

Success in the Role : What are the performance goals over the first 6-12 months you will work toward completing?

Propose, define, and drive assigned projects to completion, being clear when tradeoffs are needed, and deadlines need to be adjusted to accommodate higher-priority work.

Influence continuous improvements in our products by providing opinionated input in feature workstreams.

Demonstrate a commitment to the team SLOs.

Participate in training sessions, e.g. enabling support engineers across the globe to support their ability to work with VMware’s customers on the front-line.

Endeavor to complete the Certified Kubernetes Administrator (CKA) exam or similar.

What type of work will you be doing? What assignments, requirements, or skills will you be performing on a regular basis?

Contribute to reliability-related improvements and reliability features as part of VMware Tanzu services and upstream Kubernetes ecosystem.

Partner with Program Managers to ensure appropriate technical details are understood and tracked, so each time we work with partner teams, we are bringing forward depth and context.

Use our ticketing system to triage, prioritize, and engage with partner teams when they need our assistance.

What is leadership like for this role? What is the structure and culture of the team like?

Our culture is rooted in inclusion, collaboration, and growth. We are a tight-knit group of adaptive, self-starting, and mission-driven individuals with a passion for doing our best work and bringing our best selves.

We make sure to invest time to consider each other's perspectives and to appreciate our efforts and accomplishments. For real, we have an appreciation section during our team meeting! We also share what we have learned with each other, and we care for each other's well-being.

The hiring manager for this role is Gustavo Franco, Senior Engineering Manager. Gustavo recently joined VMware after managing CRE, GCP, Incident Response, Disaster Recovery, and other SRE teams at Google for six years.

Prior to this, Gustavo was an individual contributor SRE at Google for six years on Google Compute Engine, Google Cloud Storage, Google+, and other products.

Category : Engineering and Technology

Subcategory : Site Reliability

Experience : Manager and Professional

Full Time / Part Time : Full Time

Posted Date : 2021-05-19

Signaler cette offre d'emploi

Thank you for reporting this job!

Your feedback will help us improve the quality of our services.

Mon email
En cliquant sur « Continuer », je consens au traitement de mes données et à recevoir des alertes email, tel que détaillé dans la Politique de confidentialité de neuvoo. Je peux retirer mon consentement ou me désinscrire à tout moment.
Formulaire de candidature