Job Description
Overview
This is a remote role that may be hired in several markets across the United States.
Come join a growing bank at the heart of the innovation, technology, green tech and life sciences space. We continue to expand our global footprint, and our banking technology is at the core of everything we do.
As a Site Reliability Engineer, you will be responsible for performance, reliability and availability of critical applications for First Citizens Bank.
Responsibilities
Be part of the team that owns the availability, performance and reliability of customer-facing systems
Drive adherence to SLOs through monitoring, alerting, and scaling
Software Development in an Enterprise Java Environment, including experience with Spring Boot and Python for CICD pipelines
Maintain, support and troubleshoot critical, large-scale application and infrastructure deployments
Dive deep into issues and outages to establish root causes and communicate them to your business partners
Aptitude for analyzing and troubleshooting application, operating system, networking, configuration and performance problems
Understanding of Site Reliability Engineering concepts and best practices
Experience executing system deployments (AWS, private cloud, OpenShift)
Design, document, and implement automated procedures
Experience automating system administrative tasks with scripting tools (Python or shell preferred)
Fundamental understanding of Internet networking protocols: TCP/IP, TLS, DNS, HTTP, SMTP
Extensive experience with monitoring and automation tools such as Ansible, Gitlab, Splunk, Grafana, Prometheus, etc.
Experience and Knowledge of Open Telemetry Protocol (OTel)
Be a culture champion for SRE best practices, leveraging the ability to communicate clearly with both technical and non-technical staff
Familiar with system hardening and security best practices
#LI-IK1
Qualifications
Bachelor’s Degree and 4 years of experience in Application Engineering OR High School Diploma or GED and 8 years of experience in Application Engineering
Preferred Area of Experience:
6+ years of experience in Application Engineering/Reliability background
4+ years of experience implementing / following SRE practices
Experience working in a large financial institution (or similar environment in scope and complexity)
Hands-on experience with deploying and maintaining systems in a containerized environment (public or private cloud)
Understand performance and availability requirements and have experience working with Software Engineering teams to define deployment, configuration and monitoring requirements
Ability to create meaningful metrics and alerting for service health monitoring
Reducing manual effort through automation with scripting
Skilled with configuration management and automation frameworks
Proficiency driving Root Cause Analyses to meaningful improvements
Leading troubleshooting efforts with production/non-production systems
Participating as part of a 24×7 on call rotation
This job posting is expected to remain active for 45 days from the initial posting date listed above. If it is necessary to extend this deadline, the posting will remain active as appropriate. Job postings may come down early due to business need or a high volume of applicants.
If hired in North Carolina, the base pay for this position is generally between $107,000 and $145,000. Actual starting base pay will be determined based on skills, experience, location, and other non-discriminatory factors permitted by law. For some roles, total compensation may also include variable incentives, bonuses, benefits, and/or other awards as outlined in the offer of employment.
Benefits are an integral part of total rewards and First Citizens Bank is committed to providing a competitive, thoughtfully designed and quality benefits program to meet the needs of our associates. More information can be found at https://jobs.firstcitizens.com/benefits.