First Citizens Bank hiring for SRE Technology Operations Lead (Raleigh, NC) jobs in Holly Springs, NC, US
Overview:
Come join a growing bank at the heart of the innovation, technology, green tech, and life sciences space. We continue to expand our global footprint and our banking technology is at the core of everything we do. To continue the technology excellence, Corporate Technology Operations Group is seeking a skilled professional for the role of Site Reliability Engineering (SRE) Technology Operations.
As a Site Reliability Engineer, you will be responsible to support the applications by implementing the SRE culture and ensuring that no SLAs, SLIs, or SLOs are missed for Corporate Technology department. The SRE candidate should be a strong-willed candidate that is passionate around application uptime always ensuring 99.9% availability by expanding the existing observability platform and utilizing automation when necessary.
Responsibilities:
This is an individual contributor role in which the candidate will:
- Responsible for implementing the application infrastructure setup as well as deploy automation for applications in Azure cloud infrastructure as well as Legacy infrastructure.
- Collaborate with other Site Reliability Engineers across Corporate Technology Organization and providing 24x7 application support enabling our clients to have access to highly available, resilient, and performant applications.
- Run the production environment by monitoring availability and taking a holistic view of system health.
- Implement Industry standards and apply definition of done to all existing and new applications.
- Develop solutions, work closely and partner with team members (development leads and systems analysts).
- Measure and optimize system performance, with an eye toward pushing our capabilities forward, getting ahead of customer needs, and innovating for continual improvement.
- Drive capacity analysis work and performance engineering mentality.
- Implement and enhance existing observability via platforms such as Splunk and AppDynamics.
- Ensure that all on premise and cloud applications achieve 99.9% availability.
- Focus on cloud development experience, but also bring forethought into scalability, performance, and reliability.
- Provide product support to our mission critical applications.
- Improve upon existing processes and find and fill gaps for smooth operational standards.
- Team Leadership - Mentors and trains team members. Utilizes expertise to provide guidance, feedback, and direction on complex matters. Assists in the communication of performance results and expected behaviors for success.
- Technical Operations - Sets objectives for technical operations as well as implement action plans necessary for achieving set targets. Conducts cost-benefit analysis to determine operational alternatives with comparative advantage.
- Business Support - Develops and applies firmware, software, and hardware solutions to meet the technical requirements of a company. Ensures the provision of technical equipment and materials required for operations. Acts as the primary escalation point for troubleshooting of urgent technical issues.
- Technical Expertise - Maintains an up-to-date knowledge of Bank technologies, technical processes, functions, requirements and industry standards. Organizes training programs for staff in the department to enhance their skills and improve their job knowledge. Produces and presents to management periodic reports of technical operations.
*This position will be 3 days in the office, in Raleigh and 2 days remote
Qualifications:
Bachelor's Degree and 6 years of experience in Technical work in Application Development, Server Administration, Information Security or Engineering OR High School Diploma or GED and 10 years of experience in Technical work in Application Development, Server Administration, Information Security or Engineering
Preferred Area of Experience: Financial Services Industry
Technical and preferred skills:
An ideal candidate will have a bachelor’s degree in computer science or information technology with 6-10 years' combined experience in Tech Ops Lead, SRE, Tech Project Delivery Lead, DevOps and, performing BAU and Application Support.
What you bring:
- Hands-on experience in RDBMS architecture and performance tuning RDBMS like Oracle/SQL Server.
- Strong organizational and Incident, Problem Management skills.
- Technical skills and tools knowledge: ServiceNow, Remedy, Oracle, SQL.
- 3+ years Deep Understanding of service design for Cloud (AWS/Azure) or similar using Containers, Container Orchestration (like Kubernetes), configuration management, and Kafka
- Operationally Focused: Passionate about monitoring, resiliency, uptime, performance, and automation.
- Performance driven: Passionate about capacity planning, load testing, and stress testing.
- 2+ years of deep experience with AWS (or similar cloud technologies)
- AWS/Azure Certified is a plus.
- Knowledge of Scripting languages such as python or shell is a plus.
- 5+ years of real work deployment experience in core infrastructure technologies including compute, storage, networking, databases, security, and management.
- Understanding of Microservice design principles, patterns, and best practices
- Intimately involved in numerous end-to-end cloud migration projects.
- Terraform experience as it relates to cloud infrastructure setup.
- Database: RDS, DynamoDB, RDS - AWS Networking & Security Groups and their underlying technologies (Route53, VPC, ALB, Security Groups). Knowledge with SQL and noSQL database administration
- Understanding of J2EE, RESTful webservice design and implementation, Springboot, Spring Cloud, Eureka, Spring Security, JSON, YAML, Markdown, WSDL, XML, ANT, Maven, Python
- Working knowledge of application architectures, software development tools, and methodologies.
- Good knowledge of Linux internals and administration
- Automation Driver: Constantly look for automation opportunities using Ansible or similar technologies.
- Nice to have: Network configuration of Firewalls, VPN, Routers/Switches, and Load Balancers
- Nice to have: Knowledge of agile software development practices and release management.
- SRE Experience
- Experience in Incident & Problem Management processes with good exposure to troubleshooting.
- Preferred knowledge with Web development, JEE & Enterprise Technologies: JMS, JDBC.
- Experience suggesting application enhancements to support SLO/SLI/SLA criteria
- Nice to have: Experience setting up SLA/SLO/SLI for applications
- A good understanding and experience (3+ years) working with DevSecOps - (Jenkins, Gitlab) and running CICD pipelines
- Experience with GitLab templates, design, and CI/CD Pipelines.
- Hands on experience with Git-flow.
- Implementing static and dynamic code testing to champion secure code in production.