Fevrok logo
Lead Site Reliability Engineer
3 years ago

Equifax is where you can power your possible. If you want to achieve your true potential, chart new paths, develop new skills, collaborate with bright minds, and make a meaningful impact, we want to hear from you.Site Reliability Engineering (SRE) is a discipline that combines software and systems engineering for building and running large-scale, distributed, fault-tolerant systems. SRE ensures that internal and external services meet or exceed reliability and performance expectations while adhering to Equifax engineering principles. What you'll do

  • Run the production environment by monitoring availability and taking a holistic view of system health

  • Build software and systems to manage platform infrastructure and applications

  • Improve reliability, quality, and time-to-market of our suite of software solutions

  • Measure and optimize system performance, with an eye toward pushing our capabilities forward, getting ahead of customer needs, and innovating to continually improve

  • Provide primary operational support and engineering for multiple large distributed software applications

  • Gather and analyze metrics from both operating systems and applications to assist in performance tuning and fault finding

  • Partner with development teams to improve services through rigorous testing and release procedures

  • Participate in system design consulting, platform management, and capacity planning

  • Create sustainable systems and services through automation and uplifts

What experience you need

  • Bachelor's Degree in Computer Science, Information Management or in "STEM" Majors

  • 5- 7+ years of experience developing and/or administering software

  • 5-7 + years' experience in continuous integration tools (Jenkins, SonarQube, JIRA, Nexus, Confluence, GIT, Maven or Gradle)

  • 3+ years of Cloud experience

  • Experience in monitoring infrastructure and application uptime and availability to ensure functional and performance objectives.

  • Demonstrable cross-functional knowledge with systems, storage, networking, security and databases

  • System administration skills, including automation and orchestration of Linux/Windows using Chef, Puppet, Ansible, Salt Stack and/or containers (Docker, Kubernetes, etc.)

  • Proficiency with continuous integration and continuous delivery tooling and practices

  • Strong analytical and troubleshooting skills

  • Experience with configuring, customizing, and extending monitoring tools (Appdynamics, Apica, Splunk etc.)

What could set you apart

  • You have expertise designing, analyzing and troubleshooting large-scale distributed systems.

  • You take a system problem-solving approach, coupled with strong communication skills and a sense of ownership and drive

  • You have experience managing Infrastructure as code via tools such as Terraform or any technology

  • You are passionate for automation with a desire to eliminate toil whenever possible

  • You've built software or maintained systems in a highly secure, regulated or compliant industry

  • You thrive in and have experience and passion for working within a DevOps culture and as part of a team

We offer comprehensive compensation and healthcare packages, 401k matching, paid time off, and organizational growth potential through our online learning platform with guided career tracks.Are you ready to power your possible? Apply today, and get started on a path toward an exciting new career at Equifax, where you can make a difference!Equifax is an Equal Opportunity Employer. All qualified applicants will receive consideration for employment without regard to race, color, religion, sex, sexual orientation, gender identity, national origin, disability, or status as a protected veteran. #LI-KC1 #LI-Hybrid






©2025 Fevrok. All Rights Reserved.