Equifax is where you can power your possible. If you want to achieve your true potential, chart new paths, develop new skills, collaborate with bright minds, and make a meaningful impact, we want to hear from you.Site Reliability Engineering (SRE) is a discipline that combines software and systems engineering for building and running large-scale, distributed, fault-tolerant systems. SRE ensures that internal and external services meet or exceed reliability and performance expectations while adhering to Equifax engineering principles. What you'll do
Work with teams across an organization and ensure core services reliability and keep an eye on capacity and performance.
Responsible for blameless postmortems and proactive identification of potential outages factor into iterative improvement
Participate in release cycles of our offerings, deploying code to integration, staging and production environments, integrating with continuous integration (CI) and continuous delivery (CD) tools, monitoring, and change management
Build Automation Work with Agile development teams to ensure smooth promotion of code, configuration and Docker images to production
Manage CI/CD pipelines ensure a high level of automation
You will maintain services during deployment and in production by measuring and monitoring key performance and service level indicators including availability, latency, and overall system health
You will automate system scalability and continually work to improve system resiliency, performance and efficiency
You will practice sustainable incident response as part of an on-call rotation and through blameless postmortems
What experience you need
Bachelor's Degree in Computer Science or related field, Information Management or in "STEM" Majors
3+ years of experience developing and/or administering software in public cloud
Experience with configuring, customizing, and extending monitoring tools (Appdynamics, Apica, Splunk etc.)
3+ years' experience in continuous integration tools (Jenkins, SonarQube, JIRA, Nexus, Confluence, GIT, Maven or Gradle)
2+ years experience working on containers (Docker, Kubernetes, etc.) and other related applications.
Experience working with Nginx, Tomcat, Redis, ElasticSearch etc.
Experience in monitoring infrastructure and application uptime and availability to ensure functional and performance objectives.
Experience in languages such as Python, Ruby, Bash, Java, Go, Perl, JavaScript and/or node.js is a plus
Demonstrable cross-functional knowledge with systems, storage, networking, security and databases
Proficiency with continuous integration and continuous delivery tooling and practices
Strong analytical and troubleshooting skills
What could set you apart
You've built software or maintained systems in a highly secure, regulated or compliant industry
You thrive in and have experience and passion for working within a DevOps culture and as part of a team
Hands on experience Configuring and Administering SCM(GIT), Build (CMake, Make files, Maven), Nexus, CI(Jenkins), CD Automation Tool
Experience with large scale cluster management systems (Mesos, Kubernetes)
Experience with Docker-based containers is a plus
Able to dive into any level of a modern internet service (schedulers, containers, Linux kernel, caching, object storage, distributed file systems, RDBMS, NoSQL, etc.)
We offer comprehensive compensation and healthcare packages, 401k matching, paid time off, and organizational growth potential through our online learning platform with guided career tracks.Are you ready to power your possible? Apply today, and get started on a path toward an exciting new career at Equifax, where you can make a difference!Equifax is an Equal Opportunity Employer. All qualified applicants will receive consideration for employment without regard to race, color, religion, sex, sexual orientation, gender identity, national origin, disability, or status as a protected veteran.