Fevrok logo
SRE | GSP Data Platforms
3 years ago

**Citi is seeking a highly motivated candidate as a reliability engineer for Global Spread Product Data platform team. The candidate will help to expand observability over the platform stack and be responsible for ensuring stability of critical platform messaging (e.g. Kafka) and storage (e.g. HBase, Elastic) services. The candidate would a dual responsibility of responding to production incidents that cannot be handled by L2 support and working on preventative measures such as improved observability, setting up rigorous performance testing in lower environments or designing and conducting chaos style exercises. The candidate should have experience in handling high volume data and distributed systems. The candidate handles complex problems independently and demonstrates analytical thinking. Finally, the candidate is expecting to be familiar with reading code and be able to delve into open source products and understand complex issues not covered by any documentation.**


**Key Responsibilities:**


**Works close with L2 support and application teams to debug and resolve incidents relating to the platform**


**Conducts root cause analysis of thorny issues with the full platform development team and prioritizes stability book of work**


**Develops and setups telemetry as well as automates solutions for operational challenges to stability (such as resource exhaustion problems, bad node recovery etc)**


**Designs and conducts stability exercises in production and lower environment including single node recovery up to whole data center fail over**


**Helps platform users understand capability and advices on solution design that leverages platform services effectively**


**Keeps up with industry best practices around reliability and observability and brings this back to the whole platform team**


**Participate actively in platform architecture discussions particularly with focus on reliability & supportability considerations**


**Participates in Sprint Planning, Tasking and Estimation of the assigned work**


**May occasionally work a non-standard shift including nights and/or weekends**


**Required Skills / Experience:**


**Minimum 5 years of hands on experience in building an enterprise scale distributed application using Core Java**


**Minimum 2 years of hands on experience with messaging technology such as Kafka, JMS/EMS, Solace or similar**


**Experience working with modern observability stack such as Splunk, ELK, Grafana, Prometheus, Promtail and solving thorny latency and throughput requirements**


**Experience in scripting in Python, shell or equvialent**


**Experience in OpenShift or Kubernetes is a plus**


**Experience with distributed database technologies and caches like Cassandra, HBase, Apache ignite or Hazelcast is a plus**


**Experience working in a Continuous Integration and Continuous Delivery environment and familiar with Jenkins, TeamCity, Code Quality Tools - SonarQube, etc.**


**Experience with streaming technologies like Apache Flink is a plus**


**Experienced in RDBMS and SQL/PLSQL is a plus**


**Understanding of the SDLC lifecycle for Agile methodologies**


**Excellent written and oral communication skills**


**Citi Canada is an equal opportunity employer. Accordingly, we will make accommodations to respond to the needs of people with disabilities (including, without limitation, physical and mental health disabilities) during the recruitment process and otherwise in accordance with law. Individuals who view themselves as Aboriginals, members of visible minority or racialized communities, and people with disabilities are encouraged to apply.**


-------------------------------------------------


**Job Family Group:**


Technology

-------------------------------------------------


**Job Family:**


Applications Development

------------------------------------------------------


**Time Type:**


Full time

------------------------------------------------------


Citi is an equal opportunity and affirmative action employer.


Qualified applicants will receive consideration without regard to their race, color, religion, sex, sexual orientation, gender identity, national origin, disability, or status as a protected veteran.


Citigroup Inc. and its subsidiaries ("Citi) invite all qualified interested applicants to apply for career opportunities. If you are a person with a disability and need a reasonable accommodation to use our search tools and/or apply for a career opportunity review **Accessibility at Citi (https://www.citigroup.com/citi/accessibility/application-accessibility.htm)** .


View the "EEO is the Law (https://www.dol.gov/sites/dolgov/files/ofccp/regs/compliance/posters/pdf/eeopost.pdf) " poster. View the EEO is the Law Supplement (https://www.dol.gov/sites/dolgov/files/ofccp/regs/compliance/posters/pdf/OFCCP\_EEO\_Supplement\_Final\_JRF\_QA\_508c.pdf) .


View the EEO Policy Statement (http://citi.com/citi/diversity/assets/pdf/eeo\_aa\_policy.pdf) .


View the Pay Transparency Posting (https://www.dol.gov/sites/dolgov/files/ofccp/pdf/pay-transp\_%20English\_formattedESQA508c.pdf)
Citi is an equal opportunity and affirmative action employer.
Minority/Female/Veteran/Individuals with Disabilities/Sexual Orientation/Gender Identity.

©2025 Fevrok. All Rights Reserved.