Posting Title: IoT Cloud Senior Site Reliability Engineer (2 - 4 Years) (Grade 6)
JOB DESCRIPTION
What You'll Do
The Internet of Everything is a phenomenon driving new opportunities for Cisco and it's transforming our customers' businesses worldwide. We are pioneers and have been since the early days of connectivity.
As Cisco delivers the network that powers the Internet, we are connecting the unconnected. Your progressive ideas will affect everything from retail, healthcare, and entertainment to public and private sectors, and far beyond.
With roughly 10 billion connected things in the world now and over 50 billion estimated in the future, your career has exponential possibilities at Cisco.
Who You'll Work With?
Cisco IoT Cloud Infrastructure Engineering team is responsible for the development and operation of a cloud platform powering all Cisco hosted IoT SaaS products. Our team of cloud engineers work to design, develop, secure, automate and operate application infrastructure in a variety of public and private clouds, providing the IoT engineering group a consistent platform to deploy all the IoT SaaS application components.
In this role you will work in an Agile-based work environment as part of an SRE team tasked to design, build, integrate, deploy, and operate cloud services in a production environment, including but not limited to the following responsibilities:
- Maintain and operate tools as well as infrastructure-level services for cloud deployments and operations.
- Deploy and operate 24x7 production environments
- Participate in production support on-call rotation
- Maintain AWS Infrastructure as Code (IaC) components such as VPCs, EC2, S3, RDS, Route 53, KMS etc.
- Maintain, and extend logging stack like EFK
- Maintain cloud-based monitoring, alerting, and reporting CloudWatch, Prometheus, Grafana, PagerDuty
- Provide continual enhancements to maintain our cloud, information security and operation posture.
Who You Are?
- MS/CS or equivalent degree combined with 0 - 2 years of work experience
(or)
BS/CS or equivalent degree combined with 2 - 4 years of work experience
- Experience with operations-based software development, working with a preferred scripting language like Python, and/or Bash.
- Experience using containerization platforms like Docker and Kubernetes
- Experience managing public cloud infrastructure like AWS in a production environment
- Knowledge of standard internet services such as DNS, Load Balancers, HTTP/S, TLS, SAML, SSL etc.
- Experience administering Linux systems
- Working with configuration management tools in Linux and on AWS Terraform, Ansible, CloudFormation
- Expertise on CI/CD and tools like Git, Jenkins are a must.
- Handle seamless upgrades of infrastructure and services through automation
- Experience with logging and monitoring solutions such as Prometheus, Elastic Stack etc.
- Knowledge of cloud computing concepts including virtualization, web service APIs, distributed data storage (database, block, object, file), multi-tenancy, and metered usage patterns
- Familiar with incident and change management standard methodologies.
- Good understanding of backup/restore and disaster recovery standard methodologies
- Familiar with vulnerability management and remediation processes and tools
- Be a self-starter with high attention to details
- Passionate about automation in the areas of infrastructure as code, and configuration as code, that will help reduce toil and help drive improvements towards the manageability, availability, and reliability of the environment.
- Analytical: Able to see gaps and areas of improvement in process as well as technologies, providing recommendations and taking the initiative to fix issues.
Why Cisco?
#WeAreCisco, where each person is unique, but we bring our talents to work as a team and make a difference powering an inclusive future for all.
We embrace digital, and help our customers implement change in their digital businesses. Some may think were old (36 years strong) and only about hardware, but were also a software company. And a security company. We even invented an intuitive network that adapts, predicts, learns and protects. No other company can do what we do you cant put us in a box!
But Digital Transformation is an empty buzz phrase without a culture that allows for innovation, creativity, and yes, even failure (if you learn from it.)
Day to day, we focus on the give and take. We give our best, give our egos a break, and give of ourselves (because giving back is built into our DNA.) We take accountability, bold steps, and take difference to heart. Because without diversity of thought and a dedication to equality for all, there is no moving forward.
So, you have colorful hair? Dont care. Tattoos? Show off your ink. Like polka dots? Thats cool. Pop culture geek? Many of us are. Passion for technology and world changing? Be you, with us!