Job ID: JR0231331
Job Category: Software Engineering
Primary Location: Virtual, AZ US
Other Locations:
Job Type: Experienced Hire
Job Description
Intel's HPC Storage Architecture and Development team is growing. The Software Build and Release Engineer role, will contribute to developing and testing Intel's Exascale storage stack based on the open source Distributed Asynchronous Object Storage (DAOS) software. The end-to-end I/O stack must be optimized to prevent a storage bottleneck as throughput and IOPS requirements increase substantially, and demand will only increase in the coming years as our ability to collect data is ever-increasing. The Intel HPC Storage Architecture and Development team is seeking an experienced DevOps Software Development Engineer to support a software development infrastructure consisting of physical servers, VMs, Omni-Path and Infiniband networks, and applications used for CI.
In this role you will be engaged in the following job duties:
The application of systems analysis techniques and procedures, including consulting with users, to determine hardware, software or system functional specifications;
The design, development, documentation, analysis, creation, testing or modification of computer systems or programs, including prototypes, based on and related to user or system design specifications;
The design, documentation, testing, creation or modification of computer programs related to machine operating systems; or
A combination of the aforementioned duties, the performance of which requires the same level of skills.
Additional responsibilities include but not linted to:
Ensures the efficient planning, provisioning, installation/configuration, maintenance, and/or operations of the hardware and software infrastructure required to build, validate, and release a wide variety of hardware and software products and projects.
Works closely with development and quality teams to derive infrastructure design requirements, build, test, and automate tools appropriate to the project, and/or implements and maintains of those systems within the constraints imposed by Intel enterprise infrastructure (IT) and other governing bodies.
Owns the end-to-end delivery pipeline, including source code management, versioning/tagging strategy, component build and packaging, test automation tooling, release staging, acceptance and/or indicators, required security and IP scans, any third-party conformance tools, artifact storage and distribution, and disaster recovery planning.
Identifies opportunities and implements solutions for increased automation, reliability, and/or velocity within the pipeline through implementation of robust infrastructure telemetry, KPIs, and indicators, and by monitoring and applying industry best practices.
Architect and implement solutions and tools for maintaining and improving the infrastructure, applications, and services required to support the development of distributed storage software.
Develop and test Ansible code to install, configure and upgrade applications and services.
Write bash and python scripts for ad-hoc automation and monitoring.
Install, configure, and support a wide variety of hardware such as servers, switches, and disk storage devices. Involves racking, running cable, configuring server BIOS, etc. in the data center.
Install, configure and support Omni-Path and Infiniband high speed networks (switches, cabling, Fabric Manager)Perform BIOS and firmware upgrades.
Troubleshoot and resolve issues with bare metal provisioning via PXE and disk imaging.
Troubleshoot and resolve issues with KVM VMs, disk imaging, networking, etc.
Work closely with the CI team to quickly troubleshoot and resolve infrastructure issues.
Create and maintain infrastructure documentation, how-to articles.
The Ideal candidate should be willing to work independently, and as part of a distributed team.
Qualifications
You must possess the below minimum qualifications to be initially considered for this position. Preferred qualifications are in addition to the minimum requirements and are considered a plus factor in identifying top candidates.
Minimum Qualifications
Bachelor's degree in Computer Science or another closely related field. And/or prolonged course of study in a specialized field or equivalent experience.Â
Work experience must include the following:
Strong Linux System Administration experience, preferably with RedHat based distros
TCP IPv4 networking experience
Configuration and troubleshooting experience with SSH, DNS, DHCP, NFS
Git, GitHub and/or GitLab knowledge and experience
Bash and Python scripting
Preferred Qualifications:
Experience working in very large data centers
Support of multiple Linux distros - Fedora, CentOS 8.x, openSUSE 15.x, and Ubuntu 20.04
Installing and configuring Omni-Path and Infiniband networks
KVM/QEMU, libvirt experience
High Performance Computing (HPC) tools - Slurm, Powerman, Conman, ClusterShell, pdsh, Open MPI
Bare metal provisioning via PXE with Cobbler, or via cloud-init
Remote consoles and power management via IMPI
Centralized user authentication with FreeIPA, sssd, autofs
Testing Ansible roles with Molecule
Ansible AWX experience
Installing and configuring Jenkins
Zabbix experience
Nexus and Artifactory
HTTP load balancing using Corosync, Pacemaker, HAProxy
JIRA knowledge and experience for tracking work requests and Confluence for documentation
Inside this Business Group
The focus of Accelerated Computing Systems and Graphics (AXG) is to accelerate our execution in strategic growth areas of high-performance computing and graphics. AXG is chartered with delivering high performance computing and graphics solutions (IP, Software, Systems), for both integrated and discrete segments across client, enterprise and data center. Our mission is to make zeta-scale computing accessible to every human on the planet by the end of this decade and to entertain, educate and connect billions of people with buttery smooth visual experiences.
Intel strongly encourages employees to be vaccinated against COVID-19. Intel aligns to federal, state, and local laws and as a contractor to the U.S. Government is subject to government mandates that may be issued. Intel policies for COVID-19 including guidance about testing and vaccination are subject to change over time.
Posting Statement
All qualified applicants will receive consideration for employment without regard to race, color, religion, religious creed, sex, national origin, ancestry, age, physical or mental disability, medical condition, genetic information, military and veteran status, marital status, pregnancy, gender, gender expression, gender identity, sexual orientation, or any other characteristic protected by local law, regulation, or ordinance.
Annual Salary Range for jobs which could be performed in US, Colorado:
$113,500.00-$170,120.00
Benefits:
We offer a total compensation package that ranks among the best in the industry. It consists of competitive pay, stock, bonuses, and benefit programs. Find more information about our Amazing Benefits here
Work Model for this Role
This role is available as fully home-based and generally would require you to attend Intel sites only occasionally based on business need.
USExperienced HireJR0231331VirtualAccelerated Computing Systems and Graphics (AXG)