SpaceX was founded under the belief that a future where humanity is out exploring the stars is fundamentally more exciting than one where we are not. Today SpaceX is actively developing the technologies to make this possible, with the ultimate goal of enabling human life on Mars.
SITE RELIABILITY ENGINEER (STARLINK)
Want to build the next era of the Internet? Want to develop the infrastructure necessary to develop advanced in-house silicon powering thousands of satellites in space and millions of user devices on Earth to bring high speed broadband to every corner of the world? SpaceX is looking for a Site Reliability Engineer to design, operate and scale the infrastructure we use to run develop ASICs for Starlink, a global internet service provider and the largest satellite constellation on orbit. We have no shortage of exciting problems and challenges. The ideal candidate will be flexible, possess broad skills across product operations and software development, and flourish in a fast-paced and challenging environment.
RESPONSIBILITIES:
Develop, deploy and manage core infrastructure such as physical servers, virtual machines, monitoring and storage
Closely collaborate with engineers to both support day-to-day requests and create long-term solutions
Support high performance computing cluster which includes upgrades, patches, tuning, expanding and troubleshooting jobs and infrastructure errors
Support continuous integration workflows with Bamboo and Jenkins
Support version control systems like Git, Subversion and SOS
Support network license servers and license monitoring
BASIC QUALIFICATIONS:
Bachelor€™s degree in computer science, information systems/IT, or an engineering discipline; OR 2+ years of professional experience in system administration, high performance computing, or site reliability engineering
2+ years of experience with Linux operating systems
Experience in Bash, Python, and/or other scripting languages
PREFERRED SKILLS AND EXPERIENCE:
Experience with high performance computing and workload managers (e.g. Slurm, LSF)
Experience automating Linux system administration, provisioning and configuration with tools like Puppet
Experience with containerization technologies (e.g. Docker, Kubernetes)
Experience with automatically managing dozens servers
Experience with ASIC design flows and tools
Focus on performance bottlenecks and performance improvement techniques
Excellent communications skills with the ability to communicate with customers, peers, management etc. in both formal and informal situations
Ability to quickly learn new tools and frameworks
Familiarity configuring remote desktop technologies (e.g. X, VNC)
Understanding of databases and data modeling
Strong networking knowledge of TCP/IP
ITAR REQUIREMENTS:
To conform to U.S. Government space technology export regulations, including the International Traffic in Arms Regulations (ITAR) you must be a U.S. citizen, lawful permanent resident of the U.S., protected individual as defined by 8 U.S.C. 1324b(a)(3), or eligible to obtain the required authorizations from the U.S. Department of State. Learn more about the ITAR here.
SpaceX is an Equal Opportunity Employer; employment with SpaceX is governed on the basis of merit, competence and qualifications and will not be influenced in any manner by race, color, religion, gender, national origin/ethnicity, veteran status, disability status, age, sexual orientation, gender identity, marital status, mental or physical disability or any other legally protected status.
Applicants wishing to view a copy of SpaceX€™s Affirmative Action Plan for veterans and individuals with disabilities, or applicants requiring reasonable accommodation to the application/interview process should notify the Human Resources Department at (310) 363-6000.