Site Reliability Engineer

East Bay, (none selected)

Posted: 02/25/2019 Employment Type: Direct Hire Industry: IT Job Number: JOS000008155

Job description 

This company is building next generation cloud infrastructure with a specific focus on enterprises workloads. We are looking for a seasoned systems automation and operations expert to contribute in an impactful manner on initiatives focused on systems infrastructure management within a high volume fast scaling environment. 


- You will be responsible for the systems deployment, operations, and monitoring for our infrastructure, including design and development of infrastructure automation. 

- You will drive reliability and supportability aspects of cloud service by creating knowledge base and working closely with DevOps and the rest of the Engineering team 

- You will utilize your advanced system architecture & administration skills for collaboration with engineering and product management, test and automation teams to architect and develop strategic and tactical solutions. 

- Create both high-level and detailed technical design documents for development of infrastructure automation 

- Design and improve automation and tools for systems management, including development of scripts and building tools to help engineering service owners 

- Dig into hardware, drivers, Linux kernel, and the full software stack issues to troubleshoot problems and improve system performance 

Candidate background 

- 6+ years of experience in design and development of Datacenter and Cloud infrastructure automation 

- Prior successful experience of working in an innovative, fast-paced startup with a high rate of flux. The candidate must demonstrate strong entrepreneurial spirit and vigor. 

- Demonstrated proficiency in creating detailed technical design documents, facilitate design reviews, and execution of design implementation projects. 

- BS/MS degree in Computer Science or equivalent experience 

- Must be an excellent verbal and written communicator 

- Deep technical roots in data center technologies: 

- Understand data center networking topologies and common architectures deployed 

- Proficiency in Python, Perl, or Ruby for operations scripts and text processing (Python strongly preferred) as well as automation tools like Ansible, Chef or Puppet 

- Virtualization technologies, in particular VMware product suite (vCenter, ESXi, VSAN, NSX) is required 

- Experience in networking concepts – Layer 2/3, Load Balancers, VPN, Network Virtualization, BGP, OSPF is a strong plus 
-Understanding of DCIM tools is a strong plus. 

- Experience with at least one cloud management platform such as VMware vCloud Air/VIO/VCF is a big plus 

- Understanding of DevOps agility for continuous development and delivery 

- Understanding of Public Cloud infrastructure (AWS, Azure, Google Cloud) is a strong plus. 

Send an email reminder to:

Share This Job:

Related Jobs:

Login to save this search and get notified of similar positions.