Job Description
What You’ll Do
Want to build something new? Join us as one of the first members of our SRE team and help shape its future. Youll define processes, set best practices, and ensure scalability and reliability across our infrastructure. Working closely with R&D, you’ll design and implement solutions to support the high-throughput needs of our platform. Youll have the opportunity to enhance the reliability and performance of our systems and make a significant impact as we continue to grow.
Responsibilities
Own the production infrastructure over AWS and Azure. Implement sustainable and scalable solutions with goals of improving availability and performance
Help Identify root causes for every incident and prevent incidents from ever happening again
Have alerts on symptoms and not on outages. Ensure all infrastructure and application alerts are actionable alerts and/or self-healing automation
Work closely with the R&D and Support: offering education and guidance on integration, support, and monitoring across the toolset
Everything as a code approach: Run our infrastructure with Ansible, Terraform, and Kubernetes
Document every action and turn it into repeatable actions and then into automation
Focus on the system’s observability, availability, reliability, performance/latency, monitoring
Conduct periodic on-call duties and emergency response
Required Skills
At least 3+ years of experience as DevOps or SRE in a SaaS environment
Experience with Coding languages – Python/JavaScript/Bash, or similar
At least 3+ years of experience with Alerting & Monitoring systems such as DataDog Splunk / New Relic / Prometheus, or similar
Experience working with Linux systems from kernel to shell and beyond
Cloud systems such as AWS / Google cloud / Azure
Configuration management such as Ansible/Chef/Puppet
Experience with Docker, Kubernetes and Helm
SCM – Git/bitbucket/gitlab/Phabricator/gerrit
High Analytical & Troubleshooting skills – ability to solve complex problems
Strong verbal and written communication skills and a collaborative mindset
Ability to dive into detail while understanding the big picture
Nice-to-have
DataDog extensive experience, monitoringdashboard expert
Participated in Kubernetes migration projects
Previous experience as a C++ or Node Developer
BSC in Computer Science or related technical certifications
Previous experience in cryptocurrencies blockchains – big advantage
For employees hired to work from our NYC HQ, Fireblocks is required by law to include a reasonable estimate of the compensation range for this role. This range is specific to New York City, and takes into consideration a wide range of factors that are reviewed when making a hiring decision, such as years of experience, skills, and other business needs. It is not typical for a candidate to be hired at or near the top of the pay range and each compensation decision is dependent on each individual case. A reasonable base salary range estimate for this position is $132,000 to $174,000. The base salary is one component of the total compensation package, which for some roles may include a target bonus, a very competitive equity grant, and very generous benefits.
While we believe competitive compensation is a critical aspect of you deciding to join us, we do hope you also spend time considering why our mission and culture are right for you. We are creating something transformational here, and we hope you are as excited about the future as we are.