Senior Site Reliability Engineer

Mimecast Services Limited
Published
February 13, 2020
Location
Lexington, MA
Job Type
 

Description

Senior Site Reliability Engineer

US-MA-Lexington

Title: Senior Site Reliability Engineer

Job ID: 2020-4750
Type: Permanent Full Time
# of Openings: 1
Category: Engineering and R&D
Lexington, Massachusetts

Overview

The Messaging Services team runs and maintains the core services that accept and process all our customers e-mail. Providing software that is able to protect against phishing and impersonation attacks, right through ensuring attachments are safe, and preventing data leakage. If your software doesn't work, then customers don't get email - it's that simple.

We are looking for a site reliability engineer (Public Sector Program*) to help us develop the operational aspects of both e-mail delivery and processing aspects of our platform, implementing new ideas to ensure we protect our customers from outside attacks, and reputation damage from within. The team works broadly across a number of different areas, including:

  • Understanding e-mail and how to parse one
  • Content scanning
  • URL and domain security
  • Core performance in a highly scalable java application

Ideal candidates will share a passion for tackling complexity with simplicity and building platforms that can scale massively and are not afraid of exploring unconventional solutions. We want people who are going to complement the team, and bring new perspectives on our problems, or bring valuable prior experience; the right attitude to our challenges is key.

Responsibilities

What will this role accomplish?

  • Building and supporting analysis tools for our Cloud-security products.
  • Participating in development of new services within the Mimecast platform.
  • Developing, deploying and managing software across the full Continuous Delivery life-cycle.
  • Supporting and trouble-shooting, occasionally outside of regular office hours when needed.
  • Working closely with developers to optimise the e-mail delivery and processing systems, and make them operationally fit for purpose
  • Bring an operational perspective to the work of the team, suggesting ways to help the systems scale further, better and more easily.
  • Plan and negotiate with peers to meet deadlines.
  • Encourage others to perform at their best embracing 'best practice'.
  • Manage your own time and priorities, without the need for micro-management.

In 30/60/90 days, 6 mos., what will define this hire as successful:

  • 3 months - Becomes an effective member of the team, adding new functionality to existing services
  • 6 months - taking ownership of individual services, and setting technical direction
  • Ongoing - derives satisfaction of knowing that you're protecting and providing critical services to Mimecast customers 24/7

Qualifications

Hands on experience automating the deployment and monitoring of servers at a large scale
Hands on experience in configuration management of server farms (using tools such as mcollective, Puppet, Chef, Ansible etc).
Hands on experience with open source monitoring and graphing solutions such as Graphite, Grafana, Nagios, Zabbix, Sensu, Prometheus etc.
In depth expertise in Linux internals and system administration including configuration and troubleshooting.
Hands on experience with performance tuning of Linux OS (CentOS) in identifying bottlenecks such as disk I/O, memory, CPU and network issues.
Extensive experience with at least one scripting language apart from BASH (Ruby, Perl, Python).
Ability to analyze network behaviour, performance and application issues using standard tools.
Knowledge developing and debugging distributed applications
Understanding of continuous deployment using CI/CD tools like Jenkins, SCM (Git, SVN) along with code reviews
Knowledge of modern RESTful services.

Nice to have:
Extensive knowledge of email messaging and security, not just sending using SMTP
Administering Java applications (tuning, GC, memory)
Strong understanding of TCP/IP networking, including familiarity with concepts such as OSI stack
Experience of working to deliver a SaaS solution
Knowledge developing and debugging distributed applications
Experience of high throughput, low latency systems
Experience with application containerization technologies, e.g. Docker and Kubernetes
PI117994259

Apply
Drop files here browse files ...

Related Jobs

Senior Site Reliability Engineer   Lexington, MA new
February 21, 2020