Senior DevOps / Site Reliability Engineer

Job description

AuditBoard Ranked #3 on Deloitte’s 2019 Technology Fast 500 list with a growth rate of 16,882%.

 

Who We Are

AuditBoard is a high-growth SaaS company in the financial technology space that is transforming the way organizations manage critical risk, audit and compliance initiatives. We believe in empowering enterprises to manage and control risk so that their businesses are able to thrive.

Designed by former chief audit executives, our enterprise cloud platform is purpose-built to automate and streamline activities in ways that align with how our thousands of users think and act daily. Clients range from pre-IPO organizations to Fortune 5,000 companies, including leading organizations such as Lululemon Athletica, WeWork, Activision Publishing, Lionsgate Entertainment Corp., TripAdvisor, Arthur J. Gallagher & Co., Intel, and Snap, among many others.

Who We Are Looking For

We’re looking for a highly motivated Senior DevOps/Site Reliability Engineer with experience automating and designing highly availability and secure cloud infrastructure. We have a stateless containerized application architecture built on Kubernetes, hosted on multiple cloud platforms (AWS and Azure) and maintained via automation. In this role, you will be a close collaborator with all of our software engineers and a part of the software-development lifecycle. Our ideal candidate is an experienced software engineer who is willing to cross layer boundaries and dive deep to resolve production issues across our technology stack. This candidate also enjoys building scalable automation for reliable and secure web services.

AuditBoard is growing at an exceptional rate, and we’re a hard working, energetic team that is passionate about our customers and believes that to be successful we should never stop learning.This candidate will be working as part of an extraordinary team that operates cloud infrastructure driving SaaS services and reliability for many of the biggest companies in the world.

Responsibilities

  • Maintain reliability for our production systems to exceed our SLA requirements, part of which is being in an on-call rotation for production issues

  • Understand system limitations and building observability tools to maintain infrastructure reliability and/or alert on potential production issues

  • Lead and architect cloud infrastructure to enable high performance SaaS applications in the cloud. Software design and development experience in cloud service orchestration (API-based control plane) with an emphasis on “infrastructure-as-a-service”

  • Having the flexibility to learn new technologies, while continuously developing your skills will be key to your success. You will fit into our teams, be a fantastic collaborator, comfortable with giving and receiving feedback and able to thrive in a dynamic environment

  • Continue to grow automation for infrastructure provisioning, developer efficiency, and internal tooling efficiency

If this is you, we'd love to hear from you.

Requirements

  • 5+ years of related experience

  • Strong software development background with strong expertise in at least one programming language

  • Experience with Docker and running container orchestration systems (Kubernetes preferred)

  • Experience working in Cloud services providers (AWS or Azure is preferred)

  • Experience with Infrastructure as Code and other cloud automation tools (HashiCorp suite)

  • Experience with building observability tooling, dashboards, and alerting in regards to application performance and stability

  • Familiar with building fully automated CI/CD pipelines

  • Understanding of monitoring, networking, security (strong in 1 or more areas)

  • Strong familiarity with Linux

  • On-call rotation for production related issues to uphold our SLA with clients

Preferred:

  • Managing high-availability, multi-AZ and multi-region environments

  • Experience architecting and building internal developer tooling

  • Experience working with maintaining observability tooling such as Prometheus/Grafana

  • Worked with distributed systems or microservice architectures

  • Up to date and active in the open source community

Why You’ll Love Life at AuditBoard

  • You’ll be launching a career at a well-funded, hyper-growth SaaS tech company

  • Free daily catered lunches

  • Stock options

  • Unlimited snacks and beverages

  • Free gym membership

  • Medical, dental, and vision coverage for full-time employees

  • 3 weeks of Paid Time Off and 10 holidays per year.

  • 401k to save for your future

  • Fun company and team outings - Work Hard Play Hard!