REMOTE: Senior Site Reliability Engineer

D340 - Platform and Infrastructure · Portland, Oregon
Department D340 - Platform and Infrastructure
Employment Type Full-Time
Minimum Experience Experienced

What drives you? Working on something you believe in? At Discogs, we believe in sharing music with the world. Our technical teams develop a product that opens the doors to music discovery.


This is a Remote position. Open to candidates in or willing to relocate to OR, WA, CA, ID, UT, AZ, TX USA.

Who We Are

Discogs is the largest crowd-sourced, community-driven database of recorded music information in the world. Every day, millions of people use the Discogs Marketplace to connect, learn about music, and buy and sell vinyl records, CDs, and cassettes. As Discogs continues to grow, we are looking for bright, dedicated, creative, and highly motivated people to help us realize our mission to serve the music fan in everyone. We are relatively small, so individual contributions can have a large impact. High value is placed on quality, critical thinking, and continuous improvement. Our teams work collaboratively but are distributed geographically and open-source tools are important to who we are and how we work. We value the experiences and skills each team member contributes to helping us serve our music community.

Who We’re Looking For

The Senior Site Reliability Engineer has wide latitude to automate and improve service reliability.  The role is also responsible for diagnosing, investigating, and resolving service issues.  The role will help teams build and adopt Service Level Objectives (SLOs) in order to broadly improve service reliability at the company.  They will also have a hand in supporting and improving both our technical infrastructure and the platform services built on it.

What You’ll Accomplish

  • Maintain organization cloud presence in AWS
  • Automate and deploy infrastructure configurations using Infrastructure as Code (IAC)
  • Implement monitoring, as well as metric and log collection
  • Assist other teams with capacity planning and infrastructure budgeting
  • Participate in evening/night and weekend on-call rotation
  • Continuously improve infrastructure by automating away repetitive tasks and toil
  • Build out functionality of the Discogs Platform used by other engineering teams
  • Debug issues in application code and related services
  • Demonstrate a consistent commitment to Discogs core values and operating principles.
  • Work with your team on planning and completing department goals that align with overall business objectives.
  • Be an effective communicator by listening carefully, asking questions, and being transparent, timely, and diplomatic across all levels of the organization.
  • Stay informed on what is happening within the business and help others understand business decisions and the company direction by positively representing the company view.
  • Provide technical knowledge, coach and mentor others in the department and company.
  • Stay informed on new technologies or processes within your specialization and implement them when necessary. 
  • Use analytic skills to communicate and drive decisions for your team based on available data.
  • In partnership with your manager, start to plan, evaluate, and improve the efficiency of your department to enhance speed, quality, efficiency, and output.

What You’ll Contribute

Must Have

  • 5+ years software development experience
  • 3 years of experience with AWS
  • 3 years of experience using Terraform to manage AWS resources
  • 2 years of experience with Kubernetes (EKS preferred)
  • 2 years of experience with a scripting language (e.g., Python, Bash)
  • 1 year of experience with cloud network configuration
  • 1 year of experience configuring CI/CD pipelines
  • 1 year of experience supporting 24/7 web applications
  • Experience configuring monitoring and alerting
  • Experience with Kubernetes configuration tools like Helm and Kustomize

Nice to Have

  • Experience with systems programming languages like Rust or Go
  • Experience implementing observability through code instrumentation
  • Experience with GitOps (e.g., Argo CD, Flux)
  • Experience migrating applications from an on-prem environment to the cloud
  • Experience with Change Data Capture and Kafka
  • Bachelor's degree in computer science or related field

What We Provide

  • Competitive compensation: salary, plus performance-related bonus program
  • 401(k) with employer match
  • 100% company-paid medical and dental insurance benefits for you and your dependents
  • 4 weeks paid vacation, increasing based on tenure
  • 18 weeks paid leave for birth moms
  • 8 weeks paid parental leave, including for adoption
  • Monthly wellness allowance
  • Annual professional and personal development allowance
  • Work from home office set-up and expense allowances 
  • Flexible work location opportunities 
  • Employer matching toward charitable contributions

What We Believe In

Discogs’ mission is to serve the music fan in everyone.  We represent a diverse and inclusive community, and we are committed to serving our community with innovative and creative solutions. We know that innovation happens best when varying perspectives are embraced and integrated. Our global team reflects our global community.

Discogs is an Equal Opportunity Employer.

Applicants needing accommodation to apply should contact us at 503-597-6340

If you apply for this role, you will be required to upload a resume, cover letter, and fill out a few questions regarding your application. Once submitted, our hiring team will review your application and contact you if you are selected for an interview. Whether you are successful or not, we will store your application and data in our system for a maximum period of one year from the application date in case another role becomes available that you are suitable for. If you have any questions or concerns about us storing this data and/or the period of time, please contact us at and we will respond to you within 30 days.

Thank You

Your application was submitted successfully.

  • Location
    Portland, Oregon
  • Department
    D340 - Platform and Infrastructure
  • Employment Type
  • Minimum Experience