Skip to main content

Senior Site Reliability Engineer

posted by: spj_bot

We are looking for an experienced and motivated Senior Site Reliability Engineer (SRE) to join our team. In this role, you will be responsible for the reliability, scalability, performance, and stability of our systems and applications. You will work closely with cross-functional teams to automate processes, improve infrastructure, and support continuous product delivery.

Requirements

- Minimum of 3 years of experience in a similar SRE role;

- Strong proficiency in monitoring, logging, alerting, cloud, platform, OS, CI/CD, repostorage, and management tools;

- Solid understanding of DevOps principles and practices;

- Bachelor's degree in Computer Science, Engineering, or a related field;

- Excellent problem-solving and troubleshooting skills;

- Strong communication and collaboration skills.

Responsibilities:

- Implement and maintain monitoring solutions using Prometheus, Victoria-Metrics, and Grafana to identify and address performance issues proactively;

- Manage logging infrastructure using Vector, ElasticSearch, and Kibana, ensuring efficient log collection, analysis, and visualization;

Job Skills

View the job post & apply

Sr. Site Reliability Engineer

posted by: spj_bot

Role Overview

We are seeking a high-caliber Site Reliability Engineer (SRE) to join our Forward Engineering team. You will be the guardian of our production ecosystems, ensuring that our complex, data-driven AI platforms remain resilient, scalable, and highly performant. This role is a hybrid of software engineering and systems architecture, with a specialized focus on MLOps—bridging the gap between model development and production-grade reliability.

Job Skills

View the job post & apply

Senior Site Reliability Engineer (Azure)

posted by: spj_bot

Senior Site Reliability Engineer (Enterprise Platform)

Location: US - Open to Europe if happy to overlap with EST

Remote | Full-time

Compensation: $150K - $200K

Our client is seeking a Senior Site Reliability Engineer (Azure) to architect and scale a robust infrastructure foundation for a high-growth distributed systems platform. This position is critical for ensuring that the platform operates as a secure, scalable, and production-ready environment capable of supporting complex enterprise use cases and high reliability standards.

The successful candidate will take a lead role in designing infrastructure from first principles, bridging the gap between product requirements and technical execution. This is a high-impact opportunity for a seasoned engineer to build greenfield Azure environments and establish operational excellence across a global ecosystem.

Key Responsibilities

Job Skills

View the job post & apply

Site Reliability Engineer (Remote - Czechia)

posted by: spj_bot

About Jobgether:

Jobgether is a Talent Matching Platform that partners with companies worldwide to efficiently connect top talent with the right opportunities through AI-driven job matching.

One of our companies is currently looking for a Site Reliability Engineer in Czechia.

In this role, you will join a high-performing engineering team dedicated to maintaining and scaling critical infrastructure for cloud-based applications. You’ll take ownership of building and optimizing CI/CD pipelines, ensuring system performance, uptime, and reliability. With a strong focus on automation and scalability, you will collaborate closely with developers to integrate new services and implement best practices in system observability and security. If you thrive in dynamic, fast-paced environments and are passionate about building resilient systems, this opportunity is for you.

Accountabilities:

·         Design, build, and maintain scalable infrastructure using Terraform and Terragrunt

·         Optimize and manage AWS environments for cost-efficiency, security, and availability

·         Administer and scale Kafka and Confluent Cloud for real-time data streaming

·         Deploy and maintain Redis to support caching and high-speed data processing

·         Implement monitoring and alerting with Prometheus, Grafana, Alert Manager, and OpsGenie

Job Skills

View the job post & apply

Service Reliability Engineer ( Multiple locations)

posted by: spj_bot

About Jobgether

Jobgether is a Talent Matching Platform that partners with companies worldwide to efficiently connect top talent with the right opportunities through AI-driven job matching.

One of our companies is currently looking for a Service Reliability Engineer in Singapore, Philippines, Vietnam, India, or Malaysia.

As a Service Reliability Engineer, your primary mission is to ensure the stability, performance, and scalability of the company’s systems and services. You will serve as the highest level of technical escalation within the support team, working closely with product, tech, and data teams to resolve complex issues. Your role will involve providing technical support, troubleshooting, resolving system issues, and contributing to continuous improvement efforts. You will help ensure minimal downtime, a seamless customer experience, and drive innovation across the organization.

Accountabilities:

Job Skills

View the job post & apply

Senior Site Reliability Engineer ( Remote - US)

posted by: spj_bot

About Jobgether

Jobgether is a Talent Matching Platform that partners with companies worldwide to efficiently connect top talent with the right opportunities through AI-driven job matching.

One of our companies is currently looking for a Senior Site Reliability Engineer in United States.

As a Senior Site Reliability Engineer (SRE), you will play a key role in scaling, securing, and improving the cloud infrastructure of the organization. Your primary focus will be to ensure the reliability and scalability of systems by implementing proactive solutions and automating infrastructure management. You’ll work closely with engineering and platform teams to enhance the reliability of services, manage Kubernetes clusters, and optimize cloud resources. You will also be responsible for leading incident response, conducting post-incident reviews, and refining best practices to continuously improve the system's performance and security.

Accountabilities:

Job Skills

View the job post & apply

Site Reliability Engineer (Remote - Spain)

posted by: spj_bot

About Jobgether

Jobgether is a Talent Matching Platform that partners with companies worldwide to efficiently connect top talent with the right opportunities through AI-driven job matching.

One of our companies is currently looking for a Site Reliability Engineer in Spain.

This opportunity is perfect for a highly skilled Site Reliability Engineer who thrives in dynamic, growth-oriented environments. You will play a key role in enhancing platform stability, scalability, and security while driving automation across systems and processes. As part of a collaborative and innovative team, you’ll design resilient architectures, optimize deployment pipelines, and support teams across development, QA, and production environments. If you're passionate about DevOps culture, cloud-native architectures, and building highly available systems, this is your chance to shape the technical foundation of an international fintech platform.

Accountabilities:

Job Skills

View the job post & apply

Site Reliability Engineer (Slovakia Remote)

posted by: spj_bot

About Jobgether

Jobgether is a Talent Matching Platform that partners with companies worldwide to efficiently connect top talent with the right opportunities through AI-driven job matching.

One of our companies is currently looking for a Site Reliability Engineer in Slovakia.

As a Site Reliability Engineer, you’ll play a critical role in designing and maintaining scalable, secure, and resilient infrastructure. You'll partner closely with engineering teams to build CI/CD pipelines, automate core operational processes, and ensure the smooth operation of cloud-based platforms. This position offers the opportunity to work remotely from Slovakia, with access to the latest technologies like Kubernetes and AWS. Ideal candidates will thrive in a fast-paced SaaS environment and be passionate about automation, mentorship, and building tools that empower developers.

Accountabilities:

Job Skills

View the job post & apply

Senior Site Reliability Engineer - (Remote - Canada)

posted by: spj_bot

About Jobgether

Jobgether is a Talent Matching Platform that partners with companies worldwide to efficiently connect top talent with the right opportunities through AI-driven job matching.

One of our companies is currently looking for a Senior Site Reliability Engineer in Canada.

As a Senior Site Reliability Engineer, you will play a critical role in shaping the infrastructure that powers innovative, AI-driven software for the life sciences sector. You’ll work closely with platform, engineering, and product teams to build robust, scalable systems and support high-performance production environments. This position involves designing cloud-native systems, improving observability, automating key processes, and helping teams respond to and learn from production incidents. You’ll be part of a collaborative, remote-first culture committed to operational excellence, learning, and impact.

Accountabilities:

Job Skills

View the job post & apply

Senior Site Reliability Engineer - (Remote - Europe)

posted by: spj_bot

Jobgether has ALL remote jobs globally. We match you to roles where you're most likely to succeed and provide feedback on every application to help you learn. No more guesswork, application black holes, or recruiter ghosting in your job search.

For one of our clients, we are looking for a Senior Site Reliability Engineer, remotely from Europe.

As a Senior Site Reliability Engineer, you will be responsible for designing, maintaining, and optimizing reliable and scalable systems. You will track performance metrics, improve system reliability through automation, and ensure best practices for incident management. With your expertise in cloud services, container orchestration, and system performance, you will drive initiatives to enhance the efficiency and robustness of the infrastructure while collaborating closely with engineering teams to design systems built for high availability. This is a key role for someone passionate about building and maintaining resilient systems that ensure seamless operations at scale.

Accountabilities:

Job Skills

View the job post & apply
Subscribe to reliability engineer

SPJ is not just a platform; it's a transformative force in the maritime sector. We reinvent job discovery and collaboration, leveraging cutting-edge AI to create a space where careers thrive and innovations set sail.

Featured Posts