Senior Site Reliability Engineer
We are looking for an experienced and motivated Senior Site Reliability Engineer (SRE) to join our team. In this role, you will be responsible for the reliability, scalability, performance, and stability of our systems and applications. You will work closely with cross-functional teams to automate processes, improve infrastructure, and support continuous product delivery.
Requirements
- Minimum of 3 years of experience in a similar SRE role;
- Strong proficiency in monitoring, logging, alerting, cloud, platform, OS, CI/CD, repostorage, and management tools;
- Solid understanding of DevOps principles and practices;
- Bachelor's degree in Computer Science, Engineering, or a related field;
- Excellent problem-solving and troubleshooting skills;
- Strong communication and collaboration skills.
Responsibilities:
- Implement and maintain monitoring solutions using Prometheus, Victoria-Metrics, and Grafana to identify and address performance issues proactively;
- Manage logging infrastructure using Vector, ElasticSearch, and Kibana, ensuring efficient log collection, analysis, and visualization;