Site Reliability Engineer @ Antal

Polsko


    What you need to succeed in this role:


    • 5+ years of experience in supporting or developing distributed systems (Java-based environments preferred).
    • Hands-on experience with monitoring and logging tools: Grafana, Prometheus, Loki, Splunk, etc.
    • Solid understanding of Unix/Linux systems, cloud infrastructure (GCP preferred), and databases (RDBMS).
    • Experience with CI/CD tooling, such as Ansible, Jenkins, GitHub Actions, and vulnerability management.
    • Familiarity with job scheduling tools (e.g., Control-M or equivalent).
    • Strong communication skills and ability to drive technical discussions with multiple support teams.
    • Experience working in Agile/Scrum teams.

    Site Reliability Engineer
    ? Kraków (Hybrid – minimum 2 days/week in the office)
    ? Employment type: B2B

    Are you looking for an opportunity to join a high-impact project in a global financial institution that invests heavily in cloud, AI, and DevOps? Were building a new Site Reliability Engineering (SRE) team in Kraków to support a mission-critical Counterparty Credit Risk (CCR) platform, and were looking for experienced engineers to join the journey.

    As part of this role, youll contribute to the stability, scalability, and observability of a high-volume, distributed platform operating on both Google Cloud Platform and on-prem infrastructure.


      What we offer:

      • The chance to build and shape a new SRE team supporting a critical platform for global risk management.
      • Work in a modern technology stack: Java, GCP, Apache Beam, Spring Boot, DevOps tooling.
      • Hybrid working model with at least 2 days/week in our Kraków office.
      • Stable, long-term project with excellent opportunities for growth and learning.

      ? Interested? Apply now and take the next step in your career with a team that’s redefining reliability at a global scale.

      ,[Ensure the reliability and high availability of production systems used in global credit risk management., Monitor, detect, and troubleshoot incidents in distributed systems running in cloud and hybrid environments., Implement observability tools (Grafana, Prometheus, Loki, etc.) and improve monitoring and alerting strategies., Lead root cause analysis (RCA) and post-incident reviews to improve resilience and operational efficiency., Collaborate with developers, DevOps engineers, and global support teams to implement SRE best practices., Contribute to CI/CD automation, deployment pipelines, and security/vulnerability remediation.] Requirements: Java, Grafana, Prometheus, Loki, Splunk, Unix, Linux, Cloud infrastructure, RDBMS, CI/CD, Ansible, Jenkins, GitHub Actions, GCP, Control-M

      Kategorie

      devops

      • Podrobné informace o nabídce práce
        Firma: Antal
        Lokalita: Práce v Polsku
        Odvětví práce: devops
        Pracovní pozice: Site Reliability Engineer @ Antal
        Směnnost práce fulltime - 40 hours per week
        Nástup do práce od: IHNED
        Nabízená mzda: neuvedeno
        Nabídka přidána: 10. 6. 2025
        Pracovní pozice aktivní
      Odpovědět na inzerát
          Buďte první, kdo se na danou nabídku práce přihlásí!

      Práce Site Reliability Engineer @ Antal: Často kladené otázky

      👉 V jakém městě se nabízí nabídka práce Site Reliability Engineer @ Antal?

      Práce je nabízena v lokalitě Kraków.

      👉 Jaká firma nabírá na tuto pozici?

      Tato nabídka práce je do firmy Antal.

      0.1568