Пропустити навігацію EPAM

Site Reliability Engineer with Java Remote

  • hot

Site Reliability Engineer with Java Description

DESCRIPTION



Join EPAM as a remote Site Reliability Engineer specializing in Java.

In this role, you'll provide 24/7 on-call support for Java backend services, prepare and deploy patches, and assist in establishing top-of-the-line metrics and dashboards.

If you have 5-8 years of experience as a DevOps/SRE, proficiency in Java, and experience with Amazon DynamoDB, Amazon ElastiCache, and Amazon Web Services, we'd love to hear from you.

EPAM is a leading global provider of digital platform engineering and development services. We are committed to having a positive impact on our customers, our employees, and our communities. We embrace a dynamic and inclusive culture. Here you will collaborate with multi-national teams, contribute to a myriad of innovative projects that deliver the most creative and cutting-edge solutions, and have an opportunity to continuously learn and grow. No matter where you are located, you will join a dedicated, creative, and diverse community that will help you discover your fullest potential.

Responsibilities

  • Provide follow-the-sun, 24/7 on-call support for the entirety of the Java backend services currently owned by the customer backend - including owning API Gateway observability
  • Prepare and deploy patches to the issues found both in the Java code and related service cloud infrastructure
  • Assist in establishing top-of-the-line metrics and dashboards which enable this group and customer backend team to quickly identify/establish overall platform health
  • Assist in establishing/improving runbooks for all EOS Backend services
  • Assist in monitoring SLOs of all involved backend services submitting code changes which improve SLO as errors occur

Requirements

  • 5 – 8 years of experience as DevOps/SRE
  • Proficiency in coding with Java
  • Experience with Amazon DynamoDB, Amazon ElastiCache, Amazon Web Services
  • Experience troubleshooting complex systems efficiently using logs & telemetry - identifying and resolving root causes
  • Able to communicate operational issues clearly and concisely in writing as part of live incident response
  • Motivated to track and improve SLO across several systems through repeatable processes

We Offer

  • Career plan and real growth opportunities
  • Unlimited access to LinkedIn learning solutions
  • International Mobility Plan within 25 countries
  • Constant training, mentoring, online corporate courses, eLearning and more
  • English classes with a certified teacher
  • Support for employee’s initiatives (Algorithms club, toastmasters, agile club and more)
  • Enjoyable working environment (Gaming room, napping area, amenities, events, sport teams and more)
  • Flexible work schedule and dress code
  • Collaborate in a multicultural environment and share best practices from around the globe
  • Hired directly by EPAM & 100% under payroll
  • Law benefits (IMSS, INFONAVIT, 25% vacation bonus)
  • Major medical expenses insurance: Life, Major medical expenses with dental & visual coverage (for the employee and direct family members)
  • 13 % employee savings fund, capped to the law limit
  • Grocery coupons
  • 30 days December bonus
  • Employee Stock Purchase Plan
  • 12 vacations days plus 4 floating days
  • Official Mexican holidays, plus 5 extra holidays (Maundry Thursday and Friday, November 2nd, December 24th & 31st)
  • Relocation bonus: transportation, 2 weeks of accommodation for you and your family and more
  • Monthly non-taxable amount for the electricity and internet bills

Conditions

  • By applying to our role, you are agreeing that your personal data may be used as in set out in EPAM´s Privacy Notice and Policy

ПРИВІТ! ЯК МИ МОЖЕМО ВАМ ДОПОМОГТИ?

НАШІ ОФІСИ