Compañía

Ford Motor CompanyVer más

addressDirecciónMéxico
CategoríaTecnologías de la información

Descripción del trabajo

Our Site Reliability Engineering (SRE) team enable modernization by providing robust SRE standards, IaC, monitoring tools powered by AI and easy-to-use dashboards. The resulting transparency of end-to-end performance provides a better view into how teams can proactively manage reliability and strategically apply automation.

As a SRE your role will combine software engineering and systems engineering disciplines to ensure that software systems are available, scalable, and maintainable This individual will play a pivotal role in shaping the evolving needs of our customers including development of Service Level Indicators and Objectives (SLI/SLO), best practices with associated templates, as well as automation to remove toil and facilitate adoption.


You'll have...

  • Bachelor’s degree in Computer Science, Computer Engineering, Electrical Engineering or related field or a combination of education and equivalent work experience
  • 5 + years of experience programming with one or more of the following: Python, Go, Java/Scala, C or C++.
  • 3 + years of experience with APM or other monitoring tools such as Dynatrace, New Relic, ELK, Splunk, Prometheus, Sensu, Nagios, Kafka, DataDog
  • 3 + years’ experience with J2EE, NoSQL/SQL Datastore, Spring Boot, GCP/AWS/Azure & Docker/K8 in developing multi-tier applications.
  • 2+ years of experience as a Site Reliability Engineer.

Even better, you may have...

  • Master’s Degree in Computer Science, Computer Engineering, Electrical Engineering or related field
  • Strong proficiency with Google Cloud and its library of services
  • Experience with automated test-driven development in CI/CD Pipelines
  • Thorough understanding of software development and agile programming
  • Understanding and ability to implement effective observability strategies to improve MTTD/R
  • Experience with RESTful APIs and microservices platforms
  • Working knowledge of the TCP/IP stack, internet routing and load balancing
  • Solve complex architecture/design & business problems, work to simplify, optimize, remove bottlenecks, etc.

You may not check every box, or your experience may look a little different from what we've outlined, but if you think you can bring value to Ford Motor Company, we encourage you to apply!

As an established global company, we offer the benefit of choice. You can choose what your Ford future will look like: will your story span the globe, or keep you close to home? Will your career be a deep dive into what you love, or a series of new teams and new skills? Will you be a leader, a changemaker, a technical expert, a culture builder…or all of the above? No matter what you choose, we offer a work life that works for you,


What you'll do...

  • Have a strong background in software development and systems administration, as well as excellent problem-solving, troubleshooting, and communication skills
  • Leverage experience to safely perform destructive testing to seek and discover vulnerabilities
  • Architect, design and develop automation to improve resilience, recoverability, availability, and scalability of supported applications
  • Recognize, validate and evangelize emerging technologies and architectures that align with business objectives
  • Develop tooling to improve reliability, quality, and time-to-market for software solutions
  • Measure and optimize system performance, with an eye toward pushing our capabilities forward, getting ahead of customer needs, and innovating to continually improve
  • Identify and reduce or eliminate toil via automation to maximize the time spent on engineering and innovation
  • Collaborate with development teams to design, build, and operate scalable and resilient software systems using Cloud native principles
  • Proactively identify stability risks and work with engineering leadership to establish appropriate mitigation plans
  • Regularly review key technical metrics such as transactions errors, logging, response times, caching strategies, conversion/bounce rates, capacity and resource utilization
  • Establish error budgets by identifying the right SLOs, SLIs, and effectively drive their use to ensure maximum availability/uptime
  • Conduct performance analysis and optimization of new and in-production systems
  • Provide technical guidance and mentorship to other team members
  • Participate in incident response, support, recovery, and postmortem analysis
Refer code: 1056361. Ford Motor Company - El día anterior - 2024-03-22 15:53

Ford Motor Company

México
Empleos populares de Site Reliability Engineer en las principales ciudades

Compartir trabajos con amigos

Trabajos relacionados

Site Reliability Engineering (Sre)

Site Reliability Engineer Senior

Hsbc

México

3 Hace meses - visto

Senior Site Reliability Engineer

Sezzle

México

3 Hace meses - visto

Principal Site Reliability Engineer (Sre)

Mygwork

México

5 Hace meses - visto