Site Reliability Engineering (Sre)

Compañía	Ford Motor CompanyVer más
Dirección	México
Categoría	Tecnologías de la información

Descripción del trabajo

Our Site Reliability Engineering (SRE) team enable modernization by providing robust SRE standards, IaC, monitoring tools powered by AI and easy-to-use dashboards. The resulting transparency of end-to-end performance provides a better view into how teams can proactively manage reliability and strategically apply automation.

As a SRE your role will combine software engineering and systems engineering disciplines to ensure that software systems are available, scalable, and maintainable This individual will play a pivotal role in shaping the evolving needs of our customers including development of Service Level Indicators and Objectives (SLI/SLO), best practices with associated templates, as well as automation to remove toil and facilitate adoption.

You'll have...

Bachelor’s degree in Computer Science, Computer Engineering, Electrical Engineering or related field or a combination of education and equivalent work experience
5 + years of experience programming with one or more of the following: Python, Go, Java/Scala, C or C++.
3 + years of experience with APM or other monitoring tools such as Dynatrace, New Relic, ELK, Splunk, Prometheus, Sensu, Nagios, Kafka, DataDog
3 + years’ experience with J2EE, NoSQL/SQL Datastore, Spring Boot, GCP/AWS/Azure & Docker/K8 in developing multi-tier applications.
2+ years of experience as a Site Reliability Engineer.

Even better, you may have...

Master’s Degree in Computer Science, Computer Engineering, Electrical Engineering or related field
Strong proficiency with Google Cloud and its library of services
Experience with automated test-driven development in CI/CD Pipelines
Thorough understanding of software development and agile programming
Understanding and ability to implement effective observability strategies to improve MTTD/R
Experience with RESTful APIs and microservices platforms
Working knowledge of the TCP/IP stack, internet routing and load balancing
Solve complex architecture/design & business problems, work to simplify, optimize, remove bottlenecks, etc.

You may not check every box, or your experience may look a little different from what we've outlined, but if you think you can bring value to Ford Motor Company, we encourage you to apply!

As an established global company, we offer the benefit of choice. You can choose what your Ford future will look like: will your story span the globe, or keep you close to home? Will your career be a deep dive into what you love, or a series of new teams and new skills? Will you be a leader, a changemaker, a technical expert, a culture builder…or all of the above? No matter what you choose, we offer a work life that works for you,

What you'll do...

Have a strong background in software development and systems administration, as well as excellent problem-solving, troubleshooting, and communication skills
Leverage experience to safely perform destructive testing to seek and discover vulnerabilities
Architect, design and develop automation to improve resilience, recoverability, availability, and scalability of supported applications
Recognize, validate and evangelize emerging technologies and architectures that align with business objectives
Develop tooling to improve reliability, quality, and time-to-market for software solutions
Measure and optimize system performance, with an eye toward pushing our capabilities forward, getting ahead of customer needs, and innovating to continually improve
Identify and reduce or eliminate toil via automation to maximize the time spent on engineering and innovation
Collaborate with development teams to design, build, and operate scalable and resilient software systems using Cloud native principles
Proactively identify stability risks and work with engineering leadership to establish appropriate mitigation plans
Regularly review key technical metrics such as transactions errors, logging, response times, caching strategies, conversion/bounce rates, capacity and resource utilization
Establish error budgets by identifying the right SLOs, SLIs, and effectively drive their use to ensure maximum availability/uptime
Conduct performance analysis and optimization of new and in-production systems
Provide technical guidance and mentorship to other team members
Participate in incident response, support, recovery, and postmortem analysis

Refer code: 1056361. Ford Motor Company - El día anterior - 2024-03-22 15:53

Site Reliability Engineering (Sre)

Ford Motor CompanyVer más

Descripción del trabajo

Trabajos relacionados

Site Reliability Engineering (Sre)

Site Reliability Engineer Senior

Senior Site Reliability Engineer

Principal Site Reliability Engineer (Sre)

Site Reliability Engineering (Sre)

Ford Motor CompanyVer más

Descripción del trabajo

Compartir trabajos con amigos

Trabajos relacionados

Site Reliability Engineering (Sre)

Site Reliability Engineer Senior

Senior Site Reliability Engineer

Principal Site Reliability Engineer (Sre)

Explore las búsquedas de empleo más populares en México

Estados principales

Principales ciudades

Principales títulos de trabajo

Trabajos mejor pagados