Stability Engineer/Expert- Technology Risks

0-2 years
12 days ago
Job Description

Responsibilities

1. Responsible for the monitoring and alarming and fault emergency construction of the company's related products to improve the overall stability of the service 2. In-depth business scenarios, formulate monitoring and emergency plans that meet business characteristics, and complete implementation 3. Responsible for alarm management and SLA design of related products Work related to security, emergency response, etc. 4. Participate in the design and implementation of stable tools or platforms such as efficient monitoring and discovery, emergency collaboration, etc.

Qualifications

1. Experience in stability construction and assurance of large-scale systems 2. Fault discovery and troubleshooting of large-scale distributed systems Strong interest 3. Solid basic computer knowledge, development languages Python/Go/Java/C++, etc. 4. Bonus points: rich experience in monitoring alarms and fault emergency construction.

JOB TYPE

Function

Skills

About
Job Source: jobs.bytedance.com

ByteDance is a technology company operating a range of content platforms that inform, educate, entertain and inspire people across languages, cultures, and geographies.
Dedicated to building global platforms of creation and interaction, ByteDance now has a portfolio of applications available in over 150 markets and 75 languages. For example, TikTok, Helo, Vigo Video, Douyin, and Huoshan.
Dedicated to building global platforms of creation and interaction, ByteDance now has a portfolio of applications available in over 150 markets and 75 languages. For example, TikTok, Helo, Vigo Video, Douyin, and Huoshan.

People Also Considered

Career Advice to Find Better