Search by job, company or skills

B

Big Data SRE Operation and Maintenance Expert - Data Platform

This job is no longer accepting applications

  • Posted a month ago

Job Description

Responsibilities

1. Ensure the data stability of the company's business including Douyin, international short videos, advertising, etc., improve the quality of data platform service products, and ensure the continuous availability of the business 2. Based on operation and maintenance experience, tools, and platforms, respond quickly to online incidents to improve processing efficiency, while optimizing the operation and maintenance system, promote service reliability, scalability, guarantee system SLA, and promote operation and maintenance automation to improve efficiency 3. Through continuous all-round operations, including operation and maintenance specifications and best practices, monitoring and availability indicators, historical incidents, etc., provide guidance for the design and selection of data high-availability architecture, and at the same time systematically investigate hidden architectural risks and promote the implementation of improvement projects 4. Participate in the research and development, automation construction and continuous iteration of the big data operation and maintenance platform, and guide the development and evolution of product operation and maintenance models in the direction of digitalization and intelligence.

Qualifications

1. Computer-related major, bachelor degree or above 2. 3 years or more of SRE operation and maintenance experience, familiar with operation and maintenance system construction and stability guarantee 3. Familiar with Linux, network and other system operation and maintenance skills, with the ability to analyze operation and maintenance problems, emergency solutions, and performance tuning 4. Be familiar with at least one programming language, including but not limited to: Shell, Python, Java, Scala, PHP, Go, etc. 5. Have good communication, teamwork and self-driving skills to promote cross-team cooperation 6. Have experience in troubleshooting big data stability problems, have clear troubleshooting ideas, and have the ability to quickly locate problems.

More Info

About Company

ByteDance is a technology company operating a range of content platforms that inform, educate, entertain and inspire people across languages, cultures, and geographies.
Dedicated to building global platforms of creation and interaction, ByteDance now has a portfolio of applications available in over 150 markets and 75 languages. For example, TikTok, Helo, Vigo Video, Douyin, and Huoshan.
Dedicated to building global platforms of creation and interaction, ByteDance now has a portfolio of applications available in over 150 markets and 75 languages. For example, TikTok, Helo, Vigo Video, Douyin, and Huoshan.

Job ID: 105579121