Speech synthesis algorithm engineer-Doubao Big Model

0-2 years
8 days ago
Job Description

Responsibilities

The ByteDance Beanbao Large Model Team was established in 2023 and is committed to developing the most advanced AI large model technology in the industry and becoming a world-class research team for contribute to technological and social development. The Doubao Big Model team has long-term vision and determination in the field of AI. Its research directions cover NLP, CV, speech, etc., and it has laboratories and research positions in China, Singapore, the United States and other places. Relying on the platform's sufficient data, computing and other resources, the team continues to invest in related fields. It has launched a self-developed general large model to provide multi-modal capabilities. It supports 50+ businesses such as Doubao, Buttons, and Jimeng downstream, and is open to the public through the Volcano Engine. Corporate customers. At present, Doubao APP has become the AIGC application with the largest number of users in the Chinese market. 1. Familiar with speech synthesis front-end text analysis and processing related technologies 2. Familiar with common acoustic models and vocoders, and have relevant development and research experience 3. Familiar with sound conversion related algorithms and technologies 4. Familiar with general synthesis engine construction and Optimization, with experience in cloud and on-device engine optimization

Qualifications

1. Have experience in industries or companies in the fields of speech synthesis and natural language processing Priority will be given to those with experience 2. PhD and master's degrees in AI, EE, CS related majors in speech synthesis, natural language processing and other fields from major universities 3. Proficient in one or more community open source tools such as TensorFlow, PyTorch 4. Proficient C/C++, Python, Shell programming languages, with a deep understanding of data structure and algorithm design 5. Preference will be given to those who have published papers in relevant international conferences or mainstream journals (ICASSP, Interspeech)

JOB TYPE

Function

AI

Skills

general synthesis engine construction
acoustic models
vocoders
C/C++
sound conversion
on-device engine optimization
About
Job Source: jobs.bytedance.com

ByteDance is a technology company operating a range of content platforms that inform, educate, entertain and inspire people across languages, cultures, and geographies.
Dedicated to building global platforms of creation and interaction, ByteDance now has a portfolio of applications available in over 150 markets and 75 languages. For example, TikTok, Helo, Vigo Video, Douyin, and Huoshan.
Dedicated to building global platforms of creation and interaction, ByteDance now has a portfolio of applications available in over 150 markets and 75 languages. For example, TikTok, Helo, Vigo Video, Douyin, and Huoshan.

People Also Considered

Career Advice to Find Better