Speech Recognition Algorithm Engineer-Doubao Large model

0-2 years
10 days ago
Job Description

Responsibilities

The Bytedance Beanbao Large Model Team was established in 2023 and is committed to developing the industry's most advanced AI large model technology, becoming a world-class research team, and contributing to technological and social development. The Doubao Big Model team has long-term vision and determination in the field of AI. Its research directions cover NLP, CV, speech, etc., and it has laboratories and research positions in China, Singapore, the United States and other places. Relying on the platform's sufficient data, computing and other resources, the team continues to invest in related fields. It has launched a self-developed general large model to provide multi-modal capabilities. It supports 50+ businesses such as Doubao, Buttons, and Jimeng downstream, and is open to the public through the Volcano Engine. Corporate customers. At present, Doubao APP has become the AIGC application with the largest number of users in the Chinese market. 1. Support the implementation of speech recognition technology in a variety of business scenarios inside and outside ByteDance, solve cutting-edge issues during the implementation process, and continue to optimize the core technology effects of speech recognition 2. Build a core technology system for audio understanding, focusing on cutting-edge technologies and speech recognition Algorithm effects, pursue and explore the most cutting-edge algorithms in the industry.

Qualifications

1. Familiar with speech recognition algorithms and have practical experience in speech recognition system implementation and business effect optimization 2. Industrial-level large-scale Have actual data processing experience, and have hands-on experience in using massive data to optimize actual business models 3. Have in-depth understanding of deep learning technology and rich practical experience, be familiar with PyTorch, Tensorflow, Kaldi and other platforms, and have an end-to-end speech recognition framework ( Transformer, RNN-T, LAS, CTC, etc.) tuning experience 4. Have good coding skills, familiar with Linux development environment, familiar with C++ and Python languages 5. Have the ability to work independently and get along well with the team. Bonus points: - Experience in implementing and optimizing large-scale speech recognition systems in scenarios such as conferences and smart hardware - Experience in optimizing cutting-edge end-to-end speech recognition systems, and familiarity with end-to-end RNN-T, Encoder-Decoder, etc. Speech recognition algorithm - Experience in optimizing speech recognition decoders and implementing them in practice - Published papers in relevant international conferences or mainstream journals (ICASSP, Interspeech, ASRU, IEEE/ACM Transactions, etc.) - Speech-related competitions or machine learning-related Won international leading rankings in competitions, and won awards in programming competitions such as ACM/NOI/IOI/TopCoder - Participated in influential open source projects - Good communication skills, strong sense of ownership, and organizational and coordination skills. Optimistic and introspective, with strong ability to withstand pressure.

About
Job Source: jobs.bytedance.com

ByteDance is a technology company operating a range of content platforms that inform, educate, entertain and inspire people across languages, cultures, and geographies.
Dedicated to building global platforms of creation and interaction, ByteDance now has a portfolio of applications available in over 150 markets and 75 languages. For example, TikTok, Helo, Vigo Video, Douyin, and Huoshan.
Dedicated to building global platforms of creation and interaction, ByteDance now has a portfolio of applications available in over 150 markets and 75 languages. For example, TikTok, Helo, Vigo Video, Douyin, and Huoshan.

Career Advice to Find Better