Multi-modal recognition algorithm engineer-Tik Tok Live

Byte Dance

Beijing, China

Fresher

This job is no longer accepting applications

Posted a month ago

Job Description

Responsibilities

1. Responsible for the model construction of multi-modal content understanding, content identification and content mining for live broadcast business, to improve the content supply and growth on the live broadcast side 2. Responsible for the optimization and iteration of computer vision, audio, and text large models in live broadcast scenarios, including live broadcast room screen recognition and detection, text semantic understanding and summary, large model intelligent assistant, etc. 3. Explore cutting-edge technologies such as computer vision, multi-modality, and LLM, and be responsible for the design, development, and optimization of algorithm models.

Qualifications

1. Have in-depth research in a certain field of computer vision, NLP, multi-modality, and deep learning, including but not limited to: image and video understanding, detection, segmentation, action recognition, multi-modality, RAG, few-shot learning, etc. 2. Be familiar with the training and deployment of one or more framework models in PyTorch/TensorFlow, and understand mixed-precision training, distributed training, TensorRT deployment, etc. 3. Applicants with strong model development and tuning capabilities, project experience in video content understanding or multi-modal retrieval will be given priority, and winners in Kaggle, COCO, ActivityNet, ICPC, NOI/IOI and other competitions will be given priority 4. Excellent understanding, communication and teamwork skills, proactive and enthusiastic.

More Info

Job Type:

Permanent Job

Industry:

IT /Computers - Software

Function:

Computer Vision And Nlp

Employment Type:

Full time

About Company

Byte DanceJob Source: jobs.bytedance.com

ByteDance is a technology company operating a range of content platforms that inform, educate, entertain and inspire people across languages, cultures, and geographies.
Dedicated to building global platforms of creation and interaction, ByteDance now has a portfolio of applications available in over 150 markets and 75 languages. For example, TikTok, Helo, Vigo Video, Douyin, and Huoshan.
Dedicated to building global platforms of creation and interaction, ByteDance now has a portfolio of applications available in over 150 markets and 75 languages. For example, TikTok, Helo, Vigo Video, Douyin, and Huoshan.

Job ID: 105109211

Jobs by Skill - IT

Jobs by Skill - Non IT

Jobs By Role

International Jobs

Last Updated: 18-03-2026 07:10:04 PM

Homejobs in BeijingMulti-modal recognition algorithm engineer-Tik Tok Live

Do you want to see more relevant and perfect job for you?

Beware of Scammers

We don’t charge any money for job offers

What it feels like to have

48% more interview calls?

To get 5X more recruiter views on your profile