Search by job, company or skills

J

Senior / Staff SLM & VLM Engineer Post-Training, Tool Calling & Agents

Fresher
Save
  • Posted 2 days ago
  • Be among the first 10 applicants
Early Applicant

Job Description

Job Summary

We are looking for a highly capable engineer/researcher to lead the R&D of Small Language Models (SLMs) and Vision-Language Models (VLMs) for edge / low-latency and cost-efficient production scenarios. You will own the continuous pretraining, supervised instruction tuning (SFT), and compression/distillation pipelines, and work closely with platform teams to deliver reliable, measurable improvements in inference efficiency, tool-use success rate, and overall model quality.

Key Responsibilities

  • SLM/VLM Training: Continuous Pretraining & Instruction Tuning (SFT)
  • Conduct continuous pretraining and SFT for SLMs and VLMs to improve task performance and domain adaptation.
  • Build reproducible training workflows in PyTorch, including data processing, training, evaluation, and model versioning.
  • Compression, Distillation & Edge/Low-Latency Inference Optimization
  • Design and implement efficient compression strategies for SLM/VLM, including knowledge distillation, pruning, and quantization-oriented training or post-training optimization.
  • Optimize model serving and inference for low-latency / edge scenarios by improving throughput and cost-per-token via techniques such as quantization, caching/KV optimizations, batching strategies, and decoding-time optimizations.
  • Tool Calling System: Catalog, Routing, Validation, Fallback & Observability
  • Architect and implement a production-grade tool calling (function/tool calling) framework:
  • Tool cataloging and metadata/schema design
  • Tool selection/routing and argument construction
  • Parameter validation, result verification, and safe fallback/retry strategies
  • Call-chain tracing, monitoring, and observability to improve success rate and ROI
  • RL & Reward Modeling for Alignment and Tool-Use Reliability
  • Apply post-training methods such as PPO / DPO / GRPO-like optimization and reward modeling to align the model toward objectives including:
  • semantic understanding
  • tool-use success rate
  • content generation quality and consistency
  • Support both offline and online iteration loops, including policy evaluation, regression checks, and safe deployment gating.
  • Data Pipeline Automation (Collection, Cleaning, Curation)
  • Design automated pipelines for data collection, filtering, cleaning, de-duplication, labeling/weak supervision, and dataset version management to continuously improve training quality.
  • Ensure datasets support both SFT and preference/RL style post-training.
  • Rigorous Evaluation, Testing & Iteration
  • Build robust evaluation mechanisms: offline benchmarks, task suites for tool-use, regression tests, and reliability metrics.
  • Drive rapid iteration through A/B comparisons, ablations, and failure analysis, improving both quality and efficiency over time.

Required Qualifications

  • Strong software engineering skills in Python and C++, including experience building ML training/evaluation pipelines in PyTorch.
  • Hands-on experience in model efficiency and inference optimization (e.g., distillation, quantization, pruning, serving optimization).
  • Experience with high-performance computing and acceleration: CUDA and/or SIMD, profiling and performance tuning.
  • Ability to read and reproduce key ideas from recent papers and implement algorithms with strong experimental discipline.
  • Ability to communicate effectively in both Chinese (Mandarin) and English as the successful candidate will have to liaise with our counterparts in China.

, BE AWARE OF FRAUD: When applying for a job at Jabil you will be contacted via correspondence through our official job portal with a jabil.com e-mail address; direct phone call from a member of the Jabil team; or direct e-mail with a jabil.com e-mail address. Jabil does not request payments for interviews or at any other point during the hiring process. Jabil will not ask for your personal identifying information such as a social security number, birth certificate, financial institution, driver's license number or passport information over the phone or via e-mail. If you believe you are a victim of identity theft, contact your local police department. Any scam job listings should be reported to whatever website it was posted in.

More Info

Job Type:
Industry:
Employment Type:

About Company

Job ID: 149158437