[Remote] ML Engineer
Note: The job is a remote job and is open to candidates in USA. Talener is a global newswire and media organization whose content reaches over half the global population daily. They are seeking a Senior Machine Learning Engineer to build and optimize inference systems for processing millions of media assets across text, image, and video pipelines.
Responsibilities
- Building and optimizing inference systems that run in production at scale
- Working across text, image, and video pipelines
- Processing millions of media assets to power news intelligence products
- Profiling a transformer
- Rewriting its serving path for a 2-3x latency improvement
- Tuning an HNSW index
- Making smart infrastructure decisions on SageMaker instance selection to hit p95 targets at the lowest cost
- Partnering closely with MLOps, platform engineering, data scientists, and product teams
- Owning model performance, inference logic, and pipeline efficiency
Skills
- 5+ years building production ML inference systems
- Python - core to everything in this role
- PyTorch (TorchScript, ONNX, FastAPI/TorchServe) and TensorFlow (SavedModel, tf.data, XLA, TFLite) - both required
- Deep hands-on experience with transformer-based models (BERT family - DistilBERT, SBERT, etc.) in production
- Inference optimization at scale - quantization, distillation, compilation, kernel/profile-level performance work
- AWS infrastructure - EC2, Batch, Lambda, SageMaker across different media workload types
- Hybrid search architecture experience - BM25 + vector search + cross-encoder reranking
- Asynchronous processing systems - reliability, caching, deduplication, observability
- Data pipeline and workflow orchestration (Airflow or similar)
- Video frameworks - FFmpeg, large-scale frame-level inference
- Must have experience in the media industry
- Must have experience working with large amounts of data, including text, images and videos
- Experience with TransNetV2 or similar video shot boundary detection
- Familiarity with HuggingFace open source LLMs
- OpenAI API or other foundation model provider experience
- Hybrid CPU/GPU environment experience at scale
Benefits
- 15% bonus target
Company Overview
Apply To This Job