[Remote] ML Engineer

Remote, USA Full-time Posted 2026-06-16

Note: The job is a remote job and is open to candidates in USA. Talener is a global newswire and media organization whose content reaches over half the global population daily. They are seeking a Senior Machine Learning Engineer to build and optimize inference systems for processing millions of media assets across text, image, and video pipelines.

Responsibilities

Building and optimizing inference systems that run in production at scale
Working across text, image, and video pipelines
Processing millions of media assets to power news intelligence products
Profiling a transformer
Rewriting its serving path for a 2-3x latency improvement
Tuning an HNSW index
Making smart infrastructure decisions on SageMaker instance selection to hit p95 targets at the lowest cost
Partnering closely with MLOps, platform engineering, data scientists, and product teams
Owning model performance, inference logic, and pipeline efficiency

Skills

5+ years building production ML inference systems
Python - core to everything in this role
PyTorch (TorchScript, ONNX, FastAPI/TorchServe) and TensorFlow (SavedModel, tf.data, XLA, TFLite) - both required
Deep hands-on experience with transformer-based models (BERT family - DistilBERT, SBERT, etc.) in production
Inference optimization at scale - quantization, distillation, compilation, kernel/profile-level performance work
AWS infrastructure - EC2, Batch, Lambda, SageMaker across different media workload types
Hybrid search architecture experience - BM25 + vector search + cross-encoder reranking
Asynchronous processing systems - reliability, caching, deduplication, observability
Data pipeline and workflow orchestration (Airflow or similar)
Video frameworks - FFmpeg, large-scale frame-level inference
Must have experience in the media industry
Must have experience working with large amounts of data, including text, images and videos
Experience with TransNetV2 or similar video shot boundary detection
Familiarity with HuggingFace open source LLMs
OpenAI API or other foundation model provider experience
Hybrid CPU/GPU environment experience at scale

Benefits

15% bonus target

Company Overview

Talener is a staffing firm dedicated to finding great opportunities for technology professionals. It was founded in 2007, and is headquartered in New York, New York, USA, with a workforce of 11-50 employees. Its website is http://www.talener.com.

Apply To This Job

Apply Now

[Remote] ML Engineer

Similar Jobs

[Remote] Director, Sales Development Americas | United States | Remote

[Remote] Senior Manager Sales (Shelter & Breeder Channels)

[Remote] Principal, Sales Programs

[Remote] Pricing Sales Executive

[Remote] Senior Digital Marketing Manager

[Remote] Senior Principal Product Manager, Applications

[Remote] Strategic Account Manager

[Remote] Product Operations Manager, AI & Systems

[Remote] Senior Product Marketing Manager

[Remote] Operations Manager

Associate, Cashless Claims (Remote, Mumbai)

Claims Manager

Regional Vice President, Major Accounts

Independent Travel Agent (Remote | 1099 Contractor)

Media & Publishing Assistant

Senior Statistical Programmer/Analyst Consultant- (Respiratory & Immunology) (Outside IR35)

Research & Thought Leadership Lead, Customer Insights

Experienced Customer Service Representative – Delivering Supreme Service in Halethorpe, MD

Online Math Instructor Grades 6-12

Experienced Full Stack Apple Home Advisor – Live Chat Support Specialist