img
Permanent

Data Scientist (ML, Speech, NLP & Multimodal Expertise)

Manchester
money-bag £120,000 per annum
25D119147685F3479451C36454E96E01
Posted Yesterday

Join to apply for the

Data Scientist (ML, Speech, NLP and Multimodal Expertise)

role at

TransPerfect1 day ago Be among the first 25 applicantsJoin to apply for the

Data Scientist (ML, Speech, NLP and Multimodal Expertise)

role at

TransPerfectGet AI-powered advice on this job and more exclusive features.We are looking to hire a Data Scientist with strong expertise in machine learning, speech and language processing, and multimodal systems. This role is essential to driving our product roadmap forward, particularly in building out our core machine learning systems and developing next-generation speech technologies.The ideal candidate will be capable of working independently while effectively collaborating with cross-functional teams. In addition to deep technical knowledge, we are looking for someone who is curious, experimental, and communicative.Key Responsibilities:· Create maintainable, elegant code and high-quality data products that are modeled, well-documented, and simple to use.· Build, maintain, and improve the infrastructure to extract, transform, and load data from a variety of sources using SQL, Azure, GCP and AWS technologies.· Perform statistical analysis of training datasets to identify biases, quality issues, and coverage gaps.· Implement automated evaluation pipelines that scale across multiple models and tasks.· Create interactive dashboards and visualization tools for model performance analysis.Additional Responsibilities:· Design and implement robust data ingestion pipelines for massive-scale text and speech corpora including automated data preprocessing and cleaning pipelines.· Create data validation frameworks and monitoring systems for dataset quality.· Develop sampling strategies for balanced and representative training data.· Implement comprehensive experiment tracking and hyperparameter optimization frameworks.· Conduct statistical analysis of training dynamics and convergence patterns.· Create automated model selection pipelines based on multiple evaluation criteria.· Design comprehensive benchmark suites with statistical significance testing.· Develop fairness metrics and bias detection systems.· Build real-time monitoring systems for model performance in production.· Implement feature drift detection and data quality monitoring.· Design feedback loops to capture user interactions and model effectiveness.· Create automated retraining pipelines based on performance degradation signals.·

Develop business metrics and ROI analysis for model deployments.Required Skills, Experience and QualificationsProgramming and Software Engineering·

Python (Expert Level) : Advanced proficiency in scientific computing stack (NumPy, Pandas, SciPy, Scikit-learn).·

Version Control : Git workflows, collaborative development, and code review processes.·

Software Engineering Practices : Testing frameworks, CI/CD pipelines, and production-quality code development.Machine Learning and Language Model Expertise·

Traditional Machine Learning and Deep Learning Knowledge:

Proficiency in classical ML algorithms (Naive Bayes, SVM, Random Forest, etc.) and Deep Learning architectures.·

Understanding of Transformer Architecture:

Attention mechanisms, positional encoding, and scaling laws.·

Training Pipeline Knowledge:

Data preprocessing for large corpora, tokenization strategies, and distributed training concepts.·

Evaluation Frameworks:

Experience with standard NLP benchmarks (GLUE, SuperGLUE, etc.) and custom evaluation design.·

Fine-tuning Techniques:

Understanding of PEFT methods, instruction tuning, and alignment techniques.·

Model Deployment:

Knowledge of model optimization, quantization, and serving infrastructure for large models.Collaboration and Adaptability· Strong communication skills are a must· Self-reliant but knows when to ask for help· Comfortable working in an environment where conventional development practices may not always apply:o PBIs (Product Backlog Items) may not be highly detailedo Experimentation will be necessaryo Ability to identify what’s important in completing a task or partial task and explain/justify their approacho Can effectively communicate ideas and strategies· Proactive and takes initiative rather than waiting for PBIs to be assigned when circumstances call for it· Strong interest in AI and its possibilities, a genuine passion for certain areas can provide that extra spark· Curious and open to experimenting with technologies or languages outside their comfort zoneMindset and Work Approach· Takes ownership when things don’t go as planned· Capable of working from high-level explanations and general guidance on implementations and final outcomes· Continuous, clear communication is crucial, detailed step-by-step instructions won’t always be available· Self-starter, self-motivated, and proactive in problem-solving· Enjoys exploring and testing different approaches, even in unfamiliar programming languagesAdditional Skills, Experience and Qualifications· Framework Proficiency: Scikit-learn, XGBoost, PyTorch (preferred) or TensorFlow for model implementation and experimentation.· MLOps Expertise: Model versioning, experiment tracking, model monitoring (MLflow, Weights and Biases), data monitoring and validation (Great Expectations, Prometheus, Grafana), and automated ML pipelines (GitHub CI/CD, Jenkins, CircleCI, GitLab etc.).· Statistical Modeling: Hypothesis testing, experimental design, causal inference, and Bayesian statistics.· Model Evaluation: Cross-validation strategies, bias-variance analysis, and performance metric design.· Feature Engineering: Advanced techniques for text, time-series, and multimodal data.· Big Data Technologies: Spark (PySpark), Hadoop ecosystem, and distributed computing frameworks (DDP, TP, FSDP).· Cloud Platforms: AWS (SageMaker, S3, EMR), GCP (Vertex AI, BigQuery), or Azure ML.· Database Systems: NoSQL databases (MongoDB, Elasticsearch), graph databases (Neo4j), and vector databases (Pinecone, Milvus, ChromaDB, FAISS etc.).· Data Pipeline Tools: Airflow, Prefect, or similar orchestration frameworks.By applying, I confirm I have read and accept TransPerfect''s Privacy Policy: https://www.transperfect.com/about/data-privacy-recruiting.Seniority level

Seniority level Mid-Senior levelEmployment type

Employment type Full-timeJob function

Job function Engineering and OtherIndustries Translation and Localization, Software Development, and Technology, Information and MediaReferrals increase your chances of interviewing at TransPerfect by 2xGet notified about new Data Scientist jobs in

Manchester Area, United Kingdom .Greater Manchester, England, United Kingdom 1 day agoManchester, England, United Kingdom 1 week agoManchester, England, United Kingdom 1 month agoManchester Area, United Kingdom 1 week agoManchester, England, United Kingdom 1 week agoManchester, England, United Kingdom 1 month agoManchester, England, United Kingdom 2 weeks agoData Scientist - Machine Learning/AWS - Manchester

Manchester Area, United Kingdom 2 days agoAltrincham, England, United Kingdom 1 month agoManchester, England, United Kingdom 3 weeks agoManchester, England, United Kingdom 3 weeks agoManchester, England, United Kingdom 3 weeks agoData Scientist (Machine Learning Observability and Governance)

Manchester, England, United Kingdom 1 month agoManchester, England, United Kingdom 2 weeks agoManchester, England, United Kingdom 1 day agoManchester Area, United Kingdom 3 weeks agoManchester Area, United Kingdom 3 weeks agoManchester, England, United Kingdom 1 week agoMachine Learning Applied Scientist (Machine Learning Observability and Governance)

Manchester, England, United Kingdom 1 month agoManchester, England, United Kingdom 1 day agoData Science and AI Delivery Lead for Commercial Domain

Manchester, England, United Kingdom 3 months agoManchester, England, United Kingdom 3 days agoManchester, England, United Kingdom 3 days agoManchester, England, United Kingdom 1 week agoManchester, England, United Kingdom 2 weeks agoManchester Area, United Kingdom 2 days agoManchester Area, United Kingdom $120,000.00-$180,000.00 2 weeks agoManchester, England, United Kingdom 1 day agoManchester, England, United Kingdom 1 week agoManchester, England, United Kingdom 1 month agoWe’re unlocking community knowledge in a new way. Experts add insights directly into each article, started with the help of AI.

#J-18808-Ljbffr

Other jobs of interest...

Square One Resources
KnutsfordYesterday
money-bag£111,800 per annum
Pets at Home
Wilmslow6 days ago
money-bagNegotiable
We Are Dcoded Limited
Manchester1 week ago
money-bag£90,000
Circle Group
Manchester1 week ago
money-bag£70,000
Searchworks Ltd
Salford2 weeks ago
money-bag£65,000
CV-Library
Manchester3 weeks ago
money-bagNegotiable
Circle Group
Manchester3 weeks ago
money-bag£70,000
Gerrard White
Manchester3 weeks ago
money-bagNegotiable

Perform a fresh search...

  • Create your ideal job search criteria by
    completing our quick and simple form and
    receive daily job alerts tailored to you!

Jobs. Straight to your inbox!