Sr. ML Engineer

We are seeking a Sr. ML Engineer, Generative AI Applications

Duties and Responsibilities:

Architect, build, maintain, and improve new and existing suite of GenAI applications and their underlying systems.
Automate machine learning pipelines, monitor performance and costs, and optimize models by using techniques such as LoRA/QLoRA.
Establish reusable frameworks to streamline model building, deployment and monitoring. Incorporate comprehensive monitoring, logging, tracing, and alerting mechanisms.
Build guardrails, compliance rules and oversight workflows into the GenAI application platform, such as establishing approval chains for model updates and staged rollout for production releases
Develop templates, guides and sandbox environments for easy onboarding of new contributors and experimentation with new techniques
Ensure development of user-facing applications in the GenAI application platform is easy and safe by enforcing rigorous validation testing before publishing user-generated models and implement a clear peer review process of applications
Use your entrepreneurial spirit to identify new opportunities to optimize business processes, improve consumer experiences, and prototype solutions to demonstrate value.
Work closely with data scientists and analysts to create and deploy new product features online and in mobile apps.
Contribute to and promote good software engineering practices across the team.
Mentor and educate team members to adopt best practices in writing and maintaining production machine learning code.
Actively contribute to and re-use community best practices.
Monitor, debug, track, and resolve production issues.
Work with project managers to ensure that projects proceed on time and on budget.
Collaborate with Technical Product Managers to ensure proper tracking of algorithmic performance KPIs and prioritize performance improvements based on effort and impact.
Complete other responsibilities as assigned.

Required Skills and Qualifications:

Minimum of 7 years’ post-secondary education or relevant work experience
Bachelor’s degree in mathematics, physics, computer science, engineering, statistics, or an equivalent technical discipline.
Minimum of 5 years’ software development experience with Python and SQL.
Minimum of 3 years’ experience building pipelines to deploy NLP and deep learning models into production in an AWS cloud environment
Minimum 3years’ experience using PyTorch, Tensorflow, or MXNet, along with optimizing code for GPU clusters
Retrieval-Augmented Generation (RAG) techniques – EITHER GraphRAG OR Contextual Retrieval
Experience building advanced workflows such as retrieval augmented generation, model chaining, dynamic prompting, PEFT/SFT, etc. using Langchain and similar tools
Experience establishing model guardrails and developing bias detection and mitigation techniques for AI applications using tools such as NeMo
Experience with various embedding models and setting up and tuning vector databases to improve performance of semantic search and retrieval systems
Understand the underlying fundamentals such as Transformers, Self-Attention mechanisms that form the theoretical foundation of LLMs
Experience working with a variety of relational SQL and NoSQL databases, big data tools: Hadoop, Spark, Kafka; a Linux environment; (AWS).
Knowledge of data pipeline and workflow management tools.
Expertise in standard software engineering methodology, e.g., unit testing, test automation, continuous integration, code reviews, design documentation.

Job Category: Engineering

Job Type: Contract

Job Location: Remote

Crossbridge Global Partners | Careers

Sr. ML Engineer

Duties and Responsibilities:

Required Skills and Qualifications:

Apply for this position