Sr. ML Engineer
We are seeking a Sr. ML Engineer, Generative AI Applications
Duties and Responsibilities:
- Architect, build, maintain, and improve new and existing suite of GenAI applications and their underlying systems.
- Automate machine learning pipelines, monitor performance and costs, and optimize models by using techniques such as LoRA/QLoRA.
- Establish reusable frameworks to streamline model building, deployment and monitoring. Incorporate comprehensive monitoring, logging, tracing, and alerting mechanisms.
- Build guardrails, compliance rules and oversight workflows into the GenAI application platform, such as establishing approval chains for model updates and staged rollout for production releases
- Develop templates, guides and sandbox environments for easy onboarding of new contributors and experimentation with new techniques
- Ensure development of user-facing applications in the GenAI application platform is easy and safe by enforcing rigorous validation testing before publishing user-generated models and implement a clear peer review process of applications
- Use your entrepreneurial spirit to identify new opportunities to optimize business processes, improve consumer experiences, and prototype solutions to demonstrate value.
- Work closely with data scientists and analysts to create and deploy new product features online and in mobile apps.
- Contribute to and promote good software engineering practices across the team.
- Mentor and educate team members to adopt best practices in writing and maintaining production machine learning code.
- Actively contribute to and re-use community best practices.
- Monitor, debug, track, and resolve production issues.
- Work with project managers to ensure that projects proceed on time and on budget.
- Collaborate with Technical Product Managers to ensure proper tracking of algorithmic performance KPIs and prioritize performance improvements based on effort and impact.
- Complete other responsibilities as assigned.
Required Skills and Qualifications:
- Minimum of 7 years’ post-secondary education or relevant work experience
- Bachelor’s degree in mathematics, physics, computer science, engineering, statistics, or an equivalent technical discipline.
- Minimum of 5 years’ software development experience with Python and SQL.
- Minimum of 3 years’ experience building pipelines to deploy NLP and deep learning models into production in an AWS cloud environment
- Minimum 3years’ experience using PyTorch, Tensorflow, or MXNet, along with optimizing code for GPU clusters
- Retrieval-Augmented Generation (RAG) techniques – EITHER GraphRAG OR Contextual Retrieval
- Experience building advanced workflows such as retrieval augmented generation, model chaining, dynamic prompting, PEFT/SFT, etc. using Langchain and similar tools
- Experience establishing model guardrails and developing bias detection and mitigation techniques for AI applications using tools such as NeMo
- Experience with various embedding models and setting up and tuning vector databases to improve performance of semantic search and retrieval systems
- Understand the underlying fundamentals such as Transformers, Self-Attention mechanisms that form the theoretical foundation of LLMs
- Experience working with a variety of relational SQL and NoSQL databases, big data tools: Hadoop, Spark, Kafka; a Linux environment; (AWS).
- Knowledge of data pipeline and workflow management tools.
- Expertise in standard software engineering methodology, e.g., unit testing, test automation, continuous integration, code reviews, design documentation.
Job Category: Engineering
Job Type: Contract
Job Location: Remote