Sr. ML Engineer

We are seeking a Sr. ML Engineer, Generative AI Applications

Duties and Responsibilities:
  • Architect, build, maintain, and improve new and existing suite of GenAI applications and their underlying systems.
  • Automate machine learning pipelines, monitor performance and costs, and optimize models by using techniques such as LoRA/QLoRA.
  • Establish reusable frameworks to streamline model building, deployment and monitoring. Incorporate comprehensive monitoring, logging, tracing, and alerting mechanisms.
  • Build guardrails, compliance rules and oversight workflows into the GenAI application platform, such as establishing approval chains for model updates and staged rollout for production releases
  • Develop templates, guides and sandbox environments for easy onboarding of new contributors and experimentation with new techniques
  • Ensure development of user-facing applications in the GenAI application platform is easy and safe by enforcing rigorous validation testing before publishing user-generated models and implement a clear peer review process of applications
  • Use your entrepreneurial spirit to identify new opportunities to optimize business processes, improve consumer experiences, and prototype solutions to demonstrate value.
  • Work closely with data scientists and analysts to create and deploy new product features online and in mobile apps.
  • Contribute to and promote good software engineering practices across the team.
  • Mentor and educate team members to adopt best practices in writing and maintaining production machine learning code.
  • Actively contribute to and re-use community best practices.
  • Monitor, debug, track, and resolve production issues.
  • Work with project managers to ensure that projects proceed on time and on budget.
  • Collaborate with Technical Product Managers to ensure proper tracking of algorithmic performance KPIs and prioritize performance improvements based on effort and impact.
  • Complete other responsibilities as assigned.
Required Skills and Qualifications:
  •  Minimum of 7 years’ post-secondary education or relevant work experience
  • Bachelor’s degree in mathematics, physics, computer science, engineering, statistics, or an equivalent technical discipline.
  • Minimum of 5 years’ software development experience with Python and SQL.
  • Minimum of 3 years’ experience building pipelines to deploy NLP and deep learning models into production in an AWS cloud environment
  • Minimum 3years’ experience using PyTorch, Tensorflow, or MXNet, along with optimizing code for GPU clusters
  • Retrieval-Augmented Generation (RAG) techniques – EITHER GraphRAG OR Contextual Retrieval
  • Experience building advanced workflows such as retrieval augmented generation, model chaining, dynamic prompting, PEFT/SFT, etc. using Langchain and similar tools
  • Experience establishing model guardrails and developing bias detection and mitigation techniques for AI applications using tools such as NeMo
  • Experience with various embedding models and setting up and tuning vector databases to improve performance of semantic search and retrieval systems
  • Understand the underlying fundamentals such as Transformers, Self-Attention mechanisms that form the theoretical foundation of LLMs
  • Experience working with a variety of relational SQL and NoSQL databases, big data tools: Hadoop, Spark, Kafka; a Linux environment; (AWS).
  • Knowledge of data pipeline and workflow management tools.
  • Expertise in standard software engineering methodology, e.g., unit testing, test automation, continuous integration, code reviews, design documentation.
Job Category: Engineering
Job Type: Contract
Job Location: Remote

Apply for this position

Allowed Type(s): .pdf, .doc, .docx