Shape the future of ML evaluation with innovative frameworks and tools. Collaborate remotely with top-tier professionals in a dynamic environment. Drive impactful strategies to enhance product reliability and model quality.
Applied Machine Learning Engineer – Evaluation
in Information Technology PermanentJob Detail
Job Description
Overview
- Lead the creation of advanced evaluation frameworks for machine learning models, ensuring robust performance and reliability.
- Collaborate with multidisciplinary teams to align evaluation strategies with product goals and real-world applications.
- Design scalable pipelines and tools to automate model evaluation processes, enhancing efficiency and accuracy.
- Implement innovative metrics and ground truth definitions to measure and improve model quality.
- Develop visualization tools and dashboards to monitor trends, regressions, and evaluation results effectively.
- Contribute to the continuous improvement of workflows, accelerating iteration cycles and enhancing signal quality.
- Work in a fully remote environment, fostering collaboration and innovation across diverse teams.
Key Responsibilities & Duties
- Define and implement evaluation metrics and ground truth for machine learning systems, ensuring clarity and reproducibility.
- Develop automated pipelines for model evaluation and integrate results into centralized systems.
- Create human-in-the-loop tools for error analysis and failure categorization.
- Build internal dashboards to visualize evaluation outcomes and monitor trends over time.
- Collaborate with machine learning, product, and research teams to ensure evaluations align with practical use cases.
- Optimize workflows to enhance efficiency, reliability, and scalability of evaluation processes.
- Influence and establish evaluation standards across teams to maintain consistency and quality.
Job Requirements
- Bachelor of Science degree in a relevant field is required.
- Minimum of 4 years of professional experience in applied machine learning or machine learning engineering.
- Proven expertise in designing and implementing machine learning evaluation frameworks in production environments.
- Proficiency in metrics design, ground truth construction, and automated evaluation pipelines.
- Experience with tools such as Weights & Biases for experiment tracking and analysis.
- Ability to develop internal tools and dashboards to support machine learning workflows.
- Preferred experience in natural language processing, computer vision, or multimodal model evaluation.
- ShareAustin: