Lead Quality Engineer - AI

Company: Wolters Kluwer

Location: Texas (Remote)

Type: Full-time

Level: lead

Remote: Yes

Posted: 2026-03-04

About this role

  • This is a Hybrid role requiring 2 days a week in a Wolters Kluwer office\*\*

We are seeking a
Lead
AI Quality Engineer
to ensure the quality, reliability, and trustworthiness of AI-powered product experiences in Wolters Kluwer Tax and Accounting. This role goes beyond validating that buttons click—you will design tests that confirm the system behaves correctly, measuring retrieval accuracy, citation correctness, and overall alignment of responses with user intent. You will be a key contributor in helping us deliver a system customers can trust.


Key Responsibilities:

  • Design and implement evaluation harnesses to measure retrieval accuracy, citation correctness, response quality, and overall system behavior
  • Develop automated tests for APIs, ingestion pipelines, and chat workflows
  • Collaborate with developers and product managers to define quality metrics (accuracy, latency, cost, hallucination rate)
  • Analyze logs, traces, and feedback signals to identify root causes of failures in AI-driven responses
  • Create regression suites to ensure changes to prompts, chunking, or embeddings don’t break existing behavior
  • Validate REST APIs and service integrations for resilience, correctness, and security
  • Contribute to observability by instrumenting metrics and dashboards for system performance
  • Participate in sprint planning and retrospectives, ensuring testability is built into features from day one

Key Requirements:

  • Bachelors Degree in Computer Science or equivalent
  • 5+ years of experience in software testing, quality engineering, or equivalent engineering roles with a focus on validation and reliability.
  • Experience with AI evaluation frameworks (e.g. LlamaIndex evals, OpenAI Evals, Ragas, TruLens, or custom harnesses)
  • Strong skills in Python testing frameworks (Pytest, unittest, or equivalent)
  • Experience testing web applications and APIs
  • Familiarity with AI/ML or non-deterministic system testing
  • Knowledge of CI/CD pipelines, ...

Create Your Job Alert

Other Lead Jobs

Other Jobs in Texas