AI Evaluation Engineer C++ / Rust / Systems
Review & validate AI benchmark tasks in C++/Rust repos. Run containerized builds & test suites, verify patches & solution, debug compilation & runtime failure. Assess task quality for correctness & reproducibility. Strong build-system & Linux needed.