WazifaME
VE

Senior Software Engineer In Test (Full-Stack/Python)

Amman, Jordan

About the role

Mindrift connects specialized professionals with project-based opportunities to test, evaluate, and enhance AI systems for leading technology companies. This is a project-based engagement rather than permanent employment, offering flexibility for experienced developers seeking challenging work. As a Senior Software Engineer in Test (Full-Stack/Python), you will create sophisticated coding test cases that rigorously challenge AI coding systems. Your primary responsibility involves developing realistic, comprehensive test scenarios that push AI capabilities to their limits through complex reasoning tasks and edge case validation. You will review and refine coding tasks based on production codebases with realistic scope and requirements, write comprehensive functional tests that validate actual end-to-end behavior, and craft "fair but hard" challenges where AI systems must work through scattered information across files and external sources. Additionally, you will analyze AI failures to understand model limitations versus strengths and iterate based on feedback from expert QA reviewers.

What we're looking for

  • Degree in Computer Science, Software Engineering or related fields
  • 5+ years in software development with primary expertise in Python (pytest, async/await, subprocess, file operations)
  • Background in full-stack development with equal focus on React-based interfaces and robust back-end systems
  • Experience writing functional and integration tests (not just running them)
  • Proficiency with Docker containers for running evaluations locally
  • Understanding of CI/CD processes, particularly GitHub Actions (triggers, labels, reading results)
  • English proficiency at B2 level or higher

The hiring process

  1. STEP 01

    Recruiter call

    30 min · Within 5 days

  2. STEP 02

    Hiring manager

    60 min · Following week

  3. STEP 03

    Working session

    90 min · Take-home + review

  4. STEP 04

    Final + offer

    2 hours · Loop with team

Average time to offer: ~21 days. Your application is timestamped at every step — see applications.

Similar roles

More jobs →