VE
Senior Software Engineer In Test (Full-Stack/Python)
Amman, Jordan
About the role
Mindrift connects specialized professionals with project-based opportunities to test, evaluate, and enhance AI systems for leading technology companies. This is a project-based engagement rather than permanent employment, offering flexibility for experienced developers seeking challenging work.
As a Senior Software Engineer in Test (Full-Stack/Python), you will create sophisticated coding test cases that rigorously challenge AI coding systems. Your primary responsibility involves developing realistic, comprehensive test scenarios that push AI capabilities to their limits through complex reasoning tasks and edge case validation.
You will review and refine coding tasks based on production codebases with realistic scope and requirements, write comprehensive functional tests that validate actual end-to-end behavior, and craft "fair but hard" challenges where AI systems must work through scattered information across files and external sources. Additionally, you will analyze AI failures to understand model limitations versus strengths and iterate based on feedback from expert QA reviewers.
What we're looking for
- Degree in Computer Science, Software Engineering or related fields
- 5+ years in software development with primary expertise in Python (pytest, async/await, subprocess, file operations)
- Background in full-stack development with equal focus on React-based interfaces and robust back-end systems
- Experience writing functional and integration tests (not just running them)
- Proficiency with Docker containers for running evaluations locally
- Understanding of CI/CD processes, particularly GitHub Actions (triggers, labels, reading results)
- English proficiency at B2 level or higher
The hiring process
STEP 01
Recruiter call
30 min · Within 5 days
STEP 02
Hiring manager
60 min · Following week
STEP 03
Working session
90 min · Take-home + review
STEP 04
Final + offer
2 hours · Loop with team
Average time to offer: ~21 days. Your application is timestamped at every step — see applications.