You will be cooperating with a leading provider of AI evaluation and optimization solutions, trusted by multinational companies to optimize AI agents and detect performance issues in large language models.
In this role, you will be building the data, evaluation, and verification infrastructure for a competitive programming RL training environment. The role is focused on data engineering, sandboxed code execution, automated test generation, and grading systems rather than frontend or web development.
Due to the client’s time zone, we would appreciate a candidate who can work until 10:00 p.m.
Nice to have:


.png)
.png)












Since working with Acaisoft, our delivery velocity has increased at least 20 times. In the past months, we’ve done more developments than what had been done in the last four years in our company.
