You will be cooperating with a leading provider of AI evaluation and optimization solutions, trusted by multinational companies to optimize AI agents and detect performance issues in large language models.
In this role, you will work on generating tasks in Reinforcement Learning environments. We create environments for producing training data that can be used to train models.
The company’s mission is to enable safe, verifiable, and aligned AGI through rigorous, real-world agent evaluation.
Due to the client’s time zone, we would appreciate a candidate who can work 2 p.m. - 10 p.m.
This is a 100% remote position.
If you’re ready to broaden your horizons and work with an innovative company at the forefront of AI, we’d love to hear from you. You’ll help build the environments that shape how future AI systems are trained, evaluated, and aligned - and collaborate with world-class engineers and researchers on one of the most important technical challenges of our time.
Steps in the recruitment process:
Nice to have:


.png)
.png)








Since working with Acaisoft, our delivery velocity has increased at least 20 times. In the past months, we’ve done more developments than what had been done in the last four years in our company.
