You will be cooperating with a leading provider of AI evaluation and optimization solutions, trusted by multinational companies to optimize AI agents and detect performance issues in large language models.
In this role, you’ll help develop advanced reinforcement learning (RL) environments and scalable evaluation systems that guide and shape the behavior of cutting-edge AI models. The company’s mission is to enable safe, verifiable, and aligned AGI through rigorous, real-world agent evaluation.
Due to the client’s time zone, we would appreciate a candidate who can work until 5:00 p.m., or occasionally until 6:00 p.m. If you prefer working slightly later hours, that’s perfectly okay with the client - but it’s not a requirement.
This is a 100% remote position, but if you enjoy working from an office, you’re warmly welcome to join us there too 😊
If you’re ready to broaden your horizons and work with an innovative company at the forefront of AI, we’d love to hear from you. You’ll help build the environments that shape how future AI systems are trained, evaluated, and aligned - and collaborate with world-class engineers and researchers on one of the most important technical challenges of our time.
Since working with Acaisoft, our delivery velocity has increased at least 20 times. In the past months, we’ve done more developments than what had been done in the last four years in our company.