Senior

Senior Machine Learning /AI Engineer (RL)

Python

Langchain

Natalia Przybył

IT Recruiter

mail

Location

Remote

Experience

Senior

You will be cooperating with a leading provider of AI evaluation and optimization solutions, trusted by multinational companies to optimize AI agents and detect performance issues in large language models.
In this role, you will work on generating tasks in Reinforcement Learning environments. We create environments for producing training data that can be used to train models.
The company’s mission is to enable safe, verifiable, and aligned AGI through rigorous, real-world agent evaluation.

‍

Due to the client’s time zone, we would appreciate a candidate who can work 2 p.m. - 10 p.m.

‍

This is a 100% remote position.

Join us and make a real impact!

If you’re ready to broaden your horizons and work with an innovative company at the forefront of AI, we’d love to hear from you. You’ll help build the environments that shape how future AI systems are trained, evaluated, and aligned - and collaborate with world-class engineers and researchers on one of the most important technical challenges of our time.

‍

Steps in the recruitment process:

HR call (max 15 min.)
Technical skills assessment via discussion of a short case study (no coding required!!)
Technical interview with our client (max 30 min.)

What will be your responsibilities?

Designing and deploying large-scale, fault-tolerant AI inference services and distributed systems to support real-time voice agents.
Leveraging and orchestrating Foundational Models using tools like LangChain, LangGraph, including state-of-the-art prompting, agent design, and RAG (Retrieval-Augmented Generation) techniques.
Working with Reinforcement Learning (RL) techniques and implementing Continuous Learning/Online Optimization systems for production.
Architecting solutions using microservices and asynchronous messaging technologies like Kafka, Azure ServiceBus, etc. (critical for high-volume, real-time interactions).

Requirements

6+ years of experience in Python software engineering.
Minimum 3 years in Machine Learning/Environment Engineering, Data Scientist roles.
Practical knowledge of AI frameworks (Langchain).
Extensive practical experience in working with AI, including prompt engineering and vibe coding.
Experience in working with business requirements (analysis, summarizing, responding to changes).
‍

‍

Nice to have:

Knowledge of Codex or Claude Code.
Experience in integrating AI with a system would be an advantage.
Understanding of RL concepts - reward modeling, environment dynamics, verifiability, evaluation, and agent interaction loops.
Familiarity with instrumentation, metrics, and data pipelines for RL evaluation.

‍

170 - 230 PLN

netto per month - B2B

gross per month - Employment Contract

APPLY NOW

Natalia Przybył

IT Recruiter

mail

Our benefits

01.

Multisport

As we believe in polish saying "clean body, clean soul", we want to provide you with access to sport facilities that you prefer.

02.

Private healthcare insurance

As you join Acaisoft, you will be granted private insurance. If you have a family or a spouse - we can enable the private insurance for them as well.

03.

Internal initiatives

In Acaisoft, we regulary organize internal events such as game or movie nights, bar-hopping or sports.

04.

Top class equipment

Before your first day at work, our team will send you all of the equipment you will need. Our standard equipment includes MacBook Pro 16" with keyboard, mouse, headphones and external monitor(s).

Recruitment process

STEP 1

Application

After you apply, we will review your resume and get back to you. If your experience matches the requirements, we will schedule a quick screening call to get to know you.

create

STEP 2

Interview

After successful screening, you will be invited to interview with one of our delivery managers alongside with software engineer who will ask you technical questions.

question_answer

STEP 3

Decision

After the interviews, we will get back to you as soon as possible with a feedback and, if all goes well, job offer. The feedback process usually takes upmost to 5 business days.

forward_to_inbox

See similar jobs

We are constantly seeking for top talent all over the world. If you did not found the job position that suits your skills and experience, do not hesitate to contact us directly - we will be happy to talk to you about your future with us.

Senior Python Engineer (AI evaluations platform)

Senior

Remote

Backend

160 – 200 PLN

+ VAT (B2B) monthly

Python

API

APPLY

Python Engineer with C++ (AI project)

Senior

Remote

Backend

160 – 200 PLN

+ VAT (B2B) monthly

Python

C++

APPLY

Python Engineer

Regular

Pune, India

Python

14$/ hour

+ VAT (B2B) monthly

Python

APPLY

Python-Focused Fullstack (+React)

Senior

Remote

Fullstack

140 - 210 PLN

+ VAT (B2B) monthly

Python

React

APPLY

Pune, Maharashtra

Pune, Maharashtra, India

Pune, Maharashtra

Remote from Poland / Hybrid from Pune

Pune, Hybrid

Pune, India

Remote or hybrid (Warsaw)

Got a question?

Join Acaisoft & let's start our adventure together!

Natalia Przybył

Recruiter

mail

We use our own and third-party cookies. By continuing to navigate, we understand that you accept our cookies policy. More info here

ACCEPT 

Thank you

Your message has been sent.

We will respond to you within 24 hours.

HOME

Oops! Something went wrong while submitting the form.

Since working with Acaisoft, our delivery velocity has increased at least 20 times. In the past months, we’ve done more developments than what had been done in the last four years in our company.

Pablo Verano

Head of Product

Scoopr

Remote

Senior Machine Learning /AI Engineer (RL)

Join us and make a real impact!

What will be your responsibilities?

Requirements

170 - 230 PLN

Our benefits

Recruitment process

See similar jobs

Senior Python Engineer (AI evaluations platform)

Python Engineer with C++ (AI project)

Python Engineer

Python-Focused Fullstack (+React)

Got a question?

Contact us