Sii Ukraine

SII POLAND

SII SWEDEN

Join us Contact us

Sii Ukraine

SII POLAND

SII SWEDEN

Back

Technologies & tools

We are currently looking for an ML Engineer to join our team working on a project for an international client that delivers news and information on a global scale. The team is developing a solution based on large language models (LLMs), specifically tailored for processing legal language.

We are seeking a person who will focus on collecting, organizing, and preparing high-quality textual datasets. Your primary responsibility will be end-to-end data handling: acquisition, transformation, and structuring of data for LLM training purposes.

If you're passionate about working with data at scale and want to contribute to cutting-edge LLM technology in the legal domain — we’d love to hear from you!

Your tasks

  • Collecting and processing legal-domain textual data for use in LLMs
  • Building and optimizing data pipelines in the AWS environment
  • Cleaning, filtering, and standardizing large-scale datasets
  • Integrating data from various sources – both structured (SQL) and unstructured (NoSQL)
  • Documenting processes, generating reports, and presenting findings and technical solutions
  • Collaborating closely with engineering and research teams to ensure high data quality

Requirements

  • At least 5 years of professional experience in software or data engineering roles
  • Strong programming skills, especially in Python
  • Solid knowledge of database systems: SQL and NoSQL
  • Proven experience working with AWS, with an emphasis on ML solutions (e.g., pipeline development, model training)
  • Ability to assess data quality in the context of Machine Learning
  • Excellent communication skills – both written and verbal – for reporting and presenting technical concepts

Job no. 250423-7168J

Technologies & tools

We are currently looking for an ML Engineer to join our team working on a project for an international client that delivers news and information on a global scale. The team is developing a solution based on large language models (LLMs), specifically tailored for processing legal language.

We are seeking a person who will focus on collecting, organizing, and preparing high-quality textual datasets. Your primary responsibility will be end-to-end data handling: acquisition, transformation, and structuring of data for LLM training purposes.

If you're passionate about working with data at scale and want to contribute to cutting-edge LLM technology in the legal domain — we’d love to hear from you!

Your tasks

  • Collecting and processing legal-domain textual data for use in LLMs
  • Building and optimizing data pipelines in the AWS environment
  • Cleaning, filtering, and standardizing large-scale datasets
  • Integrating data from various sources – both structured (SQL) and unstructured (NoSQL)
  • Documenting processes, generating reports, and presenting findings and technical solutions
  • Collaborating closely with engineering and research teams to ensure high data quality

Requirements

  • At least 5 years of professional experience in software or data engineering roles
  • Strong programming skills, especially in Python
  • Solid knowledge of database systems: SQL and NoSQL
  • Proven experience working with AWS, with an emphasis on ML solutions (e.g., pipeline development, model training)
  • Ability to assess data quality in the context of Machine Learning
  • Excellent communication skills – both written and verbal – for reporting and presenting technical concepts

Job no. 250423-7168J

Quick apply

Fill in the form in English please

ML Engineer

Work mode*

Select at least one option

Option was not selected

Attach CV*

Uploaded file:
  • file_icon Created with Sketch.

Acceptable files: doc, docx, pdf. (max 5MB)
Please submit your file in DOC, DOCX or PDF format
The upload size is limited to 5 MB
File is empty
File was not uploaded

At any time, you may withdraw your consent to the processing of personal data, but such withdrawal shall not affect the legal compliance of any processing of such data, which had occurred before you withdrew your consent. Detailed information on the processing of your personal data is specified in the Privacy Policy.

Sii Poland follows the Procedure for reporting law violations.

Create MySii account to follow your application's status
success

Your application has been submitted

We will contact you as soon as we review your CV

Processing...

Sorry, something went wrong and your message was not delivered

Refresh the page and try again. Contact us form, if problem occurs again

Benefits for you

Apply now Recommend a friend

Änderungen im Gange

Wir aktualisieren unsere deutsche Website. Wenn Sie die Sprache wechseln, wird Ihnen die vorherige Version angezeigt.

This content is available only in English version.

Are you sure you want to leave this page?

Цей контент доступний тільки в одній мовній версії.
Ви будете перенаправлені на головну сторінку.

Ви справді бажаєте залишити цю сторінку?