Allegro ML Research

AlleNoise - large-scale text classification benchmark dataset with real-world label noise

Alicja Rączkowska

Understand, translate, respond - Clever language modeling at Allegro

Aleksandra Osowska-Kurczab & Jacek Szczerbiński

Talk at Data Science Summit Machine Learning Edition - 14.06.2024

Dense Retrieval for Allegro Search Engine

Aleksandra Chrabrowa

Seminar at Warsaw.ai - Episode XXIII - 23.05.2024

Machine Translation at Allegro – Use your model wise but data wiser

Mikołaj Koszowski

Seminar at Warsaw.ai - Episode XXI - 18.01.2024

Building a vector search engine

Maciej Mościcki

MOPS Community

Architecting ML for Huge Impact and Scale

Szymon Jacoń

MOPS Community

The structure of customer service data @Allegro

Aleksandra Chrabrowa

GHOST Day: AMLC 2022

Blog

How to create a synthetic annotator? The process of developing a domain-specific LLM-as-a-Judge.

4 months ago

In this blogpost we want to introduce the topic of using a Large Language Model (LLM) as an evaluator — a novel approach to tackling…

Zuzanna Rękawek…

0 Comments

read post

Trust no one, not even your training data! Machine learning from noisy data

about 2 years ago

Label noise is ever-present in machine learning practice. Allegro datasets are no exception. We compared 7 methods for training classifiers robust to label…

Alicja Rączkowska…

0 Comments

read post

Turn-Based Offline Reinforcement Learning

about 3 years ago

This blogpost is the result of a research collaboration between the Allegro Machine Learning Research team and the Institute of Mathematics of the Polish Academy of…

Riccardo Belluzzo…

0 Comments

read post

Open-Source

AlleNoise

Large-scale text classification benchmark dataset with real-world label noise. It is meant to spark development of new robust classification methods.

Hugging Face Allegro

We contribute to the NLP community by publishing models and datasets to the Hugging Face Hub!

allms

Versatile and powerful library designed to streamline the process of querying large language models.

Simple and User-Friendly Interface,
Asynchronous Querying,
Automatic Retrying Mechanism,
Error Handling and Management,
Output Parsing

allRank

Framework for training neural Learning-to-Rank (LTR) models, featuring implementations of:

common pointwise, pairwise and listwise loss function,
fully connected and Transformer-like scoring function,
commonly used evaluation metrics like Normalized Discounted Cumulative Gain (NDCG) and Mean Reciprocal Rank (MRR},
click-models for experiments on simulated click-through data

KLEJ Benchmark

The KLEJ benchmark (Kompleksowa Lista Ewaluacji Językowych) is a set of nine evaluation tasks for the Polish language understanding. Key benchmark features:

It contains a diverse set of tasks from different domains and with different objectives,
Most tasks are created from existing datasets but we also release the new sentiment analysis dataset from an e-commerce domain.

HerBERT

HerBERT is a BERT-based language model trained on six different corpora for Polish language understanding. It achieves state-of-the-art results on multiple downstream tasks, including KLEJ Benchmark and Part-of-Speech tagging. We release both Base and Large variants of the model as a part of transformers library for anyone to use.

Publications

2025

RetrySQL: text-to-SQL training with retry data for self-correcting query generation

Authors: Alicja Rączkowska, Riccardo Belluzzo, Piotr Zieliński, Joanna Baran, Paweł Olszewski

Accepted at: arXiv

2024

AlleNoise - large-scale text classification benchmark dataset with real-world label noise

Authors: Alicja Rączkowska, Aleksandra Osowska-Kurczab, Jacek Szczerbiński, Kalina Jasinska-Kobus, Klaudia Nazarko

Accepted at: 28th International Conference on Artificial Intelligence and Statistics (AISTATS 2025)

2023

Improving Domain-Specific Retrieval by NLI Fine-Tuning

Authors: Roman Dušek, Aleksander Wawer, Christopher Galias, Lidia Wojciechowska

Accepted at: Proceedings of the 18th Conference on Computer Science and Intelligence Systems, FedCSIS 2023

2023

Going beyond research datasets: Novel intent discovery in the industry setting

Authors: Aleksandra Chrabrowa, Tsimur Hadeliya, Dariusz Kajtoch, Robert Mroczkowski, Piotr Rybak

Accepted at: Findings of the Association for Computational Linguistics: EACL 2023

2022

Evaluation of Transfer Learning for Polish with a Text-to-Text Model

Authors: Aleksandra Chrabrowa, Łukasz Dragan, Karol Grzegorczyk, Dariusz Kajtoch, Mikołaj Koszowski, Robert Mroczkowski, Piotr Rybak

Accepted at: Proceedings of the 13th Conference on Language Resources and Evaluation (LREC 2022)

2021

Allegro.eu Submission to WMT21 News Translation Task

Authors: Mikołaj Koszowski, Karol Grzegorczyk, Tsimur Hadeliya

Accepted at: Proceedings of the Sixth Conference on Machine Translation

2021

HerBERT: Efficiently Pretrained Transformer-based Language Model for Polish

Authors: Robert Mroczkowski, Piotr Rybak, Alina Wróblewska, Ireneusz Gawlik

Accepted at: BSNLP, accepted long paper

2020

KLEJ: Comprehensive Benchmark for Polish Language Understanding

Authors: Piotr Rybak, Robert Mroczkowski, Janusz Tracz, Ireneusz Gawlik

Accepted at: ACL 2020, accepted long paper

2020

Context-Aware Learning to Rank with Self-Attention

Authors: Przemysław Pobrotyn, Tomasz Bartczak, Mikołaj Synowiec, Radosław Białobrzeski, Jarosław Bojar

Accepted at: SIGIR eCommerce Workshop 2020, contributed talk

2020

NeuralNDCG: Direct Optimisation of a Ranking Metric via Differentiable Relaxation of Sorting

Authors: Przemysław Pobrotyn, Radosław Białobrzeski

Accepted at: The 2021 SIGIR Workshop On eCommerce (SIGIR eCom ’21)