Follow
David Dale
Title
Cited by
Cited by
Year
Text detoxification using large pre-trained neural models
D Dale, A Voronov, D Dementieva, V Logacheva, O Kozlova, N Semenov, ...
arXiv preprint arXiv:2109.08914, 2021
362021
Paradetox: Detoxification with parallel data
V Logacheva, D Dementieva, S Ustyantsev, D Moskovskiy, D Dale, ...
Proceedings of the 60th Annual Meeting of the Association for Computational …, 2022
342022
SeamlessM4T-Massively Multilingual & Multimodal Machine Translation
L Barrault, YA Chung, MC Meglioli, D Dale, N Dong, PA Duquenne, ...
arXiv preprint arXiv:2308.11596, 2023
332023
Detecting and mitigating hallucinations in machine translation: Model internal workings alone do well, sentence similarity even better
D Dale, E Voita, L Barrault, MR Costa-jussà
arXiv preprint arXiv:2212.08597, 2022
272022
Methods for detoxification of texts for the russian language
D Dementieva, D Moskovskiy, V Logacheva, D Dale, O Kozlova, ...
Multimodal Technologies and Interaction 5 (9), 54, 2021
132021
A large-scale computational study of content preservation measures for text style transfer and paraphrase generation
N Babakov, D Dale, V Logacheva, A Panchenko
Proceedings of the 60th Annual Meeting of the Association for Computational …, 2022
122022
A simple solution for the Taxonomy enrichment task: Discovering hypernyms using nearest neighbor search
DS Dale
Computational Linguistics and Intellectual Technologies: Proceedings of the …, 2020
102020
Halomi: A manually annotated benchmark for multilingual hallucination and omission detection in machine translation
D Dale, E Voita, J Lam, P Hansanti, C Ropers, E Kalbassi, C Gao, ...
Proceedings of the 2023 Conference on Empirical Methods in Natural Language …, 2023
92023
Estimation of nested and zero-inflated ordered probit models
D Dale, A Sirchenko
The Stata Journal 21 (1), 3-38, 2021
92021
Seamless: Multilingual Expressive and Streaming Speech Translation
L Barrault, YA Chung, MC Meglioli, D Dale, N Dong, M Duppenthaler, ...
arXiv preprint arXiv:2312.05187, 2023
82023
Crowdsourcing of Parallel Corpora: the Case of Style Transfer for Detoxification.
D Dementieva, S Ustyantsev, D Dale, O Kozlova, N Semenov, ...
CSW@ VLDB, 35-49, 2021
82021
RUSSE-2022: Findings of the first Russian detoxification task based on parallel corpora
D Dementieva, I Nikishina, V Logacheva, A Fenogenova, D Dale, ...
Computational Linguistics and Intellectual Technologies, 2022
62022
Don’t Lose the Message While Paraphrasing: A Study on Content Preserving Style Transfer
N Babakov, D Dale, I Gusev, I Krotova, A Panchenko
International Conference on Applications of Natural Language to Information …, 2023
42023
SkoltechNLP at SemEval-2021 Task 5: Leveraging Sentence-level Pre-training for Toxic Span Detection
D Dale, I Markov, V Logacheva, O Kozlova, N Semenov, A Panchenko
42021
MuTox: Universal MUltilingual Audio-based TOXicity Dataset and Zero-shot Detector
MR Costa-jussà, MC Meglioli, P Andrews, D Dale, P Hansanti, E Kalbassi, ...
arXiv preprint arXiv:2401.05060, 2024
12024
Exploring Methods for Cross-lingual Text Style Transfer: The Case of Text Detoxification
D Dementieva, D Moskovskiy, D Dale, A Panchenko
arXiv preprint arXiv:2311.13937, 2023
12023
The first neural machine translation system for the Erzya language
D Dale
arXiv preprint arXiv:2209.09368, 2022
12022
Studying the role of named entities for content preservation in text style transfer
N Babakov, D Dale, V Logacheva, I Krotova, A Panchenko
International Conference on Applications of Natural Language to Information …, 2022
12022
Towards Red Teaming in Multimodal and Multilingual Translation
C Ropers, D Dale, P Hansanti, GM Gonzalez, I Evtimov, C Wong, C Touret, ...
arXiv preprint arXiv:2401.16247, 2024
2024
Added Toxicity Mitigation at Inference Time for Multimodal and Massively Multilingual Translation
MR Costa-jussà, D Dale, M Elbayad, B Yu
arXiv preprint arXiv:2311.06532, 2023
2023
The system can't perform the operation now. Try again later.
Articles 1–20