非常全的文言文(古文)-现代文平行语料
-
Updated
Apr 21, 2024 - Python
非常全的文言文(古文)-现代文平行语料
This repository contains the code and data of the paper titled "Not Low-Resource Anymore: Aligner Ensembling, Batch Filtering, and New Datasets for Bengali-English Machine Translation" published in Proceedings of the 2020 Conference on Empirical Methods in Natural Language Processing (EMNLP 2020), November 16 - November 20, 2020.
Multilingual sentence alignment using sentence embeddings
OpusFilter - Parallel corpus processing toolkit
Leeds University and King Saud University (LK) Hadith Corpus
Neural Machine Translation on the Nepali-English language pair
Caucasus languages focused multilingual and monolingual corpuses for Natural Language Processing(NLP)
An easy-to-use library to linguistically compare one sentence and its words to another, in the same language or a different one. For instance useful for comparing a translation with the original text, to find differences and similarities between two different translations, or to see how a machine translation differs from a reference translation.
OPUS (opus.nlpl.eu) Python3 API
Python application, generating parallel corpus for any language pairs, can be used for training nmt (Neural Machine Translation) systems
🪱 PARASITE || A parallel sentence data preprocessing toolkit. Originally developed as a part of the `en-ru` winner submission of WMT20 Biomedical Translation Task.
Pali Buddhist scriptures of 15 countries and its parallel corpus
Code to extract multilingual parallel corpus from Press Information Bureau (PIB) website.
Parallel corpus and multilingual machine translation system of the Pali Buddhist scriptures in 15 countries(15国巴利文大藏经平行语料与多语言机器翻译系统)
Parallel corpus annotation and visualization
Extracting present perfects (and related forms) from parallel corpora
A simple and efficient tool for mining and aligning sentences with pre-trained models.
Creating (parallel) corpora from scratch using Uplug tooling
Odia wikipedia monolingual corpus extraction
Add a description, image, and links to the parallel-corpus topic page so that developers can more easily learn about it.
To associate your repository with the parallel-corpus topic, visit your repo's landing page and select "manage topics."