BanglaRegionalTextCorpus: A curated dataset for four regional bangla dialects with standard Bangla and English translation. [PDF]
Ahmed MT +4 more
europepmc +1 more source
Contact and complexity in English varieties: The influence of speaker numbers on syntheticity and grammaticity. [PDF]
Ehret K.
europepmc +1 more source
FeniVerse: A parallel corpus of Feni dialect, standard Bengali, and English. [PDF]
Mahi MH, Khan AR, Hoque Z, Mojumdar MU.
europepmc +1 more source
Matrix-based pagerank control in hypergraphs for semantic text summaries. [PDF]
Aleja D +3 more
europepmc +1 more source
An entropy-based study of Simplification in ChatGPT translations compared to neural machine translation and human translation across genres. [PDF]
Yao G, Fan L.
europepmc +1 more source
Constructing the Corpus of Children's Video Media (CCVM): A new resource and guidelines for constructing comparable and reusable corpora. [PDF]
Gowenlock A +3 more
europepmc +1 more source
ANCHOLIK-NER: A benchmark dataset for Bangla regional named entity recognition. [PDF]
Paul B +7 more
europepmc +1 more source
Dataset on multiregional variations of Bangla language (BD-Dialect). [PDF]
Rahman A, Muna NH, Prity MS.
europepmc +1 more source
Enhanced extractive text summarization framework for low-resourced Urdu language. [PDF]
Nazir S +4 more
europepmc +1 more source

