low-resource-nlp

Here are 29 public repositories matching this topic...

adbar / simplemma

Simple multilingual lemmatizer for Python, especially useful for speed and efficiency

nlp tokenizer language-detection wordlist lemmatizer morphological-analysis lemmatiser tokenization lemmatization corpus-tools language-identification low-resource-nlp

Updated Jun 6, 2025
Python

cisnlp / GlotLID

Star

💬 Language Identification with Support for More Than 2000 Labels -- EMNLP 2023

language-detection multlingual language-detector language-recognition glot lid language-identification language-classification language-identification-toolkit low-resource-languages language-detection-library language-identifier language-detection-lib langid low-resource-nlp glotcc glotlid

Updated Jun 5, 2025
Python

This repository contains the code and data of the paper titled "Not Low-Resource Anymore: Aligner Ensembling, Batch Filtering, and New Datasets for Bengali-English Machine Translation" published in Proceedings of the 2020 Conference on Empirical Methods in Natural Language Processing (EMNLP 2020), November 16 - November 20, 2020.

machine-translation neural-machine-translation parallel-corpus parallel-corpora bangla-nlp low-resource-languages bangla-machine-translation bangla-dataset-machine-translation emnlp-2020 low-resource-nlp low-resource-machine-translation

Updated Oct 23, 2024
Python

ljvmiranda921 / calamanCy

Star

NLP pipelines for Tagalog using spaCy

nlp machine-learning natural-language-processing spacy computational-linguistics ner low-resource-languages low-resource-nlp

Updated Jul 20, 2025
Python

231sm / Reasoning_In_EE

Star

Code and datasets for the ACL 2021 paper "OntoED: Low-resource Event Detection with Ontology Embedding"

information-extraction event-extraction low-resource low-resource-nlp ontoed

Updated Apr 19, 2022
Python

zjunlp / RAP

Star

[SIGIR 2023] Schema-aware Reference as Prompt Improves Data-Efficient Knowledge Graph Construction

Updated Apr 5, 2023
Python

KennethEnevoldsen / scandinavian-embedding-benchmark

Star

A Scandinavian Benchmark for sentence embeddings

nlp benchmark natural-language-processing low-resource-nlp scandinavian

Updated May 23, 2025
Python

luciusssss / mc2_corpus

Star

[ACL'24] MC^2: A Multilingual Corpus of Minority Languages in China (Tibetan, Uyghur, Kazakh, and Mongolian)

multilingual natural-language-processing corpus mongolian tibetan tibetan-nlp uyghur kazakh low-resource-languages low-resource-nlp

Updated Jun 16, 2025
Python

luciusssss / ZhuangBench

Star

[ACL'24 Findings] Teaching Large Language Models an Unseen Language on the Fly

low-resource-languages zhuang low-resource-nlp large-language-models llm

Updated Mar 13, 2025
Python

StefanHeng / ProgGen

Star

Code for paper "ProgGen: Generating Named Entity Recognition Datasets Step-by-step with Self-Reflexive Large Language Models"

natural-language-processing named-entity-recognition data-generation few-shot-learning training-data-generation low-resource-nlp large-language-models efficient-nlp

Updated Mar 29, 2024
Python

csebuetnlp / banglaparaphrase

Star

This repository contains the code, data, and associated models of the paper titled "BanglaParaphrase: A High-Quality Bangla Paraphrase Dataset", accepted in Proceedings of the Asia-Pacific Chapter of the Association for Computational Linguistics: AACL 2022.

paraphrase-generation bangla-nlp low-resource-nlp bangla-paraphrase

Updated Nov 14, 2022
Python

AsifulNobel / Metsys

Star

Chatbot Solution for Resource-Poor Languages. Contains code and data for Journal Article 'Focused domain contextual AI chatbot framework for resource poor languages'.

nlp website natural-language-processing neural-network chatbot django-application django-channels restful-api nlp-machine-learning bangla-nlp low-resource-languages bangla-ai low-resource-nlp resource-poor-languages customer-service-chatbot

Updated Jul 25, 2021
Python

nicolay-r / RuSentRel-Leaderboard

Star

This is an official Leaderboard for the RuSentRel-1.1 dataset originally described in paper (arxiv:1808.08932)

benchmark sentiment-analysis leaderboard cnn neural-networks attention language-models attention-mechanism relation-extraction classifiers bilstm bert-model low-resource-nlp chatgpt

Updated Dec 28, 2023
Python

pnborchert / MultiRep

Star

Efficient Information Extraction in Few-Shot Relation Classification through Contrastive Representation Learning. NAACL 2024.

information-extraction relation-extraction few-shot fewrel contrastive-learning low-resource-nlp

Updated Jun 18, 2024
Python

HenningBuhl / low-resource-machine-translation

Star

This repository is an open-source colleciton of various low-resource machine translation experiments.

Updated May 23, 2023
Python

Lhtie / Bio-Domain-Transfer

Star

Implementation of NAACL 2024 main conference paper: Named Entity Recognition Under Domain Shift via Metric Learning for Life Science

chemical pytorch information-extraction named-entity-recognition nltk biomedical knowledge-transfer few-shot contrastive-learning low-resource-nlp doamin-adaptation transformers-bert

Updated Jun 19, 2024
Python

EagleW / Chem-FINESE

Star

Official implementation of the EACL Findings 2024 paper: Chem-FINESE: Validating Fine-Grained Few-shot Entity Extraction through Text Reconstruction

transformers pytorch information-extraction datasets reconstruction chemical-data few-shot few-shot-learning constrastive-learning low-resource-nlp large-language-models chemical-information-extraction

Updated Mar 18, 2024
Python

ruoyuxie / noisy_parallel_data_alignment

Star

Enhanced awesome-align for low-resource languages and noise simulation: https://arxiv.org/abs/2301.09685

ocr noise word-aligner word-alignment noisy-data ocr-text low-resource-languages nueral-machine-translation low-resource-nlp

Updated Mar 4, 2023
Python

chschroeder / self-training-for-sample-efficient-active-learning

Star

Self-Training for Sample-Efficient Active Learning for Text Classification with Pre-Trained Language Models (EMNLP 2024)

active-learning low-resource-nlp llms active-learning-in-nlp

Updated Nov 2, 2024
Python

Rui0828 / Learning-From-Mistakes-Prompting

Star

LoResMT@ACL 2024: Learning-From-Mistakes Prompting for Indigenous Language Translation – A feedback-driven approach to enhance low-resource translation.

natural-language-processing machine-translation low-resource few-shot-learning low-resouce-language low-resource-nlp low-resource-machine-translation in-context-learning chain-of-thought

Updated Dec 6, 2024
Python

Improve this page

Add a description, image, and links to the low-resource-nlp topic page so that developers can more easily learn about it.

Curate this topic

Add this topic to your repo

To associate your repository with the low-resource-nlp topic, visit your repo's landing page and select "manage topics."

Learn more

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

low-resource-nlp

Here are 29 public repositories matching this topic...

adbar / simplemma

cisnlp / GlotLID

csebuetnlp / banglanmt

ljvmiranda921 / calamanCy

231sm / Reasoning_In_EE

zjunlp / RAP

KennethEnevoldsen / scandinavian-embedding-benchmark

luciusssss / mc2_corpus

luciusssss / ZhuangBench

StefanHeng / ProgGen

csebuetnlp / banglaparaphrase

AsifulNobel / Metsys

nicolay-r / RuSentRel-Leaderboard

pnborchert / MultiRep

HenningBuhl / low-resource-machine-translation

Lhtie / Bio-Domain-Transfer

EagleW / Chem-FINESE

ruoyuxie / noisy_parallel_data_alignment

chschroeder / self-training-for-sample-efficient-active-learning

Rui0828 / Learning-From-Mistakes-Prompting

Improve this page

Add this topic to your repo