#自然语言处理#General technology for enabling AI capabilities w/ LLMs and MLLMs
📃Language Model based sentences scoring library
ModuleFormer is a MoE-based architecture that includes two different types of experts: stick-breaking attention heads and feedforward experts. We released a collection of ModuleFormer-based Language M...
Scholar Copilot is an intelligent academic writing assistant that enhances the research writing process through AI-powered text completion and citation suggestions
#自然语言处理#Bangla-Bert is a pretrained bert model for Bengali language
#大语言模型#The LM Contamination Index is a manually created database of contamination evidences for LMs.
Korean text normalization and language preparation package for LM in Kaldi-based ASR system
#自然语言处理#🐍 Python library for n-gram models in ARPA format
Codes for the experiments in our EMNLP 2021 paper "Open Aspect Target Sentiment Classification with Natural Language Prompts"
#自然语言处理#Embeddings: State-of-the-art Text Representations for Natural Language Processing tasks, an initial version of library focus on the Polish Language
The official repo for "TheoremQA: A Theorem-driven Question Answering dataset" (EMNLP 2023)
Automatically extracts NT and LM hashes from Windows memory dumps based on volatility.