#自然语言处理#🦛 CHONK your texts with Chonkie ✨ - The no-nonsense RAG chunking library
#自然语言处理#NCRF++, a Neural Sequence Labeling Toolkit. Easy use to any sequence labeling tasks (e.g. NER, POS, Segmentation). It includes character LSTM/CNN, word LSTM/CNN and softmax/CRF components.
#下载器#Content-Addressable Data Synchronization Tool
翻译 - 内容可寻址数据同步工具
An extensible Java framework for building event-driven applications that break up XML and non-XML data into chunks for data integration
#自然语言处理#A package for parsing PDFs and analyzing their content using LLMs.
A TensorFlow implementation of Neural Sequence Labeling model, which is able to tackle sequence labeling tasks such as POS Tagging, Chunking, NER, Punctuation Restoration and etc.
#大语言模型#The RAG Experiment Accelerator is a versatile tool designed to expedite and facilitate the process of conducting experiments and evaluations using Azure Cognitive Search and RAG pattern.
#大语言模型#A new chunking strategy developed by ZeroEntropy for general semantic chunking using Llama-70B.
An LLM GUI application; enables you to interact with your files, offering dynamic parameters that can modify response behavior during runtime.
#自然语言处理#a modular multimodal framework for ai applications
webpack 2, react hotloader 3, react router v4, code splitting and more
📑 Split Laravel jobs into multiple separate job chunks
An asynchronous event-driven HTTP client based on netty.
#自然语言处理#Грамматический Словарь Русского Языка (+ английский, японский, etc)
Fast multi-threaded content-dependent chunking deduplication for Buffers in C++ with a reference implementation in Javascript. Ships with extensive tests, a fuzz test and a benchmark.
#自然语言处理#Labelling Sequential Data in Natural Language Processing with R - using CRFsuite
#大语言模型#🍱 semantic-chunking ⇢ semantically create chunks from large document for passing to LLM workflows