GitHub 中文社区
回车: Github搜索    Shift+回车: Google搜索
论坛
排行榜
趋势
登录

©2025 GitHub中文社区论坛GitHub官网网站地图GitHub官方翻译

  • X iconGitHub on X
  • Facebook iconGitHub on Facebook
  • Linkedin iconGitHub on LinkedIn
  • YouTube iconGitHub on YouTube
  • Twitch iconGitHub on Twitch
  • TikTok iconGitHub on TikTok
  • GitHub markGitHub’s organization on GitHub

编程语言

”chunking“ 的搜索结果

chunking_evaluation
@brandonstarxel

This package, developed as part of our research detailed in the Chroma Technical Report, provides tools for text chunking and evaluation. It allows users to compare different chunking methods and incl...

Python338
4 个月前

相关主题

人工智能大语言模型自然语言处理ragchunking机器学习

Google   Bing   GitHub

late-chunking
@jina-ai

Code for explaining and evaluating late chunking (chunked pooling)

Python408
6 个月前
fine-uploader存档
@FineUploader

Multiple file upload plugin with image previews, drag and drop, progress bars. S3 and Azure support, image scaling, form support, chunking, resume, pause, and tons of other features.

JavaScriptweb-developmentfilesVanilla JavaScript
JavaScript8.17 k
7 年前
Netflix奈飞
rend
Netflix奈飞@Netflix

A memcached proxy that manages data chunking and L1 / L2 caches

Go1.19 k
6 年前
chonkie
@chonkie-inc

🦛 CHONK your texts with Chonkie ✨ — The no-nonsense RAG chunking library

ragretrieval-systems
Python1.62 k
7 小时前
The restic backup program
chunker
The restic backup program@restic

Implementation of Content Defined Chunking (CDC) in Go

Go330
2 年前
Meta-Chunking
@IAAR-Shanghai

Meta-Chunking: Learning Efficient Text Segmentation via Logical Perception

Python224
24 天前
ACT
@Shaka-Labs

Action Chunking Transformer implementation for low cost robot

Jupyter Notebook306
1 年前
zarr-developers/zarr-python
zarr-python
@zarr-developers

An implementation of chunked, compressed, N-dimensional arrays for Python.

Python
Python1.73 k
17 小时前
video-splitter
@c0decracker

Simple Python script to split video into equal length chunks or chunks of equal size, duration, etc.

Python488
2 个月前
conlleval
@sighsmile

conlleval in Python (script for chunking/NER evaluation)

Python126
1 年前
chonkie-ai/chonkie
chonkie
@chonkie-ai

#自然语言处理#🦛 CHONK your texts with Chonkie ✨ - The no-nonsense RAG chunking library

人工智能chunkingragtext-processing自然语言处理
Python2.87 k
3 个月前
NLP-CHUNKING
@CaiRugou
内容违规,已屏蔽
12
1 年前
Alex Greene
WikiQuiz
Alex Greene@alexgreene

Generates a quiz for a Wikipedia page using parts of speech and text chunking.

nltkwikipedia
JavaScript803
5 年前
react-dynamic-route-loading-es6
@ModusCreateOrg

Auto chunking and dynamic loading of routes with React Router and Webpack 2

splittingWebpackReact
JavaScript295
2 年前
compress
@ning

High-performance, streaming/chunking Java LZF codec, compatible with standard C LZF package

Java255
5 个月前
vectordb
@kagisearch

#大语言模型#A minimal Python package for storing and retrieving text using chunking, embeddings, and vector search.

人工智能大语言模型机器学习
Python732
9 个月前
chonkie-ts
@chonkie-inc

🦛 CHONK your texts with Chonkie ✨ Type-friendly, light-weight, fast and super-simple chunking library

TypeScript人工智能rag
TypeScript245
7 天前
LetoReader
@Axym-Labs

A free self-hostable speed reader. Highly customizable. Implements chunking (RSVP), pacing and highlighting. Modern UI and local-storage only.

BlazorOpen Sourcereading自托管LocalStorage
HTML209
8 个月前
neural_sequence_labeling
@26hzhang

A TensorFlow implementation of Neural Sequence Labeling model, which is able to tackle sequence labeling tasks such as POS Tagging, Chunking, NER, Punctuation Restoration and etc.

TensorflowPythonsequence-labelingpos-taggerchunking
Python234
7 年前
unstructured
@Unstructured-IO

#自然语言处理#Convert documents to structured data effortlessly. Unstructured is open-source ETL solution for transforming complex documents into clean, structured formats for language models. Visit our website to...

深度学习document-parsing机器学习自然语言处理OCR
HTML11.77 k
3 小时前
embedditor
@IngestAI

#自然语言处理#⚡ GUI for editing LLM vector embeddings. No more blind chunking. Upload content in any file extension, join and split chunks, edit metadata and embedding tokens + remove stop-words and punctuation wi...

embeddings大语言模型vector-databasevector-searchvectorization
PHP226
2 年前
ColiVara
@tjmlabs

Colivara is a suite of services that allows you to store, search, and retrieve documents based on their visual embedding. ColiVara has state of the art retrieval performance on both text and visual do...

Python1.14 k
2 个月前
loading...