#自然语言处理#STaRK: Benchmarking LLM Retrieval on Textual and Relational Knowledge Bases (NeurIPS D&B 2024)
Refinery is a tool to extract and transform semi-structured data from Excel spreadsheets of different layouts in a declarative way.
#自然语言处理#Programming language for symbolic computation with unusual combination of pattern matching features: Tree patterns, associative patterns and expressions embedded in patterns.
#自然语言处理#A dataset for extracting information from repair manuals
#自然语言处理#Implementation of the semi-structured inference model in our ACL 2020 paper, INFOTABS: Inference on Tables as Semi-structured Data.
#自然语言处理#Repository containing code for the NAACL 2021 paper (Incorporating External Knowledge to Enhance Tabular Reasoning)
Endoscopic and Pathological data extraction for various endo-pathological data extraction
An ActiveModel extension to model your semi-structured data using embedded associations
Urban Dict spelling variant dataset. Source code of How to Evaluate Word Representations of Informal Domain?
#自然语言处理#This repository contains the official code for the paper : Realistic Data Augmentation Framework for Enhancing Tabular Reasoning (Findings-EMNLP, 2022).
Schema inference for semistructured data using Formal Concept Analysis
#自然语言处理#A semi-automatic web-based annotation tool for MyFixit dataset :
#自然语言处理#Implementation of the semi-structured inference model in our ACL 2023 paper: INFOSYNC: Information Synchronization across Multilingual Semi-structured Tables.
#自然语言处理#Web-based workflow management system that computes candidate tool workflows given input file(s) and the user's requirements regarding the output. Afterwards, runs a workflow selected by the user from ...
Java Standalone application for querying XML documents with requests with preferences (GTPs requests with preferences)
Framework to manipulate semi structured documents and extract data from them
Eloquent Serialized LOB is a trait for Laravel Eloquent models that allows Serialized LOB pattern