A (PyTorch) imbalanced dataset sampler for oversampling low frequent classes and undersampling high frequent ones.
翻译 - (PyTorch)不平衡数据集采样器,用于对低频率类进行过采样和对高频率类进行欠采样。
Microarchitectural exploitation and other hardware attacks.
State-of-the-art neural cardinality estimators for join queries
TIP2022 Adaptive Boosting (AdaBoost) for Domain Adaptation ? :woman_shrugging: Why not ! 🙆♀️
Implémentation d'un modèle de scoring (OpenClassrooms | Data Scientist | Projet 7)
Adaptive data sampling and transmission in a wireless sensor node as a function of energy reserves
Generating realistic test data or simulating load with authentic, dynamic data using the Gatling framework and JavaFaker
This project aims to analyze the citation network of arXiv papers. We use Python to clean the data and create a Neo4j network to visualize and analyze the citation relationships between arXiv papers.
#自然语言处理#Some nlp utils file I write to reuse
Code and Data for paper: Variation across Scales: Measurement Fidelity under Twitter Data Sampling (ICWSM '20)
Process of data preparaton in R.
#计算机科学#Here is Task 5: Credit card fraud detection using machine learning, for my data science internship with Codsoft
A Python package for flexible subset selection for data visualization.
A method for sampling a balanced dataset from biased signals by leveraging statistical distributions derived from the data.
#计算机科学#Here is Task 5: Credit card fraud detection using machine learning, for my data science internship with Codsoft
This repository contains experiments on data wrangling techniques, focusing on methods for handling missing values, filtering, aggregation, and more.