Hetionet: an integrative network of disease
-
Updated
Apr 3, 2023 - HTML
Hetionet: an integrative network of disease
R package for High dimensional data analysis and integration with O2PLS!
An R package to download and merge labeled single-cell RNA-seq data from the PanglaoDB database into a Seurat object.
The PyDI framework provides methods for end-to-end data integration. The framework covers all steps of the integration process, including schema matching, data translation, entity matching, and data fusion. The framework offers traditional string-based methods as well as modern LLM- and embedding-based techniques for these tasks.
Serene Data Integration Platform
Software suite for marker gene identification and cell type integration from single cell RNA-sequencing data
The Mannheim Data Integration Benchmark (MaDI-Bench) is an end-to-end benchmark for tabular data integration. It provides integration tasks across five domains, covering schema matching, value normalization, entity matching, and data fusion. It supports difficulty variants and measuring step-wise as well as end-to-end performance.
Integrating gene expression and biological knowledge for drug discovery and repurposing
This repo is created to make two publications. Please read the README.
Undergraduate Final Project (needs README up to date!!) - Scientific paper soon to be included
R-based ML pipeline for customer status prediction with multi-source data integration and SMOTE-enhanced modeling.
Survey Hub is a versatile platform for creating and managing forms and templates. It features a user-friendly drag-and-drop editor, a library of customizable templates, and comprehensive real-time analytics. Perfect for both simple and complex survey needs. Enhance your survey management experience with ease. Check out the project on GitHub.
A workflow to integrate ecological monitoring data from different sources
Examples showing usage of CloverDX.
Geospatial and statistical analysis of mortality risk across U.S. National Park units using Python, SQL, and reproducible data science workflows.
YAML-driven REST API integration test framework with schema validation, data integrity checks, chained tests, and HTML reporting. Built for enterprise data integration testing.
Reproducible transcriptomic data integration workflow for multi-cohort and cross-platform datasets including annotation, normalization, batch correction and ML-ready feature preparation.
A lightweight Docker-based wrapper for the official TMLink image that adds n8n automation workflows and a simple agents UI to handle registration, CSV upload/approval, and record linkage with multimedia compability.
Add a description, image, and links to the data-integration topic page so that developers can more easily learn about it.
To associate your repository with the data-integration topic, visit your repo's landing page and select "manage topics."