A general purpose Snakemake workflow and MrBiomics module to perform unsupervised analyses (dimensionality reduction & cluster analysis) and visualizations of high-dimensional data.
-
Updated
Sep 12, 2025 - Python
A general purpose Snakemake workflow and MrBiomics module to perform unsupervised analyses (dimensionality reduction & cluster analysis) and visualizations of high-dimensional data.
Pepelka is a MATLAB toolbox for data clustering and visualization.
A python package which implements a distance-based extension of the adjusted Rand index for the supervised validation of 2 cluster analysis solutions
This folder contains code for the paper titled "Information maximization-based clustering of histopathology images using deep learning".
Clustering algorithms, validations and interpretations with some robust statistic advanced topics. Real case applications. My R scripts to solve some university exams ;) - Work in progress
This repository contains the code for the conference paper titled "Feature extraction and unsupervised clustering of histopathological images of pancreatic cancer using information maximization".
A data-driven customer segmentation project using hierarchical (Ward.D2) and k-means clustering on retail survey data. The analysis applies z-score normalization, Euclidean distance and cluster validation (NbClust) to identify four distinct segments and translate insights into strategic targeting recommendations using the McKinsey GE Matrix.
Internship project: Decomposition of somatic mutation profiles into mutational signatures
This repository contains the source code associated with the paper titled "Implementation of a conditional latent diffusion-based generative model to synthetically create unlabeled histopathological images".
Add a description, image, and links to the cluster-validation topic page so that developers can more easily learn about it.
To associate your repository with the cluster-validation topic, visit your repo's landing page and select "manage topics."