quick_intNMF code from 01/12/23
Code for inNMF (abstract below). This is a reference of the code base at the point of submission to NAR.
Single cell multi-modal technologies provide powerful means to simultaneously profile cellular states. These are now being employed to study gene regulatory mechanisms in a variety of biological systems. Tailored computational methods for integration and analysis of these data are much-needed with desirable properties in terms of efficiency - to cope with high dimensionality of the data, interpretability - for downstream biological discovery and hypothesis generation, and flexibility - to easily incorporate future modalities. Existing methods cover some but not all of the desirable properties for effective integration and analysis of these data.\\
Here we present a highly efficient method, intNMF, for representation and integration of single cell multi-modal data using joint non-negative matrix factorisation which can facilitate discovery of linked regulatory topics in each modality. We provide thorough benchmarking using large publicly available datasets against five popular existing methods. intNMF performs comparably against the current state-of-the-art across and range of metrics, and provides advantages in terms of computational efficiency and interpretability of discovered regulatory topics in the original feature space. We illustrate this enhanced interpretability in providing insights into cell state changes associated with Alzheimer’s disease.
intNMF is available as a Python package with extensive documentation and use-cases at https://github.com/wmorgans/quick_intNMF