Data integration by fuzzy similarity-based hierarchical clustering-Reference-Cited by-同舟云学术

Data integration by fuzzy similarity-based hierarchical clustering

Published:2020-08 Issue:S10 Volume:21 Page:
ISSN:1471-2105
Container-title:BMC Bioinformatics
language:en
Short-container-title:BMC Bioinformatics

Author:

Ciaramella Angelo,Nardone Davide,Staiano Antonino

Abstract

Abstract Background High throughput methods, in biological and biomedical fields, acquire a large number of molecular parameters or omics data by a single experiment. Combining these omics data can significantly increase the capability for recovering fine-tuned structures or reducing the effects of experimental and biological noise in data. Results In this work we propose a multi-view integration methodology (named FH-Clust) for identifying patient subgroups from different omics information (e.g., Gene Expression, Mirna Expression, Methylation). In particular, hierarchical structures of patient data are obtained in each omic (or view) and finally their topologies are merged by consensus matrix. One of the main aspects of this methodology, is the use of a measure of dissimilarity between sets of observations, by using an appropriate metric. For each view, a dendrogram is obtained by using a hierarchical clustering based on a fuzzy equivalence relation with Łukasiewicz valued fuzzy similarity. Finally, a consensus matrix, that is a representative information of all dendrograms, is formed by combining multiple hierarchical agglomerations by an approach based on transitive consensus matrix construction. Several experiments and comparisons are made on real data (e.g., Glioblastoma, Prostate Cancer) to assess the proposed approach. Conclusions Fuzzy logic allows us to introduce more flexible data agglomeration techniques. From the analysis of scientific literature, it appears to be the first time that a model based on fuzzy logic is used for the agglomeration of multi-omic data. The results suggest that FH-Clust provides better prognostic value and clinical significance compared to the analysis of single-omic data alone and it is very competitive with respect to other techniques from literature.

Publisher

Springer Science and Business Media LLC

Subject

Applied Mathematics,Computer Science Applications,Molecular Biology,Biochemistry,Structural Biology

Link

https://link.springer.com/content/pdf/10.1186/s12859-020-03567-6.pdf

Reference21 articles.

1. Camastra F, Di Taranto MD, Staiano A. Statistical and computational methods for genetic diseases: An overview. Comput Math Meth Med. 2015; 2015(Article ID 954598):1–8.

2. Serra A, Fratello M, Fortino V, Raiconi G, Tagliaferri R, Greco D. Mvda: a multi-view genomic data integration methodology. BMC Bioinformatics. 2015; 16(261):1–13.

3. Rappoport N, Shamir R. Multi-omic and multi-view clustering algorithms: review and cancer benchmark. Nucleic Acids Res. 2018; 46(20):10546–62.

4. Reddy CK, Aggarwal CC. Data Clustering. Boca Raton: Chapman and Hall/CRC; 2016.

5. Camastra F, Ciaramella A, Son LH, Riccio A, Staiano A. Fuzzy similarity-based hierarchical clustering for atmospheric pollutants prediction. LNCS. 2019; 11291:123–33.

Cited by 19 articles. 订阅此论文施引文献订阅此论文施引文献，注册后可以免费订阅5篇论文的施引文献，订阅后可以查看论文全部施引文献

1. Differential evolutionary optimization fuzzy entropy for gland segmentation based on breast mammography imaging;Journal of Radiation Research and Applied Sciences;2024-09

2. Advance computational tools for multiomics data learning;Biotechnology Advances;2024-09

3. Unsupervised Learning for Characterizing Type IV Secreted Effectors;2024 4th International Conference on Applied Artificial Intelligence (ICAPAI);2024-04-16

4. Graded Mean Integration Representation and Intuitionistic Fuzzy Weighted Arithmetic Mean for Similarity Measures in Case-Based Reasoning;International Journal of Fuzzy Systems;2024-04-08

5. Improving Real-Time Data Streams Performance on Autonomous Surface Vehicles using DataX;2024 32nd Euromicro International Conference on Parallel, Distributed and Network-Based Processing (PDP);2024-03-20