Akshay, Akshay; Katoch, Mitali; Shekarchizadeh, Navid; Abedi, Masoud; Sharma, Ankush; Burkhard, Fiona C.; Adam, Rosalyn M; Monastyrskaya, Katia; Hashemi Gheinani, Ali (4 July 2023). Machine Learning Made Easy (MLme): A Comprehensive Toolkit for Machine Learning-Driven Data Analysis. (bioRxiv). Cold Spring Harbor Laboratory 10.1101/2023.07.04.546825
|
Text
Machine_Learning_Made_Easy__MLme__A_Comprehensive_Toolkit_for_Machine_Learning_Driven_Data_Analysis.pdf - Published Version Available under License Creative Commons: Attribution (CC-BY). Download (1MB) | Preview |
BACKGROUND
Machine learning (ML) has emerged as a vital asset for researchers to analyze and extract valuable information from complex datasets. However, developing an effective and robust ML pipeline can present a real challenge, demanding considerable time and effort, thereby impeding research progress. Existing tools in this landscape require a profound understanding of ML principles and programming skills. Furthermore, users are required to engage in the comprehensive configuration of their ML pipeline to obtain optimal performance.
RESULTS
To address these challenges, we have developed a novel tool called Machine Learning Made Easy (MLme) that streamlines the use of ML in research, specifically focusing on classification problems at present. By integrating four essential functionalities, namely Data Exploration, AutoML, CustomML, and Visualization, MLme fulfills the diverse requirements of researchers while eliminating the need for extensive coding efforts. To demonstrate the applicability of MLme, we conducted rigorous testing on six distinct datasets, each presenting unique characteristics and challenges. Our results consistently showed promising performance across different datasets, reaffirming the versatility and effectiveness of the tool. Additionally, by utilizing MLme's feature selection functionality, we successfully identified significant markers for CD8+ naive (BACH2), CD16+ (CD16), and CD14+ (VCAN) cell populations.
CONCLUSION
MLme serves as a valuable resource for leveraging machine learning (ML) to facilitate insightful data analysis and enhance research outcomes, while alleviating concerns related to complex coding scripts. The source code and a detailed tutorial for MLme are available at https://github.com/FunctionalUrology/MLme.