The browser you are using is not supported by this website. All versions of Internet Explorer are no longer supported, either by us or Microsoft (read more here: https://www.microsoft.com/en-us/microsoft-365/windows/end-of-ie-support).

Please use a modern browser to fully experience our website, such as the newest versions of Edge, Chrome, Firefox or Safari etc.

LU

Emma Hammarlund

Research team manager

LU

AlphaML : A clear, legible, explainable, transparent, and elucidative binary classification platform for tabular data

Author

  • Ahmad Nasimian
  • Saleena Younus
  • Özge Tatli
  • Emma U. Hammarlund
  • Kenneth J. Pienta
  • Lars Rönnstrand
  • Julhash U. Kazi

Summary, in English

Leveraging the potential of machine learning and recognizing the broad applications of binary classification, it becomes essential to develop platforms that are not only powerful but also transparent, interpretable, and user friendly. We introduce alphaML, a user-friendly platform that provides clear, legible, explainable, transparent, and elucidative (CLETE) binary classification models with comprehensive customization options. AlphaML offers feature selection, hyperparameter search, sampling, and normalization methods, along with 15 machine learning algorithms with global and local interpretation. We have integrated a custom metric for hyperparameter search that considers both training and validation scores, safeguarding against under- or overfitting. Additionally, we employ the NegLog2RMSL scoring method, which uses both training and test scores for a thorough model evaluation. The platform has been tested using datasets from multiple domains and offers a graphical interface, removing the need for programming expertise. Consequently, alphaML exhibits versatility, demonstrating promising applicability across a broad spectrum of tabular data configurations.

Department/s

  • StemTherapy: National Initiative on Stem Cells for Regenerative Therapy
  • LUCC: Lund University Cancer Centre
  • Division of Translational Cancer Research
  • Molecular Cancer Research
  • EpiHealth: Epidemiology for Health
  • Molecular Evolution
  • Stem Cell Center

Publishing year

2024-01

Language

English

Publication/Series

Patterns

Volume

5

Issue

1

Document type

Journal article

Publisher

Cell Press

Topic

  • Bioinformatics (Computational Biology)

Keywords

  • deep tabular learning
  • drug sensitivity prediction
  • DSML 3: Development/Pre-production: Data science output has been rolled out/validated across multiple domains/problems
  • ensemble learning
  • explainable AI
  • feature selection
  • hyperparameter optimization
  • machine learning
  • precision medicine
  • TabNet
  • XGBoost

Status

Published

Research group

  • Molecular Cancer Research
  • Molecular Evolution

ISBN/ISSN/Other

  • ISSN: 2666-3899