AI Alone Isn’t Enough: Researchers Call for Mathematical Modeling in Cancer Predictions

By merging the strengths of AI and mathematical modeling, and championing transparent, ethical data sharing, University of Maryland researchers unveil a new path toward safer, smarter, and more personalized cancer treatment.

Image Credit: raker / Shutterstock

With the advent of artificial intelligence (AI), predictive medicine is becoming an important part of healthcare, especially in cancer treatment. Predictive medicine uses algorithms and data to help doctors understand how a cancer might continue to grow or react to specific drugs, making it easier to target precision treatment for individual patients.

While AI is important in this work, researchers from the University of Maryland School of Medicine (UMSOM) say it should not be relied on exclusively. Instead, AI should be combined with other methods, such as traditional mathematical modeling, for the best outcomes.

In a commentary published in Nature Biotechnology, Elana Fertig, PhD, Director of the Institute for Genome Sciences (IGS) and Professor of Medicine at UMSOM, and Daniel Bergman, PhD, an IGS scientist, argue that mathematical modeling has been underestimated and underused in precision medicine to date.

All health computational models need three key components to work: datasets, equations, and software. After generating data, they leverage it to improve early diagnoses, discover new treatments, and aid understanding of diseases. 

In a second commentary in Cell Reports Medicine, Dr. Fertig and IGS colleagues Dmitrijs Lvovs, PhD, Anup Mahurkar, PhD, and Owen White, PhD, address how to share health data and methods to create reproducible science ethically.

Together, the two commentaries set a foundational approach to generating, analyzing, and ethically sharing data to benefit patients and science.

Explaining the argument of the Nature Biotechnology commentary, Dr. Fertig said, "AI and mathematical models differ dramatically in how they arrive at an outcome. AI models first must be trained with existing data to make an outcome prediction, while mathematical models are directed to answer a specific question using both data and biological knowledge."

That means that when data is sparse, as it often is in newer cancer treatments such as immunotherapy, AI can overgeneralize, resulting in biased or inaccurate outcomes that other scientists cannot reproduce. On the other hand, mathematical modeling uses known biological mechanisms, learned from scientific experiments, to explain how it arrived at an outcome.

"For example, with a mathematical model, we could create virtual cancer cells and healthy cells and write a program that would mimic how those cells interact and evolve inside of a tumor with different types of treatments," said Dr. Bergman, Assistant Professor at IGS and UMSOM's Department of Pharmacology, Physiology, and Drug Development. "At this time, AI cannot give us that type of specificity."

The authors state that in addition to using both models in "computational immunotherapy," using a breadth of populations and making publicly available datasets are critical for the most accurate outcomes.

"Data breadth and accuracy are key. Artifacts in the dataset, or even a simple typo in computer code, can throw off the accuracy of either type of model," added Dr. Fertig. "Therefore, for any analysis pipeline to work correctly, it must be reproducible and that can only be assured by open science-giving access to other researchers whose work can confirm the models will get the right treatment to the right patient."

However, reproducibility remains a critical challenge in science. In a 2016 article in Nature, more than 1500 scientists were surveyed, and more than 70% of researchers said they have tried and failed to reproduce another scientist's experiments, and more than half have failed to reproduce their own experiments. 

"Reproducible research enables investigators to verify that the findings are accurate, reduce biases, promote scientific integrity, and build trust," explained Dmitrijs Lvovs, PhD, Research Associate at IGS and first author on the Cell Reports Medicine commentary. "Because data science is computationally driven, all results should be transparent andautomatically reproducible from the same dataset if the analysis code is readily available through open science."

While that sounds simple enough, and there are best practices in place, the authors argue that the challenge is how to share data while protecting patient privacy and blocking unauthorized data breaches. Genomic data, when combined with personal health information (PHI), could lead to the re-identification of patients, a privacy violation. 

The authors say that creating ethical open science data sharing means: 1. Getting detailed informed consent from patients; 2. Ensuring data quality when collecting and processing data by mitigating errors; 3. Harmonizing and standardizing data collected from disparate sources; 4. Using and creating resources and platforms, such as multiomic, clinical, public health, and drug discovery repositories; and 5. Working with vetted pipelines, such as open-source analysis tools and software platforms.

"Ethical and responsible data sharing democratizes research, supports the advancement of AI, and informs public health policies," said Dr. Lvovs. "With ethical and responsible data sharing, the biomedical research community can maximize the benefits of shared data, accelerate discovery, and improve human health."

Source:
Journal references:

Comments

The opinions expressed here are the views of the writer and do not necessarily reflect the views and opinions of AZoAi.
Post a new comment
Post

While we only use edited and approved content for Azthena answers, it may on occasions provide incorrect responses. Please confirm any data provided with the related suppliers or authors. We do not provide medical advice, if you search for medical information you must always consult a medical professional before acting on any information provided.

Your questions, but not your email details will be shared with OpenAI and retained for 30 days in accordance with their privacy principles.

Please do not ask questions that use sensitive or confidential information.

Read the full Terms & Conditions.