Emory's Chatbot Makes Molecular Modeling as Simple as Chatting

A revolutionary chatbot from Emory University transforms molecular simulations, empowering students and scientists to visualize complex chemistry in motion without writing a single line of code.

Research: Chatbot-assisted quantum chemistry for explicitly solvated molecules. Image Credit: Emory UniversityResearch: Chatbot-assisted quantum chemistry for explicitly solvated molecules. Image Credit: Emory University

Advanced computational software is streamlining quantum chemistry research by automating many of the processes of running molecular simulations. However, the complicated design of these software packages often limits their use to theoretical chemists trained in specialized computing techniques.

A new web platform developed at Emory University overcomes this limitation with a user-friendly chatbot.

The chatbot guides non-experts through a multistep process for setting up molecular simulations and visualizing molecules in solution. It enables any chemist, including undergraduate chemistry majors, to configure and execute complex quantum mechanical simulations through chatting.

The free, publicly available platform, AutoSolvateWeb, operates primarily on cloud infrastructure, further expanding access to sophisticated computational research tools.

The journal Chemical Science published a proof-of-concept for AutoSolvateWeb, marking a significant step forward in integrating AI into education and scientific research.

AutoSolvateWeb is geared to set up simulations for a particular chemical to be dissolved (a solute) and a substance to dissolve it in (a solvent), resulting in a solution (a solvate).

The simulations are delivered in the form of 3D movies.

"It's a bit like a microscope, giving you an atomic-level view of molecules interacting in a solution," says Fang Liu, Emory assistant professor of chemistry, who led the development of AutoSolvateWeb.

The broad accessibility of AutoSolvateWeb makes it a valuable tool for creating large, high-quality datasets addressing the behaviors of molecules in solution. Such datasets provide a foundation for applying machine-learning techniques to drive innovations in everything from renewable energy to human health. 

"Our goal is to help speed up scientific discovery," says Fangning Ren, co-author of the Chemical Science paper and an Emory PhD chemistry student.

Rohit Gadde, a former Emory research specialist, is the paper's first author. Additional co-authors include Lechen Dong, Emory graduate student of chemistry; Yao Wang, Emory assistant professor of chemistry; Sreelaya Devaguptam, a former Emory visiting scholar; and Rajat Mittal, a former graduate research assistant at Clemson University.

AutoSolvateWeb Tutorial

Automating complex tasks

As a theoretical chemist, Liu leads a team specializing in computational chemistry, which includes modeling and deciphering molecular properties and reactions in the solution phase.

Before running a quantum chemistry program for a molecule in solution, it's necessary to determine the geometry of the solute molecule and the location and orientation of the surrounding solvent molecules through molecular simulation. Setting up and running these simulations is complicated and time-consuming, limiting how often researchers can perform such calculations.

In 2022, the Liu group developed a way to automate many of these calculations with a system it dubbed AutoSolvate. That system reduced the hundreds of lines of code that a computational chemist needs to enter into a supercomputer to run a simulation to just a few lines.

In addition to the command-line interface geared toward more experienced theoretical chemists, AutoSolvate included an intuitive graphical interface suitable for graduate students learning to run simulations.

AutoSolvateWeb builds on this foundation.

Expanding access

By operating primarily on cloud infrastructure, AutoSolvateWeb overcomes hardware configuration challenges, further flattening the learning curve for sophisticated computational research. The chatbot communicates via natural language rather than computer code on the front end, while AutoSolvateWeb automates the software processes on the backend.

"Chemists can spend less time learning to write computer code so they can focus more of their efforts on specific problems that they want to solve," Liu explains. "We also want to enable students to run simulations by themselves so that they can understand the dynamics of molecules in solution more fully."

Rather than a large language model (LLM) chatbot, like ChatGPT, the AutoSolvateWeb chatbot is primarily rules-based. It doesn't converse like a real human over various subjects but is geared to specific tasks, similar to chatbots used for customer services like online banking.

The chatbot prompts a user to type in the name of a molecule of interest, such as caffeine, and then select a solvent to dissolve the caffeine in, such as water. The system taps data from PubChem, the world's largest collection of freely accessible, online chemical information assembled by the National Institutes of Health.

The chatbot guides the user step-by-step through the cloud environment, seamlessly integrating multiple open-source software programs needed for the workflow. Once all the proper parameters are calculated through the automated process, AutoSolvateWeb submits the results to a National Science Foundation supercomputer to create the simulation.

The supercommuter returns a trajectory file. The user can download this file and use open-source software to turn it into a 3D movie of their requested simulation.

Seeing is believing - and understanding

AutoSolvateWeb is poised to enhance how chemistry is taught.

"As computers become more and more powerful, they become more important to scientific research," Ren says. "Undergraduate chemistry students need to get familiar with computer simulations so they can keep pace with advances in how research is done."

He cites solvatochromism, a technique for analyzing the makeup of chemicals in a liquid, as an example of the power of computer simulations in education.

Undergraduate students typically learn about solvatochromism in lab experiments by dissolving a solute known as Riechart's dye in different solvents. The solution turns blue, red, green, or yellow depending on how the solute molecules absorb light.

The simplest explanation for this phenomenon is that the color variations are due to variations in the polarity of a solvent. Changes in polarity stabilize the ground state of a molecule differently, which in turn affects a molecule's absorption peak along the wavelength of light.

What's trickier to explain are exceptions to this rule. Sometimes, solvents of similar polarities produce different colors because of the way hydrogen bonds form between the solute and the solvent.

"To fully understand how hydrogen bonding plays a special role in this situation the students need to run a computer simulation," Liu says. "Seeing is believing. You need to look directly at the structure in motion so that you can understand things at the microscopic scale."

She says such detailed visualizations help students learn to think critically, so they can go beyond memorizing concepts in textbooks and make and analyze their own discoveries.

"In science we don't want to just understand what is happening," Ren adds. "We want to know why it is happening."

Small molecules, big data

Liu and her colleagues are now working to expand the range of chemical systems that AutoSolvateWeb can simulate, going beyond limitations such as using single organic molecules as the solute. They are also enhancing the platform's ability to generate data and store and freely exchange that data across the chemistry community in an open-source format.

The researchers hope that their pioneering work to democratize computational chemistry research will inspire similar initiatives across the natural sciences. Their ultimate goal, Ren explains, is to help connect AI across various domains of basic science, boosting the power of interdisciplinary research.

Sources:
Journal reference:

Comments

The opinions expressed here are the views of the writer and do not necessarily reflect the views and opinions of AZoAi.
Post a new comment
Post

While we only use edited and approved content for Azthena answers, it may on occasions provide incorrect responses. Please confirm any data provided with the related suppliers or authors. We do not provide medical advice, if you search for medical information you must always consult a medical professional before acting on any information provided.

Your questions, but not your email details will be shared with OpenAI and retained for 30 days in accordance with their privacy principles.

Please do not ask questions that use sensitive or confidential information.

Read the full Terms & Conditions.