AI Gets Chatty in the Kitchen: Georgia Tech's ChattyChef Uses Natural Language Processing to Help Users Cook

Download PDF Copy

Reviewed by Joel ScanlonJul 7 2023

Artificial intelligence (AI) can help people shop, plan, and write -; but not cook. It turns out humans aren't the only ones who have a hard time following step-by-step recipes in the correct order, but new research from the Georgia Institute of Technology's College of Computing could change that.

Image Credit: Ollyy / Shutterstock

Researchers created a dataset called ChattyChef, which uses natural language processing models that can help a user cook a recipe. Using the open-source large language model GPT-J, ChattyChef's dataset of cooking dialogues follows recipes with the user.

*Important notice: arXiv publishes preliminary scientific reports that are not peer-reviewed and, therefore, should not be regarded as definitive, used to guide development decisions, or treated as established information in the field of artificial intelligence research.

The researchers presented their AI in the paper "Improved Instruction Ordering in Recipe-Grounded Conversation," presented at the 61st annual meeting of the Association for Computational Linguistics.

Although other researchers have theorized about the possibility of an AI chef, Georgia Tech's work pushes the field forward. "We are one of the first research teams to analyze the challenges of using large language models for building an AI chef," said Duong Le, a Ph.D. student in the School of Interactive Computing.

A conversation snippet of the cooking instructional dialogue task with good and bad system responses and the corresponding error type of each bad response.

Most attempts at using language models for cooking fail because GPT-J doesn't understand what the user wants to do next or the user intent and has difficulty tracking how far the user is in the recipe -; what the researchers call the "state of the conversation." It also can't easily answer clarification questions, like about ingredient amounts or cooking times.

For example, maybe someone is trying to cook hashbrowns. The AI tells them to melt butter in the pan and add the potatoes. The user then asks about the next step. A bad bot might jumble the order and tell them to serve the hashbrown even though they haven't finished cooking it yet. Or a user asks a follow-up question about how long to cook the hashbrown, and AI won't be precise enough, instead giving a general time and not specifying the cooking time for each side.

With this in mind, the researchers ensured their model had two key features:

User intent detection to determine the user's current intent within a fixed set of possibilities, such as "Ask for next instruction" or "Ask for details about ingredients."
Instruction state tracking to identify which recipe step the user is on, which works with 80% accuracy.

The combined information from these features supports the third innovation of ChattyChef -; response generation. User intent helps generate the best response to answer a user's question. The instruction state selects the most relevant parts of the recipe rather than including the entire recipe to avoid confusing the user or burdening them with extra steps as they are cooking.

The ChattyChef dataset is built off WikiHow recipes with positive ratings and fewer than eight steps. The researchers crowdsourced people to role-play how they might use ChattyChef to determine what instructions would be best to include in the dataset.

The researchers believe the innovations of ChattyChef could be used in many domains besides cooking, such as repair manuals or software documentation.

This research is supported in part by the National Science Foundation awards IIS-2112633 and IIS-2052498.

Source:

Georgia Institute of Technology

Journal reference:

Preliminary scientific report. Le, Duong, et al. "Improved Instruction Ordering in Recipe-Grounded Conversation." ArXiv, 2023, /abs/2305.17280. https://arxiv.org/abs/2305.17280

Posted in: AI Research News