Unlocking Better AI Performance by Rethinking Decision-Making in Deep Learning Layers

Deep Learning (DL) performs classification tasks using a series of layers. To effectively execute these tasks, local decisions are performed progressively along the layers. But can we perform an all-encompassing decision by choosing the most influential path to the output rather than performing these decisions locally?

Study: Enhancing the accuracies by performing pooling decisions adjacent to the output layer. Image Credit: cono0430 / ShutterstockStudy: Enhancing the accuracies by performing pooling decisions adjacent to the output layer. Image Credit: cono0430 / Shutterstock

In an article published today in the journal Scientific Reports, researchers from Bar-Ilan University in Israel answer this question with a resounding "yes." By updating the most influential paths to the output, preexisting deep architectures have been improved.

"One can think of it as two children who wish to climb a mountain with many twists and turns. One of them chooses the fastest local route at every intersection while the other uses binoculars to see the entire path ahead and picks the shortest and most significant route, just like Google Maps or Waze. The first child might get a head start, but the second will end up winning," said Prof. Ido Kanter of Bar-Ilan's Department of Physics and Gonda (Goldschmied) Multidisciplinary Brain Research Center, who led the research.

"This discovery can pave the way for better enhanced AI learning, by choosing the most significant route to the top," added Yarden Tzach, a PhD student and one of the key contributors to this work.

This exploration of a deeper comprehension of AI systems by Prof. Kanter and his experimental research team, led by Dr. Roni Vardi, aims to bridge between the biological world and machine learning, thereby creating an improved, advanced AI system. To date, they have discovered evidence for efficient dendritic adaptation using neuronal cultures, as well as how to implement those findings in machine learning, showing how shallow networks can compete with deep ones, and finding the mechanism underlying successful deep learning.

Enhancing existing architectures using global decisions can pave the way for improved AI, which can improve its classification tasks without the need for additional layers. 

VGG16 and A-VGG16 architectures. (A) VGG16 architecture (13 (3×3)(3×3) CLs and in between 5 (2×2)(2×2) MP operators, followed by 3 FC layers) for the CIFAR10 database consisting of 32×3232×32 RGB inputs. A CL is defined by its square filters with dimension K and depth D, (K,D)(K, D). (B) A-VGG16 architecture for CIFAR10 inputs consisting of 7 (3×3)(3×3) CLs, (4×4)(4×4) average pooling (AP), 6 (3×3)(3×3) CLs, (2×2)(2×2) MP and 3 FC layers.VGG16 and A-VGG16 architectures. (A) VGG16 architecture (13 (3×3)(3×3) CLs and in between 5 (2×2)(2×2) MP operators, followed by 3 FC layers) for the CIFAR10 database consisting of 32×3232×32 RGB inputs. A CL is defined by its square filters with dimension K and depth D, (K,D)(K, D). (B) A-VGG16 architecture for CIFAR10 inputs consisting of 7 (3×3)(3×3) CLs, (4×4)(4×4) average pooling (AP), 6 (3×3)(3×3) CLs, (2×2)(2×2) MP and 3 FC layers.

 

 
Journal reference:
  • Meir, Yuval, et al. "Enhancing the Accuracies by Performing Pooling Decisions Adjacent to the Output Layer." Scientific Reports, vol. 13, no. 1, 2023, pp. 1-8, https://doi.org/10.1038/s41598-023-40566-y, https://www.nature.com/articles/s41598-023-40566-y

Article Revisions

  • Jun 25 2024 - Fixed broken journal link

Comments

The opinions expressed here are the views of the writer and do not necessarily reflect the views and opinions of AZoAi.
Post a new comment
Post

While we only use edited and approved content for Azthena answers, it may on occasions provide incorrect responses. Please confirm any data provided with the related suppliers or authors. We do not provide medical advice, if you search for medical information you must always consult a medical professional before acting on any information provided.

Your questions, but not your email details will be shared with OpenAI and retained for 30 days in accordance with their privacy principles.

Please do not ask questions that use sensitive or confidential information.

Read the full Terms & Conditions.

You might also like...
Predicting CO2 Solubility in Ionic Liquids Using Deep Learning