AI is employed in image processing to enhance and manipulate images through various techniques like denoising, super-resolution, and image restoration. Deep learning models and algorithms enable improved image quality, object recognition, and advanced image editing capabilities for a wide range of applications including photography, medical imaging, and computer vision.
Researchers present an advanced robotic prototype for litchi harvesting equipped with a cutting-edge visual system. The system integrates the YOLOv8-Seg model for litchi segmentation, binocular stereo-vision for picking point localization, and an intelligent algorithm for obstruction removal, showcasing promising capabilities for autonomous litchi picking.
This study introduces an innovative framework for early plant disease diagnosis, leveraging fog computing, IoT sensor technology, and a novel GWO algorithm. The hybrid approach, incorporating deep learning models like AlexNet and GoogleNet, coupled with modified GWO for feature selection, demonstrates superior performance in plant disease identification.
Researchers present DEEPPATENT2, an extensive dataset containing over two million technical drawings derived from design patents. Addressing the limitations of previous datasets, DEEPPATENT2 provides rich semantic information, including object names and viewpoints, offering a valuable resource for advancing research in diverse areas such as 3D image reconstruction, image retrieval for technical drawings, and multimodal generative models for innovation.
Researchers have introduced the All-Analog Chip for Combined Electronic and Light Computing (ACCEL), a groundbreaking technology that significantly improves energy efficiency and computing speed in vision tasks. ACCEL's innovative approach combines diffractive optical analog computing and electronic analog computing, eliminating the need for Analog-to-Digital Converters (ADCs) and achieving low latency.
Researchers delved into the ethical and legal aspects of integrating machine learning in defense systems. They conducted a comprehensive analysis, using a case study and identified challenges, emphasizing the need for robust legal and ethical frameworks in this transformative field.
This study introduces a novel approach to autonomous vehicle navigation by leveraging machine vision, machine learning, and artificial intelligence. The research demonstrates that it's possible for vehicles to navigate unmarked roads using economical webcam-based sensing systems and deep learning, offering practical insights into enhancing autonomous driving in real-world scenarios.
This article delves into the use of deep convolutional neural networks (DCNN) to detect and differentiate synthetic cannabinoids based on attenuated total reflectance Fourier-transform infrared (ATR-FTIR) spectra. The study demonstrates the effectiveness of DCNN models, including a vision transformer-based approach, in classifying and distinguishing synthetic cannabinoids, offering promising applications for drug identification and beyond.
The paper introduces the ODEL-YOLOv5s model, designed to address the challenges of obstacle detection in coal mines using deep learning target detection algorithms. This model improves detection accuracy, real-time responsiveness, and safety for driverless electric locomotives in the challenging coal mine environment. It outperforms other target detection algorithms, making it a promising solution for obstacle identification in coal mines.
Researchers have developed an enhanced YOLOv8 model for detecting wildfire smoke using images captured by unmanned aerial vehicles (UAVs). This approach improves accuracy in various weather conditions and offers a promising solution for early wildfire detection and monitoring in complex forest environments.
Researchers revisit generative models' potential to enhance visual data comprehension, introducing DiffMAE—a novel approach that combines diffusion models and masked autoencoders (MAE). DiffMAE demonstrates significant advantages in tasks such as image inpainting and video processing, shedding light on the evolving landscape of generative pre-training for visual data understanding and recognition.
Explore the cutting-edge advancements in image processing through reinforcement learning and deep learning, promising enhanced accuracy and real-world applications, while acknowledging the challenges that lie ahead for these transformative technologies.
Researchers have developed the FFMKO algorithm, a powerful tool for the early detection of Sudden Decline Syndrome (SDS) in date palm trees. By combining image enhancement, thresholding, and clustering techniques, this algorithm achieved an impressive accuracy rate of over 94%, offering a promising solution to combat the devastating effects of SDS on date palm crops.
This paper explores the integration of artificial intelligence (AI) and computer vision (CV) technologies in addressing urban expansion challenges, particularly in optimizing container movement within seaports. Through a systematic review, it highlights the significant role of AI and CV in sustainable parking ecosystems, offering valuable insights for enhancing seaport management and smart city development.
Researchers conduct a systematic review of AI techniques in otitis media diagnosis using medical images. Their findings reveal that AI significantly enhances diagnostic accuracy, particularly in primary care and telemedicine, with an average accuracy of 86.5%, surpassing the 70% accuracy of human specialists.
Researchers have successfully employed the MegaDetector open-source object detection model to automate cross-regional wildlife and visitor monitoring using camera traps. This innovation not only accelerates data processing but also ensures accurate and privacy-compliant monitoring of wildlife-human interactions.
Researchers present LightSpaN, a streamlined Convolutional Neural Network (CNN)-based solution for swift and accurate vehicle identification in intelligent traffic monitoring systems powered by the Internet of Things (IoT). This innovative approach outperforms existing methods with an average accuracy of 99.9% for emergency vehicles, contributing to reduced waiting and travel times.
This paper explores how the fusion of big data and artificial intelligence (AI) is reshaping product design in response to heightened consumer preferences for customized experiences. The study highlights how these innovative methods are breaking traditional design constraints, providing insights into user preferences, and fostering automation and intelligence in the design process, ultimately driving more competitive and intelligent product innovations.
Researchers have introduced a novel Two-Stage Induced Deep Learning (TSIDL) approach to accurately and efficiently classify similar drugs with diverse packaging. By leveraging pharmacist expertise and innovative CNN models, the method achieved exceptional classification accuracy and holds promise for preventing medication errors and safeguarding patient well-being in real-time dispensing systems.
Researchers present a novel approach utilizing a residual network (ResNet-18) combined with AI to classify cooling system faults in hydraulic test rigs with 95% accuracy. As hydraulic systems gain prominence in various industries, this innovative method offers a robust solution for preventing costly breakdowns, paving the way for improved reliability and efficiency.
The study delves into the integration of deep learning, discusses the dataset, and showcases the potential of AI-driven fault detection in enhancing sustainable operations within hydraulic systems.
The article introduces SliDL, a powerful Python library designed to simplify and streamline the analysis of high-resolution whole-slide images (WSIs) in digital pathology. With deep learning at its core, SliDL addresses challenges in managing image annotations, handling artifacts, and evaluating model performance. From automatic tissue detection to comprehensive model evaluation, SliDL bridges the gap between conventional image analysis and the intricate world of WSI analysis.
Terms
While we only use edited and approved content for Azthena
answers, it may on occasions provide incorrect responses.
Please confirm any data provided with the related suppliers or
authors. We do not provide medical advice, if you search for
medical information you must always consult a medical
professional before acting on any information provided.
Your questions, but not your email details will be shared with
OpenAI and retained for 30 days in accordance with their
privacy principles.
Please do not ask questions that use sensitive or confidential
information.
Read the full Terms & Conditions.