Large Language Model News and Research

RSS

A large language model is an advanced artificial intelligence system trained on vast amounts of text data, capable of generating human-like responses and understanding natural language queries. It uses deep learning techniques to process and generate coherent and contextually relevant text.

Zero-Shot AI Outperforms GPT-4o in Art Classification, Slashing Annotation Costs

Researchers demonstrate that large language models (LLMs) can classify artwork types with high accuracy using zero-shot learning, reducing the need for costly manual data labeling.

25 Feb 2025

AI Rewrites Social Media Posts but Subtly Changes Their Emotional Tone

Large language models subtly shift the sentiment of text they rephrase, making content more neutral and altering conclusions drawn from sentiment-based research.

25 Feb 2025

AI Chatbots Give Basic Endometriosis Info but Lack Medical Accuracy

Researchers at UT Southwestern Medical Center found that AI chatbots provide basic information on endometriosis but lack the depth and accuracy of healthcare professionals. ChatGPT performed best among the three chatbots studied.

20 Feb 2025

Sandia Deploys Secure AI Chat to Boost Workplace Efficiency

Sandia National Laboratories has launched SandiaAI Chat, a secure in-house version of ChatGPT, allowing employees to ask unclassified yet sensitive questions without data leaving Sandia's cloud infrastructure. The system enhances efficiency, supports secure AI use, and complies with national AI risk management standards.

5 Feb 2025

AI’s Hidden Bias: Study Finds ChatGPT Skews Left in Text and Images

ChatGPT exhibits a left-leaning bias in text and image outputs, with refusals to generate certain right-leaning perspectives, raising concerns about AI fairness and societal impact.

3 Feb 2025

AI Model EpiBERT Predicts Gene Expression by Unlocking DNA’s Hidden Regulatory Code

Scientists have developed EpiBERT, an AI model that predicts gene expression by decoding chromatin accessibility and regulatory grammar across human cell types.

29 Jan 2025

AI Models Strategically Fake Alignment to Avoid Retraining Risks

Researchers demonstrated that large language models can fake alignment during training by selectively complying with harmful queries to preserve their original harmless behavior, raising critical concerns for AI safety.

7 Jan 2025

CRAFT-MD Framework Redefines AI’s Clinical Readiness

Large language models like GPT-4 excel in medical exams but falter in realistic doctor-patient conversations, prompting the creation of the CRAFT-MD framework to better evaluate their real-world clinical capabilities.

5 Jan 2025

LG's EXAONE 3.5 Sets New Standards in Generative AI

Researchers at LG AI Research unveiled EXAONE 3.5, a series of instruction-tuned large language models that excel in long-context comprehension, bilingual capabilities, and competitive benchmarks.

18 Dec 2024

MIT Researchers Transform AI Explanations into Plain Language with EXPLINGO

Researchers at MIT have developed EXPLINGO, a system that transforms complex machine-learning explanations into clear, human-readable narratives. By leveraging large language models, EXPLINGO enables users to trust AI predictions with concise, accurate, and fluently graded explanations.

10 Dec 2024

Decentralized AI Takes Center Stage with INTELLECT-1’s Global Training Breakthrough

INTELLECT-1 demonstrates the power of decentralized AI by training a 10-billion-parameter model globally, achieving groundbreaking efficiency and paving the way for community-driven AGI.

8 Dec 2024

Logic Training Transforms AI Into Smarter Problem-Solver

Researchers propose Additional Logic Training (ALT) to enhance reasoning in large language models using a robust, synthetic corpus, leading to significant performance boosts across logic, math, coding, and natural language tasks.

27 Nov 2024

AI Takes on Ambiguity: Simple Strategies Improve Language Model Accuracy

Researchers at Arizona State University reveal how simple, training-free disambiguation techniques can enhance large language models' accuracy in answering ambiguous open-domain questions. Their study highlights the effectiveness of prompt-based methods over fine-tuning.

26 Nov 2024

Agora Protocol Tackles AI Communication Challenges with Scalable Autonomous Networks

Agora, a new meta-protocol, solves the "Agent Communication Trilemma" by blending structured routines, natural language, and LLM-generated responses to enable scalable, autonomous networks of AI agents.

13 Nov 2024

Tencent’s Hunyuan-Large AI Model Sets New Benchmark with 389 Billion Parameters

Hunyuan-Large, Tencent’s largest open-source Transformer-based mixture of experts (MoE) model, pushes the boundaries of AI with 389 billion parameters and 52 billion activated experts, excelling in tasks like reasoning, coding, and long-context processing. It outperforms leading models like LLama3.1, demonstrating superior scalability and efficiency.

11 Nov 2024

AMD Unveils Open-Source 1B Language Model to Drive Innovation in AI

Researchers introduce AMD OLMo, an open-source language model with 1 billion parameters, trained using 1.3 trillion tokens on AMD GPUs to push the boundaries of AI, enabling improved reasoning, instruction-following, and ethical alignment in AI systems.

10 Nov 2024