Tech News Directory

Research

Arxiv • 4 hours ago

Relational graph-driven differential denoising and diffusion attention fusion for multimodal conversation emotion recognition

arXiv:2603.25752v1 Announce Type: new Abstract: In real-world scenarios, audio and video signals are often subject to environmental noise and limited acquisition conditions, resulting in extracted features containing excessive noise. Furthermore, there is an imbalance in data quality and informatio

Research

Arxiv • 4 hours ago

RealChart2Code: Advancing Chart-to-Code Generation with Real Data and Multi-Task Evaluation

arXiv:2603.25804v1 Announce Type: new Abstract: Vision-Language Models (VLMs) have demonstrated impressive capabilities in code generation across various domains. However, their ability to replicate complex, multi-panel visualizations from real-world data remains largely unassessed. To address this

Research

Arxiv • 4 hours ago

Doctorina MedBench: End-to-End Evaluation of Agent-Based Medical AI

arXiv:2603.25821v1 Announce Type: new Abstract: We present Doctorina MedBench, a comprehensive evaluation framework for agent-based medical AI based on the simulation of realistic physician-patient interactions. Unlike traditional medical benchmarks that rely on solving standardized test questions,

Research

Arxiv • 4 hours ago

Gradient-Informed Training for Low-Resource Multilingual Speech Translation

arXiv:2603.25836v1 Announce Type: new Abstract: In low-resource multilingual speech-to-text translation, uniform architectural sharing across languages frequently introduces representation conflicts that impede convergence. This work proposes a principled methodology to automatically determine laye

Research

Arxiv • 4 hours ago

Methods for Knowledge Graph Construction from Text Collections: Development and Applications

arXiv:2603.25862v1 Announce Type: new Abstract: Virtually every sector of society is experiencing a dramatic growth in the volume of unstructured textual data that is generated and published, from news and social media online interactions, through open access scholarly communications and observatio

Research

Arxiv • 4 hours ago

Density-aware Soft Context Compression with Semi-Dynamic Compression Ratio

arXiv:2603.25926v1 Announce Type: new Abstract: Soft context compression reduces the computational workload of processing long contexts in LLMs by encoding long context into a smaller number of latent tokens. However, existing frameworks apply uniform compression ratios, failing to account for the

Research

Arxiv • 4 hours ago

Can Small Models Reason About Legal Documents? A Comparative Study

arXiv:2603.25944v1 Announce Type: new Abstract: Large language models show promise for legal applications, but deploying frontier models raises concerns about cost, latency, and data privacy. We evaluate whether sub-10B parameter models can serve as practical alternatives by testing nine models acr

Research

Arxiv • 4 hours ago

When Chain-of-Thought Backfires: Evaluating Prompt Sensitivity in Medical Language Models

arXiv:2603.25960v1 Announce Type: new Abstract: Large Language Models (LLMs) are increasingly deployed in medical settings, yet their sensitivity to prompt formatting remains poorly characterized. We evaluate MedGemma (4B and 27B parameters) on MedMCQA (4,183 questions) and PubMedQA (1,000 question

Research

Arxiv • 4 hours ago

MemoryCD: Benchmarking Long-Context User Memory of LLM Agents for Lifelong Cross-Domain Personalization

arXiv:2603.25973v1 Announce Type: new Abstract: Recent advancements in Large Language Models (LLMs) have expanded context windows to million-token scales, yet benchmarks for evaluating memory remain limited to short-session synthetic dialogues. We introduce \textsc{MemoryCD}, the first large-scale,

Research

Arxiv • 4 hours ago

Toward Culturally Grounded Natural Language Processing

arXiv:2603.26013v1 Announce Type: new Abstract: Recent progress in multilingual NLP is often taken as evidence of broader global inclusivity, but a growing literature shows that multilingual capability and cultural competence come apart. This paper synthesizes over 50 papers from 2020--2026 spannin

Research

Arxiv • 4 hours ago

AgentCollab: A Self-Evaluation-Driven Collaboration Paradigm for Efficient LLM Agents

arXiv:2603.26034v1 Announce Type: new Abstract: Autonomous agents powered by large language models (LLMs) perform complex tasks through long-horizon reasoning and tool interaction, where a fundamental trade-off arises between execution efficiency and reasoning robustness. Models at different capabi

Research

Arxiv • 4 hours ago

Retrieval-Augmented Generation Based Nurse Observation Extraction

arXiv:2603.26046v1 Announce Type: new Abstract: Recent advancements in Large Language Models (LLMs) have played a significant role in reducing human workload across various domains, a trend that is increasingly extending into the medical field. In this paper, we propose an automated pipeline design

Research

Arxiv • 4 hours ago

I Want to Believe (but the Vocabulary Changed): Measuring the Semantic Structure and Evolution of Conspiracy Theories

arXiv:2603.26062v1 Announce Type: new Abstract: Research on conspiracy theories has largely focused on belief formation, exposure, and diffusion, while paying less attention to how their meanings change over time. This gap persists partly because conspiracy-related terms are often treated as stable

Research

Arxiv • 4 hours ago

IndoBERT-Relevancy: A Context-Conditioned Relevancy Classifier for Indonesian Text

arXiv:2603.26095v1 Announce Type: new Abstract: Determining whether a piece of text is relevant to a given topic is a fundamental task in natural language processing, yet it remains largely unexplored for Bahasa Indonesia. Unlike sentiment analysis or named entity recognition, relevancy classificat

Research

Arxiv • 4 hours ago

LLM Benchmark-User Need Misalignment for Climate Change

arXiv:2603.26106v1 Announce Type: new Abstract: Climate change is a major socio-scientific issue shapes public decision-making and policy discussions. As large language models (LLMs) increasingly serve as an interface for accessing climate knowledge, whether existing benchmarks reflect user needs i

Research

Arxiv • 4 hours ago

Clash of the models: Comparing performance of BERT-based variants for generic news frame detection

arXiv:2603.26156v1 Announce Type: new Abstract: Framing continues to remain one of the most extensively applied theories in political communication. Developments in computation, particularly with the introduction of transformer architecture and more so with large language models (LLMs), have natura

Research

Arxiv • 4 hours ago

ClinicalAgents: Multi-Agent Orchestration for Clinical Decision Making with Dual-Memory

arXiv:2603.26182v1 Announce Type: new Abstract: While Large Language Models (LLMs) have demonstrated potential in healthcare, they often struggle with the complex, non-linear reasoning required for accurate clinical diagnosis. Existing methods typically rely on static, linear mappings from symptoms

Research

Arxiv • 4 hours ago

Sparse Auto-Encoders and Holism about Large Language Models

arXiv:2603.26207v1 Announce Type: new Abstract: Does Large Language Model (LLM) technology suggest a meta-semantic picture i.e. a picture of how words and complex expressions come to have the meaning that they do? One modest approach explores the assumptions that seem to be built into how LLMs capt

Research

Arxiv • 4 hours ago

Ask or Assume? Uncertainty-Aware Clarification-Seeking in Coding Agents

arXiv:2603.26233v1 Announce Type: new Abstract: As Large Language Model (LLM) agents are increasingly deployed in open-ended domains like software engineering, they frequently encounter underspecified instructions that lack crucial context. While human developers naturally resolve underspecificatio

Research

Arxiv • 4 hours ago

GS-BrainText: A Multi-Site Brain Imaging Report Dataset from Generation Scotland for Clinical Natural Language Processing Development and Validation

arXiv:2603.26235v1 Announce Type: new Abstract: We present GS-BrainText, a curated dataset of 8,511 brain radiology reports from the Generation Scotland cohort, of which 2,431 are annotated for 24 brain disease phenotypes. This multi-site dataset spans five Scottish NHS health boards and includes b