arXiv cs.CL

arXiv cs.CL

arXiv cs.CL publishes articles covering LLM, AI. A trusted source for AI and technology insights.

Profile generated by AI for Anything

All98 Videos Shorts Articles97

What is Good? Extracting and Testing Implicit Theories of Literary Quality from LLM Reasoning Traces

22h

Human-in-the-Loop Large Language Model Framework for Identification of Cutaneous Immune-Related Adverse Events

22h

The original title is "null" so I need to create a headline from the summary. The key facts are:

22h

More Is Not More: What Matters for Diversity in LLM Opinions?

22h

Knowledge Injection Exists in MoE? Exploring Expert-Aware Contrast Decoding in MoE for Mitigating LLMs'Hallucinations

22h

Is MoE Routing a Huffman Code? Discovering the Frequency-Diversity Law in Chain-of-Thought

22h

Position: Natural Language Should Not Fully Replace Formal Languages

22h

Skill-Contracted Agents for Evidence-Aware Materials Literature Analysis

22h

Moir: Let the Model Direct Its Own Story for Robust Cross-Domain Knowledge Editing

22h

LLM-INSTRUCT at UZH Shared Task 2026: Constraint-Aware Retrieval and Selective Debate for Paragraph-Level Argument Mining

22h

When Reasoning Narrows the Move: Diversity Collapse in LLM Game Play

1d

On the Computational Complexity of Structural Generalization

1d

Multi-Mask Diffusion Language Models for Few-Step Generation

1d

Scaling Laws for Hypernetwork-Based Knowledge Injection in Large Language Models

1d

Reference-Free Evaluation of Reasoning in Open-Ended Question Answering

1d

SLPO: Scaling Latent Reasoning via a Surrogate Policy

1d

Adaptive Capitulation: A Structural Failure Mode of LLM Responses in Vulnerability Contexts

1d

Lightweight Person-Place Relation Extraction from Historical Newspapers with Dependency Graphs and Proximity Features

1d

Stateful Guardrails for Multi-Turn LLM Systems: A Conversational Risk Accumulation Framework

1d

Task Competence Is Not Instruction Following: Evaluating Instruction-Conflicting Behavior in Small Language Models

1d

Convolution for Large Language Models

2d

Building a European Multilingual Evaluation Dataset: The MMLU Localisation Project within the EMT Network

2d

Relay-Bench: Evaluating LLMs on Multi-Domain Reasoning Chains

2d

Decoding EEG Signals to Explore Next-Word Predictability in the Human Brain

2d