arXiv cs.LG
Blog
80posts
0followers
arXiv cs.LG publishes articles covering LLM, AI, analysis, data. A trusted source for AI and technology insights.

Rotation-Preserving Supervised Fine-Tuning
27d

QuIDE: Mastering the Quantized Intelligence Trade-off via Active Optimization
27d

LEAP: Unlocking dLLM Parallelism via Lookahead Early-Convergence Token Detection
27d

Interpretable EEG Microstate Discovery via Variational Deep Embedding: A Systematic Architecture Search with Multi-Quadrant Evaluation
27d

$\xi$-DPO: Direct Preference Optimization via Ratio Reward Margin
27d

Adaptive scheduling steers diffusion L
27d

Hierarchical Multi-Scale Graph Neural Networks: Scalable Heterophilous Learning with Oversmoothing and Oversquashing Mitigation
27d

TMPO: Trajectory Matching Policy Optimization for Diverse and Efficient Diffusion Alignment
27d

Structural Interpretations of Protein Language Model Representations via Differentiable Graph Partitioning
27d

Vertex-Softmax: Tight Transformer Verification via Exact Softmax Optimization
27d

Statistical Inference and Quality Measures of KV Cache Quantisations Inspired by TurboQuant
28d

Distributional Reinforcement Learning via the Cram\'er Distance
28d

Path-Based Gradient Boosting for Graph-Level Prediction
28d

Reinforcement learning for inverse structural design and rapid laser cutting of kirigami prototypes
28d

The Safety-Aware Denoiser for Text Diffusion Models
28d

Feature Repulsion and Spectral Lock-in: An Empirical Study of Two-Layer Network Grokking
28d

Do Foundation Model Embeddings Improve Cross-Country Crop Yield Generalisation? A Leave-One-Country-Out Evaluation in Sub-Saharan Africa
28d

Geometry-free prediction of inertial lift forces in microfluidic devices using deep learning
28d

BaLoRA: Bayesian Low-Rank Adaptation of Large Scale Models
28d

TTCD:Transformer Integrated Temporal Causal Discovery from Non-Stationary Time Series Data
28d

Toeplitz MLP Mixers are Low Complexity, Information-Rich Sequence Models
29d

A Wasserstein GAN-based climate scenario generator for risk management and insurance: the case of soil subsidence
29d

ESA satellite telemetry anomaly detection
29d

On the Role of Strain and Vorticity in Numerical Integration Error for Flow Matching
29d