💦 FULL SET: Trc/publications - High Quality
Publications
2025
December
Agile Flight Emerges from Multi-Agent Competitive Racing
Pasumarti, Bianchi & Loquercio
Stronger Normalization-Free Transformers
Chen et al.
Hauck, Kabel & Gauger
Hyperparameter Transfer Enables Consistent Gains of Matrix-Preconditioned Optimizers Across Scales
Qiu et al.
EventQueues: Autodifferentiable spike event queues for brain simulation on AI accelerators
Landsmeer et al.
Hasan, Yildiz & Kamruzzaman
Arbitrage: Efficient Reasoning via Advantage-Aware Speculation
Maheswaran et al.
Defense That Attacks: How Robust Models Become Better Attackers
Awad, Akrm & Gomaa
November
Code-Switch Pretraining for Improved Cross-Lingual Alignment in Low-Resource Languages
Visser, Grobler & Dunaiski
Scaling Behavior of Encoder Language Models in Low-Resource Settings
Visser, Grobler & Dunaiski
Flow Map Distillation Without Data
Tong et al.
TempoPFN: Towards Synthetic Pre-training of Linear RNNs for Zero-shot Time Series Forecasting
Moroshan et al.
Geospatial Open Building Stack
Jindal & Kumar
October
Relative Scaling Laws for LLMs
Held et al.
Saleh & Biltawi
HalleluBERT: Let every token that has meaning bear its weight
Scheible-Schmitt
SindBERT, the Sailor: Charting the Seas of Turkish NLP
Scheible-Schmitt & Schweter
September
Comparative Analysis of Chemical Structure String Representations for Neural Machine Translation
Rajan, Zielesny & Steinbeck
Cheminformatics Microservice V3: a web portal for chemical structure manipulation and analysis
Rajan et al.
CayleyPy Growth: Efficient growth computations and hundreds of new conjectures on Cayley graphs
Chervov et al.
Artificial Neural Networks and Machine Learning
Senn et al.
How to Scale Second-Order Optimization
Chen et al.
Martingale Posterior Neural Networks for Fast Sequential Decision Making
Duran-Martin et al.
Breaking the Performance Ceiling in Reinforcement Learning requires Inference Strategies
Chalumeau et al.
Memory-Enhanced Neural Solvers for Routing Problems
Chalumeau et al.
On Evaluating LLM Alignment by Evaluating LLMs as Judges
Liu, Liu & Cohan
Muthamilselvan, Vaithilingam & Palaniappan
Pre-training under infinite compute
Kim et al.
PortBERT: Navigating the Depths of Portuguese Language Models
Scheible-Schmitt, He & Mendes
Srilakshmi, Tituss & Palaniappan
Almeida, Nogueira & Pedrini
Creating a Large Clean Web Corpus for Turkish
Uzun et al.
August
Democracy-in-Silico: Institutional Design as Alignment in AI-Governed Polities
Srinivasan & Patapati
Azam & Singh
July
Discovering and using Spelke segments
Venkatesh et al.
Topographic Vision Transformers
Shah & Yamins
Ref-Long: Benchmarking the Long-context Referencing Capability of Long-context Language Models
Wu et al.
Marek et al.
June
Bimodal masked language modeling for bulk RNA-seq and DNA methylation representation learning
Gélard et al.
MeshSlice: Efficient 2D Tensor Parallelism for Distributed DNN Training
Nam, Gerogiannis & Torrellas
Multipole Attention for Efficient Long Context Reasoning
Hooper et al.
Scalable Generalized Bayesian Online Neural Network Training for Sequential Decision Making
Duran-Martin et. al
A Stable Whitening Optimizer for Efficient Neural Network Training
Franz, Levine & Abbeel
Cheminformatics Microservice V-3: A Web Portal for Chemical Structure Manipulation and Analysis
Rajan et al.
de Kock, Pretorius & Shock
May
Predicting gene expression using millions of yeast promoters reveals cis-regulatory logic
Dash & Bornelöv
Efficient and Unbiased Sampling from Boltzmann Distributions via Variance-Tuned Diffusion Models
Zhang, Midgely & Hernández-Lobato