gn00029914

AI & ML interests

None yet

Recent Activity

upvoted a paper about 4 hours ago

Group Sequence Policy Optimization

upvoted a paper about 4 hours ago

Training-Free Long-Context Scaling of Large Language Models

liked a model about 4 hours ago

ServiceNow-AI/Apriel-1.6-15b-Thinker

View all activity

Organizations

upvoted 3 papers about 4 hours ago

upvoted a collection about 4 hours ago

Apriel-1.6-15B-Thinker

Collection

3 items • Updated Dec 16, 2025 • 7

upvoted an article about 5 hours ago

Article

Apriel-1.6-15b-Thinker: Cost-efficient Frontier Multimodal Performance

Dec 9, 2025

•

upvoted 3 papers 6 days ago

SWE-rebench: An Automated Pipeline for Task Collection and Decontaminated Evaluation of Software Engineering Agents

Paper • 2505.20411 • Published May 26, 2025 • 93

Towards Autonomous Mathematics Research

Paper • 2602.10177 • Published 9 days ago • 35

SWE-Search: Enhancing Software Agents with Monte Carlo Tree Search and Iterative Refinement

Paper • 2410.20285 • Published Oct 26, 2024 • 1

upvoted 2 collections 6 days ago

Steering Reasoning VLAs

Collection

Steering Reasoning VLA in robotics manipulation https://www.arxiv.org/abs/2510.16281 • 2 items • Updated 15 days ago • 1

Nvidia reward models GGUF

Collection

4 items • Updated Nov 3, 2025 • 1

upvoted 10 papers 11 days ago

AGIEval: A Human-Centric Benchmark for Evaluating Foundation Models

Paper • 2304.06364 • Published Apr 13, 2023 • 3

Michelangelo: Long Context Evaluations Beyond Haystacks via Latent Structure Queries

Paper • 2409.12640 • Published Sep 19, 2024 • 3

WMT24++: Expanding the Language Coverage of WMT24 to 55 Languages & Dialects

Paper • 2502.12404 • Published Feb 18, 2025 • 4

Program Synthesis with Large Language Models

Paper • 2108.07732 • Published Aug 16, 2021 • 5

EmbeddingGemma: Powerful and Lightweight Text Representations

Paper • 2509.20354 • Published Sep 24, 2025 • 47

T5Gemma 2: Seeing, Reading, and Understanding Longer

Paper • 2512.14856 • Published Dec 16, 2025 • 3

Exploring the Limits of Transfer Learning with a Unified Text-to-Text Transformer

Paper • 1910.10683 • Published Oct 23, 2019 • 16

ForecastPFN: Synthetically-Trained Zero-Shot Forecasting

Paper • 2311.01933 • Published Nov 3, 2023 • 1

Chronos: Learning the Language of Time Series

Paper • 2403.07815 • Published Mar 12, 2024 • 48

Chronos-2: From Univariate to Universal Forecasting

Paper • 2510.15821 • Published Oct 17, 2025 • 22

gn00029914

AI & ML interests

Recent Activity

Organizations

gn00029914's activity

Apriel-1.6-15b-Thinker: Cost-efficient Frontier Multimodal Performance