Alex Rigler

aldaleri

36 246

https://choochoo.cc

AI & ML interests

systems, security & governance

Recent Activity

upvoted a paper 2 days ago

Agentic Abstention: Do Agents Know When to Stop Instead of Act?

liked a model 4 days ago

deepreinforce-ai/Ornith-1.0-35B-GGUF

liked a model 4 days ago

nationaldesignstudio/rampart

View all activity

Organizations

upvoted a paper 2 days ago

Agentic Abstention: Do Agents Know When to Stop Instead of Act?

Paper • 2606.28733 • Published 7 days ago • 140

upvoted 3 papers 5 days ago

upvoted a paper 10 days ago

Tmax: A simple recipe for terminal agents

Paper • 2606.23321 • Published 12 days ago • 14

upvoted an article 12 days ago

Article

Beyond LoRA: Can you beat the most popular fine-tuning technique?

BenjaminB, sayakpaul, hubnemo, kashif

•

16 days ago

• 71

upvoted a paper 18 days ago

SkillOpt: Executive Strategy for Self-Evolving Agent Skills

Paper • 2605.23904 • Published May 22 • 250

upvoted an article 19 days ago

Article

Nemotron 3.5 Content Safety: Customizable Multimodal Safety for Global Enterprise AI

nvidia

•

29 days ago

• 12

upvoted an article about 1 month ago

Article

Granite Embedding Multilingual R2: Open Apache 2.0 Multilingual Embeddings with 32K Context — Best Sub-100M Retrieval Quality

ibm-granite

•

May 14

• 33

upvoted 2 papers about 2 months ago

WildGuard: Open One-Stop Moderation Tools for Safety Risks, Jailbreaks, and Refusals of LLMs

Paper • 2406.18495 • Published Jun 26, 2024 • 14

Beyond Semantic Similarity: Rethinking Retrieval for Agentic Search via Direct Corpus Interaction

Paper • 2605.05242 • Published May 3 • 126

upvoted an article 2 months ago

Article

DenseOn with the LateOn: Open State-of-the-Art Single and Multi-Vector Models

lightonai

•

Apr 21

• 42

upvoted an article 3 months ago

Article

Welcome Gemma 4: Frontier multimodal intelligence on device

merve, pcuenq, sergiopaniego, burtenshaw, Steveeeeeeen, alvarobartt, SaylorTwift

•

Apr 2

• 910

upvoted 3 collections 4 months ago

Mistral Small 4

Collection

A state-of-the-art model, open-weight, with a granular Mixture-of-Experts architecture that fuses instruct, reasoning and agentic skills. • 3 items • Updated Mar 16 • 75

BitNet

Collection

🔥BitNet family of large language models (1-bit LLMs). • 7 items • Updated May 1, 2025 • 62

NVIDIA Nemotron v3

Collection

Open, Production-ready Enterprise Models • 23 items • Updated 22 days ago • 333

upvoted an article 4 months ago

Article

Introducing Storage Buckets on the Hugging Face Hub

Wauplin, coyotte508, XciD, victor, julien-c, lhoestq, pierric, Sylvestre, hlarcher, rajatarya, seanses, assafvayner

•

Mar 10

• 195

upvoted 3 papers 4 months ago

GLM-5: from Vibe Coding to Agentic Engineering

Paper • 2602.15763 • Published Feb 17 • 196

Weak-Driven Learning: How Weak Agents make Strong Agents Stronger

Paper • 2602.08222 • Published Feb 9 • 290

Step 3.5 Flash: Open Frontier-Level Intelligence with 11B Active Parameters

Paper • 2602.10604 • Published Feb 11 • 201

Alex Rigler

AI & ML interests

Recent Activity

Organizations

aldaleri's activity

Beyond LoRA: Can you beat the most popular fine-tuning technique?

Nemotron 3.5 Content Safety: Customizable Multimodal Safety for Global Enterprise AI

Granite Embedding Multilingual R2: Open Apache 2.0 Multilingual Embeddings with 32K Context — Best Sub-100M Retrieval Quality

DenseOn with the LateOn: Open State-of-the-Art Single and Multi-Vector Models

Welcome Gemma 4: Frontier multimodal intelligence on device

Introducing Storage Buckets on the Hugging Face Hub