robbie

robb-0

AI & ML interests

i like semiotics and hermeneutics, happens that I train image LoRAs (and in secret fine-tune LLMs.) Billy is my teddy doggy 🐶🐕🦊🧸

Recent Activity

liked a model 2 days ago

Laxhar/noobai-XL-1.1

updated a model 2 days ago

robb-0/miami_beach_hologram_2

updated a dataset 3 days ago

robb-0/miami_beach_hologram2_dataset

View all activity

Organizations

robb-0's activity

upvoted a paper about 1 month ago

Beyond Chains of Thought: Benchmarking Latent-Space Reasoning Abilities in Large Language Models

Paper • 2504.10615 • Published Apr 14 • 1

upvoted a collection about 1 month ago

Granite 4.0 Language Models

Collection

2 items • Updated May 2 • 13

upvoted a paper about 2 months ago

Scaling up Test-Time Compute with Latent Reasoning: A Recurrent Depth Approach

Paper • 2502.05171 • Published Feb 7 • 142

upvoted a collection 2 months ago

Llama 4

Collection

Llama 4 release • 13 items • Updated Apr 29 • 522

upvoted a paper 2 months ago

Were RNNs All We Needed?

Paper • 2410.01201 • Published Oct 2, 2024 • 52

upvoted a collection 2 months ago

— UI is a good thing 💅 —

Collection

cool spaces with a cool UI, what could be better? • 5 items • Updated May 5 • 20

upvoted an article 2 months ago

Article

I Clicked “I Agree”, But What Am I Really Consenting To?

•

Mar 26

• 24

upvoted 2 collections 2 months ago

My Bookmarks

Collection

149 items • Updated 6 days ago • 4

Spaces for LLM / VLM / NLP

Collection

1113 items • Updated about 5 hours ago • 10

upvoted 5 papers 3 months ago

Model Hubs and Beyond: Analyzing Model Popularity, Performance, and Documentation

Paper • 2503.15222 • Published Mar 19 • 1

The AI Community Building the Future? A Quantitative Analysis of Development Activity on Hugging Face Hub

Paper • 2405.13058 • Published May 20, 2024 • 2

SpaceByte: Towards Deleting Tokenization from Large Language Modeling

Paper • 2404.14408 • Published Apr 22, 2024 • 7

T-FREE: Tokenizer-Free Generative LLMs via Sparse Representations for Memory-Efficient Embeddings

Paper • 2406.19223 • Published Jun 27, 2024 • 11

Does Time Have Its Place? Temporal Heads: Where Language Models Recall Time-specific Information

Paper • 2502.14258 • Published Feb 20 • 26

upvoted a collection 3 months ago

Foundation Text-Generation Models Below 360M Parameters

Collection

Great candidates for fine-tuning targeting Wllama and Transformers.js for mobile devices, ordered by number of parameters. • 36 items • Updated Apr 6 • 32

upvoted a paper 3 months ago

Finch: Prompt-guided Key-Value Cache Compression

Paper • 2408.00167 • Published Jul 31, 2024 • 18

upvoted a collection 3 months ago

Hallucination

Collection

14 items • Updated Jun 10, 2024 • 8

upvoted 3 papers 3 months ago

Transformers without Normalization

Paper • 2503.10622 • Published Mar 13 • 165

OmniMamba: Efficient and Unified Multimodal Understanding and Generation via State Space Models

Paper • 2503.08686 • Published Mar 11 • 19

Charting and Navigating Hugging Face's Model Atlas

Paper • 2503.10633 • Published Mar 13 • 84