Stefan Schweter's picture

Stefan Schweter PRO

stefan-it

·

AI & ML interests

Flair Library 💕, NER & PoS Tagging, LM Pretraining (mostly encoder-only & encoder-decoder), Historical Language Models, German Language Models

Recent Activity

commented on a paper about 11 hours ago

The Common Pile v0.1: An 8TB Dataset of Public Domain and Openly Licensed Text

upvoted a paper about 11 hours ago

The Common Pile v0.1: An 8TB Dataset of Public Domain and Openly Licensed Text

upvoted a paper about 11 hours ago

Static Word Embeddings for Sentence Semantic Representation

View all activity

Organizations

stefan-it's activity

liked a dataset 3 days ago

ScaDSAI/ParaDeHate

Viewer • Updated about 12 hours ago • 8.28k • 141 • 1

liked 2 models 10 days ago

LSX-UniWue/ModernGBERT_134M

Feature Extraction • Updated 4 days ago • 389 • 4

LSX-UniWue/ModernGBERT_1B

Feature Extraction • Updated 3 days ago • 248 • 3

liked a dataset 24 days ago

PleIAs/Post-OCR-Correction

Viewer • Updated 23 days ago • 50.4k • 489 • 128

liked a dataset 29 days ago

openbmb/Ultra-FineWeb

Viewer • Updated about 11 hours ago • 1.29B • 26.6k • 147

liked 3 datasets about 1 month ago

BramVanroy/CommonCrawl-CreativeCommons

Viewer • Updated about 1 month ago • 739M • 2.57k • 29

Aleph-Alpha/Aleph-Alpha-GermanWeb

Viewer • Updated 21 days ago • 1.41B • 882 • 17

lang-uk/UberText-NER-Silver

Viewer • Updated Apr 24 • 48M • 61 • 2

liked 2 datasets 2 months ago

huggingface-legal/takedown-notices

Viewer • Updated 2 days ago • 8 • 708 • 24

BramVanroy/finewebs-copyright-domains

Viewer • Updated Mar 26 • 361 • 67 • 1

liked 2 models 2 months ago

stanfordnlp/mrt5-large

Text2Text Generation • Updated Mar 27 • 12 • 2

stanfordnlp/mrt5-small

Text2Text Generation • Updated Mar 27 • 41 • 2

liked 2 Spaces 3 months ago

FAT5 (Flash Attention T5) report

English version of the blog post introducing FAT5 model

Follow History

Track history of Follows of organizations and users on HF

liked 2 datasets 3 months ago

bbunzeck/babylm-german

Viewer • Updated Mar 18 • 1.88M • 99 • 2

Open-Orca/FLAN

Viewer • Updated Aug 2, 2023 • 378M • 5.81k • 179

liked a model 3 months ago

chandar-lab/NeoBERT

Feature Extraction • Updated Mar 25 • 2.47k • 106

liked a Space 4 months ago

The Ultra-Scale Playbook

The ultimate guide to training LLM on large GPU Clusters

liked 2 datasets 4 months ago

google/smol

Viewer • Updated Mar 3 • 811k • 1.15k • 56

BUCOLIN/HisTR

Viewer • Updated Feb 3 • 25.3k • 131 • 3