Hugging Face
Models
Datasets
Spaces
Community
Docs
Enterprise
Pricing
Log In
Sign Up
44
54
113
Junlin Zhou
jlzhou
Follow
reubenrouse's profile picture
almatrx's profile picture
21world's profile picture
5 followers
·
43 following
edwardzjl
AI & ML interests
None yet
Recent Activity
reacted
to
Narsil
's
post
with 😎
about 12 hours ago
Me: This function is too slow. Find a faster algorithm. Cursor: Hold my beer. Me: *Slacking off with colleagues* Cursor: Ping. Me: 🤯
reacted
to
Akhil-Theerthala
's
post
with ❤️
8 days ago
I'm excited to announce that I've just released the newest versions of my Kuvera models and the expanded Personal Finance Reasoning dataset on Hugging Face! What's new: I've expanded the Personal Finance Reasoning Dataset, which now includes 18.9k samples of real-world financial questions paired with detailed, empathetic answers. The previous generation pipeline was also streamlined with better psychological context and response validations. I've also released new Kuvera models trained on this improved dataset: - Kuvera-4B & 8B: These are my upgraded non-reasoning models, fine-tuned to provide practical financial advice. I've specifically trained the 8B model to better understand the user's emotional context. - Kuvera-12B: A first experimental reasoning model focused on the query resolution. As the sole person working on this project, this release is a noticeable step forward from my previous work, offering more powerful and nuanced tools for financial AI. I am actively looking to collaborate with others who are passionate about analyzing and improving the quality of personal finance advice generated by large language models. If this sounds like you, please reach out! You can check these out on the following links: Models: - https://huggingface.co/Akhil-Theerthala/Kuvera-8B-qwen3-v0.2.1 - https://huggingface.co/Akhil-Theerthala/Kuvera-4B-unsloth-gemma3 - https://huggingface.co/Akhil-Theerthala/kuvera-12B-v0.2.0-unsloth-gemma3 Dataset: - https://huggingface.co/datasets/Akhil-Theerthala/Kuvera-PersonalFinance-V2.1 P.S. The paper on the framework used to generate these models along with the detailed evaluation of the main 8B model's responses is going to be released soon!
upvoted
an
article
8 days ago
How to generate text: using different decoding methods for language generation with Transformers
View all activity
Organizations
Articles
2
Article
3
Distributed SFT with trl and DeepSpeed Part 2: Scaling Locally
Article
4
Distributed SFT with trl and DeepSpeed Part 1: Starting Locally
View all Articles
Papers
1
arxiv:
2307.08674
models
2
Sort: Recently updated
jlzhou/Qwen2.5-3B-Infinity-Instruct-0625
Text Generation
•
3B
•
Updated
Feb 8
•
8
jlzhou/ppo-LunarLander-v2
Reinforcement Learning
•
Updated
Feb 5
•
1
datasets
0
None public yet