Junlin Zhou's picture

Junlin Zhou

jlzhou

·

edwardzjl

AI & ML interests

None yet

Recent Activity

reacted to Narsil's post with 😎 about 12 hours ago

Me: This function is too slow. Find a faster algorithm. Cursor: Hold my beer. Me: *Slacking off with colleagues* Cursor: Ping. Me: 🤯

reacted to Akhil-Theerthala's post with ❤️ 8 days ago

I'm excited to announce that I've just released the newest versions of my Kuvera models and the expanded Personal Finance Reasoning dataset on Hugging Face! What's new: I've expanded the Personal Finance Reasoning Dataset, which now includes 18.9k samples of real-world financial questions paired with detailed, empathetic answers. The previous generation pipeline was also streamlined with better psychological context and response validations. I've also released new Kuvera models trained on this improved dataset: - Kuvera-4B & 8B: These are my upgraded non-reasoning models, fine-tuned to provide practical financial advice. I've specifically trained the 8B model to better understand the user's emotional context. - Kuvera-12B: A first experimental reasoning model focused on the query resolution. As the sole person working on this project, this release is a noticeable step forward from my previous work, offering more powerful and nuanced tools for financial AI. I am actively looking to collaborate with others who are passionate about analyzing and improving the quality of personal finance advice generated by large language models. If this sounds like you, please reach out! You can check these out on the following links: Models: - https://huggingface.co/Akhil-Theerthala/Kuvera-8B-qwen3-v0.2.1 - https://huggingface.co/Akhil-Theerthala/Kuvera-4B-unsloth-gemma3 - https://huggingface.co/Akhil-Theerthala/kuvera-12B-v0.2.0-unsloth-gemma3 Dataset: - https://huggingface.co/datasets/Akhil-Theerthala/Kuvera-PersonalFinance-V2.1 P.S. The paper on the framework used to generate these models along with the detailed evaluation of the main 8B model's responses is going to be released soon!

upvoted an article 8 days ago

How to generate text: using different decoding methods for language generation with Transformers

View all activity

Organizations

Articles 2

Article

3

Distributed SFT with trl and DeepSpeed Part 2: Scaling Locally

Article

4

Distributed SFT with trl and DeepSpeed Part 1: Starting Locally

View all Articles

Papers 1

arxiv:2307.08674

models 2

jlzhou/Qwen2.5-3B-Infinity-Instruct-0625

Text Generation • 3B • Updated Feb 8 • 8

jlzhou/ppo-LunarLander-v2

Reinforcement Learning • Updated Feb 5 • 1

datasets 0

None public yet