Hugging Face's logo Hugging Face
  • Models
  • Datasets
  • Spaces
  • Posts
  • Docs
  • Enterprise
  • Pricing

  • Log In
  • Sign Up
Akhil Theerthala's picture
1

Akhil Theerthala

Akhil-Theerthala
·
  • akhil-theerthala

AI & ML interests

None yet

Recent Activity

replied to burtenshaw's post 1 day ago
Here’s a notebook to make Gemma reason with GRPO & TRL. I made this whilst prepping the next unit of the reasoning course: In this notebooks I combine together google’s model with some community tooling - First, I load the model from the Hugging Face hub with transformers’s latest release for Gemma 3 - I use PEFT and bitsandbytes to get it running on Colab - Then, I took Will Browns processing and reward functions to make reasoning chains from GSM8k - Finally, I used TRL’s GRPOTrainer to train the model Next step is to bring Unsloth AI in, then ship it in the reasoning course. Links to notebook below. https://colab.research.google.com/drive/1Vkl69ytCS3bvOtV9_stRETMthlQXR4wX?usp=sharing
updated a dataset 1 day ago
Akhil-Theerthala/Personal-Finance-Queries
published a dataset 8 days ago
Akhil-Theerthala/Personal-Finance-Queries
View all activity

Organizations

None yet

models

None public yet

datasets 2

Akhil-Theerthala/Personal-Finance-Queries

Preview • Updated 1 day ago • 62

Akhil-Theerthala/PersonalFinance-CoTR-5K

Viewer • Updated 25 days ago • 5.02k • 77 • 1
Company
TOS Privacy About Jobs
Website
Models Datasets Spaces Pricing Docs