Pengchong Jin
Use approximate=tahn for GeLU
The official PyTorch implementation of Google's Gemma models
Python
main