This repository contains an implementation of Google's Paligemma model in PyTorch.
-
PyTorch-based implementation
-
Modular configuration for vision transformer models
-
Easily extensible for research and experimentation
-
Contains the implementation of SigLip.
You can modify or extend the model in siglip.py.
-
siglip.py: Contains the Siglip model implementation. -
requirements.txt: Python dependencies. -
pyproject.toml: Project metadata and dependencies.
This project is for research and educational purposes only. Please refer to Google's original Paligemma license for usage restrictions.