Efficient vision foundation models for high-resolution generation and perception.
-
Updated
Sep 5, 2025 - Python
Efficient vision foundation models for high-resolution generation and perception.
This is a warehouse for EfficientViT-pytorch-model, can be used to train your dataset
End-to-end MLOps pipeline for multimodal e-commerce product classification (text + image) — ingestion, training, inference and monitoring.
Pretraining the EfficientViT-B4 model on the ImageNet-1k dataset
Add a description, image, and links to the efficientvit topic page so that developers can more easily learn about it.
To associate your repository with the efficientvit topic, visit your repo's landing page and select "manage topics."