Would also love to see some throughput numbers on basic VM setup.
Edit: there are some latency numbers in the paper https://arxiv.org/pdf/2507.18546
If you're looking for a zero-shot classifier, tasksource is in a similar vein.
https://huggingface.co/tasksource/ModernBERT-large-nli
https://github.com/urchade/GLiNER
Would also love to see some throughput numbers on basic VM setup.
Edit: there are some latency numbers in the paper https://arxiv.org/pdf/2507.18546
If you're looking for a zero-shot classifier, tasksource is in a similar vein.
https://huggingface.co/tasksource/ModernBERT-large-nli
https://github.com/urchade/GLiNER