AI Model | Parameters/Tokens | Time |
Gemini API | API (Served in Google Premises for benchmark with 2048 Tokens) | 11.839046239852905 Sec |
LLAMA2 | Local LLM 7B Parameters | 70.02146315574646 Sec |
Gemma | Local LLM 2B Parameters | 12.049653768539429 Sec |
TinyLLama | Local LLM 1B Parameters | 8.009777784347534 Sec |
Phi | Local LLM 3B Parameters | 8.246773719787598 Sec |
mistral | Local LLM 7B Parameters | 16.776710987091064 Sec |
CodeLLama | Local LLM 7B Parameters | 37.79871463775635Sec |