Loading data...
All Projects
Comparing two Large Language Models Gemini-2.5-Flash and llama-3.3-7b on same prompt for some coding tasks and evaluate different metrics like runtime, memory usage, lines of code etc.

Streamlit App

Task File

Evaluation Results in CSV file