code-evaluation

Star

Here are 10 public repositories matching this topic...

k4black / codebleu

Star

Pip compatible CodeBLEU metric implementation available for linux/macos/win

code evaluation code-generation code-evaluation evaluation-metrics codebleu

Updated Mar 31, 2025
Python

codefuse-ai / codefuse-evaluation

Star

Industrial-level evaluation benchmarks for Coding LLMs in the full life-cycle of AI native software developing.企业级代码大模型评测体系,持续开放中

code-evaluation lcc evaluation-framework repository-eval codetranseval codecommenteval codefuse

Updated Apr 28, 2025
Python

danilobecke / codemaze

Star

Backend for automated evaluation of programming tasks in higher education

python docker redis open-source flask sqlalchemy docker-compose papertrail postgresql gunicorn pytest bcrypt pylint code-evaluation moss mypy flask-cors pyjwt flask-restx

Updated Jul 8, 2024
Python

solarillion / Hyouka

Star

The SF Code Evaluator

python heroku slack flask firebase webhook evaluator code-evaluation slack-api progress-tracker

Updated Sep 22, 2024
Python

An open-source Python library for code encryption, decryption, and safe evaluation using Python's built-in AST module, complete with allowed functions, variables, built-in imports, timeouts, and blocked access to attributes.

python sandboxing eval code-evaluation safe-evaluation code-obfuscation codesafe safe-eval

Updated Sep 24, 2025
Python

aidevelopertraining / gowlin

Star

Gowlin: Open-source Secure autograder for LLM agent development and evaluation

python education machine-learning artificial-intelligence autograder code-evaluation llm agent-training

Updated Jun 27, 2025
Python

Unactived / python-tio

Star

Python library to interact synchronously and asynchronously with tio.run

python3 execution code-evaluation

Updated Oct 27, 2019
Python

codernoahx / socratiq-ai

Star

SocratiQ AI uses socratic method of teaching to guide users through learning, asking questions that prompt critical thinking and problem-solving rather than providing direct answers.

code-evaluation dsa data-structures-and-algorithms gemini-api problem-generator streamlit

Updated Aug 23, 2025
Python

juanjh1 / Artha-server

Star

Artha is a code evaluation system developed with Django and Django REST Framework that uses Judge0 as the code execution engine.

education backend django-rest-framework online-judge code-evaluation

Updated Sep 11, 2025
Python

StressTestor / CodeEfficiencyEvalTool

Star

Python toolkit for automated evaluation and benchmarking of code efficiency, performance, and resource usage. Easily analyze, compare, and score scripts or code snippets in a fast, modular CLI workflow.

python cli productivity engineering benchmark automation performance opensource metrics static-analysis test-suite code-analysis efficiency developer-tools code-evaluation llm

Updated May 26, 2025
Python

Improve this page

Add a description, image, and links to the code-evaluation topic page so that developers can more easily learn about it.

Curate this topic

Add this topic to your repo

To associate your repository with the code-evaluation topic, visit your repo's landing page and select "manage topics."

Learn more

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

code-evaluation

Here are 10 public repositories matching this topic...

k4black / codebleu

codefuse-ai / codefuse-evaluation

danilobecke / codemaze

solarillion / Hyouka

Infinitode / CodeSafe

aidevelopertraining / gowlin

Unactived / python-tio

codernoahx / socratiq-ai

juanjh1 / Artha-server

StressTestor / CodeEfficiencyEvalTool

Improve this page

Add this topic to your repo