Pre-built Docker Images Support - We merged PR #8 which enables instant use of pre-built Docker images, significantly reducing setup time and improving the evaluation ...
Abstract: This study evaluates the performance of six prominent Large Language Models (LLMs) on graduate entrance exam multiple-choice mathematics questions in computer science, computer engineering, ...
Abstract: Tools based on the use of Large Language Models (LLMs) have improved the computer programming teaching process, automated feedback processes, facilitated program repair, and enabled ...
Some results have been hidden because they may be inaccessible to you
Show inaccessible results