Substance-vs-Style

Overview

This repository contains the code and data for the paper "Substance Beats Style: Why Beginning Students Fail to Code with LLMs".

Running Experiments

Substitution Experiment Workflow

We first apply a tagging process to every prompt in the first/last success/failure subsets of StudentEval. For example, given a prompt "Outputs if the number input is even", the tagged prompt becomes "$Returns:prints$ if the number $parameter:input$ is even". This process is semi-automated.

a. mutated_dataset_builder/main.py rule-based script that creates a preliminary tagged dataset nuprl-staging/studenteval_tagged_prompts. b. We transform this dataset to the file tagged_prompts_for_edits by running json_to_yaml.py. We manually edit this file. c. We map these edits back to a new split of the tagged dataset
We then run bash script bin/prepare_subst.sh on the validated dataset to get various splits of substituted data base on target word and replacement value. Create a directory subst_experiments where the dataset will be stored in jsonl format. ./bin/prepare_subst.sh CATEGORY ORIGINAL The first argument is the category, the second argument is the replacement value

eg. ./bin/prepare_subst.sh "return" "output" replaces all occurrance of words tagged with category 'return' with the correct word variation of 'output'.(i.e. returns-outputs, returning-outputting.)

eg. ./bin/prepare_subst.sh "loop through" "go through"
We run generation script bin/run_generation.sh on the substitued datasets. Create a directory generation_experiments to store the generated model results. eg. ./bin/run_generation.sh "return" "output" This will create a dir return_output, with a sub dir completions_jsons storing all the json.gz files.

Student Trajectories Experiment Workflow

Follow instructions in eval_scripts/README.md to get stderr/stdout outputs for StudentEval completions, saved as a dataset.
Use student_trajectories/parse_graph.py to turn the dataset into student trajectory graphs (saved as .yaml files).
Use student_trajectories/alternating_automata.py to turn the graphs into alternating automata (saved as .dot files), which can be rendered into viewable .pdf files using Graphviz. For an interactive .html prompt, use student_trajectories/plot_graph.py.

Use of AI assistants

Copilot used in this project.

Name		Name	Last commit message	Last commit date
Latest commit History 343 Commits
assisted_annotate		assisted_annotate
bin		bin
causal_intervention		causal_intervention
eval_scripts		eval_scripts
for_edits		for_edits
generation_experiments		generation_experiments
mutated_dataset_builder		mutated_dataset_builder
notebooks		notebooks
student_trajectories		student_trajectories
subst_experiments		subst_experiments
subst_results		subst_results
tests		tests
.gitignore		.gitignore
LICENSE		LICENSE
README.md		README.md
apply_substitutions.py		apply_substitutions.py
charlie_synonyms.csv		charlie_synonyms.csv
charlie_synonyms_with_counts.csv		charlie_synonyms_with_counts.csv
codebook.txt		codebook.txt
jsonl_to_yaml.py		jsonl_to_yaml.py
problem_tags.yaml		problem_tags.yaml
problems.yaml		problems.yaml
prompt_steps.py		prompt_steps.py
pyproject.toml		pyproject.toml
see_pass_1.ipynb		see_pass_1.ipynb
studenteval.py		studenteval.py
utils.py		utils.py

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Repository files navigation

Substance-vs-Style

Overview

Running Experiments

Substitution Experiment Workflow

Student Trajectories Experiment Workflow

Use of AI assistants

About

Uh oh!

Releases

Packages

Contributors 5

Uh oh!

Languages

License

nuprl/substance-vs-style

Folders and files

Latest commit

History

Repository files navigation

Substance-vs-Style

Overview

Running Experiments

Substitution Experiment Workflow

Student Trajectories Experiment Workflow

Use of AI assistants

About

Resources

License

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Contributors 5

Uh oh!

Languages

Packages