Name of Project: OpenBugger: a tool for injecting bugs into Python scripts
Proposal in one sentence: Create a tool to automatically bug code that synthetically generates debugging instructions to tune OpenAssistant
Description of the project and what problem it is solving:
The OpenBugger project GitHub - furlat/OpenBugger: Code to create bugged python scripts for OpenAssistant Training, maintained by https://twitter.com/Cyndesama aims at generating large amount of synthetic conversations about debugging code between a user and an assistant. This data will be used as a part of Open-Assistant training set.
To achieve this the bugger starts from a working code snippet and injects it with errors of several different types (currently ~30 syntax and logic error types ), finally the diads of working and bugged code are used to generate a conversation:
e.g: User:"Is this code snippet correct?: {bugged code, bug_type, num_bugs}:
Assistant: "No the code snippet is not correct, it has {num_bugs} erorrs of {bug_type}
User: “Can you fix the code?”
Assistant: “Sure, I will be glad to do it: {orginal_code}”
Grant Deliverables:
Python package at GitHub - furlat/OpenBugger: Code to create bugged python scripts for OpenAssistant Training, maintained by https://twitter.com/Cyndesama that will be used for OpenAssistant (demo version is already in main branch here Create openbugger_example.ipynb by furlat · Pull Request #418 · LAION-AI/Open-Assistant · GitHub)
The main deliverable is exstending the current code to include Runtime errors, Type errors, Name errors, Import errors and Indentation errors as well as the scripts to map from tuples of (code/bugged_code) to conversation.
Squad:
Name: Tommaso Furlanello
Twitter handle: @cyndesama
Discord handle: iriden#1609