Generate Python or SQL code from natural language prompts
Run GAIA benchmark agent and submit all answers
Check if the server is running successfully
Generate code and answers using an AI assistant