ProgramBench: Can Language Models Rebuild Programs from Scratch?
Researchers used ProgramBench to test if language models can rebuild programs from scratch. Results showed that some models can generate functional code, but with limitations. This matters for AI development and potential applications in software development. Engineers should be aware of the capabilities and limitations of language models in code generation.