AST-Aware Code Chunking for RAG: Why Text Splitting Fails on Code
Splitting code files at arbitrary token boundaries breaks functions in half and destroys semantic meaning. AST-aware chunking respects code structure — and dramatically improves retrieval.