Natural Language to T-Sql issue: sqlcoder-7b-2 fails on complex T-SQL joins & date logic (offline, 40GB GPU)

MihirK2201 · December 16, 2025, 11:13am

Project Overview

I am building a Natural Language → T-SQL system for Microsoft SQL Server (T-SQL).

Expected behavior:
If a user asks a natural-language question (e.g.,
“How many users are using smartphones last month?”),
the system should generate a valid and logically correct T-SQL query.

Constraints

Maximum GPU memory: 40 GB
Deployment: Local GPU inference only
No internet access after training (fully offline deployment)
This restricts model size and external API usage

Current Architecture

LLM: defog/sqlcoder-7b-2
Fine-tuning: ~2,500 complex SQL queries
- Multi-table JOINs
- Aggregations
- Date logic
Schema Handling (RAG):
- Tables and column descriptions stored separately
- Embedded using MiniLM
- Retrieved via cosine similarity
Generation Flow:
1. User NL query
2. Retrieve relevant schema context
3. Inject schema into prompt
4. Generate T-SQL

What Works

Simple queries
Single-table queries
WHERE / GROUP BY / HAVING
Basic aggregations

Issue

For complex queries involving:

Multiple JOINs
SQL Server date functions (DATEADD, DATEDIFF, CONVERT)
Cross-table business logic

the model often:

Chooses incorrect JOIN paths
Misses required tables
Hallucinates columns
Produces SQL Server–invalid date syntax
Generates logically incorrect queries

This happens despite fine-tuning and schema grounding.

Questions

Is this mainly a 7B model limitation for complex for this project?
Would explicitly injecting foreign-key relationships / join graphs into the prompt help?
Is a query-planning stage (join planning → filters → final SQL) recommended?
Any best practices for T-SQL–specific correctness?
Given offline + 40 GB GPU constraints, would:
- Larger quantized models
- Multi-stage planners
- Rule-based join resolution + LLM
  be more reliable?
Are there any open-source or production-grade Natural Language to Sql architectures that handle complex joins reliably under similar constraints?

Goal

To generate correct, production-ready T-SQL for complex NL queries under offline and 40 GB GPU constraints.

Thanks in advance for any guidance or references!

John6666 · December 16, 2025, 4:18pm

While the small size of the model seems to be one factor, it appears that even large models may struggle to solve the problem unless the task is broken down.

Topic		Replies	Views
I need someone to help me with my llm project Beginners	1	33	September 2, 2025
How to pass table structure to LLM model Intermediate	2	1739	May 1, 2024
Mistral 7b gives suggestion questions and queries while generating sql query Beginners	0	69	August 7, 2024
I am trying to build one text-to-sql with huggingface chatdb/natural-sql-7b model, it seems it is getting stuck every time and not generating any result. here is my code. Another problem is its notworking with "cuda". It's showing "torch is not compiled w Models	3	58	October 30, 2024
Best way to text-to-sql Models	1	3381	September 24, 2024