• About Us
  • Contact Us
  • Terms & Conditions
  • Privacy Policy
Technology Hive
  • Home
  • Technology
  • Artificial Intelligence (AI)
  • Cyber Security
  • Machine Learning
  • More
    • Deep Learning
    • AI in Healthcare
    • AI Regulations & Policies
    • Business
    • Cloud Computing
    • Ethics & Society
No Result
View All Result
  • Home
  • Technology
  • Artificial Intelligence (AI)
  • Cyber Security
  • Machine Learning
  • More
    • Deep Learning
    • AI in Healthcare
    • AI Regulations & Policies
    • Business
    • Cloud Computing
    • Ethics & Society
No Result
View All Result
Technology Hive
No Result
View All Result
Home Artificial Intelligence (AI)

“Smart Coach” Helps LLMs Switch Between Text and Code

Adam Smith – Tech Writer & Blogger by Adam Smith – Tech Writer & Blogger
July 17, 2025
in Artificial Intelligence (AI)
0
“Smart Coach” Helps LLMs Switch Between Text and Code
0
SHARES
0
VIEWS
Share on FacebookShare on Twitter

Introduction to Large Language Models

Large language models (LLMs) are excellent at using textual reasoning to understand the context of a document and provide a logical answer about its contents. However, these same LLMs often struggle to correctly answer even the simplest math problems. Textual reasoning is usually a less-than-ideal way to deliberate over computational or algorithmic tasks. While some LLMs can generate code like Python to handle symbolic queries, the models don’t always know when to use code, or what kind of code would work best.

The Need for Guidance

LLMs, it seems, may need a coach to steer them toward the best technique. This is where CodeSteer comes in, a smart assistant developed by MIT researchers that guides an LLM to switch between code and text generation until it correctly answers a query. CodeSteer, itself a smaller LLM, automatically generates a series of prompts to iteratively steer a larger LLM. It reviews the model’s current and previous answers after each round and provides guidance for how it can fix or refine that solution until it deems the answer is correct.

How CodeSteer Works

The researchers found that augmenting a larger LLM with CodeSteer boosted its accuracy on symbolic tasks, like multiplying numbers, playing Sudoku, and stacking blocks, by more than 30 percent. CodeSteer works in conjunction with the larger LLM, first reviewing a query and determining whether text or code is suitable for this problem, and which sort of code would be best. Then it generates a prompt for the larger LLM, telling it to use a coding method or textual reasoning to answer the query.

Benefits of CodeSteer

The larger model follows this prompt to answer the query and sends the result back to CodeSteer, which reviews it. If the answer is not correct, CodeSteer will continue prompting the LLM to try different things that might fix the problem, such as incorporating a search algorithm or constraint into its Python code, until the answer is correct. This advance could improve the problem-solving capabilities of LLMs for complex tasks that are especially difficult to solve with textual reasoning alone, such as generating paths for robots in uncertain environments or scheduling shipments in an international supply chain.

Tackling Complex Tasks

As the researchers designed CodeSteer, they couldn’t find suitable symbolic datasets to fine-tune and test the model, since many existing benchmarks don’t point out whether a certain query could be best solved with text or code. So, they gathered a corpus of 37 complex symbolic tasks, including spatial reasoning, mathematics, order reasoning, and optimization, and built their own dataset, called SymBench. They implemented a fine-tuning approach that leverages SymBench to maximize the performance of CodeSteer.

Results and Future Directions

In their experiments, CodeSteer outperformed all nine baseline methods they evaluated and boosted average accuracy from 53.3 percent to 86.4 percent. It maintains similar performance even on unseen tasks, and on a variety of LLMs. In addition, a general-purpose model augmented with CodeSteer can achieve higher accuracy than state-of-the-art models designed to focus on complex reasoning and planning, while requiring much less computation. The researchers want to streamline CodeSteer to speed up its iterative prompting process and study how to effectively fine-tune a unified model with the ability to switch between textual reasoning and code generation.

Conclusion

CodeSteer is a significant advancement in the field of large language models, as it enables LLMs to improve their problem-solving capabilities for complex tasks. By guiding LLMs to switch between code and text generation, CodeSteer can help LLMs achieve higher accuracy and efficiency. This technology has the potential to be applied to a wide range of tasks, from generating paths for robots to scheduling shipments in an international supply chain.

FAQs

Q: What is CodeSteer?
A: CodeSteer is a smart assistant developed by MIT researchers that guides a large language model (LLM) to switch between code and text generation until it correctly answers a query.
Q: How does CodeSteer work?
A: CodeSteer works in conjunction with a larger LLM, reviewing a query and determining whether text or code is suitable for the problem, and generating a prompt for the larger LLM to use a coding method or textual reasoning to answer the query.
Q: What are the benefits of CodeSteer?
A: CodeSteer can improve the problem-solving capabilities of LLMs for complex tasks, achieving higher accuracy and efficiency.
Q: What is SymBench?
A: SymBench is a dataset of 37 complex symbolic tasks, including spatial reasoning, mathematics, order reasoning, and optimization, built by the researchers to fine-tune and test CodeSteer.
Q: What are the future directions of CodeSteer?
A: The researchers want to streamline CodeSteer to speed up its iterative prompting process and study how to effectively fine-tune a unified model with the ability to switch between textual reasoning and code generation.

Previous Post

Where AI Companies Could Go Next in the US

Next Post

Major AI Training Data Set Exposes Millions of Personal Records

Adam Smith – Tech Writer & Blogger

Adam Smith – Tech Writer & Blogger

Adam Smith is a passionate technology writer with a keen interest in emerging trends, gadgets, and software innovations. With over five years of experience in tech journalism, he has contributed insightful articles to leading tech blogs and online publications. His expertise covers a wide range of topics, including artificial intelligence, cybersecurity, mobile technology, and the latest advancements in consumer electronics. Adam excels in breaking down complex technical concepts into engaging and easy-to-understand content for a diverse audience. Beyond writing, he enjoys testing new gadgets, reviewing software, and staying up to date with the ever-evolving tech industry. His goal is to inform and inspire readers with in-depth analysis and practical insights into the digital world.

Related Posts

AI Video Generation Techniques
Artificial Intelligence (AI)

AI Video Generation Techniques

by Adam Smith – Tech Writer & Blogger
September 12, 2025
VMware starts down the AI route, but it’s not core business
Artificial Intelligence (AI)

VMware starts down the AI route, but it’s not core business

by Adam Smith – Tech Writer & Blogger
September 11, 2025
Collaborating with Generative AI in Finance
Artificial Intelligence (AI)

Collaborating with Generative AI in Finance

by Adam Smith – Tech Writer & Blogger
September 11, 2025
DoE selects MIT to establish a Center for the Exascale Simulation of Coupled High-Enthalpy Fluid–Solid Interactions
Artificial Intelligence (AI)

DoE selects MIT to establish a Center for the Exascale Simulation of Coupled High-Enthalpy Fluid–Solid Interactions

by Adam Smith – Tech Writer & Blogger
September 10, 2025
Therapist Caught Using ChatGPT
Artificial Intelligence (AI)

Therapist Caught Using ChatGPT

by Adam Smith – Tech Writer & Blogger
September 9, 2025
Next Post
Major AI Training Data Set Exposes Millions of Personal Records

Major AI Training Data Set Exposes Millions of Personal Records

Leave a Reply Cancel reply

Your email address will not be published. Required fields are marked *

Latest Articles

Alibaba to Raise .17 Billion for Cloud and AI Expansion

Alibaba to Raise $3.17 Billion for Cloud and AI Expansion

September 11, 2025
French Conseil d’État authorises the use of drones by law enforcement agencies

French Conseil d’État authorises the use of drones by law enforcement agencies

February 25, 2025
The Future of Jobs 2025 Report

The Future of Jobs 2025 Report

March 4, 2025

Browse by Category

  • AI in Healthcare
  • AI Regulations & Policies
  • Artificial Intelligence (AI)
  • Business
  • Cloud Computing
  • Cyber Security
  • Deep Learning
  • Ethics & Society
  • Machine Learning
  • Technology
Technology Hive

Welcome to Technology Hive, your go-to source for the latest insights, trends, and innovations in technology and artificial intelligence. We are a dynamic digital magazine dedicated to exploring the ever-evolving landscape of AI, emerging technologies, and their impact on industries and everyday life.

Categories

  • AI in Healthcare
  • AI Regulations & Policies
  • Artificial Intelligence (AI)
  • Business
  • Cloud Computing
  • Cyber Security
  • Deep Learning
  • Ethics & Society
  • Machine Learning
  • Technology

Recent Posts

  • Exploring AI Solutions for Business Growth
  • Visual Guide to LLM Quantisation Methods for Beginners
  • Create a Voice Agent in a Weekend with Realtime API, MCP, and SIP
  • AI Revolution in Law
  • Discovering Top Frontier LLMs Through Benchmarking — Arc AGI 3

Our Newsletter

Subscribe Us To Receive Our Latest News Directly In Your Inbox!

We don’t spam! Read our privacy policy for more info.

Check your inbox or spam folder to confirm your subscription.

© Copyright 2025. All Right Reserved By Technology Hive.

No Result
View All Result
  • Home
  • Technology
  • Artificial Intelligence (AI)
  • Cyber Security
  • Machine Learning
  • AI in Healthcare
  • AI Regulations & Policies
  • Business
  • Cloud Computing
  • Ethics & Society
  • Deep Learning

© Copyright 2025. All Right Reserved By Technology Hive.

Are you sure want to unlock this post?
Unlock left : 0
Are you sure want to cancel subscription?