Bridging the gap between LLMs and symbolic reasoning

Bridging the gap between LLMs and symbolic reasoning


Researchers have introduced a novel approach called natural language embedded programs (NLEPs) to improve the numerical and symbolic reasoning capabilities of large language models (LLMs). The technique involves prompting LLMs to generate and execute Python programs to solve user queries, then output solutions in natural language.

While LLMs like ChatGPT have demonstrated impressive performance on various tasks, they often struggle with problems requiring numerical or symbolic reasoning.

NLEPs follow a four-step problem-solving template: calling necessary packages, importing natural language representations of required knowledge, implementing a solution-calculating function, and outputting results as natural language with optional data visualisation.

This approach offers several advantages, including improved accuracy, transparency, and efficiency. Users can investigate generated programs and fix errors directly, avoiding the need to rerun entire models for troubleshooting. Additionally, a single NLEP can be reused for multiple tasks by replacing certain variables.

bybit

The researchers found that NLEPs enabled GPT-4 to achieve over 90% accuracy on various symbolic reasoning tasks, outperforming task-specific prompting methods by 30%

Beyond accuracy improvements, NLEPs could enhance data privacy by running programs locally, eliminating the need to send sensitive user data to external companies for processing. The technique may also boost the performance of smaller language models without costly retraining.

However, NLEPs rely on a model’s program generation capability and may not work as well with smaller models trained on limited datasets. Future research will explore methods to make smaller LLMs generate more effective NLEPs and investigate the impact of prompt variations on reasoning robustness.

The research, supported in part by the Center for Perceptual and Interactive Intelligence of Hong Kong, will be presented at the Annual Conference of the North American Chapter of the Association for Computational Linguistics later this month.

(Photo by Alex Azabache)

See also: Apple is reportedly getting free ChatGPT access

Want to learn more about AI and big data from industry leaders? Check out AI & Big Data Expo taking place in Amsterdam, California, and London. The comprehensive event is co-located with other leading events including Intelligent Automation Conference, BlockX, Digital Transformation Week, and Cyber Security & Cloud Expo.

Explore other upcoming enterprise technology events and webinars powered by TechForge here.

Tags: ai, artificial intelligence, development, large language models, llm, natural language, nlep



Source link

Leave a Reply

Your email address will not be published. Required fields are marked *

Pin It on Pinterest

CryptoKorner
Changelly
CryptoKorner
Bridging the gap between LLMs and symbolic reasoning
bybit
Fiverr
Mixture-of-recursions delivers 2x faster inference—Here's how to implement it
Brain made up of dollar symbols as Google releases the stable version of Gemini 2.5 Flash-Lite and they've essentially created a model that's designed to be the workhorse for developers who need to build things at scale without breaking the bank.
Top 15+ Most Affordable Proxy Providers 2025
How CrowdStrike's 78-minute outage reshaped enterprise cybersecurity
bitcoin
ethereum
bnb
xrp
cardano
solana
dogecoin
polkadot
shiba-inu
dai
Free book
Ledger
Bitcoin, Stocks And Altcoins Move Toward New Houses
Mixture-of-recursions delivers 2x faster inference—Here's how to implement it
Online Pastor Indicted for $3.4M Crypto Scam
Solana Rises 20% in a Week, But Analyst Warns of LUNA-Like Breakdown Ahead
Max Keiser Blasts Trump’s $2B Bitcoin Play: 'He’s Front Running Americans'
Bitcoin, Stocks And Altcoins Move Toward New Houses
Mixture-of-recursions delivers 2x faster inference—Here's how to implement it
Online Pastor Indicted for $3.4M Crypto Scam
Solana Rises 20% in a Week, But Analyst Warns of LUNA-Like Breakdown Ahead
ar
zh-CN
nl
en
fr
de
it
pt
ru
es
en
bitcoin
ethereum
xrp
tether
bnb
solana
usd-coin
dogecoin
staked-ether
cardano
bitcoin
ethereum
xrp
tether
bnb
solana
usd-coin
dogecoin
staked-ether
cardano