• 2025.12.15 (Mon)
  • All articles
  • LOGIN
  • JOIN
Global Economic Times
APEC2025KOREA가이드북
  • Synthesis
  • World
  • Business
  • Industry
  • ICT
  • Distribution Economy
  • Well+Being
  • Travel
  • Eco-News
  • Education
  • Korean Wave News
  • Opinion
  • Arts&Culture
  • Sports
  • People & Life
  • Column
    • Cho Kijo Column
    • Lee Yeon-sil Column
    • Ko Yong-chul Column
    • Cherry Garden Story
  • Photo News
  • New Book Guide
MENU
 
Home > Synthesis

South Korean AI Models Flunk College Entrance Math Exams, Lagging Far Behind Global Leaders

Yim Kwangsoo Correspondent / Updated : 2025-12-15 07:01:13
  • -
  • +
  • Print

(C) Seeking Alpha


SEOUL— A recent performance comparison of South Korea's leading large language models (LLMs), often dubbed "National AI" contenders, revealed a significant gap in mathematical problem-solving ability compared to their international counterparts. The domestic models largely failed to achieve passing grades on standardized mathematics tests, including the highly challenging Suneung (College Scholastic Ability Test).

A research team led by Professor Kim Jong-rak of Sogang University's Department of Mathematics conducted the rigorous assessment. They tested five major South Korean LLMs—Upstage’s Solar Pro-2, LG AI Research’s Exaone 4.0.1, Naver’s HCX-007, SK Telecom’s A.X 4.0 (72B), and NCSOFT’s lightweight model Llama Varco 8B Instruct—against five frontier international models, including GPT-5.1, Gemini 3 Pro Preview, Claude Opus 4.5, Grok 4.1 Fast, and DeepSeek V3.2.

Rigorous Testing Methodology

The researchers administered a total of 50 mathematics problems across two categories:

Suneung (CSAT) Math (20 Problems): The 20 questions were selected as the most difficult from the common subjects, Probability and Statistics, Calculus, and Geometry sections of the highly competitive South Korean CSAT.
Essay-Type/Advanced Math (30 Problems): This set comprised questions from the entrance exams of 10 domestic universities, 10 questions from the Indian university entrance examination, and 10 questions from the mathematics section of the graduate school entrance exam for the University of Tokyo's Faculty of Engineering.
In the initial test comprising the 20 Suneung and 30 essay-type problems, the performance disparity was stark. International models consistently scored high, ranging from 76 to 92 points. In sharp contrast, the South Korean models struggled immensely. Only Solar Pro-2 managed a score of 58 points, while the others languished in the 20s. NCSOFT's Llama Varco 8B Instruct recorded the lowest score, a mere 2 points.

The research team noted that even after designing the domestic models to use Python as a tool to enhance problem-solving accuracy beyond simple inference, the results remained discouraging.

Second Test: EntropyMath Dataset Confirms Lag

The researchers conducted a second test using a proprietary dataset they developed called 'EntropyMath,' which features 100 questions of varying difficulty, from university-level to professorial research standards. Ten selected questions from this set were presented to the 10 AI models.

The results mirrored the first test: International models achieved scores between 82.8 and 90 points, whereas the domestic models were significantly lower, ranging from 7.1 to 53.3 points.

In a third attempt, where the models were given three chances to solve a problem for a correct answer, the international models again demonstrated dominance. Grok 4.1 Fast achieved a perfect score, and the rest of the overseas models scored 90 points. The best-performing domestic model, Solar Pro-2, scored 70 points, followed by Exaone at 60 points. The other domestic contenders, HCX-007, A.X 4.0, and Llama Varco 8B Instruct, recorded 40, 30, and 20 points, respectively.

Call for Improvement and Future Plans

"There was a lot of inquiry about why there was no evaluation of the five domestic sovereign AI models on Suneung problems, so our team conducted this test," Professor Kim explained. "It confirmed that the level of domestic models is significantly behind that of the overseas frontier models."

The research team acknowledged that the domestic models tested were based on existing public versions and plan to conduct a re-evaluation once the updated, dedicated "National AI" versions from each team are officially released.

Professor Kim also announced the launch of a dedicated mathematics leaderboard based on the EntropyMath dataset, with the goal of expanding it to an international standard. He added that the team will improve their proprietary problem-generation algorithms and pipelines to create specialized datasets for domains beyond mathematics, including science, manufacturing, and culture, to contribute to the performance enhancement of domain-specific AI models.

The study was jointly supported by Sogang University's Institute of Mathematical Sciences and Data Science (IMDS) and Deep Fountain.

[Copyright (c) Global Economic Times. All Rights Reserved.]

  • #globaleconomictimes
  • #micorea
  • #mykorea
  • #nammidonganews
  • #singaporenewsk
  • #Samsung
  • #Daewoo
  • #Hyosung
  • #Apple
  • #korea
Yim Kwangsoo Correspondent
Yim Kwangsoo Correspondent

Popular articles

  • North Korea Publicly Executes ‘Big-Hand’ Business Couple Over ‘Arrogance’ and Anti-State Charges

  • Kim Whanki's Abstraction Fetches $8.4 Million in New York, Securing Second Highest Price for Korean Art

  • Global Derivatives Market Grinds to Halt as Cooling Failure Cripples CME Group

I like it
Share
  • Facebook
  • X
  • Kakaotalk
  • LINE
  • BAND
  • NAVER
  • https://www.globaleconomictimes.kr/article/1065563947796469 Copy URL copied.
Comments >

Comments 0

Weekly Hot Issue

  • South Korea Launches $115 Million Export Voucher Program to Boost SME Global Reach
  • Extension Granted for '2026 Honors for SME Contributors' Application
  • 44% of Recent Construction Projects Report Deficits, Industry Survey Finds
  • KRX Temporarily Slashes Stock Trading Fees by 20-40% to Counter ATS Rival
  • Lotte Mart Launches Major Imported Fruit Discount Event Amid High Prices
  • Korean Drone Exports Soar 58% to $28 Million, Expanding Global Footprint

Most Viewed

1
Choi Bun-do, Chairman of PTV Group, Assumes Presidency of the Korean Chamber of Commerce and Industry in South Central Vietnam
2
From Court to Content: French Tennis Star Océane Dodin Trades Racquet for OnlyFans, Eyes $5M in a Year
3
Lee Dismisses Vice Minister Amid Allegations of Misconduct and Vetting Gaps
4
NVIDIA Lobby Succeeds? U.S. Bill Expected to Drop AI Chip Export Restrictions
5
US Layoffs Surge: Over 1.17 Million Job Cuts Announced in First 11 Months of 2025
광고문의
임시1
임시3
임시2

Hot Issue

South Korean AI Models Flunk College Entrance Math Exams, Lagging Far Behind Global Leaders

KRX Temporarily Slashes Stock Trading Fees by 20-40% to Counter ATS Rival

Israel Condemns Australia After Sydney Shooting, Citing 'Fueling' of Anti-Semitism

Lotte Mart Launches Major Imported Fruit Discount Event Amid High Prices

Let’s recycle the old blankets in Jeju Island’s closet instead of incinerating them.

Global Economic Times
korocamia@naver.com
CEO : LEE YEON-SIL
Publisher : KO YONG-CHUL
Registration number : Seoul, A55681
Registration Date : 2024-10-24
Youth Protection Manager: KO YONG-CHUL
Singapore Headquarters
5A Woodlands Road #11-34 The Tennery. S'677728
Korean Branch
Phone : +82(0)10 4724 5264
#304, 6 Nonhyeon-ro 111-gil, Gangnam-gu, Seoul
Copyright © Global Economic Times All Rights Reserved
  • 에이펙2025
  • APEC2025가이드북TV
  • 독도는우리땅
Search
Category
  • All articles
  • Synthesis
  • World
  • Business
  • Industry
  • ICT
  • Distribution Economy
  • Well+Being
  • Travel
  • Eco-News
  • Education
  • Korean Wave News
  • Opinion
  • Arts&Culture
  • Sports
  • People & Life
  • Column 
    • 전체
    • Cho Kijo Column
    • Lee Yeon-sil Column
    • Ko Yong-chul Column
    • Cherry Garden Story
  • Photo News
  • New Book Guide
  • Multicultural News
  • Jobs & Workers