• 2026.03.18 (Wed)
  • All articles
  • LOGIN
  • JOIN
Global Economic Times
fashionrunwayshow2026
  • Synthesis
  • World
  • Business
  • Industry
  • ICT
  • Distribution Economy
  • Well+Being
  • Travel
  • Eco-News
  • Education
  • Korean Wave News
  • Opinion
  • Arts&Culture
  • Sports
  • People & Life
    • International Student Report
    • With Ambassador
  • Column
    • Cho Kijo Column
    • Cherry Garden Story
    • Ko Yong-chul Column
    • Kim Seul-Ong Column
    • Lee Yeon-sil Column
  • Photo News
  • New Book Guide
MENU
 
Home > Synthesis

South Korean AI Models Flunk College Entrance Math Exams, Lagging Far Behind Global Leaders

Yim Kwangsoo Correspondent / Updated : 2025-12-15 07:01:13
  • -
  • +
  • Print

(C) Seeking Alpha


SEOUL— A recent performance comparison of South Korea's leading large language models (LLMs), often dubbed "National AI" contenders, revealed a significant gap in mathematical problem-solving ability compared to their international counterparts. The domestic models largely failed to achieve passing grades on standardized mathematics tests, including the highly challenging Suneung (College Scholastic Ability Test).

A research team led by Professor Kim Jong-rak of Sogang University's Department of Mathematics conducted the rigorous assessment. They tested five major South Korean LLMs—Upstage’s Solar Pro-2, LG AI Research’s Exaone 4.0.1, Naver’s HCX-007, SK Telecom’s A.X 4.0 (72B), and NCSOFT’s lightweight model Llama Varco 8B Instruct—against five frontier international models, including GPT-5.1, Gemini 3 Pro Preview, Claude Opus 4.5, Grok 4.1 Fast, and DeepSeek V3.2.

Rigorous Testing Methodology

The researchers administered a total of 50 mathematics problems across two categories:

Suneung (CSAT) Math (20 Problems): The 20 questions were selected as the most difficult from the common subjects, Probability and Statistics, Calculus, and Geometry sections of the highly competitive South Korean CSAT.
Essay-Type/Advanced Math (30 Problems): This set comprised questions from the entrance exams of 10 domestic universities, 10 questions from the Indian university entrance examination, and 10 questions from the mathematics section of the graduate school entrance exam for the University of Tokyo's Faculty of Engineering.
In the initial test comprising the 20 Suneung and 30 essay-type problems, the performance disparity was stark. International models consistently scored high, ranging from 76 to 92 points. In sharp contrast, the South Korean models struggled immensely. Only Solar Pro-2 managed a score of 58 points, while the others languished in the 20s. NCSOFT's Llama Varco 8B Instruct recorded the lowest score, a mere 2 points.

The research team noted that even after designing the domestic models to use Python as a tool to enhance problem-solving accuracy beyond simple inference, the results remained discouraging.

Second Test: EntropyMath Dataset Confirms Lag

The researchers conducted a second test using a proprietary dataset they developed called 'EntropyMath,' which features 100 questions of varying difficulty, from university-level to professorial research standards. Ten selected questions from this set were presented to the 10 AI models.

The results mirrored the first test: International models achieved scores between 82.8 and 90 points, whereas the domestic models were significantly lower, ranging from 7.1 to 53.3 points.

In a third attempt, where the models were given three chances to solve a problem for a correct answer, the international models again demonstrated dominance. Grok 4.1 Fast achieved a perfect score, and the rest of the overseas models scored 90 points. The best-performing domestic model, Solar Pro-2, scored 70 points, followed by Exaone at 60 points. The other domestic contenders, HCX-007, A.X 4.0, and Llama Varco 8B Instruct, recorded 40, 30, and 20 points, respectively.

Call for Improvement and Future Plans

"There was a lot of inquiry about why there was no evaluation of the five domestic sovereign AI models on Suneung problems, so our team conducted this test," Professor Kim explained. "It confirmed that the level of domestic models is significantly behind that of the overseas frontier models."

The research team acknowledged that the domestic models tested were based on existing public versions and plan to conduct a re-evaluation once the updated, dedicated "National AI" versions from each team are officially released.

Professor Kim also announced the launch of a dedicated mathematics leaderboard based on the EntropyMath dataset, with the goal of expanding it to an international standard. He added that the team will improve their proprietary problem-generation algorithms and pipelines to create specialized datasets for domains beyond mathematics, including science, manufacturing, and culture, to contribute to the performance enhancement of domain-specific AI models.

The study was jointly supported by Sogang University's Institute of Mathematical Sciences and Data Science (IMDS) and Deep Fountain.

[Copyright (c) Global Economic Times. All Rights Reserved.]

  • #globaleconomictimes
  • #micorea
  • #mykorea
  • #nammidonganews
  • #singaporenewsk
  • #Samsung
  • #Daewoo
  • #Hyosung
  • #Apple
  • #korea
Yim Kwangsoo Correspondent
Yim Kwangsoo Correspondent

Popular articles

  • The "Betrayal" of US Beef: Record-High Prices Hit South Korean Dinner Tables

  • The Structural Pivot of the Semiconductor Era: Samsung and SK hynix Accelerate Expansion Amidst Chronic D-RAM Shortages

  • 10-Year-Old Boy Wins Lawsuit Against Father Who Used $12,000 Lunar New Year Gift for Remarriage

I like it
Share
  • Facebook
  • X
  • Kakaotalk
  • LINE
  • BAND
  • NAVER
  • https://www.globaleconomictimes.kr/article/1065563947796469 Copy URL copied.
Comments >

Comments 0

Weekly Hot Issue

  • Chungnam Techno Park to Launch 'Hydrogen Industry Council' with 16 Key Organizations Building an Action-Oriented Governance for the Region's Hydrogen Ecosystem
  • LG Electronics Unveils Specialized HVAC Solutions in India, Aiming for B2B Market Leadership
  • BC Card Secures Patent for Blockchain-Based ‘Asset Authentication NFT’ to Protect Wealth in Financial Emergencies
  • Shinil Electronics Eyes KRW 200 Billion Revenue Target Through Product Diversification and Digital Transformation
  • Jeju Bank Leapfrogs into 'Tech-fin' Powerhouse: AI Now Screens Over Half of Corporate Loans
  • Global Food Delivery Giants Hit Record Highs: Subscription and Quick-Commerce Emerge as Key Growth Engines

Most Viewed

1
Adwa’s Echo in Korea: A Shared Story of Dignity and Freedom
2
An Open Letter to BTS On the Eve of a Historic Performance
3
From Industrial Capital to Tourism Mecca... Ulsan Makes a Bold Move with ‘Experiential Content’ in 2026
4
Ko Sang-goo, President of World Federation of Korean Associations, Elected as First Private Sector Chair of World Korean Community Leaders Convention
5
A Street in Cairo in French… From Dakar to Paris!
광고문의
임시1
임시3
임시2

Hot Issue

BTS Live Streaming on Netflix Reignites "Network Free-Ride" Controversy in Korea

LG Electronics Unveils Specialized HVAC Solutions in India, Aiming for B2B Market Leadership

BYD Set to Break Record: Fastest Import Car Brand to Hit 10,000 Sales in Korea

Global Food Delivery Giants Hit Record Highs: Subscription and Quick-Commerce Emerge as Key Growth Engines

Let’s recycle the old blankets in Jeju Island’s closet instead of incinerating them.

Global Economic Times
korocamia@naver.com
CEO : LEE YEON-SIL
Publisher : KO YONG-CHUL
Registration number : Seoul, A55681
Registration Date : 2024-10-24
Youth Protection Manager: KO YONG-CHUL
Singapore Headquarters
5A Woodlands Road #11-34 The Tennery. S'677728
Korean Branch
Phone : +82(0)10 4724 5264
#304, 6 Nonhyeon-ro 111-gil, Gangnam-gu, Seoul
Copyright © Global Economic Times All Rights Reserved
  • 에이펙2025
  • APEC2025가이드북TV
  • 독도는우리땅
Search
Category
  • All articles
  • Synthesis
  • World
  • Business
  • Industry
  • ICT
  • Distribution Economy
  • Well+Being
  • Travel
  • Eco-News
  • Education
  • Korean Wave News
  • Opinion
  • Arts&Culture
  • Sports
  • People & Life 
    • 전체
    • International Student Report
    • With Ambassador
  • Column 
    • 전체
    • Cho Kijo Column
    • Cherry Garden Story
    • Ko Yong-chul Column
    • Kim Seul-Ong Column
    • Lee Yeon-sil Column
  • Photo News
  • New Book Guide
  • Multicultural News
  • Jobs & Workers