กูเกิลอัพเดต Gemini 2.0 Flash Thinking คะแนนทดสอบเทียบชั้น OpenAI o1 แล้ว | Blognone

กูเกิลอัพเดต Gemini 2.0 Flash Thinking คะแนนทดสอบเทียบชั้น OpenAI o1 แล้ว

By: lew

on 22 January 2025 - 12:44 Tags:

Topics:

กูเกิลปล่อยโมเดล Gemini 2.0 Flash Thinking เวอร์ชั่น exp-01-21 อัพเดตจาก เวอร์ชั่นเดิมเมื่อเดือนธันวาคมที่ผ่านมา โดยเวอร์ชั่นนี้ผลทดสอบระดับสูงทำได้ดีขึ้นอย่างก้าวกระโดด

เวอร์ชั่นนี้ทำผลทดสอบ AIME (math) ได้ 73.3% และ GPQA Diamond (science) ได้ 74.2% เทียบกับ OpenAI o1 ที่ได้ AIME 78% และ GPQA ที่ 76% แม้ยังตามอยู่แต่ Gemini 2.0 Flash Thinking ก็ทำงานค่อนข้างเร็ว มีความสามารถอ่านอินพุตถึง 1 ล้านโทเค็น และสามารถรันโค้ดได้อัตโนมัติ

ฟีเจอร์ของโมเดลคิดก่อนตอบนี้ยังไม่เทียบเท่าโมเดลปกติ โดยยังขาดความสามารถค้นเว็บอัตโนมัติและการส่งเอาท์พุตเป็น JSON ตอนนี้โมเดลยังเปิดให้ใช้งานใน Google AI Studio เท่านั้น

ที่มา - @demishassabis

No Description

Hiring! บริษัทที่น่าสนใจ

LINE Company Thailand company cover

LINE Company Thailand

LINE, the world's hottest mobile messaging platform, offers free text and voice messaging + Call

Thai Credit Guarantee Corporation (TCG) company cover

Thai Credit Guarantee Corporation (TCG)

เป็นศูนย์กลางเชื่อมโยงเงินทุนและโอกาสให้แก่ SMEs เพื่อการเติบโตอย่างยั่งยืน (SMEs’ Gateway)

KBTG - KASIKORN Business-Technology Group company cover

KBTG - KASIKORN Business-Technology Group

KBTG - "The Technology Company for Digital Business Innovation"

Comments

By: Fzo

on 22 January 2025 - 13:05 #1331892

Fzo's picture

google ครองหัวตาราง leaderboard มายาวๆ เลยรอบนี้ลองมาตั้งแต่เมื่อคืน exp-01-21 ก็ดีจริงๆ แต่ส่วนตัวยังรัก deepseek อยู่

WE ARE THE 99%

Log in or register to post comments

By: panurat2000

on 22 January 2025 - 14:01 #1331899

panurat2000's picture

ทำได้ดีขึ้นอย่างก้าวกระโด

ก้าวกระโด => ก้าวกระโดด

Log in or register to post comments

By: Iamz

on 22 January 2025 - 14:03 #1331900

ไม่ชอบกราฟที่ไม่เริ่มจาก 0 เลย

Log in or register to post comments

By: hisoft

on 22 January 2025 - 21:49 #1331923

hisoft's picture

flash นี่คือตัวที่เร็วกว่าปกติด้วยใช่มั้ยนะฮะ

Log in or register to post comments

By: tontpong

on 23 January 2025 - 01:47 #1331928 Reply to:1331923

หมายถึง .. ไม่ใช่แค่ พารามิเตอร์ น้อยกว่า , แต่ว่ามีการปรับแต่งในแง่ eff/perf ด้วย ?

Log in or register to post comments

By: hisoft

on 23 January 2025 - 07:44 #1331935 Reply to:1331928

hisoft's picture

หมายถึงว่ายอมแลกคุณภาพกับความเร็วน่ะฮะ

Log in or register to post comments

By: lew

on 23 January 2025 - 09:36 #1331946 Reply to:1331923

lew's picture

ถ้าดู model card ของ Google เองจะกลายเป็นพอๆ กับ Gemini Pro ครับ

ความเร็วคงเท่าๆ กับ Flash ตัวเดิม แต่มันเสียเวลาคิด ตอน output นี่มัน tag ได้ด้วยว่าข้อความ output เป็น thinking หรือ output

lewcpe.com , @wasonliw

Log in or register to post comments

By: hisoft

on 23 January 2025 - 10:11 #1331955 Reply to:1331946

hisoft's picture

หมายถึงว่าถ้าเอา Gemini Pro มาทำเป็น Thinking ด้วยแบบไม่เน้นเร็วด้วย Gemini Flash คะแนนมันอาจจะไปไกลกว่านี้น่ะฮะ

Log in or register to post comments

About Blognone

Spread Blognone

ช่วยประชาสัมพันธ์ Blognone ให้เป็นที่รู้จัก โดยลิงก์กลับมายังเรา หรือแปะไอคอนแบบต่างๆ บนเว็บไซต์ของคุณ ( ไอคอนอื่นๆ และวิธีประชาสัมพันธ์ )

Other Version

Blognone in Other Forms

Copyright Notice

Creative Commons Attribution 3.0 ©2004-2011 Blognone Crew

Design a Mobile Website

View Site in Mobile | Classic

Share by: