AlphaGo vs. Humanity

Ve Bilgisayar İnsanı Yine Yendi!

Bilgisayar ve Bilişim Topluluğu

Yrd.Doç.Dr. İzzet PEMBECİ - 08.04.2016
http://github.com/pembeci/alphago_sunum

Konuşma Planı

Neden buradayız?
Nereden geldik? (Go, YZ Tarihçesi, Oyunlar ve YZ)
Ne oldu şimdi? (AlphaGo, Deep Learning)
Nereye gidiyoruz? (SKYNET!!!!)

Neden buradayız?

DeepMind -- 2010 Kuruluş, 2014 Google ($650 Milyon)

Demis Hassabis
Kıbrıs Rum & Singapur

Shane Legg
Yeni Zellanda

Mustafa Suleyman
???

Konuşma Planı

Neden buradayız?
Nereden geldik? (Go, YZ Tarihçesi, Oyunlar ve YZ)
Ne oldu şimdi? (AlphaGo, Deep Learning)
Nereye gidiyoruz? (SKYNET!!!!)

Go Oyunu

Çin (MÖ 4.yy, savaş, sanat), Kore (5-7. yy), Japonya (7. yy, okullar), Batı (1900'ler)
Türkiye (90'lar, Alpar Kılınç, Şibumi/Trevanian, Türkiye Go Oyuncuları Derneği)
Felsefi Boyutları
Basit Kurallar
Farklı güçte oyuncular (30k → 1k → 1d → 9d → 1p → 9p)
Nasıl öğrenirim: Web, Problemler, Kitaplar, Go Sunucuları (IGS, KGS, Clients)

Go Temel Kurallar

Satrançvari mekanik: İki kişi, siyah beyaz eşdeğer taşlar, noktalara hamle
Kural1 (liberty): Her taş (taş grubu) komşu boş noktalardan nefes almalı, nefes alamayanlar ölür.
Kural2 (ko): Oyun (tahta) daha önceki bir konuma dönemez.
Diğer kavramlar bunlardan türüyor
Puanlama: Çevrilen alan + hapsedilen taşlar

Go Bilgeliği - Strateji

U.S. Army’s Strategic Studies Institute - Learning from the Stones: A Go Approach to Mastering China’s Strategic Concept of Shi:

In this monograph, the author uses the ancient game of Go as a metaphor for the Chinese approach to strategy. He shows that this is very different than the linear method that underlies American strategy. By better understanding Go, he argues, American strategies could better understand Chinese strategy.

Go Bilgeliği - Özdeyişler

Give your opponent what he wants.
He who rides on a tiger can never get off.
A rich man should not pick quarrels.
Greed for the win takes the win away.
Don't try to win. Try not to lose.
You can play Go but don't let Go play you.
Never wrestle with a pig.
Make a fist before striking.
Play urgent moves before big moves.
Make a feint to the East while attacking in the West.
Play fast, die fast. Play slow, die slow.

Konuşma Planı

Neden buradayız?
Nereden geldik? (Go, YZ Tarihçesi, Oyunlar ve YZ)
Ne oldu şimdi? (AlphaGo, Deep Learning)
Nereye gidiyoruz? (SKYNET!!!!)

Yapay Zeka (YZ, AI)

Köklü bir dal
Disiplinler arası
Strong AI, Weak AI
Hala yolunu buluyor

Yapay Zeka Tarihçe

1950 - Turing Test
1950 - Claude Shannon:Satranç'ın Arama Problemi olarak analizi
1951 - Dama ve Satranç oynayan programlar
1956 - Dartmouth College summer AI conference
1959 - MIT AI Lab
1961 - Symbolic Computing, Calculus Problemleri
1965 - ELIZA, İlk chat bot
1966 - Makine çevirisi işe yaramıyor. Doğal Dil İşleme (NLP) uykuya.
1969 - Minsky & Pappert, Perceptrons, Yapay Sinir Ağları uykuya
1970ler - YZ'nin Uzun Kışı

Yapay Zeka Tarihçe

1980ler - Uzman sistemler, Ticarileşme
1986 - Yapay Sinir Ağları ayaklanır. Backpropogation (1974)
1991 - Dart, 1991 Körfez Savaşı
1994 - Chinook, İngiliz Daması Şampiyonu
1995 - No Hands Across America, 4500km, Gaz-Fren hariç
1997 - Deep Blue Kasparov'u yener.
1997 - Othello Dünya Şampiyonu
1998 - Furby, robotlar evlerde
1998 - TD-Gammon, Tavla Dünya Şampiyonu
2005 - Honda, ASIMO robot
2009 - Google, Kendi kendine giden araba
2011 - IBM Watson, Jeopardy Dünya Şampiyonu
2013 - HRP2 (Japonya/Google), drive a vehicle, walk over debris, climb a ladder, remove debris, walk through doors, cut through a wall, close valves and connect a hose
2016 - AlphaGo

Bilgisayarlara Oyun Oynatma

YZ için test alanı, Zor, Eğlenceli
İyi tanımlı, az sayıda kural, çok ihtimal (arama uzayı)
Belirsizlikler (rakip, şans), zaman kısıtı, eksik bilgi
Halloldu: Dama (Chinook, Dünya Şampiyonu 1994), Satranç (Deep Blue-Kasparov 1997, Deep Fritz-Kramnik 2006), Othello (Logistello, DŞ 1997), Scrabble (Maven, DŞ 1998), Tavla (TD-Gammon, 1998 neredeyse DŞ)
Yeni hedefler: Briç, Poker, Go, Kutu Oyunları (Settlers of Catan), MMORPG, General Game Playing

Oyun Ağaçları

Ne kadar büyük olabilir: Tablo

Üstsel büyüme (Exponential Growth)

Complexity Class	N = 10	N = 100	N = 1,000	...	N = 1,000,000
O(1)	1x10^-7 seconds	1x10^-7 seconds	1x10^-7 seconds	...	1x10^-7 seconds
O(log₂N)	3.3x10^-7 seconds	6.6x10^-7 seconds	10x10^-7 seconds	...	20x10^-7 seconds
O(N)	1x10^-7 seconds	1x10^-6 seconds	1x10^-5 seconds	...	1x10^-3 seconds
O(N²)	1x10^-6 seconds	1x10^-4 seconds	1x10^-2 seconds	...	2.7 hours
O(N³)	1x10^-5 seconds	1x10^-2 seconds	10 seconds	...	3x10³ years
O(2^N)	1x10^-5 seconds	4x10²¹ centuries	forgetaboutit	...	forgetaboutit

Değer Fonksiyonu (Evaluation Function)

f(P) = 200(K-K') + 9(Q-Q') + 5(R-R') + 3(B-B'+N-N') + (P-P') - 0.5(D-D'+S-S'+I-I') + 0.1(M-M') + ...
K, Q, R, B, N, P are the number of white kings, queens, rooks, bishops, knights and pawns on the board.
D, S, I are doubled, backward and isolated white pawns.
M represents white mobility (measured, say, as the number of legal moves available to White)

Kısmi Arama (Minimax, Alfa-Beta budama)

Go ve Arama

Konuşma Planı

Neden buradayız?
Nereden geldik? (Go, YZ Tarihçesi, Oyunlar ve YZ)
Ne oldu şimdi? (Deep Learning, AlphaGo)
Nereye gidiyoruz? (SKYNET!!!!)

Deep Learning

Multilayer Artificial Neural Networks (çok katmanlı)
Matematiksel/algoritmik defolar giderildi
Bilgisayarlar hızlandı. Paralelleştirme, GPUs.
Convolutional Networks. Görme, Resmi Anlama.
Uygulamalar: Nesne tanıma, Visual Question Answering, Fonts, Doğal Dil, Word2Vec Uygulama, Müzik, DeepArt
Detaylar(ANN'e giriş ve Deep Learning)
Diğer kaynaklar (dersler, kütüphaneler)

Konuşma Planı

Neden buradayız?
Nereden geldik? (Go, YZ Tarihçesi, Oyunlar ve YZ)
Ne oldu şimdi? (Deep Learning, AlphaGo)
Nereye gidiyoruz? (SKYNET!!!!)

AlphaGo Mimarisi (Deep Learning, Reinforcement Learning)

AlphaGo Mimarisi (Monte Carlo Search Tree)

Konuşma Planı

Neden buradayız?
Nereden geldik? (Go, YZ Tarihçesi, Oyunlar ve YZ)
Ne oldu şimdi? (Deep Learning, AlphaGo)
Nereye gidiyoruz? (SKYNET!!!!)

Nereye Gidiyoruz?

AlphaGo: Hybrid, Görme ve Dil Uygulamaları, Bol ilgi.
Genel YZ: Problem hala belli değil. Logo'larla uğraşıyoruz.
Bilim insansal bir faaliyet. Önce kolay şeyler.
Teknoloji sadece bir araç (iyi/kötü değil)???
İşleri AI kapacak. Evrensel Taban Gelir (Universal Base Income)

Nereye Gidiyoruz? YZ Harici.

Facebook sizi daha iyi tanıyor.
Internet of Things. Güvenlik. Big Brother.
Crispr gene editing. BioEthics.