David Silver (computer scientist)

514: 400: 183: 1796: 1776: 1870: 1670: 367: 305:

games directly from pixels. Silver led the AlphaGo project, culminating in the first program to defeat a top professional player in the full-size game of Go.

803: 918: 1840: 415:; Igor Babuschkin; Wojciech M Czarnecki; et al. (30 October 2019). "Grandmaster level in StarCraft II using multi-agent reinforcement learning". 310: 1512: 1860: 317:, which used the same AI to learn to play Go from scratch (learning only by playing itself and not from human games) before learning to play 275: 83: 593: 1028: 826:"ACM Prize in Computing Awarded to AlphaGo Developer: David Silver Recognized for Breakthrough Advances in Computer Game-Playing" 911: 1875: 1855: 1701: 750:; Chris J. Maddison; et al. (27 January 2016). "Mastering the game of Go with deep neural networks and tree search". 668: 1802: 1353: 1090: 875: 829: 481: 638: 1241: 1048: 904: 118: 1569: 1850: 1756: 1696: 1294: 328:

Silver is among the most published members of staff at Google DeepMind, with over 200,000 citations and has an

232: 1845: 1289: 978: 527: 509: 564: 1731: 1128: 1085: 1038: 1033: 1782: 1078: 1004: 355: 195: 29: 1406: 1341: 942: 850: 1807: 1665: 1304: 1135: 958: 283: 204: 136: 1865: 1706: 963: 1751: 1736: 1389: 1384: 1284: 1152: 933: 106: 49: 271:, where he was CTO and lead programmer, receiving several awards for technology and innovation. 1835: 1711: 1471: 1190: 1185: 411: 348: 294: 248: 208: 114: 88: 1741: 1726: 1691: 1379: 1279: 1147: 825: 690:; et al. (25 February 2015). "Human-level control through deep reinforcement learning". 240: 54: 1609: 1830: 1761: 1716: 1162: 1107: 953: 948: 220: 73: 399: 8: 1336: 1314: 1063: 1058: 1016: 968: 685: 513: 391: 182: 286:. His lectures on Reinforcement Learning are available on YouTube. Silver consulted for 1721: 1299: 1787: 1775: 1579: 1231: 1102: 1095: 854: 777: 769: 717: 709: 585: 545: 442: 434: 613: 1532: 1522: 1329: 1123: 1073: 1068: 1011: 999: 785: 761: 752: 725: 701: 692: 537: 450: 426: 417: 110: 259:(co-authored with Sylvain Gelly) was one of the strongest Go programs as of 2009. 1645: 1589: 1411: 1053: 973: 742: 359: 287: 200: 132: 309:

subsequently received an honorary 9 Dan Professional Certification; and won the

1619: 1584: 1574: 1399: 1157: 983: 669:"RL Course by David Silver - Lecture 1: Introduction to Reinforcement Learning" 395: 336: 268: 236: 140: 482:"David Silver: The unsung hero and intellectual powerhouse at Google DeepMind" 430: 1824: 1564: 1544: 1461: 1140: 773: 713: 599: 549: 505: 438: 412: 298: 267:

After graduating from university, Silver co-founded the video games company

1650: 1481: 896: 781: 721: 646: 589: 446: 251:, where he co-introduced the algorithms used in the first master-level 9×9 789: 729: 572:

Proceedings of the Twenty-Third AAAI Conference on Artificial Intelligence

454: 235:, graduating in 1997 with the Addison-Wesley award, and having befriended 1871:

Fellows of the Association for the Advancement of Artificial Intelligence

1746: 1517: 1426: 1421: 1043: 1021: 122: 765: 705: 1640: 1599: 1594: 1507: 1416: 1324: 1236: 1216: 1635: 1604: 1502: 1346: 1309: 1246: 1200: 1195: 1180: 747: 314: 252: 216: 69: 153: 1537: 1369: 541: 325:

in the same way, to higher levels than any other computer program.

279: 851:"Royal Society elects outstanding new Fellows and Foreign Members" 1660: 1497: 1451: 1374: 1274: 1269: 1221: 672: 529:

Reinforcement Learning and Simulation-Based Search in Computer Go

363: 329: 306: 212: 154:

Reinforcement learning and simulation-based search in computer Go

65: 239:

whilst at Cambridge. Silver returned to academia in 2004 at the

1675: 1655: 1527: 1319: 147: 173: 1476: 1456: 1446: 1441: 1436: 1431: 1394: 1226: 584: 322: 318: 302: 804:"Google DeepMind AlphaGo in U.K. Wins Innovation Grand Prix" 606: 1466: 614:"What the AI Behind AlphaGo Can Teach Us About Being Human" 467:

The Cambridge University List of Members up to 31 July 1998

368:

Association for the Advancement of Artificial Intelligence

562: 244: 255:

programs and graduated in 2009. His version of program

351:

for breakthrough advances in computer game-playing.

565:"Achieving Master Level Play in 9 × 9 Computer Go" 199:(born 1976) is a principal research scientist at 1822: 290:from its inception, joining full-time in 2013. 912: 475: 473: 926: 276:Royal Society University Research Fellowship 84:Royal Society University Research Fellowship 313:for innovation. He then led development of 919: 905: 679: 595:Artificial Intelligence: A Modern Approach 512: 470: 398: 301:, including a program that learns to play 181: 293:His recent work has focused on combining 499: 1861:Academics of University College London 1823: 525: 262: 1841:Alumni of Christ's College, Cambridge 900: 536:(PhD thesis). University of Alberta. 342: 1757:Generative adversarial network (GAN) 563:Sylvain Gelly; David Silver (2008). 405: 387: 385: 383: 686:Volodymyr Mnih; Koray Kavukcuoglu; 278:in 2011, and subsequently became a 13: 823: 736: 14: 1887: 479: 380: 366:. He was elected a Fellow of the 16:Computer scientist and researcher 1795: 1794: 1774: 868: 843: 817: 796: 661: 631: 358:(FRS) for his contributions to 1707:Recurrent neural network (RNN) 1697:Differentiable neural computer 578: 556: 519: 461: 1: 1752:Variational autoencoder (VAE) 1712:Long short-term memory (LSTM) 979:Computational learning theory 510:Mathematics Genealogy Project 373: 1876:Fellows of the Royal Society 1856:University of Alberta alumni 1732:Convolutional neural network 354:In 2021, Silver was elected 347:Silver was awarded the 2019 226: 7: 1727:Multilayer perceptron (MLP) 356:Fellow of the Royal Society 233:Christ's College, Cambridge 41:1976 (age 47–48) 10: 1892: 1803:Artificial neural networks 1717:Gated recurrent unit (GRU) 943:Differentiable programming 671:. 13 May 2015 – via 1770: 1684: 1628: 1557: 1490: 1362: 1262: 1255: 1209: 1173: 1136:Artificial neural network 1116: 992: 959:Automatic differentiation 932: 431:10.1038/S41586-019-1724-Z 284:University College London 207:. He has led research on 205:University College London 168: 164: 146: 137:University College London 128: 102: 95: 79: 61: 45: 37: 23: 964:Neuromorphic engineering 927:Differentiable computing 394:publications indexed by 1737:Residual neural network 1153:Artificial Intelligence 107:Artificial intelligence 50:University of Cambridge 876:"Elected AAAI Fellows" 526:Silver, David (2009). 349:ACM Prize in Computing 295:reinforcement learning 249:reinforcement learning 209:reinforcement learning 115:Reinforcement learning 89:ACM Prize in Computing 1851:Go (game) researchers 1692:Neural Turing machine 1280:Human image synthesis 639:"CSML | David Silver" 274:Silver was awarded a 241:University of Alberta 55:University of Alberta 1846:Computer programmers 1783:Computer programming 1762:Graph neural network 1337:Text-to-video models 1315:Text-to-image models 1163:Large language model 1148:Scientific computing 954:Statistical manifold 949:Information geometry 1129:In-context learning 969:Pattern recognition 766:10.1038/NATURE16961 706:10.1038/NATURE14236 486:businessinsider.com 335:of 93 according to 263:Career and research 203:and a professor at 1722:Echo state network 1610:Jürgen Schmidhuber 1305:Facial recognition 1300:Speech recognition 1210:Software libraries 343:Awards and honours 1818: 1817: 1580:Stephen Grossberg 1553: 1552: 760:(7587): 484–489. 700:(7540): 529–533. 586:Stuart J. Russell 425:(7782): 350–354. 311:Cannes Lion award 189: 188: 97:Scientific career 1883: 1808:Machine learning 1798: 1797: 1778: 1533:Action selection 1523:Self-driving car 1330:Stable Diffusion 1295:Speech synthesis 1260: 1259: 1124:Machine learning 1000:Gradient descent 921: 914: 907: 898: 897: 891: 890: 888: 886: 872: 866: 865: 863: 861: 855:royalsociety.org 847: 841: 840: 838: 836: 821: 815: 814: 812: 810: 800: 794: 793: 740: 734: 733: 683: 677: 676: 665: 659: 658: 656: 654: 649:on 24 April 2021 645:. Archived from 635: 629: 628: 626: 624: 610: 604: 603: 598:(3rd ed.). 582: 576: 575: 569: 560: 554: 553: 523: 517: 516: 503: 497: 496: 494: 492: 477: 468: 465: 459: 458: 409: 403: 402: 389: 198: 185: 180: 177: 175: 160: 111:Machine learning 32: 21: 20: 1891: 1890: 1886: 1885: 1884: 1882: 1881: 1880: 1866:Google DeepMind 1821: 1820: 1819: 1814: 1766: 1680: 1646:Google DeepMind 1624: 1590:Geoffrey Hinton 1549: 1486: 1412:Project Debater 1358: 1256:Implementations 1251: 1205: 1169: 1112: 1054:Backpropagation 988: 974:Tensor calculus 928: 925: 895: 894: 884: 882: 874: 873: 869: 859: 857: 849: 848: 844: 834: 832: 822: 818: 808: 806: 802: 801: 797: 741: 737: 684: 680: 667: 666: 662: 652: 650: 637: 636: 632: 622: 620: 612: 611: 607: 583: 579: 567: 561: 557: 524: 520: 504: 500: 490: 488: 478: 471: 466: 462: 410: 406: 390: 381: 376: 360:Deep Q-Networks 345: 288:Google DeepMind 265: 243:to study for a 229: 219:and co-lead on 201:Google DeepMind 194: 172: 158: 139: 135: 133:Google Deepmind 121: 117: 113: 109: 87: 72: 68: 53: 46:Alma mater 33: 28: 26: 17: 12: 11: 5: 1889: 1879: 1878: 1873: 1868: 1863: 1858: 1853: 1848: 1843: 1838: 1833: 1816: 1815: 1813: 1812: 1811: 1810: 1805: 1792: 1791: 1790: 1785: 1771: 1768: 1767: 1765: 1764: 1759: 1754: 1749: 1744: 1739: 1734: 1729: 1724: 1719: 1714: 1709: 1704: 1699: 1694: 1688: 1686: 1682: 1681: 1679: 1678: 1673: 1668: 1663: 1658: 1653: 1648: 1643: 1638: 1632: 1630: 1626: 1625: 1623: 1622: 1620:Ilya Sutskever 1617: 1612: 1607: 1602: 1597: 1592: 1587: 1585:Demis Hassabis 1582: 1577: 1575:Ian Goodfellow 1572: 1567: 1561: 1559: 1555: 1554: 1551: 1550: 1548: 1547: 1542: 1541: 1540: 1530: 1525: 1520: 1515: 1510: 1505: 1500: 1494: 1492: 1488: 1487: 1485: 1484: 1479: 1474: 1469: 1464: 1459: 1454: 1449: 1444: 1439: 1434: 1429: 1424: 1419: 1414: 1409: 1404: 1403: 1402: 1392: 1387: 1382: 1377: 1372: 1366: 1364: 1360: 1359: 1357: 1356: 1351: 1350: 1349: 1344: 1334: 1333: 1332: 1327: 1322: 1312: 1307: 1302: 1297: 1292: 1287: 1282: 1277: 1272: 1266: 1264: 1257: 1253: 1252: 1250: 1249: 1244: 1239: 1234: 1229: 1224: 1219: 1213: 1211: 1207: 1206: 1204: 1203: 1198: 1193: 1188: 1183: 1177: 1175: 1171: 1170: 1168: 1167: 1166: 1165: 1158:Language model 1155: 1150: 1145: 1144: 1143: 1133: 1132: 1131: 1120: 1118: 1114: 1113: 1111: 1110: 1108:Autoregression 1105: 1100: 1099: 1098: 1088: 1086:Regularization 1083: 1082: 1081: 1076: 1071: 1061: 1056: 1051: 1049:Loss functions 1046: 1041: 1036: 1031: 1026: 1025: 1024: 1014: 1009: 1008: 1007: 996: 994: 990: 989: 987: 986: 984:Inductive bias 981: 976: 971: 966: 961: 956: 951: 946: 938: 936: 930: 929: 924: 923: 916: 909: 901: 893: 892: 867: 842: 816: 795: 735: 678: 660: 630: 605: 577: 555: 542:10.7939/R39D8T 518: 498: 469: 460: 404: 396:Google Scholar 378: 377: 375: 372: 344: 341: 337:Google scholar 269:Elixir Studios 264: 261: 237:Demis Hassabis 231:He studied at 228: 225: 187: 186: 170: 166: 165: 162: 161: 150: 144: 143: 141:Elixir Studios 130: 126: 125: 123:Computer Games 104: 100: 99: 93: 92: 81: 77: 76: 63: 62:Known for 59: 58: 47: 43: 42: 39: 35: 34: 27: 24: 15: 9: 6: 4: 3: 2: 1888: 1877: 1874: 1872: 1869: 1867: 1864: 1862: 1859: 1857: 1854: 1852: 1849: 1847: 1844: 1842: 1839: 1837: 1836:Living people 1834: 1832: 1829: 1828: 1826: 1809: 1806: 1804: 1801: 1800: 1793: 1789: 1786: 1784: 1781: 1780: 1777: 1773: 1772: 1769: 1763: 1760: 1758: 1755: 1753: 1750: 1748: 1745: 1743: 1740: 1738: 1735: 1733: 1730: 1728: 1725: 1723: 1720: 1718: 1715: 1713: 1710: 1708: 1705: 1703: 1700: 1698: 1695: 1693: 1690: 1689: 1687: 1685:Architectures 1683: 1677: 1674: 1672: 1669: 1667: 1664: 1662: 1659: 1657: 1654: 1652: 1649: 1647: 1644: 1642: 1639: 1637: 1634: 1633: 1631: 1629:Organizations 1627: 1621: 1618: 1616: 1613: 1611: 1608: 1606: 1603: 1601: 1598: 1596: 1593: 1591: 1588: 1586: 1583: 1581: 1578: 1576: 1573: 1571: 1568: 1566: 1565:Yoshua Bengio 1563: 1562: 1560: 1556: 1546: 1545:Robot control 1543: 1539: 1536: 1535: 1534: 1531: 1529: 1526: 1524: 1521: 1519: 1516: 1514: 1511: 1509: 1506: 1504: 1501: 1499: 1496: 1495: 1493: 1489: 1483: 1480: 1478: 1475: 1473: 1470: 1468: 1465: 1463: 1462:Chinchilla AI 1460: 1458: 1455: 1453: 1450: 1448: 1445: 1443: 1440: 1438: 1435: 1433: 1430: 1428: 1425: 1423: 1420: 1418: 1415: 1413: 1410: 1408: 1405: 1401: 1398: 1397: 1396: 1393: 1391: 1388: 1386: 1383: 1381: 1378: 1376: 1373: 1371: 1368: 1367: 1365: 1361: 1355: 1352: 1348: 1345: 1343: 1340: 1339: 1338: 1335: 1331: 1328: 1326: 1323: 1321: 1318: 1317: 1316: 1313: 1311: 1308: 1306: 1303: 1301: 1298: 1296: 1293: 1291: 1288: 1286: 1283: 1281: 1278: 1276: 1273: 1271: 1268: 1267: 1265: 1261: 1258: 1254: 1248: 1245: 1243: 1240: 1238: 1235: 1233: 1230: 1228: 1225: 1223: 1220: 1218: 1215: 1214: 1212: 1208: 1202: 1199: 1197: 1194: 1192: 1189: 1187: 1184: 1182: 1179: 1178: 1176: 1172: 1164: 1161: 1160: 1159: 1156: 1154: 1151: 1149: 1146: 1142: 1141:Deep learning 1139: 1138: 1137: 1134: 1130: 1127: 1126: 1125: 1122: 1121: 1119: 1115: 1109: 1106: 1104: 1101: 1097: 1094: 1093: 1092: 1089: 1087: 1084: 1080: 1077: 1075: 1072: 1070: 1067: 1066: 1065: 1062: 1060: 1057: 1055: 1052: 1050: 1047: 1045: 1042: 1040: 1037: 1035: 1032: 1030: 1029:Hallucination 1027: 1023: 1020: 1019: 1018: 1015: 1013: 1010: 1006: 1003: 1002: 1001: 998: 997: 995: 991: 985: 982: 980: 977: 975: 972: 970: 967: 965: 962: 960: 957: 955: 952: 950: 947: 945: 944: 940: 939: 937: 935: 931: 922: 917: 915: 910: 908: 903: 902: 899: 881: 877: 871: 856: 852: 846: 831: 827: 824:Ormond, Jim. 820: 805: 799: 791: 787: 783: 779: 775: 771: 767: 763: 759: 755: 754: 749: 745: 739: 731: 727: 723: 719: 715: 711: 707: 703: 699: 695: 694: 689: 682: 674: 670: 664: 648: 644: 640: 634: 619: 615: 609: 601: 600:Prentice Hall 597: 596: 591: 587: 581: 573: 566: 559: 551: 547: 543: 539: 535: 531: 530: 522: 515: 511: 507: 502: 487: 483: 476: 474: 464: 456: 452: 448: 444: 440: 436: 432: 428: 424: 420: 419: 414: 413:Oriol Vinyals 408: 401: 397: 393: 388: 386: 384: 379: 371: 369: 365: 361: 357: 352: 350: 340: 338: 334: 332: 326: 324: 320: 316: 312: 308: 304: 300: 299:deep learning 296: 291: 289: 285: 281: 277: 272: 270: 260: 258: 254: 250: 246: 242: 238: 234: 224: 222: 218: 214: 210: 206: 202: 197: 193: 184: 179: 171: 167: 163: 156: 155: 151: 149: 145: 142: 138: 134: 131: 127: 124: 120: 116: 112: 108: 105: 101: 98: 94: 90: 85: 82: 78: 75: 71: 67: 64: 60: 56: 51: 48: 44: 40: 36: 31: 22: 19: 1651:Hugging Face 1615:David Silver 1614: 1263:Audio–visual 1117:Applications 1096:Augmentation 941: 883:. Retrieved 879: 870: 858:. Retrieved 845: 833:. Retrieved 819: 807:. Retrieved 798: 757: 751: 744:David Silver 743: 738: 697: 691: 688:David Silver 687: 681: 663: 651:. Retrieved 647:the original 642: 633: 621:. Retrieved 617: 608: 594: 590:Peter Norvig 580: 571: 558: 533: 528: 521: 506:David Silver 501: 491:26 September 489:. Retrieved 485: 480:Shead, Sam. 463: 422: 416: 407: 392:David Silver 353: 346: 330: 327: 292: 273: 266: 256: 230: 192:David Silver 191: 190: 176:.davidsilver 152: 129:Institutions 96: 25:David Silver 18: 1831:1976 births 1799:Categories 1747:Autoencoder 1702:Transformer 1570:Alex Graves 1518:OpenAI Five 1422:IBM Watsonx 1044:Convolution 1022:Overfitting 534:ualberta.ca 1825:Categories 1788:Technology 1641:EleutherAI 1600:Fei-Fei Li 1595:Yann LeCun 1508:Q-learning 1491:Decisional 1417:IBM Watson 1325:Midjourney 1217:TensorFlow 1064:Activation 1017:Regression 1012:Clustering 374:References 1671:MIT CSAIL 1636:Anthropic 1605:Andrew Ng 1503:AlphaZero 1347:VideoPoet 1310:AlphaFold 1247:MindSpore 1201:SpiNNaker 1196:Memristor 1103:Diffusion 1079:Rectifier 1059:Batchnorm 1039:Attention 1034:Adversary 885:3 January 790:Q28005460 774:1476-4687 748:Aja Huang 730:Q27907579 714:1476-4687 643:ucl.ac.uk 618:Wired.com 550:575410609 455:Q72988805 439:1476-4687 370:in 2022. 315:AlphaZero 227:Education 221:AlphaStar 217:AlphaZero 74:AlphaStar 70:AlphaZero 1779:Portals 1538:Auto-GPT 1370:Word2vec 1174:Hardware 1091:Datasets 993:Concepts 786:Wikidata 782:26819042 726:Wikidata 722:25719670 592:(2009). 451:Wikidata 447:31666705 280:lecturer 119:Planning 1661:Meta AI 1498:AlphaGo 1482:PanGu-Σ 1452:ChatGPT 1427:Granite 1375:Seq2seq 1354:Whisper 1275:WaveNet 1270:AlexNet 1242:Flux.jl 1222:PyTorch 1074:Sigmoid 1069:Softmax 934:General 835:2 April 830:acm.org 673:YouTube 508:at the 364:AlphaGo 307:AlphaGo 213:AlphaGo 169:Website 66:AlphaGo 1676:Huawei 1656:OpenAI 1558:People 1528:MuZero 1390:Gemini 1385:Claude 1320:DALL-E 1232:Theano 860:8 June 809:27 May 788: 780: 772: 753:Nature 728: 720: 712: 693:Nature 653:27 May 623:17 May 548: 453: 445: 437: 418:Nature 333:-index 159:(2009) 157: 148:Thesis 103:Fields 91:(2019) 86:(2011) 80:Awards 1742:Mamba 1513:SARSA 1477:LLaMA 1472:BLOOM 1457:GPT-J 1447:GPT-4 1442:GPT-3 1437:GPT-2 1432:GPT-1 1395:LaMDA 1227:Keras 568:(PDF) 323:shogi 319:chess 303:Atari 297:with 211:with 57:(PhD) 1666:Mila 1467:PaLM 1400:Bard 1380:BERT 1363:Text 1342:Sora 887:2024 880:AAAI 862:2021 837:2020 811:2017 778:PMID 770:ISSN 718:PMID 710:ISSN 655:2017 625:2016 546:OCLC 493:2020 443:PMID 435:ISSN 362:and 321:and 257:MoGo 52:(BA) 38:Born 1407:NMT 1290:OCR 1285:HWR 1237:JAX 1191:VPU 1186:TPU 1181:IPU 1005:SGD 762:doi 758:529 702:doi 698:518 538:doi 427:doi 423:575 282:at 247:on 245:PhD 196:FRS 178:.uk 174:www 30:FRS 1827:: 878:. 853:. 828:. 784:. 776:. 768:. 756:. 746:; 724:. 716:. 708:. 696:. 641:. 616:. 588:; 570:. 544:. 532:. 484:. 472:^ 449:. 441:. 433:. 421:. 382:^ 339:. 253:Go 223:. 215:, 920:e 913:t 906:v 889:. 864:. 839:. 813:. 792:. 764:: 732:. 704:: 675:. 657:. 627:. 602:. 574:. 552:. 540:: 495:. 457:. 429:: 331:h

Text is available under the Creative Commons Attribution-ShareAlike License. Additional terms may apply.

Knowledge

David Silver (computer scientist)

Index