Knowledge

David Silver (computer scientist)

Source đź“ť

514: 400: 183: 1796: 1776: 1870: 1670: 367: 305:
games directly from pixels. Silver led the AlphaGo project, culminating in the first program to defeat a top professional player in the full-size game of Go.
803: 918: 1840: 415:; Igor Babuschkin; Wojciech M Czarnecki; et al. (30 October 2019). "Grandmaster level in StarCraft II using multi-agent reinforcement learning". 310: 1512: 1860: 317:, which used the same AI to learn to play Go from scratch (learning only by playing itself and not from human games) before learning to play 275: 83: 593: 1028: 826:"ACM Prize in Computing Awarded to AlphaGo Developer: David Silver Recognized for Breakthrough Advances in Computer Game-Playing" 911: 1875: 1855: 1701: 750:; Chris J. Maddison; et al. (27 January 2016). "Mastering the game of Go with deep neural networks and tree search". 668: 1802: 1353: 1090: 875: 829: 481: 638: 1241: 1048: 904: 118: 1569: 1850: 1756: 1696: 1294: 328:
Silver is among the most published members of staff at Google DeepMind, with over 200,000 citations and has an
232: 1845: 1289: 978: 527: 509: 564: 1731: 1128: 1085: 1038: 1033: 1782: 1078: 1004: 355: 195: 29: 1406: 1341: 942: 850: 1807: 1665: 1304: 1135: 958: 283: 204: 136: 1865: 1706: 963: 1751: 1736: 1389: 1384: 1284: 1152: 933: 106: 49: 271:, where he was CTO and lead programmer, receiving several awards for technology and innovation. 1835: 1711: 1471: 1190: 1185: 411: 348: 294: 248: 208: 114: 88: 1741: 1726: 1691: 1379: 1279: 1147: 825: 690:; et al. (25 February 2015). "Human-level control through deep reinforcement learning". 240: 54: 1609: 1830: 1761: 1716: 1162: 1107: 953: 948: 220: 73: 399: 8: 1336: 1314: 1063: 1058: 1016: 968: 685: 513: 391: 182: 286:. His lectures on Reinforcement Learning are available on YouTube. Silver consulted for 1721: 1299: 1787: 1775: 1579: 1231: 1102: 1095: 854: 777: 769: 717: 709: 585: 545: 442: 434: 613: 1532: 1522: 1329: 1123: 1073: 1068: 1011: 999: 785: 761: 752: 725: 701: 692: 537: 450: 426: 417: 110: 259:(co-authored with Sylvain Gelly) was one of the strongest Go programs as of 2009. 1645: 1589: 1411: 1053: 973: 742: 359: 287: 200: 132: 309:
subsequently received an honorary 9 Dan Professional Certification; and won the
1619: 1584: 1574: 1399: 1157: 983: 669:"RL Course by David Silver - Lecture 1: Introduction to Reinforcement Learning" 395: 336: 268: 236: 140: 482:"David Silver: The unsung hero and intellectual powerhouse at Google DeepMind" 430: 1824: 1564: 1544: 1461: 1140: 773: 713: 599: 549: 505: 438: 412: 298: 267:
After graduating from university, Silver co-founded the video games company
1650: 1481: 896: 781: 721: 646: 589: 446: 251:, where he co-introduced the algorithms used in the first master-level 9Ă—9 789: 729: 572:
Proceedings of the Twenty-Third AAAI Conference on Artificial Intelligence
454: 235:, graduating in 1997 with the Addison-Wesley award, and having befriended 1871:
Fellows of the Association for the Advancement of Artificial Intelligence
1746: 1517: 1426: 1421: 1043: 1021: 122: 765: 705: 1640: 1599: 1594: 1507: 1416: 1324: 1236: 1216: 1635: 1604: 1502: 1346: 1309: 1246: 1200: 1195: 1180: 747: 314: 252: 216: 69: 153: 1537: 1369: 541: 325:
in the same way, to higher levels than any other computer program.
279: 851:"Royal Society elects outstanding new Fellows and Foreign Members" 1660: 1497: 1451: 1374: 1274: 1269: 1221: 672: 529:
Reinforcement Learning and Simulation-Based Search in Computer Go
363: 329: 306: 212: 154:
Reinforcement learning and simulation-based search in computer Go
65: 239:
whilst at Cambridge. Silver returned to academia in 2004 at the
1675: 1655: 1527: 1319: 147: 173: 1476: 1456: 1446: 1441: 1436: 1431: 1394: 1226: 584: 322: 318: 302: 804:"Google DeepMind AlphaGo in U.K. Wins Innovation Grand Prix" 606: 1466: 614:"What the AI Behind AlphaGo Can Teach Us About Being Human" 467:
The Cambridge University List of Members up to 31 July 1998
368:
Association for the Advancement of Artificial Intelligence
562: 244: 255:
programs and graduated in 2009. His version of program
351:
for breakthrough advances in computer game-playing.
565:"Achieving Master Level Play in 9 Ă— 9 Computer Go" 199:(born 1976) is a principal research scientist at 1822: 290:from its inception, joining full-time in 2013. 912: 475: 473: 926: 276:Royal Society University Research Fellowship 84:Royal Society University Research Fellowship 313:for innovation. He then led development of 919: 905: 679: 595:Artificial Intelligence: A Modern Approach 512: 470: 398: 301:, including a program that learns to play 181: 293:His recent work has focused on combining 499: 1861:Academics of University College London 1823: 525: 262: 1841:Alumni of Christ's College, Cambridge 900: 536:(PhD thesis). University of Alberta. 342: 1757:Generative adversarial network (GAN) 563:Sylvain Gelly; David Silver (2008). 405: 387: 385: 383: 686:Volodymyr Mnih; Koray Kavukcuoglu; 278:in 2011, and subsequently became a 13: 823: 736: 14: 1887: 479: 380: 366:. He was elected a Fellow of the 16:Computer scientist and researcher 1795: 1794: 1774: 868: 843: 817: 796: 661: 631: 358:(FRS) for his contributions to 1707:Recurrent neural network (RNN) 1697:Differentiable neural computer 578: 556: 519: 461: 1: 1752:Variational autoencoder (VAE) 1712:Long short-term memory (LSTM) 979:Computational learning theory 510:Mathematics Genealogy Project 373: 1876:Fellows of the Royal Society 1856:University of Alberta alumni 1732:Convolutional neural network 354:In 2021, Silver was elected 347:Silver was awarded the 2019 226: 7: 1727:Multilayer perceptron (MLP) 356:Fellow of the Royal Society 233:Christ's College, Cambridge 41:1976 (age 47–48) 10: 1892: 1803:Artificial neural networks 1717:Gated recurrent unit (GRU) 943:Differentiable programming 671:. 13 May 2015 – via 1770: 1684: 1628: 1557: 1490: 1362: 1262: 1255: 1209: 1173: 1136:Artificial neural network 1116: 992: 959:Automatic differentiation 932: 431:10.1038/S41586-019-1724-Z 284:University College London 207:. He has led research on 205:University College London 168: 164: 146: 137:University College London 128: 102: 95: 79: 61: 45: 37: 23: 964:Neuromorphic engineering 927:Differentiable computing 394:publications indexed by 1737:Residual neural network 1153:Artificial Intelligence 107:Artificial intelligence 50:University of Cambridge 876:"Elected AAAI Fellows" 526:Silver, David (2009). 349:ACM Prize in Computing 295:reinforcement learning 249:reinforcement learning 209:reinforcement learning 115:Reinforcement learning 89:ACM Prize in Computing 1851:Go (game) researchers 1692:Neural Turing machine 1280:Human image synthesis 639:"CSML | David Silver" 274:Silver was awarded a 241:University of Alberta 55:University of Alberta 1846:Computer programmers 1783:Computer programming 1762:Graph neural network 1337:Text-to-video models 1315:Text-to-image models 1163:Large language model 1148:Scientific computing 954:Statistical manifold 949:Information geometry 1129:In-context learning 969:Pattern recognition 766:10.1038/NATURE16961 706:10.1038/NATURE14236 486:businessinsider.com 335:of 93 according to 263:Career and research 203:and a professor at 1722:Echo state network 1610:JĂĽrgen Schmidhuber 1305:Facial recognition 1300:Speech recognition 1210:Software libraries 343:Awards and honours 1818: 1817: 1580:Stephen Grossberg 1553: 1552: 760:(7587): 484–489. 700:(7540): 529–533. 586:Stuart J. Russell 425:(7782): 350–354. 311:Cannes Lion award 189: 188: 97:Scientific career 1883: 1808:Machine learning 1798: 1797: 1778: 1533:Action selection 1523:Self-driving car 1330:Stable Diffusion 1295:Speech synthesis 1260: 1259: 1124:Machine learning 1000:Gradient descent 921: 914: 907: 898: 897: 891: 890: 888: 886: 872: 866: 865: 863: 861: 855:royalsociety.org 847: 841: 840: 838: 836: 821: 815: 814: 812: 810: 800: 794: 793: 740: 734: 733: 683: 677: 676: 665: 659: 658: 656: 654: 649:on 24 April 2021 645:. Archived from 635: 629: 628: 626: 624: 610: 604: 603: 598:(3rd ed.). 582: 576: 575: 569: 560: 554: 553: 523: 517: 516: 503: 497: 496: 494: 492: 477: 468: 465: 459: 458: 409: 403: 402: 389: 198: 185: 180: 177: 175: 160: 111:Machine learning 32: 21: 20: 1891: 1890: 1886: 1885: 1884: 1882: 1881: 1880: 1866:Google DeepMind 1821: 1820: 1819: 1814: 1766: 1680: 1646:Google DeepMind 1624: 1590:Geoffrey Hinton 1549: 1486: 1412:Project Debater 1358: 1256:Implementations 1251: 1205: 1169: 1112: 1054:Backpropagation 988: 974:Tensor calculus 928: 925: 895: 894: 884: 882: 874: 873: 869: 859: 857: 849: 848: 844: 834: 832: 822: 818: 808: 806: 802: 801: 797: 741: 737: 684: 680: 667: 666: 662: 652: 650: 637: 636: 632: 622: 620: 612: 611: 607: 583: 579: 567: 561: 557: 524: 520: 504: 500: 490: 488: 478: 471: 466: 462: 410: 406: 390: 381: 376: 360:Deep Q-Networks 345: 288:Google DeepMind 265: 243:to study for a 229: 219:and co-lead on 201:Google DeepMind 194: 172: 158: 139: 135: 133:Google Deepmind 121: 117: 113: 109: 87: 72: 68: 53: 46:Alma mater 33: 28: 26: 17: 12: 11: 5: 1889: 1879: 1878: 1873: 1868: 1863: 1858: 1853: 1848: 1843: 1838: 1833: 1816: 1815: 1813: 1812: 1811: 1810: 1805: 1792: 1791: 1790: 1785: 1771: 1768: 1767: 1765: 1764: 1759: 1754: 1749: 1744: 1739: 1734: 1729: 1724: 1719: 1714: 1709: 1704: 1699: 1694: 1688: 1686: 1682: 1681: 1679: 1678: 1673: 1668: 1663: 1658: 1653: 1648: 1643: 1638: 1632: 1630: 1626: 1625: 1623: 1622: 1620:Ilya Sutskever 1617: 1612: 1607: 1602: 1597: 1592: 1587: 1585:Demis Hassabis 1582: 1577: 1575:Ian Goodfellow 1572: 1567: 1561: 1559: 1555: 1554: 1551: 1550: 1548: 1547: 1542: 1541: 1540: 1530: 1525: 1520: 1515: 1510: 1505: 1500: 1494: 1492: 1488: 1487: 1485: 1484: 1479: 1474: 1469: 1464: 1459: 1454: 1449: 1444: 1439: 1434: 1429: 1424: 1419: 1414: 1409: 1404: 1403: 1402: 1392: 1387: 1382: 1377: 1372: 1366: 1364: 1360: 1359: 1357: 1356: 1351: 1350: 1349: 1344: 1334: 1333: 1332: 1327: 1322: 1312: 1307: 1302: 1297: 1292: 1287: 1282: 1277: 1272: 1266: 1264: 1257: 1253: 1252: 1250: 1249: 1244: 1239: 1234: 1229: 1224: 1219: 1213: 1211: 1207: 1206: 1204: 1203: 1198: 1193: 1188: 1183: 1177: 1175: 1171: 1170: 1168: 1167: 1166: 1165: 1158:Language model 1155: 1150: 1145: 1144: 1143: 1133: 1132: 1131: 1120: 1118: 1114: 1113: 1111: 1110: 1108:Autoregression 1105: 1100: 1099: 1098: 1088: 1086:Regularization 1083: 1082: 1081: 1076: 1071: 1061: 1056: 1051: 1049:Loss functions 1046: 1041: 1036: 1031: 1026: 1025: 1024: 1014: 1009: 1008: 1007: 996: 994: 990: 989: 987: 986: 984:Inductive bias 981: 976: 971: 966: 961: 956: 951: 946: 938: 936: 930: 929: 924: 923: 916: 909: 901: 893: 892: 867: 842: 816: 795: 735: 678: 660: 630: 605: 577: 555: 542:10.7939/R39D8T 518: 498: 469: 460: 404: 396:Google Scholar 378: 377: 375: 372: 344: 341: 337:Google scholar 269:Elixir Studios 264: 261: 237:Demis Hassabis 231:He studied at 228: 225: 187: 186: 170: 166: 165: 162: 161: 150: 144: 143: 141:Elixir Studios 130: 126: 125: 123:Computer Games 104: 100: 99: 93: 92: 81: 77: 76: 63: 62:Known for 59: 58: 47: 43: 42: 39: 35: 34: 27: 24: 15: 9: 6: 4: 3: 2: 1888: 1877: 1874: 1872: 1869: 1867: 1864: 1862: 1859: 1857: 1854: 1852: 1849: 1847: 1844: 1842: 1839: 1837: 1836:Living people 1834: 1832: 1829: 1828: 1826: 1809: 1806: 1804: 1801: 1800: 1793: 1789: 1786: 1784: 1781: 1780: 1777: 1773: 1772: 1769: 1763: 1760: 1758: 1755: 1753: 1750: 1748: 1745: 1743: 1740: 1738: 1735: 1733: 1730: 1728: 1725: 1723: 1720: 1718: 1715: 1713: 1710: 1708: 1705: 1703: 1700: 1698: 1695: 1693: 1690: 1689: 1687: 1685:Architectures 1683: 1677: 1674: 1672: 1669: 1667: 1664: 1662: 1659: 1657: 1654: 1652: 1649: 1647: 1644: 1642: 1639: 1637: 1634: 1633: 1631: 1629:Organizations 1627: 1621: 1618: 1616: 1613: 1611: 1608: 1606: 1603: 1601: 1598: 1596: 1593: 1591: 1588: 1586: 1583: 1581: 1578: 1576: 1573: 1571: 1568: 1566: 1565:Yoshua Bengio 1563: 1562: 1560: 1556: 1546: 1545:Robot control 1543: 1539: 1536: 1535: 1534: 1531: 1529: 1526: 1524: 1521: 1519: 1516: 1514: 1511: 1509: 1506: 1504: 1501: 1499: 1496: 1495: 1493: 1489: 1483: 1480: 1478: 1475: 1473: 1470: 1468: 1465: 1463: 1462:Chinchilla AI 1460: 1458: 1455: 1453: 1450: 1448: 1445: 1443: 1440: 1438: 1435: 1433: 1430: 1428: 1425: 1423: 1420: 1418: 1415: 1413: 1410: 1408: 1405: 1401: 1398: 1397: 1396: 1393: 1391: 1388: 1386: 1383: 1381: 1378: 1376: 1373: 1371: 1368: 1367: 1365: 1361: 1355: 1352: 1348: 1345: 1343: 1340: 1339: 1338: 1335: 1331: 1328: 1326: 1323: 1321: 1318: 1317: 1316: 1313: 1311: 1308: 1306: 1303: 1301: 1298: 1296: 1293: 1291: 1288: 1286: 1283: 1281: 1278: 1276: 1273: 1271: 1268: 1267: 1265: 1261: 1258: 1254: 1248: 1245: 1243: 1240: 1238: 1235: 1233: 1230: 1228: 1225: 1223: 1220: 1218: 1215: 1214: 1212: 1208: 1202: 1199: 1197: 1194: 1192: 1189: 1187: 1184: 1182: 1179: 1178: 1176: 1172: 1164: 1161: 1160: 1159: 1156: 1154: 1151: 1149: 1146: 1142: 1141:Deep learning 1139: 1138: 1137: 1134: 1130: 1127: 1126: 1125: 1122: 1121: 1119: 1115: 1109: 1106: 1104: 1101: 1097: 1094: 1093: 1092: 1089: 1087: 1084: 1080: 1077: 1075: 1072: 1070: 1067: 1066: 1065: 1062: 1060: 1057: 1055: 1052: 1050: 1047: 1045: 1042: 1040: 1037: 1035: 1032: 1030: 1029:Hallucination 1027: 1023: 1020: 1019: 1018: 1015: 1013: 1010: 1006: 1003: 1002: 1001: 998: 997: 995: 991: 985: 982: 980: 977: 975: 972: 970: 967: 965: 962: 960: 957: 955: 952: 950: 947: 945: 944: 940: 939: 937: 935: 931: 922: 917: 915: 910: 908: 903: 902: 899: 881: 877: 871: 856: 852: 846: 831: 827: 824:Ormond, Jim. 820: 805: 799: 791: 787: 783: 779: 775: 771: 767: 763: 759: 755: 754: 749: 745: 739: 731: 727: 723: 719: 715: 711: 707: 703: 699: 695: 694: 689: 682: 674: 670: 664: 648: 644: 640: 634: 619: 615: 609: 601: 600:Prentice Hall 597: 596: 591: 587: 581: 573: 566: 559: 551: 547: 543: 539: 535: 531: 530: 522: 515: 511: 507: 502: 487: 483: 476: 474: 464: 456: 452: 448: 444: 440: 436: 432: 428: 424: 420: 419: 414: 413:Oriol Vinyals 408: 401: 397: 393: 388: 386: 384: 379: 371: 369: 365: 361: 357: 352: 350: 340: 338: 334: 332: 326: 324: 320: 316: 312: 308: 304: 300: 299:deep learning 296: 291: 289: 285: 281: 277: 272: 270: 260: 258: 254: 250: 246: 242: 238: 234: 224: 222: 218: 214: 210: 206: 202: 197: 193: 184: 179: 171: 167: 163: 156: 155: 151: 149: 145: 142: 138: 134: 131: 127: 124: 120: 116: 112: 108: 105: 101: 98: 94: 90: 85: 82: 78: 75: 71: 67: 64: 60: 56: 51: 48: 44: 40: 36: 31: 22: 19: 1651:Hugging Face 1615:David Silver 1614: 1263:Audio–visual 1117:Applications 1096:Augmentation 941: 883:. Retrieved 879: 870: 858:. Retrieved 845: 833:. Retrieved 819: 807:. Retrieved 798: 757: 751: 744:David Silver 743: 738: 697: 691: 688:David Silver 687: 681: 663: 651:. Retrieved 647:the original 642: 633: 621:. Retrieved 617: 608: 594: 590:Peter Norvig 580: 571: 558: 533: 528: 521: 506:David Silver 501: 491:26 September 489:. Retrieved 485: 480:Shead, Sam. 463: 422: 416: 407: 392:David Silver 353: 346: 330: 327: 292: 273: 266: 256: 230: 192:David Silver 191: 190: 176:.davidsilver 152: 129:Institutions 96: 25:David Silver 18: 1831:1976 births 1799:Categories 1747:Autoencoder 1702:Transformer 1570:Alex Graves 1518:OpenAI Five 1422:IBM Watsonx 1044:Convolution 1022:Overfitting 534:ualberta.ca 1825:Categories 1788:Technology 1641:EleutherAI 1600:Fei-Fei Li 1595:Yann LeCun 1508:Q-learning 1491:Decisional 1417:IBM Watson 1325:Midjourney 1217:TensorFlow 1064:Activation 1017:Regression 1012:Clustering 374:References 1671:MIT CSAIL 1636:Anthropic 1605:Andrew Ng 1503:AlphaZero 1347:VideoPoet 1310:AlphaFold 1247:MindSpore 1201:SpiNNaker 1196:Memristor 1103:Diffusion 1079:Rectifier 1059:Batchnorm 1039:Attention 1034:Adversary 885:3 January 790:Q28005460 774:1476-4687 748:Aja Huang 730:Q27907579 714:1476-4687 643:ucl.ac.uk 618:Wired.com 550:575410609 455:Q72988805 439:1476-4687 370:in 2022. 315:AlphaZero 227:Education 221:AlphaStar 217:AlphaZero 74:AlphaStar 70:AlphaZero 1779:Portals 1538:Auto-GPT 1370:Word2vec 1174:Hardware 1091:Datasets 993:Concepts 786:Wikidata 782:26819042 726:Wikidata 722:25719670 592:(2009). 451:Wikidata 447:31666705 280:lecturer 119:Planning 1661:Meta AI 1498:AlphaGo 1482:PanGu-ÎŁ 1452:ChatGPT 1427:Granite 1375:Seq2seq 1354:Whisper 1275:WaveNet 1270:AlexNet 1242:Flux.jl 1222:PyTorch 1074:Sigmoid 1069:Softmax 934:General 835:2 April 830:acm.org 673:YouTube 508:at the 364:AlphaGo 307:AlphaGo 213:AlphaGo 169:Website 66:AlphaGo 1676:Huawei 1656:OpenAI 1558:People 1528:MuZero 1390:Gemini 1385:Claude 1320:DALL-E 1232:Theano 860:8 June 809:27 May 788:  780:  772:  753:Nature 728:  720:  712:  693:Nature 653:27 May 623:17 May 548:  453:  445:  437:  418:Nature 333:-index 159:(2009) 157:  148:Thesis 103:Fields 91:(2019) 86:(2011) 80:Awards 1742:Mamba 1513:SARSA 1477:LLaMA 1472:BLOOM 1457:GPT-J 1447:GPT-4 1442:GPT-3 1437:GPT-2 1432:GPT-1 1395:LaMDA 1227:Keras 568:(PDF) 323:shogi 319:chess 303:Atari 297:with 211:with 57:(PhD) 1666:Mila 1467:PaLM 1400:Bard 1380:BERT 1363:Text 1342:Sora 887:2024 880:AAAI 862:2021 837:2020 811:2017 778:PMID 770:ISSN 718:PMID 710:ISSN 655:2017 625:2016 546:OCLC 493:2020 443:PMID 435:ISSN 362:and 321:and 257:MoGo 52:(BA) 38:Born 1407:NMT 1290:OCR 1285:HWR 1237:JAX 1191:VPU 1186:TPU 1181:IPU 1005:SGD 762:doi 758:529 702:doi 698:518 538:doi 427:doi 423:575 282:at 247:on 245:PhD 196:FRS 178:.uk 174:www 30:FRS 1827:: 878:. 853:. 828:. 784:. 776:. 768:. 756:. 746:; 724:. 716:. 708:. 696:. 641:. 616:. 588:; 570:. 544:. 532:. 484:. 472:^ 449:. 441:. 433:. 421:. 382:^ 339:. 253:Go 223:. 215:, 920:e 913:t 906:v 889:. 864:. 839:. 813:. 792:. 764:: 732:. 704:: 675:. 657:. 627:. 602:. 574:. 552:. 540:: 495:. 457:. 429:: 331:h

Index

FRS
University of Cambridge
University of Alberta
AlphaGo
AlphaZero
AlphaStar
Royal Society University Research Fellowship
ACM Prize in Computing
Artificial intelligence
Machine learning
Reinforcement learning
Planning
Computer Games
Google Deepmind
University College London
Elixir Studios
Thesis
Reinforcement learning and simulation-based search in computer Go
www.davidsilver.uk
Edit this at Wikidata
FRS
Google DeepMind
University College London
reinforcement learning
AlphaGo
AlphaZero
AlphaStar
Christ's College, Cambridge
Demis Hassabis
University of Alberta

Text is available under the Creative Commons Attribution-ShareAlike License. Additional terms may apply.

↑