Knowledge

Data (computer science)

Source đź“ť

543:. They reserved blocks of disc data space for partitions and used the allocated blocks more economically, by dynamically assigning blocks of a partition to a file as needed. To achieve this, the file system had to keep track of which blocks were used or unused by data files in a catalog or file allocation table. Though this made better use of the disc data space, it resulted in fragmentation of files across the disc, and a concomitant performance overhead due additional seek time to read the data. Modern file systems reorganize fragmented files dynamically to optimize file access times. Further developments in file systems resulted in 679:, rely on massively parallel distributed data processing across many commodity computers on a high bandwidth network. In such systems, the data is distributed across multiple computers and therefore any particular computer in the system must be represented in the key of the data, either directly, or indirectly. This enables the differentiation between two identical sets of data, each being processed on a different computer at the same time. 38: 449:(or links or paths) from the start of the structure, in which data values are stored. Therefore, the data key consists of the key to the structure plus the offset (or links or paths) into the structure. When such a structure is repeated, storing variations of the data values and the data keys within the same repeating structure, the result can be considered to resemble a 471:, such that nodes are linked to each other in a cascade of parent-child relationships. Values and potentially more complex data-structures are linked to the nodes. Thus the nodal hierarchy provides the key for addressing the data structures associated with the nodes. This representation can be thought of as an 406:
or memory. This is as opposed to data storage, where the CPU must direct the transfer of data between the storage device (disk, tape...) and memory. RAM is an array of linear contiguous locations that a processor may read or write by providing an address for the read or write operation. The processor
386:
Keys in data provide the context for values. Regardless of the structure of data, there is always a key component present. Keys in data and data-structures are essential for giving meaning to data values. Without a key that is directly or indirectly associated with a value, or collection of values in
453:, in which each element of the repeating structure is considered to be a column and each repetition of the structure is considered as a row of the table. In such an organization of data, the data key is usually a value in one (or a composite of the values in several) of the columns. 539:. In those systems, the files could be filled up, running out of data space before all the data had been written to them. Thus much unused data space was reserved unproductively to ensure adequate free space for each file. Later file-systems introduced 497:. All the values for subsets of the key appear together. When passing sequentially through groups of the data with the same key, or a subset of the key changes, this is referred to in data processing circles as a break, or a 203:
Data relating to physical events or processes will have a temporal component. This temporal component may be implied. This is the case when a device such as a temperature logger receives data from a temperature
619:
It is only after instantiation that an object of a specified class exists. After an object's reference is cleared, the object also ceases to exist. The memory locations where the object's data was stored are
567:
to reduce the time taken to retrieve a subset of the original data. In order to do this, the key of the subset of data to be retrieved must be known before retrieval begins. The most popular indexes are the
212:. So the device records the date, time and temperature together. When the data logger communicates temperatures, it must also report the date and time as metadata for each temperature reading. 846: 387:
a structure, the values become meaningless and cease to be data. That is to say, there has to be a key component linked to a value component in order for it to be considered data.
215:
Fundamentally, computers follow a sequence of instructions they are given in the form of data. A set of instructions to perform a given task (or tasks) is called a
241:
stored at a specific location. Therefore, it is possible for computer programs to operate on other computer programs, by manipulating their programmatic data.
869: 576:
key indexing methods. Indexing is overhead for filing and retrieving data. There are other ways of organizing indexes, e.g. sorting the keys and using a
221:. A program is data in the form of coded instructions to control the operation of a computer or other machine. In the nominal case, the program, as 264:. However, executable files may also contain data used by the program which is built into the program. In particular, some executable files have a 842: 789: 527:
of a predetermined size. In this case, the seek location on the media, is the data key and the blocks are the data values. Early used
161:
or SQL databases, and can generally be represented as abstract key/value pairs. Data can be organized in many different types of
563:
are a way to copy out keys and location addresses from data structures in files, tables and data sets, then organize them using
621: 557:
Retrieving a small subset of data from a much larger set may imply inefficiently searching through the data sequentially.
751: 330:
stored in another file. In this example, the document would be considered data. If the word processor also features a
200:
helps translate data to information. Metadata is data about the data. Metadata may be implied, specified or given.
275:, for example, is a program. The input data to an interpreter is itself a program, just not one expressed in native 407:
may operate on any location in memory at any time in any order. In RAM the smallest element of data is the binary
873: 467:
view of repeating data structures is only one of many possibilities. Repeating data structures can be organised
913: 268:, which nominally contains constants and initial values for variables, both of which can be considered data. 819: 600: 612: 608: 590: 111: 110:. Data sourced from an analog device, such as a temperature sensor, may be converted to digital using an 547:
of disc drives i.e. where a logical drive can be defined as partitions from a number of physical drives.
166: 131: 658: 532: 446: 182: 178: 170: 127: 997: 993: 725: 238: 256:. Typically, programs are stored in special file types, different from those used for other data. 650: 577: 272: 234: 153:
elements consist of an address and a byte/word of data storage. Digital data are often stored in
1195: 1024: 607:
at run time, the creation of references to in-memory data-structures of objects that have been
154: 103: 66: 24: 402:(RAM) holds data that the CPU has direct access to. A CPU may only manipulate data within its 564: 472: 424: 412: 354: 230: 222: 20: 781: 142:
in the form of digital electrical or optical signals. Data pass in and out of computers via
1154: 662: 524: 399: 343: 334:, then the dictionary (word list) for the spell checker would also be considered data. The 327: 208:. When the temperature is received it is assumed that the data has a temporal reference of 107: 523:. These devices typically seek to a location on the magnetic media and then read or write 8: 1094: 1079: 1007: 512: 464: 450: 186: 906: 411:. The capabilities and limitations of accessing RAM are processor specific. In general 403: 308: 296: 139: 390:
Data can be represented in computers in multiple ways, as per the following examples:
1084: 1074: 938: 638: 441:, abstract and logical keys codes can be stored in association with values to form a 1034: 983: 968: 948: 933: 654: 319: 315: 304: 217: 158: 46: 16:
Quantities, characters, or symbols on which operations are performed by a computer
1164: 1099: 1089: 1059: 1002: 973: 963: 700: 501:. It particularly facilitates the aggregation of data values on subsets of a key. 288: 150: 135: 95: 1174: 1169: 1134: 1114: 1109: 1064: 1039: 958: 720: 559: 544: 540: 442: 416: 375: 323: 291:
similarly involves programs manipulating other programs as data. Programs like
162: 87:
representation. In modern (post-1960) computer systems, all data is digital.
1189: 1139: 1129: 1104: 978: 943: 899: 705: 676: 573: 498: 331: 249: 84: 80: 759: 675:
Modern scalable and high-performance data persistence technologies, such as
519:, persistent data storage was traditionally achieved by writing the data to 1149: 1144: 1124: 1119: 1049: 1044: 1019: 1012: 988: 516: 339: 326:
program from one file, and then use the running program to open and edit a
276: 265: 226: 123: 91: 75: 1069: 1029: 710: 476: 438: 420: 370:
The total amount of digital data in 2007 was estimated to be 281 billion
350: 284: 253: 99: 70: 520: 257: 143: 41:
Various types of data which can be visualized through a computer device
1159: 1054: 536: 494: 468: 363: 335: 280: 261: 174: 54: 953: 715: 690: 642: 634: 381: 371: 300: 292: 197: 119: 115: 423:
0). Each location can store usually 8 or 32 bits depending on the
279:. In many cases, the interpreted program will be a human-readable 37: 811: 338:
used by the spell checker to suggest corrections would be either
237:(CPU), are also data. At its most essential, a single datum is a 118:, characters, or symbols on which operations are performed by a 569: 205: 58: 730: 233:
manipulated by the program, but not actually executed by the
843:"Study: amount of digital info > global storage capacity" 593:
uses two basic concepts for understanding data and software:
695: 624:
and are reclassified as unused memory available for reuse.
604:, which is an example of a hierarchical data structure; and 437:
Data keys need not be a direct hardware address in memory.
245: 31: 646: 521:
external block devices like magnetic tape and disk drives
480: 408: 872:. International Data Corporation via EMC. Archived from 271:
The line between program and data can become blurry. An
891: 669: 457: 531:
data file-systems or disc operating systems reserved
106:. Data moving to or from a computer, in most cases, 173:. Data structures can store data of many different 138:, electronic, or mechanical recording media, and 1187: 382:Data keys and values, structures and persistence 786:The Oxford pocket dictionary of current english 867: 584: 907: 83:system of ones (1) and zeros (0), instead of 870:"The diverse and exploding digital universe" 641:for persistent data storage. Databases use 493:Data has some inherent features when it is 311:and such use other programs as their data. 260:contain programs; all other files are also 914: 900: 840: 65:is a single symbol of data. Data requires 487: 102:. Data within a computer, in most cases, 36: 30:For broader coverage of this topic, see 1188: 79:is data that is represented using the 53:(treated as singular, plural, or as a 895: 792:from the original on 28 November 2011 505: 445:. Data structures have predetermined 361:as distinguished from human-readable 868:Gantz, John F.; et al. (2008). 822:from the original on 5 February 2018 670:Parallel distributed data processing 475:. Modern computer operating system 458:Organised recurring data structures 342:data or text in some interpretable 13: 849:from the original on 13 March 2008 192: 14: 1207: 225:by the computer, will consist of 57:) is any sequence of one or more 627: 598:The taxonomic rank-structure of 665:completeness, when saving data. 551: 861: 834: 804: 774: 744: 653:systems, communicating over a 283:, which is manipulated with a 1: 737: 535:blocks on the disc drive for 90:Data exists in three states: 841:Paul, Ryan (12 March 2008). 7: 683: 591:Object-oriented programming 585:Abstraction and indirection 415:is arranged as an array of 248:in a file, they have to be 112:analog-to-digital converter 10: 1212: 479:are a common example; and 29: 18: 929: 661:logging system to ensure 647:structured query language 511:Until the advent of bulk 318:might first instruct the 921: 726:State (computer science) 565:inverted tree structures 419:beginning at address 0 ( 578:binary search algorithm 431: 357:) are sometimes called 349:In an alternate usage, 235:central processing unit 488:Sorted or ordered data 393: 104:moves as parallel data 42: 25:Datum (disambiguation) 637:introduced a further 425:computer architecture 40: 21:Data (disambiguation) 1095:Protection (privacy) 818:. 24 December 2015. 816:OpenBSD manual pages 639:layer of abstraction 400:Random access memory 344:programming language 165:, including arrays, 155:relational databases 114:. Data representing 108:moves as serial data 19:For other uses, see 513:non-volatile memory 404:processor registers 782:"Computer program" 506:Peripheral storage 229:. The elements of 144:peripheral devices 43: 1183: 1182: 1175:Wrangling/munging 1025:Format management 651:client and server 649:protocol between 1203: 916: 909: 902: 893: 892: 886: 885: 883: 881: 876:on 11 March 2008 865: 859: 858: 856: 854: 845:. Ars Technics. 838: 832: 831: 829: 827: 808: 802: 801: 799: 797: 778: 772: 771: 769: 767: 758:. Archived from 748: 659:two phase commit 655:computer network 572:and the dynamic 320:operating system 305:program updaters 277:machine language 258:Executable files 47:computer science 1211: 1210: 1206: 1205: 1204: 1202: 1201: 1200: 1186: 1185: 1184: 1179: 1155:Synchronization 925: 920: 890: 889: 879: 877: 866: 862: 852: 850: 839: 835: 825: 823: 810: 809: 805: 795: 793: 780: 779: 775: 765: 763: 750: 749: 745: 740: 735: 701:Data dictionary 686: 672: 630: 587: 554: 508: 495:sorted on a key 490: 460: 434: 396: 384: 353:(which are not 314:For example, a 289:Metaprogramming 195: 193:Characteristics 187:data structures 185:and even other 163:data structures 151:computer memory 96:data in transit 35: 28: 17: 12: 11: 5: 1209: 1199: 1198: 1181: 1180: 1178: 1177: 1172: 1167: 1162: 1157: 1152: 1147: 1142: 1137: 1132: 1127: 1122: 1117: 1112: 1107: 1102: 1097: 1092: 1087: 1082: 1080:Pre-processing 1077: 1072: 1067: 1062: 1057: 1052: 1047: 1042: 1037: 1032: 1027: 1022: 1017: 1016: 1015: 1010: 1005: 991: 986: 981: 976: 971: 966: 961: 956: 951: 946: 941: 936: 930: 927: 926: 919: 918: 911: 904: 896: 888: 887: 860: 833: 803: 773: 742: 741: 739: 736: 734: 733: 728: 723: 721:Database index 718: 713: 708: 703: 698: 693: 687: 685: 682: 681: 680: 671: 668: 667: 666: 633:The advent of 629: 626: 617: 616: 605: 595: 594: 586: 583: 582: 581: 553: 550: 549: 548: 545:virtualization 525:blocks of data 507: 504: 503: 502: 489: 486: 485: 484: 469:hierarchically 459: 456: 455: 454: 443:data structure 433: 430: 429: 428: 395: 392: 383: 380: 355:human-readable 324:word processor 309:virus scanners 244:To store data 194: 191: 67:interpretation 15: 9: 6: 4: 3: 2: 1208: 1197: 1196:Computer data 1194: 1193: 1191: 1176: 1173: 1171: 1168: 1166: 1163: 1161: 1158: 1156: 1153: 1151: 1148: 1146: 1143: 1141: 1138: 1136: 1133: 1131: 1128: 1126: 1123: 1121: 1118: 1116: 1113: 1111: 1108: 1106: 1103: 1101: 1098: 1096: 1093: 1091: 1088: 1086: 1083: 1081: 1078: 1076: 1073: 1071: 1068: 1066: 1063: 1061: 1058: 1056: 1053: 1051: 1048: 1046: 1043: 1041: 1038: 1036: 1033: 1031: 1028: 1026: 1023: 1021: 1018: 1014: 1011: 1009: 1006: 1004: 1001: 1000: 999: 995: 992: 990: 987: 985: 982: 980: 977: 975: 972: 970: 967: 965: 962: 960: 957: 955: 952: 950: 947: 945: 942: 940: 937: 935: 932: 931: 928: 924: 917: 912: 910: 905: 903: 898: 897: 894: 875: 871: 864: 848: 844: 837: 821: 817: 813: 807: 791: 787: 783: 777: 762:on 2019-06-23 761: 757: 753: 747: 743: 732: 729: 727: 724: 722: 719: 717: 714: 712: 709: 707: 706:Data modeling 704: 702: 699: 697: 694: 692: 689: 688: 678: 677:Apache Hadoop 674: 673: 664: 663:transactional 660: 656: 652: 648: 644: 640: 636: 632: 631: 628:Database data 625: 623: 614: 613:class library 610: 606: 603: 602: 597: 596: 592: 589: 588: 579: 575: 571: 566: 562: 561: 556: 555: 546: 542: 538: 534: 530: 526: 522: 518: 514: 510: 509: 500: 499:control break 496: 492: 491: 482: 478: 474: 473:inverted tree 470: 466: 462: 461: 452: 448: 444: 440: 436: 435: 426: 422: 418: 414: 410: 405: 401: 398: 397: 391: 388: 379: 377: 373: 368: 366: 365: 360: 356: 352: 347: 345: 341: 337: 333: 332:spell checker 329: 325: 321: 317: 312: 310: 306: 302: 298: 294: 290: 286: 282: 278: 274: 269: 267: 263: 259: 255: 251: 247: 242: 240: 236: 232: 228: 224: 220: 219: 213: 211: 207: 201: 199: 190: 188: 184: 180: 176: 172: 168: 164: 160: 156: 152: 147: 145: 141: 137: 133: 129: 125: 121: 117: 113: 109: 105: 101: 97: 93: 88: 86: 82: 81:binary number 78: 77: 72: 68: 64: 60: 56: 52: 48: 39: 33: 26: 22: 1085:Preservation 1075:Philanthropy 939:Augmentation 922: 878:. Retrieved 874:the original 863: 851:. Retrieved 836: 824:. Retrieved 815: 806: 794:. Retrieved 785: 776: 764:. Retrieved 760:the original 755: 746: 618: 609:instantiated 599: 558: 552:Indexed data 528: 477:file systems 389: 385: 369: 362: 358: 351:binary files 348: 340:machine code 313: 270: 266:data segment 243: 227:machine code 216: 214: 209: 202: 196: 177:, including 148: 92:data at rest 89: 76:Digital data 74: 62: 50: 44: 1145:Stewardship 1035:Integration 984:Degradation 969:Compression 949:Archaeology 934:Acquisition 711:Data stream 483:is another. 421:hexadecimal 413:main memory 285:text editor 273:interpreter 254:file format 140:transmitted 100:data in use 71:information 1165:Validation 1100:Publishing 1090:Processing 1060:Management 974:Corruption 964:Collection 826:4 February 796:11 October 766:14 January 738:References 657:, using a 541:partitions 537:data files 533:contiguous 336:algorithms 322:to load a 262:data files 250:serialized 116:quantities 69:to become 1170:Warehouse 1135:Scrubbing 1115:Retention 1110:Reduction 1065:Migration 1040:Integrity 1008:Transform 959:Cleansing 812:"file(1)" 635:databases 417:locations 372:gigabytes 301:debuggers 293:compilers 287:program. 281:text file 149:Physical 55:mass noun 1190:Category 1140:Security 1130:Scraping 1105:Recovery 979:Curation 944:Analysis 880:12 March 853:13 March 847:Archived 820:Archived 790:Archived 716:Data set 691:Big data 684:See also 645:, and a 643:metadata 529:raw disk 439:Indirect 376:exabytes 328:document 223:executed 198:Metadata 132:magnetic 128:recorded 120:computer 1150:Storage 1125:Science 1120:Quality 1050:Lineage 1045:Library 1020:Farming 1003:Extract 989:Editing 622:garbage 611:from a 601:classes 560:Indexes 465:tabular 447:offsets 297:linkers 231:storage 218:program 183:strings 179:numbers 171:objects 157:, like 136:optical 59:symbols 1070:Mining 1030:Fusion 756:Lexico 752:"Data" 570:B-tree 206:sensor 169:, and 167:graphs 159:tables 124:stored 85:analog 731:Tuple 517:flash 515:like 451:table 374:(281 252:in a 246:bytes 239:value 175:types 63:datum 1160:Type 1055:Loss 1013:Load 923:Data 882:2008 855:2008 828:2018 798:2012 768:2022 696:Data 574:hash 463:The 432:Keys 364:text 359:data 316:user 126:and 122:are 98:and 51:data 32:Data 23:and 998:ELT 994:ETL 954:Big 481:XML 409:bit 394:RAM 378:). 367:. 210:now 130:on 45:In 1192:: 814:. 788:. 784:. 754:. 346:. 307:, 303:, 299:, 295:, 189:. 181:, 146:. 134:, 94:, 73:. 61:; 49:, 996:/ 915:e 908:t 901:v 884:. 857:. 830:. 800:. 770:. 615:. 580:. 427:. 34:. 27:.

Index

Data (disambiguation)
Datum (disambiguation)
Data

computer science
mass noun
symbols
interpretation
information
Digital data
binary number
analog
data at rest
data in transit
data in use
moves as parallel data
moves as serial data
analog-to-digital converter
quantities
computer
stored
recorded
magnetic
optical
transmitted
peripheral devices
computer memory
relational databases
tables
data structures

Text is available under the Creative Commons Attribution-ShareAlike License. Additional terms may apply.

↑