Knowledge

Raw data

Source 📝

142: 314:, this raw data may indicate the particular items that each customer buys, when they buy them, and at what price; as well, an analyst or manager could calculate the average total sales per customer or the average expenditure per day of the week by hour. This processed and analyzed data provides information for the manager, that the manager could then use to help her determine, for example, how many cashiers to hire and at what times. Such 45: 360:, meaning that everyone should demand that governments and businesses share the data they collect as raw data. He points out that "data drives a huge amount of what happens in our lives… because somebody takes the data and does something with it." To Berners-Lee, it is essentially from this sharing of raw data, that advances in science will emerge. Advocates of 226:
Data has two ways of being created or made. The first is what is called 'captured data', and is found through purposeful investigation or analysis. The second is called 'exhaust data', and is gathered usually by machines or terminals as a secondary function. For example, cash registers, smartphones,
177:
which records the temperature of a chemical mixture in a test tube every minute, the list of temperature readings for every minute, as printed out on a spreadsheet or viewed on a computer screen are "raw data". Raw data have not been subjected to processing, "cleaning" by researchers to remove
306:) in a busy supermarket collects huge volumes of raw data each day about customers' purchases. However, this list of grocery items and their prices and the time and date of purchase does not yield much information until it is processed. Once processed and analyzed by a 271:, to make it easier for computers and humans to interpret during later processing. Raw data (sometimes colloquially called "sources" data or "eggy" data, the latter a reference to the data being "uncooked", that is, "unprocessed", like a raw 202:), because even once raw data have been "cleaned" and processed by one team of researchers, another team may consider these processed data to be "raw data" for another stage of research. Raw data can be inputted to a 263:. For example, a data input sheet might contain dates as raw data in many forms: "31st January 1999", "31/01/1999", "31/1/99", "31 Jan", or "today". Once captured, this raw data may be 291:
processing. Raw data that has undergone processing are sometimes referred to as "cooked" data in a colloquial sense. Although raw data has the potential to be transformed into "
194:
result). As well, raw data have not been subject to any other manipulation by a software program or a human researcher, analyst or technician. They are also referred to as
231:
serve a main function but may collect data as a secondary task. Exhaustive data is usually too large or of little use to process and becomes 'transient' or thrown away.
243:, raw data may have the following attributes: it may possibly contain human, machine, or instrument errors, it may not be validated; it might be in different area ( 364:
argue that once citizens and civil society organizations have access to data from businesses and governments, it will enable citizens and NGOs to do their
162:(e.g., numbers, instrument readings, figures, etc.) collected from a source. In the context of examinations, the raw data might be described as a 295:," extraction, organization, analysis, and formatting for presentation are required before raw data can be transformed into usable information. 368:
analysis of the data, which can empower people and civil society. For example, a government may claim that its policies are reducing the
330:, which enables the raw data to become accessible for further processing and analysis in any number of different ways. 109: 128: 81: 380:
do their own analysis of the raw data, which may lead this group to draw different conclusions about the data set.
88: 66: 432: 345: 357: 95: 353: 218:
data on electronic storage devices, such as hard disk drives (also referred to as "low-level data").
77: 62: 17: 55: 299: 182:, obvious instrument reading errors or data entry errors, or any analysis (e.g., determining 145:
The two columns to the right of the left-most column in this computerized table are raw data.
465: 27:"Primary data" redirects here. For data that has been created at the time under study, see 8: 455: 211: 102: 460: 369: 307: 203: 183: 436: 341: 333: 264: 187: 389: 337: 326:
campaign. As a result of processing, raw data sometimes ends up being put in a
28: 449: 349: 303: 215: 141: 377: 256: 34:
A collection of information which has not been fully processed or analyzed
292: 268: 228: 174: 311: 244: 207: 167: 361: 323: 240: 44: 327: 275:) are the data input to processing. A distinction is made between 260: 373: 252: 179: 191: 322:
for further processing, for example as part of a predictive
248: 199: 158: 340:) argues that sharing raw data is important for society. 251:
or unformatted; or some entries might be "suspect" (e.g.,
272: 310:
or even by a researcher using a pen and paper and a
69:. Unsourced material may be challenged and removed. 447: 206:or used in manual procedures such as analyzing 442:Tim Berners-Lee Gives the Web a New Definition 376:advocacy group may be able to have its staff 433:Give Us the Data Raw, and Give it to Us Now 267:stored as a normalized format, perhaps a 129:Learn how and when to remove this message 435:- the blog post from Rufus Pollock that 283:, to the effect that information is the 140: 413: 214:. The term "raw data" can refer to the 198:data. Raw data is a relative term (see 14: 448: 173:If a scientist sets up a computerized 409: 407: 405: 67:adding citations to reliable sources 38: 24: 426: 221: 25: 477: 418:. United States: Sage. p. 6. 402: 43: 54:needs additional citations for 302:(POS terminal, a computerized 13: 1: 395: 7: 383: 234: 10: 482: 26: 354:Open Knowledge Foundation 356:his call to action is 300:point-of-sale terminal 146: 414:Kitchin, Rob (2014). 144: 186:aspects such as the 63:improve this article 416:The Data Revolution 318:could then become 147: 370:unemployment rate 336:(inventor of the 139: 138: 131: 113: 16:(Redirected from 473: 420: 419: 411: 308:software program 204:computer program 184:central tendency 152:, also known as 134: 127: 123: 120: 114: 112: 71: 47: 39: 21: 481: 480: 476: 475: 474: 472: 471: 470: 446: 445: 439:Tim Berners-Lee 429: 427:Further reading 424: 423: 412: 403: 398: 386: 378:econometricians 334:Tim Berners-Lee 298:For example, a 237: 224: 222:Generating data 135: 124: 118: 115: 72: 70: 60: 48: 35: 32: 23: 22: 15: 12: 11: 5: 479: 469: 468: 463: 458: 444: 443: 440: 428: 425: 422: 421: 400: 399: 397: 394: 393: 392: 390:Standard score 385: 382: 358:"Raw Data Now" 338:World Wide Web 236: 233: 223: 220: 137: 136: 51: 49: 42: 33: 29:Primary source 9: 6: 4: 3: 2: 478: 467: 464: 462: 459: 457: 454: 453: 451: 441: 438: 434: 431: 430: 417: 410: 408: 406: 401: 391: 388: 387: 381: 379: 375: 371: 367: 363: 359: 355: 351: 350:Rufus Pollock 347: 343: 339: 335: 331: 329: 325: 321: 317: 313: 309: 305: 304:cash register 301: 296: 294: 290: 286: 282: 278: 274: 270: 266: 262: 258: 255:), requiring 254: 250: 246: 242: 232: 230: 219: 217: 213: 209: 205: 201: 197: 193: 189: 185: 181: 176: 171: 169: 165: 161: 160: 155: 151: 143: 133: 130: 122: 119:December 2009 111: 108: 104: 101: 97: 94: 90: 87: 83: 80: –  79: 75: 74:Find sources: 68: 64: 58: 57: 52:This article 50: 46: 41: 40: 37: 30: 19: 415: 365: 332: 319: 315: 297: 288: 284: 280: 276: 257:confirmation 238: 229:speedometers 225: 195: 172: 163: 157: 154:primary data 153: 149: 148: 125: 116: 106: 99: 92: 85: 73: 61:Please help 56:verification 53: 36: 466:Information 316:information 293:information 287:product of 281:information 269:Julian date 247:) formats; 175:thermometer 168:test scores 456:Data types 450:Categories 396:References 312:calculator 245:colloquial 208:statistics 89:newspapers 78:"Raw data" 362:open data 324:marketing 265:processed 241:computing 164:raw score 461:Research 437:inspired 384:See also 372:, but a 342:Inspired 328:database 261:citation 253:outliers 235:Examples 180:outliers 150:Raw data 18:Raw Data 374:poverty 352:of the 249:uncoded 210:from a 196:primary 188:average 166:(after 103:scholar 346:a post 216:binary 212:survey 192:median 156:, are 105:  98:  91:  84:  76:  110:JSTOR 96:books 320:data 289:data 279:and 277:data 227:and 200:data 159:data 82:news 366:own 348:by 344:by 285:end 273:egg 259:or 239:In 190:or 170:). 65:by 452:: 404:^ 132:) 126:( 121:) 117:( 107:· 100:· 93:· 86:· 59:. 31:. 20:)

Index

Raw Data
Primary source

verification
improve this article
adding citations to reliable sources
"Raw data"
news
newspapers
books
scholar
JSTOR
Learn how and when to remove this message

data
test scores
thermometer
outliers
central tendency
average
median
data
computer program
statistics
survey
binary
speedometers
computing
colloquial
uncoded

Text is available under the Creative Commons Attribution-ShareAlike License. Additional terms may apply.