Knowledge

Data cube

Source 📝

68:(OLAP) such dimensions could be the subsidiaries a company has, the products the company offers, and time; in this setup, a fact would be a sales event where a particular product has been sold in a particular subsidiary at a particular time. In satellite image timeseries dimensions would be latitude and longitude coordinates and time; a fact (sometimes called measure) would be a pixel at a given space and time as taken by the satellite (following some processing that is not of concern here). Even though it is called a 72:(and the examples provided above happen to be 3-dimensional for brevity), a data cube generally is a multi-dimensional concept which can be 1-dimensional, 2-dimensional, 3-dimensional, or higher-dimensional. In any case, every dimension divides data into groups of cells whereas each cell in the cube represents a single measure of interest. Sometimes cubes hold only a few values with the rest being 289:
Multi-dimensional arrays can meaningfully represent spatio-temporal sensor, image, and simulation data, but also statistics data where the semantics of dimensions is not necessarily of spatial or temporal nature. Generally, any kind of axis can be combined with any other into a data cube.
442:
Gray, Jim; Chaudhuri, Surajit; Bosworth, Adam; Layman, Andrew; Reichart, Don; Venkatrao, Murali; Pellow, Frank; Pirahesh, Hamid (January 1997). "Data Cube: A Relational Aggregation Operator Generalizing Group-By, Cross-Tab, and Sub-Totals".
123:
introduced management of massive data cubes with high-level user functionality combined with an efficient software architecture. Datacube operations include subset extraction, processing, fusion, and in general queries in the spirit of
100:
supports n-D arrays with a rich set of operations. All these have in common that arrays must fit into the main memory and are available only while the particular program maintaining them (such as image processing software) is running.
322:) color plane. For example, the EarthServer initiative unites data centers from different continents offering 3-D x/y/t satellite image timeseries and 4-D x/y/z/t weather data for retrieval and server-side processing through the 277:(Database Management Systems) offer a data model which generically supports definition, management, retrieval, and manipulation of n-dimensional data cubes. This database category has been pioneered by the 208:
in 2008. In addition to the common data cube operations, the language knows about the semantics of space and time and supports both regular and irregular grid data cubes, based on the concept of
53:
of values. Typically, the term data cube is applied in contexts where these arrays are massively larger than the hosting computer's main memory; examples include multi-terabyte/petabyte
360:(OLAP), data cubes are a common arrangement of business data suitable for analysis from different perspectives through operations like slicing, dicing, pivoting, and aggregation. 104:
A series of data exchange formats support storage and transmission of data cube-like data, often tailored towards particular application domains. Examples include
326: 231:
Many high-level computer languages treat data cubes and other large arrays as single entities distinct from their contents. These languages, of which
314:
For a time sequence of color images, the array is generally four-dimensional, with the dimensions representing image X and Y coordinates, time, and
423:. Int. Workshop on Graphics Modeling, Visualization in Science & Technology. Darmstadt, Germany: Springer (published 1993). pp. 236–45. 336:, since a spectrally-resolved image is represented as a three-dimensional volume. Earth observation data cubes combine satellite imagery such as 96:
offers arbitrarily-indexed 1-D arrays and arrays of arrays, which allows the construction of higher-dimensional arrays, up to 15 dimensions.
154:
Around that time, a working group on Multi-Dimensional Databases ("Arbeitskreis Multi-Dimensionale Datenbanken") was established at German
76:, i.e. undefined, while sometimes most or all cube coordinates hold a cell value. In the first case such data are called 509: 120: 198:
database language was extended with data cube functionality as "SQL – Part 15: Multi-dimensional arrays (SQL/MDA)".
64:
The data cube is used to represent data (sometimes called facts) along some dimensions of interest. For example, in
384: 201: 155: 345: 136: 135:
Some years after, the data cube concept was applied to describe time-varying business data as data cubes by
389: 357: 220: 105: 65: 478:
Harinarayan, Venky; Rajaraman, Anand; Ullman, Jeffrey D. (1996). "Implementing data cubes efficiently".
323: 240: 205: 125: 97: 728: 298:
In mathematics, a one-dimensional array corresponds to a vector, a two-dimensional array resembles a
733: 551: 492: 236: 109: 569: 532: 394: 50: 487: 480:
Proceedings of the 1996 ACM SIGMOD international conference on Management of data – SIGMOD '96
689:
Kopp, Steve; Becker, Peter; Doshi, Abhijit; Wright, Dawn J.; Zhang, Kaixi; Xu, Hong (2019).
419:
Baumann, Peter (April 1992). "Language Support for Raster Image Manipulation in Databases".
333: 299: 8: 151:
which rank among the top 500 most cited computer science articles over a 25-year period.
515: 460: 264: 248: 505: 177: 169: 140: 612: 587: 464: 702: 519: 497: 452: 424: 268: 165: 428: 144: 260: 215:
An industry standard for querying business data cubes, originally developed by
183:
The EarthServer initiative has established geo data cube service requirements.
54: 637: 456: 722: 209: 161: 36: 32: 399: 92:
Multi-dimensional arrays have long been familiar in programming languages.
501: 319: 148: 58: 20: 707: 690: 483: 369: 341: 274: 28: 379: 337: 216: 24: 538: 374: 278: 259:
clips and other data en masse with simple expressions derived from
173: 232: 93: 267:
mathematics. Some languages (such as PDL) distinguish between a
303: 252: 664: 617:
DIS 9075-15 Information technology – Database languages – SQL
421:
Graphics Modeling and Visualization in Science and Technology
244: 271:
of images and a data cube, while many (such as IDL) do not.
691:"Achieving the Full Vision of Earth Observation Data Cubes" 441: 256: 113: 477: 255:
are examples, allow the programmer to manipulate complete
665:"EarthServer - Big Datacube Analytics at Your Fingertips" 315: 195: 192: 129: 84:, although there is no hard delineation between the two. 180:
in 1996, however without addressing data cubes as such.
688: 204:
is a geo data cube analytics language issued by the
306:may be represented as an n-dimensional data cube. 720: 108:for statistical (in particular, business) data, 638:"Management of Multidimensional Discrete Data" 613:"Part 15: Multi-dimensional arrays (SQL/MDA)" 570:"Datenbank Rundbrief, Ausgabe 23, Mai 1999" 552:"Datenbank Rundbrief, Ausgabe 19, Mai 1997" 309: 27:. For database support for datacubes, see 706: 491: 332:A data cube is also used in the field of 80:, and in the second case they are called 31:. For spatio-temporal geo datacubes, see 541:. 12 June 2009. Retrieved 21 March 2017. 534:500 Most Cited Computer Science Articles 351: 35:. For the Image Processing company, see 418: 329:geo data cube query language standard. 721: 41:In computer programming contexts, a 445:Data Mining and Knowledge Discovery 13: 186: 14: 745: 226: 112:for general scientific data, and 16:Multi-dimensional data structure 682: 657: 385:Australian Geoscience Data Cube 284: 202:Web Coverage Processing Service 51:multi-dimensional ("n-D") array 630: 605: 580: 562: 544: 526: 471: 435: 412: 293: 1: 405: 346:Geographic information system 429:10.1007/978-3-642-77811-7_19 390:Graph (discrete mathematics) 358:online analytical processing 221:MultiDimensional eXpressions 66:online analytical processing 7: 363: 156:Gesellschaft für Informatik 126:data manipulation languages 10: 750: 324:Open Geospatial Consortium 206:Open Geospatial Consortium 87: 18: 110:Hierarchical Data Format 588:"The DatabaseManifesto" 457:10.1023/A:1009726021843 395:Abstract semantic graph 310:Science and engineering 502:10.1145/233269.233333 352:Business intelligence 176:applications for the 334:imaging spectroscopy 302:; more generally, a 708:10.3390/data4030094 486:. pp. 205–16. 281:system since 1994. 141:Venky Harinarayan 139:, et al., and by 741: 729:Image processing 713: 712: 710: 686: 680: 679: 677: 676: 661: 655: 654: 652: 651: 642: 634: 628: 627: 625: 624: 609: 603: 602: 600: 599: 584: 578: 577: 576:. DE: Uni Trier. 566: 560: 559: 558:. DE: Uni Trier. 548: 542: 530: 524: 523: 495: 482:. Vol. 25. 475: 469: 468: 439: 433: 432: 416: 168:company selling 166:image processing 749: 748: 744: 743: 742: 740: 739: 738: 734:Database theory 719: 718: 717: 716: 687: 683: 674: 672: 663: 662: 658: 649: 647: 640: 636: 635: 631: 622: 620: 611: 610: 606: 597: 595: 586: 585: 581: 568: 567: 563: 550: 549: 545: 531: 527: 512: 476: 472: 440: 436: 417: 413: 408: 366: 354: 312: 296: 287: 229: 189: 187:Standardization 145:Anand Rajaraman 90: 61:of image data. 55:data warehouses 39: 17: 12: 11: 5: 747: 737: 736: 731: 715: 714: 681: 656: 629: 604: 579: 561: 543: 525: 511:978-0897917940 510: 493:10.1.1.41.1205 470: 434: 410: 409: 407: 404: 403: 402: 397: 392: 387: 382: 377: 372: 365: 362: 353: 350: 311: 308: 295: 292: 286: 283: 261:linear algebra 228: 227:Implementation 225: 188: 185: 89: 86: 15: 9: 6: 4: 3: 2: 746: 735: 732: 730: 727: 726: 724: 709: 704: 700: 696: 692: 685: 670: 666: 660: 646: 639: 633: 618: 614: 608: 593: 589: 583: 575: 571: 565: 557: 553: 547: 540: 536: 535: 529: 521: 517: 513: 507: 503: 499: 494: 489: 485: 481: 474: 466: 462: 458: 454: 450: 446: 438: 430: 426: 422: 415: 411: 401: 398: 396: 393: 391: 388: 386: 383: 381: 378: 376: 373: 371: 368: 367: 361: 359: 349: 347: 343: 339: 335: 330: 328: 325: 321: 317: 307: 305: 301: 291: 282: 280: 276: 272: 270: 266: 262: 258: 254: 250: 246: 242: 238: 234: 224: 222: 218: 213: 211: 210:coverage data 207: 203: 199: 197: 194: 191:In 2018, the 184: 181: 179: 175: 171: 167: 163: 162:Datacube Inc. 159: 157: 152: 150: 146: 142: 138: 133: 131: 127: 122: 121:Peter Baumann 117: 116:for imagery. 115: 111: 107: 102: 99: 95: 85: 83: 79: 75: 71: 67: 62: 60: 56: 52: 48: 44: 38: 37:Datacube Inc. 34: 33:Coverage data 30: 26: 23:concept, see 22: 698: 694: 684: 673:. Retrieved 669:Earth server 668: 659: 648:. Retrieved 644: 632: 621:. Retrieved 616: 607: 596:. Retrieved 592:Earth server 591: 582: 573: 564: 555: 546: 533: 528: 479: 473: 451:(1): 29–53. 448: 444: 437: 420: 414: 400:Apache Kylin 355: 331: 313: 297: 288: 285:Applications 273: 230: 214: 200: 190: 182: 160: 153: 134: 118: 103: 91: 81: 77: 73: 69: 63: 46: 42: 40: 537:(501–600), 348:analytics. 320:color space 294:Mathematics 275:Array DBMSs 149:Jeff Ullman 59:time series 21:data mining 723:Categories 675:2017-03-31 650:2017-09-21 623:2018-05-27 598:2017-09-21 484:ACM SIGMOD 406:References 370:Array DBMS 342:Sentinel-2 318:(or other 29:Array DBMS 701:(3): 94. 619:. ISO/IEC 488:CiteSeerX 380:OLAP cube 338:Landsat 8 217:Microsoft 178:PC market 119:In 1992, 43:data cube 25:OLAP cube 539:CiteSeer 465:12502175 375:rasdaman 364:See also 279:rasdaman 174:software 170:hardware 137:Jim Gray 47:datacube 19:For the 520:3104453 233:Fortran 164:was an 94:Fortran 88:History 49:) is a 518:  508:  490:  463:  304:tensor 300:matrix 265:vector 253:S-Lang 251:, and 78:sparse 641:(PDF) 516:S2CID 461:S2CID 344:with 245:NumPy 219:, is 128:like 82:dense 74:empty 695:Data 671:. EU 645:VLDB 594:. EU 574:dblp 556:dblp 506:ISBN 340:and 327:WCPS 269:list 263:and 257:film 172:and 147:and 114:TIFF 70:cube 57:and 45:(or 703:doi 498:doi 453:doi 425:doi 356:In 316:RGB 249:PDL 241:IDL 237:APL 196:SQL 193:ISO 130:SQL 106:MDX 98:APL 725:: 697:. 693:. 667:. 643:. 615:. 590:. 572:. 554:. 514:. 504:. 496:. 459:. 447:. 247:, 243:, 239:, 235:, 223:. 212:. 158:. 143:, 132:. 711:. 705:: 699:4 678:. 653:. 626:. 601:. 522:. 500:: 467:. 455:: 449:1 431:. 427::

Index

data mining
OLAP cube
Array DBMS
Coverage data
Datacube Inc.
multi-dimensional ("n-D") array
data warehouses
time series
online analytical processing
Fortran
APL
MDX
Hierarchical Data Format
TIFF
Peter Baumann
data manipulation languages
SQL
Jim Gray
Venky Harinarayan
Anand Rajaraman
Jeff Ullman
Gesellschaft für Informatik
Datacube Inc.
image processing
hardware
software
PC market
ISO
SQL
Web Coverage Processing Service

Text is available under the Creative Commons Attribution-ShareAlike License. Additional terms may apply.