首頁 > 軟考 > 軟考英語 > 計(jì)算機(jī)專業(yè)時(shí)文選讀之二

計(jì)算機(jī)專業(yè)時(shí)文選讀之二

軟考責(zé)任編輯：zhxlbj2 2004-12-31

添加老師微信

備考咨詢

加我微信

希賽網(wǎng)公眾號(hào) 點(diǎn)擊領(lǐng)取軟考備考資料

摘要：DataCubesDEFINITION:Adatacubeisatypeofmultidimensionalmatrixthatletsusersexploreandanalyzeacollectionofdatafrommanydifferentperspectives,usuallyconsideringthreefactors(dimensions)atatime.Whenwetrytoextractinformationfromastackofdata,weneedtoolstoh

Data Cubes

DEFINITION: A data cube is a type of multidimensional matrix that lets users explore and analyze a collection of data from many different perspectives, usually considering three factors (dimensions) at a time.

When we try to extract information from a stack of data, we need tools to help us find what's relevant and what's important and to explore different scenarios. A report, whether printed on paper or viewed on-screen, is at best a two-dimensional representation of data, a table using columns and rows. That's sufficient when we have only two factors to consider, but in the real world we need more powerful tools.

Data cubes are multidimensional extensions of 2-D tables, just as in geometry a cube is a three-dimensional extension of a square. The word cube brings to mind a 3-D object, and we can think of a 3-D data cube as being a set of similarly structured 2-D tables stacked on top of one another.

But data cubes aren't restricted to just three dimensions. Most online analytical processing (OLAP) systems can build data cubes with many more dimensions—Microsoft SQL Server 2000 Analysis Services, for example, allows up to 64 dimensions. We can think of a 4-D data cube as consisting of a series of 3-D cubes, though visualizing such higher-dimensional entities in spatial or geometric terms can be a problem.

In practice, therefore, we often construct data cubes with many dimensions, but we tend to look at just three at a time. What makes data cubes so valuable is that we can index the cube on one or more of its dimensions.

Relational or Multidimensional?

Since data cubes are such a useful interpretation tool, most OLAP products are built around a structure in which the cube is modeled as a multidimensional array. These multidimensional OLAP, or MOLAP, products typically run faster than other approaches, primarily because it's possible to index directly into the data cube's structure to collect subsets of data.

However, for very large data sets with many dimensions, MOLAP solutions aren't always so effective. As the number of dimensions increases, the cube becomes sparser—that is, many cells representing specific attribute combinations are empty, containing no aggregated data. As with other types of sparse databases, this tends to increase storage requirements, sometimes to unacceptable levels. Compression techniques can help, but using them tends to destroy MOLAP's natural indexing. ?

Data cubes can be built in other ways. Relational OLAP uses the relational database model. The ROLAP data cube is implemented as a collection of relational tables (up to twice as many as the number of dimensions) instead of as a multidimensional array. Each of these tables, called a cuboid, represents a particular view.

Because the cuboids are conventional database tables, we can process and query them using traditional RDBMS techniques, such as indexes and joins. This format is likely to be efficient for large data collections, since the tables must include only data cube cells that actually contain data.

However, ROLAP cubes lack the built-in indexing of a MOLAP implementation. Instead, each record in a given table must contain all attribute values in addition to any aggregated or summary values. This extra overhead may offset some of the space savings, and the absence of an implicit index means that we must provide one explicitly.

From a structural perspective, data cubes are made up of two elements: dimensions and measures. Dimensions are already explained; measures are simply the actual data values.

It's important to keep in mind that the data in a data cube has already been processed and aggregated into cube form. Thus we normally don't perform calculations within a data cube. This also means that we're not looking at real-time, dynamic data in a data cube.

The data contained within a cube has already been summarized to show figures such as unit sales, store sales, regional sales, net sale profits and average time for order fulfillment. With this data, an analyst can efficiently analyze any or all of those figures for any or all products, customers, sales agents and more. Thus data cubes can be extremely helpful in establishing trends and analyzing performance. In contrast, tables are best suited to reporting standardized operational scenarios.

時(shí)文選讀

數(shù)據(jù)立方體

定義：數(shù)據(jù)立方體是一類多維矩陣，讓用戶從多個(gè)角度探索和分析數(shù)據(jù)集，通常是一次同時(shí)考慮三個(gè)因素（維度）。

當(dāng)我們?cè)噲D從一堆數(shù)據(jù)中提取信息時(shí)，我們需要工具來幫助我們找到那些有關(guān)聯(lián)的和重要的信息，以及探討不同的情景。一份報(bào)告，不管是印在紙上的還是出現(xiàn)在屏幕上，都是數(shù)據(jù)的二維表示，是行和列構(gòu)成的表格。在我們只有兩個(gè)因素要考慮時(shí)，這就足矣，但在真實(shí)世界中我們需要更強(qiáng)的工具。

數(shù)據(jù)立方體是二維表格的多維擴(kuò)展，如同幾何學(xué)中立方體是正方形的三維擴(kuò)展一樣。 “立方體”這個(gè)詞讓我們想起三維的物體，我們也可以把三維的數(shù)據(jù)立方體看作是一組類似的互相疊加起來的二維表格。

但是數(shù)據(jù)立方體不局限于三個(gè)維度。大多數(shù)在線分析處理（ OLAP）系統(tǒng)能用很多個(gè)維度構(gòu)建數(shù)據(jù)立方體，例如，微軟的SQL Server 2000 Analysis Services工具允許維度數(shù)高達(dá)64個(gè)（雖然在空間或幾何范疇想像更高維度的實(shí)體還是個(gè)問題）。

在實(shí)際中，我們常常用很多個(gè)維度來構(gòu)建數(shù)據(jù)立方體，但我們傾向于一次只看三個(gè)維度。數(shù)據(jù)立方體之所以有價(jià)值，是因?yàn)槲覀兡茉谝粋€(gè)或多個(gè)維度上給立方體做索引。

關(guān)系的還是多維的？

由于數(shù)據(jù)立方體是一個(gè)非常有用的解釋工具，所以大多數(shù) OLAP產(chǎn)品都圍繞著按多維陣列建立立方模型這樣一個(gè)結(jié)構(gòu)編制。這些多維的OLAP產(chǎn)品，即MOLAP產(chǎn)品，運(yùn)行速度通常比其他方法更快，這是因?yàn)槟苤苯影阉饕鲞M(jìn)數(shù)據(jù)立方的結(jié)構(gòu)，方便收集數(shù)據(jù)子集。

然而，對(duì)于非常大的多維數(shù)據(jù)集， MOLAP方案并不總是有效的。隨著維度數(shù)目的增加，立方體變得更稀疏，即表示某些屬性組合的多個(gè)單元是空的，沒有集合的數(shù)據(jù)。相對(duì)于其他類型的稀疏數(shù)據(jù)庫，數(shù)據(jù)立方體往往會(huì)增加存儲(chǔ)需求，有時(shí)會(huì)達(dá)到不能接受的程度。壓縮技術(shù)能有些幫助，但利用這些技術(shù)往往會(huì)破壞MOLAP的自然索引。

數(shù)據(jù)立方體還可以用其他的方法構(gòu)建。關(guān)系 OLAP就利用了關(guān)系數(shù)據(jù)庫模型。ROLAP數(shù)據(jù)立方體是按關(guān)系表格的集合實(shí)現(xiàn)的（最多可達(dá)維度數(shù)目的兩倍），來代替多維陣列。其中的表格叫做立方單元，代表特定的視圖。

由于立方單元是一個(gè)常規(guī)的數(shù)據(jù)庫表格，所以我們能用傳統(tǒng)的 RDBMS技術(shù)（如索引和連接）來處理和查詢它們。這種形式對(duì)大量的數(shù)據(jù)集合可能是有效的，因?yàn)檫@些表格必須只能包含實(shí)際有數(shù)據(jù)的數(shù)據(jù)立方單元。

但是 ROLAP缺少了用MOLAP實(shí)現(xiàn)時(shí)所具有的內(nèi)在索引功能。相反，給定表格中的每個(gè)記錄必須包括所有的屬性值而任何集合的或摘要的數(shù)據(jù)。這種額外的開銷可能會(huì)抵消掉一些節(jié)省出來的空間，而隱性索引的缺少意味著我們必須提供顯性的索引。

從結(jié)構(gòu)角度看，數(shù)據(jù)立方體由兩個(gè)單元構(gòu)成：維度和測(cè)度。維度已經(jīng)解釋過了，測(cè)度就是實(shí)際的數(shù)據(jù)值。

記住這點(diǎn)是很重要的：數(shù)據(jù)立方體中的數(shù)據(jù)是已經(jīng)過處理并聚合成立方形式。因此，通常不需要在數(shù)據(jù)立方體中進(jìn)行計(jì)算。這也意味著我們看到數(shù)據(jù)立方體中的數(shù)據(jù)并不是實(shí)時(shí)的、動(dòng)態(tài)的數(shù)據(jù)。

立方體中的數(shù)據(jù)已經(jīng)過摘要，表示諸如計(jì)件銷售、店面銷售、區(qū)域銷售、銷售純利和完成訂單的平均時(shí)間等數(shù)據(jù)。有了這些數(shù)據(jù)，分析師能針對(duì)一個(gè)或全部產(chǎn)品、客戶、銷售代理等，就這些數(shù)字中的一個(gè)或全部進(jìn)行分析。這樣，在預(yù)測(cè)趨勢(shì)和分析業(yè)績(jī)時(shí)，數(shù)據(jù)立方體就非常有用，而表格最適合報(bào)告標(biāo)準(zhǔn)化的運(yùn)作情況。

溫馨提示：因考試政策、內(nèi)容不斷變化與調(diào)整，本網(wǎng)站提供的以上信息僅供參考，如有異議，請(qǐng)考生以權(quán)威部門公布的內(nèi)容為準(zhǔn)！

視頻教程【新版】軟考各科精講班視頻教程

備考學(xué)習(xí) 2025年上半年軟考信息系統(tǒng)監(jiān)理師考試備考資料匯總

備考學(xué)習(xí) 2025年上半年軟考復(fù)習(xí)計(jì)劃如何制定

備考學(xué)習(xí) 從零到一：軟考中級(jí)備考時(shí)間線及關(guān)鍵節(jié)點(diǎn)

備考資料 2025年上半年軟考各科案例簡(jiǎn)答合集

歷年真題軟考各科歷年真題全集練習(xí)

每日一練備考2025年軟考不慌，每日一練陪伴你

報(bào)考指導(dǎo) 2025年信息系統(tǒng)項(xiàng)目管理師備考指導(dǎo)課及精講試聽

延伸閱讀

更多精彩內(nèi)容請(qǐng)關(guān)注
軟考微信公眾號(hào)
掃碼加入免費(fèi)獲得

掃碼加入軟考QQ群
（群號(hào)：838864597）
+點(diǎn)擊加入

軟考備考資料免費(fèi)領(lǐng)取

去領(lǐng)取

共收錄117.93萬道題
已有25.02萬小伙伴參與做題

距離考試還有

天

備考必讀報(bào)考相關(guān) 培訓(xùn)課程

2025年全國軟考報(bào)名時(shí)間及報(bào)名通知匯總表軟考機(jī)考熱點(diǎn)問答各地軟考報(bào)名審核證明材料匯總非計(jì)算機(jī)專業(yè)可以考軟考嗎？軟考適合哪些人報(bào)考？軟考難嗎？通過率高不高？軟考可以評(píng)職稱嗎？軟考科目有哪些？3個(gè)級(jí)別27個(gè)科目，一次性搞懂！軟考和PMP?哪個(gè)含金量更高？報(bào)考人數(shù)超500萬！軟考為什么這么火？軟考是什么？軟考5個(gè)高級(jí)科目怎么選？上岸考友：這一點(diǎn)，比通過率更重要！

報(bào)名時(shí)間報(bào)名入口報(bào)名條件報(bào)考指南備考資料

軟考題庫我的題庫

專注在線職業(yè)教育24年

項(xiàng)目管理

信息系統(tǒng)項(xiàng)目管理師

考試指南

廠商認(rèn)證

信息系統(tǒng)項(xiàng)目管理師

建筑工程

信息系統(tǒng)項(xiàng)目管理師

金融財(cái)會(huì)

信息系統(tǒng)項(xiàng)目管理師

考博考研

信息系統(tǒng)項(xiàng)目管理師

學(xué)歷提升

防詐騙聲明培訓(xùn)證書查詢違法信息舉報(bào) 資質(zhì)&榮譽(yù)

客服熱線：400-111-9811

售后投訴：156-1612-8671

軟考

專注軟考培訓(xùn)24年

希賽網(wǎng)主編的教材一百余種，為全國數(shù)萬家企業(yè)、政府部門和事業(yè)單位提供了 ...

課程咨詢
學(xué)員服務(wù)
電話咨詢

客服熱線：400-111-9811

售后投訴：156-1612-8671
公眾號(hào)
掃描二維碼
關(guān)注希賽網(wǎng)站
APP下載
掃描二維碼
下載APP
返回頂部

掃描二維碼
下載APP

聯(lián)系我們

售前電話：400-111-9811（僅收市話費(fèi)）

售后投訴：156-1612-8671

在線客服

關(guān)注希賽網(wǎng)微信
下載希賽網(wǎng)APP

PMP^?，PMI-ACP^?和PMBOK^?是Project Management Institute，Inc.的注冊(cè)商標(biāo)
ITIL^?、PRINCE2^?是PeopleCert集團(tuán)的注冊(cè)商標(biāo)，經(jīng)PeopleCert授權(quán)使用，保留所有權(quán)利
湖南希賽網(wǎng)絡(luò)科技有限公司　版權(quán)所有　©2001-2025　湘ICP備10203241號(hào)-14　湘公網(wǎng)安備43019002000749號(hào)
違法和不良信息舉報(bào)電話：15673157832　舉報(bào)/反饋/投訴郵箱：ujigu@ujigu.com
出版物經(jīng)營許可證：4301042021177　高新技術(shù)企業(yè)證書：GR202143001539　廣播電視節(jié)目制作經(jīng)營許可證： (湘)字00833號(hào)

咨詢?cè)诰€老師!

計(jì)算機(jī)專業(yè)時(shí)文選讀之二

延伸閱讀

項(xiàng)目管理

信息系統(tǒng)項(xiàng)目管理師

軟考通信

信息系統(tǒng)項(xiàng)目管理師

廠商認(rèn)證

信息系統(tǒng)項(xiàng)目管理師

建筑工程

信息系統(tǒng)項(xiàng)目管理師

金融財(cái)會(huì)

信息系統(tǒng)項(xiàng)目管理師

考博考研

信息系統(tǒng)項(xiàng)目管理師

學(xué)歷提升

軟考