CS614 - Data Warehousing Solved MCQs from Quiz # 3
With data mining, the best way to accomplish this is by setting aside some of your data in a vault to isolate it from the mining process; once the mining is complete, the results can be tested against the isolated data to confirm the model's _______.
Validity
Security
Integrity
None of above
The automated, prospective analyses offered by data mining move beyond the analyses of past events provided by _____________ tools typical of decision support systems.
Introspective
Intuitive
Reminiscent
Retrospective
The technique that is used to perform these feats in data mining is called modeling, and this act of model building is something that people have been doing for a long time, certainly before the _________ of computers or data mining technology.
Access
Advent
Ascent
Avowal
Classification consists of examining the properties of a newly presented observation and assigning it to a predefined ____________.
Object
Container
Subject
Class
During business hours, most ______ systems should probably not use parallel execution.
OLAP
DSS
Data Mining
OLTP
In contrast to statistics, data mining is ______ driven.
Assumption
Knowledge
Human
Database
Data mining derives its name from the similarities between searching for valuable business information in a large database, for example, finding linked products in gigabytes of store scanner data, and mining a mountain for a _________ of valuable ore.
Furrow
Streak
Trough
Vein
CS614 - Data Warehousing
As opposed to the outcome of classification, estimation deal with __________ valued outcome.
Discrete
Isolated
Continuous
Distinct
The goal of ideal parallel execution is to completely parallelize those parts of a computation that are not constrained by data dependencies. The smaller the portion of the program that must be executed __________, the greater the scalability of the computation.
In Parallel
Distributed
Sequentially
None of above
Data mining evolve as a mechanism to cater the limitations of ________ systems to deal massive data sets with high dimensionality, new data types, multiple heterogeneous data resources etc.
OLTP
OLAP
DSS
DWH
The goal of ideal parallel execution is to completely parallelize those parts of a computation that are not constrained by data dependencies. The ______ the portion of the program that must be executed sequentially, the greater the scalability of the computation.
Larger
Smaller
Unambiguous
Superior
The goal of ___________ is to look at as few blocks as possible to find the matching records(s).
Indexing
Partitioning
Joining
None of above
In nested-loop join case, if there are ‘M’ rows in outer table and ‘N’ rows in inner table, time complexity is
O (M log N)
O (log MN)
O (MN)
O (M + N)
Friday, June 29, 2012
0 comments:
Post a Comment