1)  The problem of finding hidden structure in unlabeled data is called... | Data Mining Mcqs

   A.  Supervised learning

   B.  Unsupervised learning

   C.  Reinforcement learning

Ans: B

 

2)  Task of inferring a model from labeled training data is called | Data Mining Mcqs

    A.  Unsupervised learning

   B.  Supervised learning

   C.  Reinforcement learning  

Ans: B

 

3)  Some telecommunication company wants to segment their customers into distinct groups in order to send appropriate subscription offers, this is an example of  | Data Mining Mcqs

   A.  Supervised learning

   B.  Data extraction

   C.  Serration

   D.  Unsupervised learning

Ans: D

 

4)  Self-organizing maps are an example of... | Data Mining Mcqs

   A.  Unsupervised learning  

   B.  Supervised learning

   C.  Reinforcement learning

   D.  Missing data imputation

Ans: A

 

5)  You are given data about seismic activity in Japan, and you want to predict a magnitude of the next earthquake, this is in an example of... | Data Mining Mcqs

   A.  Supervised learning

   B.  Unsupervised learning

   C.  Serration

   D.  Dimensionality reduction

Ans: A

 

6)  Assume you want to perform supervised learning and to predict number of newborns according to size of storks' population (http://www.brixtonhealth.com/storksBabies.pdf), it is an example of ...  | Data Mining Mcqs

   A.  Classification

   B.  Regression

   C.  Clustering

   D.  Structural equation modeling

Ans: B

 

7)  Discriminating between spam and ham e-mails is a classification task, true or false? | Data Mining Mcqs

   A.  True

   B.  False

Ans: A

 

8)  In the example of predicting number of babies based on storks' population size, number of babies is... | Data Mining Mcqs

   A.  outcome

   B.  feature  

   C.  attribute

   D.  observation

Ans: A

 

9)  It may be better to avoid the metric of ROC curve as it can suffer from accuracy paradox. | Data Mining Mcqs

   A.  True

   B.  False  

Ans: B

 

10)  which of the following is not involve in data mining? | Data Mining Mcqs

   A.  Knowledge extraction

   B.  Data archaeology  

   C.  Data exploration

   D.  Data transformation

Ans: D

 

11)  Which is the right approach of Data Mining? | Data Mining Mcqs

 

   A.  Infrastructure, exploration, analysis, interpretation, exploitation

   B.  Infrastructure, exploration, analysis, exploitation, interpretation

   C.  Infrastructure, analysis, exploration, interpretation, exploitation  

   D.  Infrastructure, analysis, exploration, exploitation, interpretation

Ans: A

 

12)   Which of the following issue is considered before investing in Data Mining? | Data Mining Mcqs

 A.  Functionality

   B.  Vendor consideration  

   C.  Compatibility

   D.  All of the above

Ans: D

 

13.  Adaptive system management is  | Data Mining Mcqs

A. It uses machine-learning techniques. Here program can learn from past experience and adapt themselves to new situations 

B. Computational procedure that takes some value as input and produces some value as output. 

C. Science of making machines performs tasks that would require intelligence when performed by humans 

D. none of these

Ans: A

 

14. Bayesian classifiers is | Data Mining Mcqs 

A. A class of learning algorithm that tries to find an optimum classification of a set of examples using the probabilistic theory. 

B.  Any mechanism employed by a learning system to constrain the search space of a hypothesis 

C.  An approach to the design of learning algorithms that is inspired by the fact that when people encounter new situations, they often explain them by reference to familiar experiences, adapting the explanations to fit the new situation. 

D. None of these 

Ans: A

 

15. Algorithm is | Data Mining Mcqs 

A. It uses machine-learning techniques. Here program can learn from past experience and adapt themselves to new situations 

B. Computational procedure that takes some value as input and produces some value as output 

C. Science of making machines performs tasks that would require intelligence when performed by humans 

D. None of these 

Ans: B

 

16. Bias is | Data Mining Mcqs 

A.A class of learning algorithm that tries to find an optimum classification of a set of examples using the probabilistic theory 

B. Any mechanism employed by a learning system to constrain the search space of a hypothesis 

C. An approach to the design of learning algorithms that is inspired by the fact that when people encounter new situations, they often explain them by reference to familiar experiences, adapting the explanations to fit the new situation. 

D. None of these 

Ans: B

 

17. Background knowledge referred to | Data Mining Mcqs 

A.  Additional acquaintance used by a learning algorithm to facilitate the learning process 

B. A neural network that makes use of a hidden layer 

C. It is a form of automatic learning. 

D. None of these 

Ans: A

 

18. Case-based learning is | Data Mining Mcqs 

A. A class of learning algorithm that tries to find an optimum classification of a set of examples using the probabilistic theory. 

B. Any mechanism employed by a learning system to constrain the search space of a hypothesis 

c. An approach to the design of learning algorithms that is inspired by the fact that when people encounter new situations, they often explain them by reference to familiar experiences, adapting the explanations to fit the new situation. 

D. None of these 

Ans: C

 

19. Classification is | Data Mining Mcqs 

A. A subdivision of a set of examples into a number of classes 

B. A measure of the accuracy, of the classification of a concept that is given by a certain theory 

C. The task of assigning a classification to a set of examples 

D. None of these 

Ans: A

 

20. Binary attribute are | Data Mining Mcqs 

A. This takes only two values. In general, these values will be 0 and 1 and .they can be coded as one bit 

B. The natural environment of a certain species 

C. Systems that can be used without knowledge of internal operations 

D. None of these 

Ans: A

 

21. Classification accuracy is 

A. A subdivision of a set of examples into a number of classes 

B. Measure of the accuracy, of the classification of a concept that is given by a certain theory 

C. The task of assigning a classification to a set of examples 

D. None of these 

Ans: B

 

22. Biotope are 

A. This takes only two values. In general, these values will be 0 and 1 

and they can be coded as one bit. 

B. The natural environment of a certain species 

C. Systems that can be used without knowledge of internal operations 

D. None of these 

Ans: B

 

23. Cluster is 

A. Group of similar objects that differ significantly from other objects 

B. Operations on a database to transform or simplify data in order to prepare it for a machine-learning algorithm 

C. Symbolic representation of facts or ideas from which information can potentially be extracted 

D. None of these 

Ans: A

 

24. Black boxes are 

A. This takes only two values. In general, these values will be 0 and 1 

and they can be coded as one bit. 

B. The natural environment of a certain species 

C. Systems that can be used without knowledge of internal operations 

D. None of these 

Ans: C

 

25. A definition of a concept is if it recognizes all the instances of that concept 

A. Complete 

B. Consistent 

C. Constant 

D. None of these 

Ans: A

 

26. Data mining is 

A. The actual discovery phase of a knowledge discovery process 

B. The stage of selecting the right data for a KDD process 

C. A subject-oriented integrated time variant non-volatile collection of data in support of management 

D. None of these 

Ans: A

 

27. A definition or a concept is if it classifies any examples as coming within the concept 

A. Complete 

B. Consistent 

C. Constant 

D. None of these 

Ans: B

 

28. Data independence means 

A. Data is defined separately and not included in programs 

B. Programs are not dependent on the physical attributes of data. 

C. Programs are not dependent on the logical attributes of data 

D. Both (B) and (C). 

Ans: D

 

29. E-R model uses this symbol to represent weak entity set? 

A. Dotted rectangle 

B. Diamond 

C. Doubly outlined rectangle 

D. None of these 

Ans: C

 

30. SET concept is used in 

A. Network Model 

B. Hierarchical Model 

C. Relational Model 

D. None of these 

Ans: D

 

31. Relational Algebra is 

A. Data Definition Language 

B. Meta Language 

C. Procedural query Language 

D. None of the above 

Ans: C

 

32. Key to represent relationship between tables is called 

A. Primary key 

B. Secondary Key 

C. Foreign Key 

D. None of these 

Ans: C

 

33. ________ produces the relation that has attributes of Ri and R2 

A. Cartesian product 

B. Difference 

C. Intersection 

D. Product 

Ans: A

 

34. Which of the following are the properties of entities? 

A. Groups 

B. Table 

C. Attributes 

D. Switchboards 

Ans: C

 

35. In a relation 

A. Ordering of rows is immaterial 

B. No two rows are identical 

C. (A) and (B) both are true 

D. None of these 

Ans: C