Business Analytics for Decision Making
Week 1 Quiz
1.
Question 1
Which of the following is true of cluster
analysis?
1 point
- It is a data analysis
technique to discover trends in time-series data - It is a data mining tool that
is used to create homogeneous groups - It is a data visualization
tool in market research - It is model for customer
behavior in the organic and natural products industry
2.
Question 2
Which
of the following settings are appropriate applications of cluster
analysis? (select all that apply)
1 point
- A
recommender system that seeks to predict the rating or preference that a user
would give to an item (e.g., a movie, a book, or a restaurant). - A delivery scheduling
system that assigns delivery trucks to customers in the same general
geographical area - A cable
company seeking to identify the number and type of TV packages to offer (e.g.,
Basic, Sports, Entertainment, or Premium). - An inventory management
system for retail pharmacies that attempts to minimize both the probability of
running out of stock and the inventory carrying cost.
3.
Question 3
Which of the following statements is true of principal component analysis (PCA) and cluster analysis?
1 point
- PCA and cluster analysis are incompatible techniques, only one of them can be applied to the same data
- PCA is a data reduction technique and cluster analysis is a dimensionality reduction technique
- Cluster analysis is a data reduction technique and PCA is a dimensionality reduction technique
- The main goal of cluster analysis is to identify redundant variables and the main goal of PCA is to create homogeneous groups of observations
4.
Question 4
Cluster analysis is considered an unsupervised learning technique because it operates on historical observations that are not labeled. That is, it is not known to which group historical observations belong and therefore it is not known how many groups there are.
1 point
- True
- False
5.
Question 5
If the Euclidean distance were to be represented in a right angle triangle, which of the following would be considered the distance between two objects of a cluster?
1 point
- Hypotenuse
- Small leg
- Long leg
- Average of the sum of both legs
6.
Question 6
Which of the following is the definition of distance between two clusters in a complete linkage clustering?
1 point
- The average of distances between all pairs of objects, where each pair is made up of one object of each group
- The distance between the most distant pair of objects, one from each group
- The sum of squares of the distance between clusters
- The distance between the value of the shortest link between the clusters
7.
Question 7
Which of the following is true of hierarchical clustering?
1 point
- All clusters must have the same number of objects
- No single cluster can have all objects
- Each step of the procedure consists of merging the two closest clusters
- All clusters must have more than one object in them
8.
Question 8
Which of the following is true of clustering methods?
1 point
- The k-means method is an exact procedure that finds the optimal (i.e., the best)
- The best clustering approach when dealing with very large data sets is to solve the optimization problem using Excel’s Solver
- The k-means method and hierarchical clustering always arrive at the same solution, that is, they always produce the same set of clusters
- Finding the best set of clusters is complicated because the number of ways of partitioning the observations into k groups is very large and this is why approximation methods such as k-means and hierarchical clustering are used.
Week 1 Application Assignment – Clustering
1.
Question 1
Assignment Overview
In this assignment you will practice what we learned in video 5 of this module. In Part 1 of the assignment, which is optional, you will be provided with a set of demographic data on 49 of America’s largest cities and will have an opportunity apply k-means clustering to city groups for marketing purposes. In the Part 2 of the assignment, you will be asked a series of questions that will prompt you to describe demographic structure of the clusters, and identify cities where to conduct a test for a new product.
Assignment Prompt
A large consumer goods company wants to select 4 U.S. cities where to test a new product. The company wants each city to represent a particular market segment, as defined by their demographic structure. The company has collected demographic data on 49 of America’s largest cities (see the Cities Excel file below). The demographic data consist of six attributes: 1) percentage of African-American population (% Black), 2) percentage of Hispanic population (% Hispanic), 3) percentage of Asian-American population (% Asian), 4) median age, 5) unemployment rate, and 6) per capita income.
Which cluster represents cities with no particular dominant minority group, with average age, employment rate, and income?
1 point
- Cluster 1
- Cluster 2
- Cluster 3
- Cluster 4
2.
Question 2
Which cluster consists of cities with a large Asian
population who is older and wealthy.
1 point
- Cluster 1
- Cluster 2
- Cluster 3
- Cluster 4
3.
Question 3
Which
cluster includes cities with a large population of African-Americans.
1 point
- Cluster 1
- Cluster 2
- Cluster 3
- Cluster 4
4.
Question 4
The
company would like to choose one city to represent each market in order to test
the new product. As discussed in the module, a representative object for a
cluster could be chosen as the one that is closest to the centroid. The
worksheet KMC_Clusters generated by XLMiner contains a table with the distances
from each city to the centroid of each cluster. To identify the city to
represent each cluster, we just need to find the city with the minimum distance
to each of the centroids. Which cities
would you recommend to choose to represent each cluster?
1 point
- Cluster 1: Seattle, Cluster 2: Memphis,
Cluster 3: Las Vegas, and Cluster 4: San Antonio - Cluster
1: San Francisco, Cluster 2: Philadelphia, Cluster 3: Toledo, and Cluster 4:
Los Angeles - Cluster 1: San Francisco, Cluster 2:
Philadelphia, Cluster 3: Omaha, and Cluster 4: Los Angeles - Cluster 1: San Jose, Cluster 2: Detroit,
Cluster 3: Las Vegas, and Cluster 4: El Paso
Week 2 Quiz
1.
Question 1
Which of the following best defines Monte Carlo simulation?
1 point
- It’s a tool for building statistical models that characterize relationships among a dependent variable and one or more independent variables.
- It’s a collection of techniques that seeks to group or segment a collection of objects into subsets.
- It’s the process of selecting values of decision variables that minimizes or maximizes some quantity of interest.
- It’s the process of generating random values for uncertain inputs in a model and computing the output variables of interest.
2.
Question 2
If chance or uncertainty is present in a system then there is an element of ______ in the decision-making problem.
1 point
- danger
- security
- risk
- difficulty
3.
Question 3
Which of the following are weaknesses of manual what-if analysis? (select all that apply)
1 point
- biased sample values of performance measures
- hard to do many what-if scenarios
- does not provide distribution information
4.
Question 4
Which of the following is a parameter of the Poisson distribution?
1 point
- maximum value
- mean
- minimum value
- most likely value
5.
Question 5
In the Analytic Solver Platform, “Psi” functions are used to add uncertainty to a spreadsheet model.
1 point
- true
- false
6.
Question 6
Why would a manager be interested in analyzing risk?
1 point
- to determine a most likely outcome
- to determine a range of outcomes
- to determine a distribution of outcomes
- to determine a confidence interval on most likely outcomes
7.
Question 7
The PsiOutput function of the Analytic Solver Platform is used to collect simulation data to create an empirical distribution of an output variable.
1 point
- true
- false
8.
Question 8
Historical data is used in simulation to:
1 point
- perform a worst-case analysis
- optimize the outcomes
- estimate a probability distribution function for critical inputs to the model
- simplify the model
9.
Question 9
Distribution fitting is the process of gathering historical data.
1 point
- true
- false
10.
Question 10
Adding a correlation matrix to a simulation model is necessary when:
1 point
- the uncertain input variables in the model are independent
- the model is deterministic (i.e., it does not have any uncertain inputs)
- two or more of the uncertain input variables in the model are not independent
- an output variable is related to an uncertain input variable
11.
Question 11
Which of the following statements is false:
1 point
- correlation is a measure of the strength of the relationship between two variables
- correlation values are always positive
- the correlation between two variables can be positive or negative
- the correlation between two independent variables is zero
12.
Question 12
The Analytic Solver Platform ________ allows you to determine the influence that each uncertain input variable has on an output variable based on the correlation between the input and the output variable.
1 point
- trend chart
- overlay chart
- box-whisker chart
- sensitivity chart
13.
Question 13
The Analytic Solver Platform ________ allows you to superimpose the frequency distributions of selected output variables in order to compare them.
1 point
- trend chart
- overlay chart
- box-whisker chart
- sensitivity chart
14.
Question 14
The Flaw of Averages typically results when a single number, the average value, is used in a spreadsheet model to represent an uncertain future quantity.
1 point
- true
- false
15.
Question 15
The average value for an output cell in a deterministic spreadsheet model that uses average values for uncertain input cells is always the same as the average value for the same output cell obtained with a Monte Carlo simulation.
1 point
- true
- false
Week 2 Application Assignment – Monte Carlo Simulation
1.
Question 1
A technology company has $2 million to invest in new research and development projects. The following table summarizes the initial cost, probability of success, and revenue potential for each of the projects under consideration.
Management has built the Monte Carlo simulation model in the Excel file Project Selection and would like to use it to compare various portfolio alternatives. The probability of making at least $1 million in total profit is the criterion that management wants to use. Based on this criterion, which of the eight project portfolios should the company fund?
Portfolio Selection.xlsx
(Hint: Enter 1 in the “Select?” column to indicate that a project is included in the portfolio. Turn on the Simulation Bulb in the Solver Action group of the Analytic Solver Platform. Run the simulation by clicking on the green “play” button in the Solver Options panel. Double-click on cell K14 to display the Frequency Chart of total profit and set the right marker to 1000.)
1 point
- Projects 1, 2, 3, 6, 7, and 8
- Projects 1, 2, 3, 4, and 7
- Projects 2, 4, 5, 6, and 8
- Projects 1, 3, 4, 5, 6, and 8
Week 3 Quiz
1.
Question 1
Which of the following statements are true? (select all that apply)
1 point
- Optimization has been defined as the process of selecting the values of decision variables that minimize or maximize some quantity of interest.
- Optimization started in the area of operations management but it is now used in all areas of business.
- Optimization models are prescriptive because their outcome is a recommendation of what to do.
2.
Question 2
In an optimization model, decision variables are:
1 point
- The unknowns for which the optimization process will find the best values.
- The functions to be maximized or minimized.
- The restrictions or limitations that are either related to technical and practical considerations or they are imposed by managerial policies.
- The parameter values provided by the analyst.
3.
Question 3
In a linear programming model both the objective function and the constraints are formulated as linear functions of the decision variables.
1 point
- True
- False
4.
Question 4
What is the goal in optimization of the transportation problem?
1 point
- Find the values of the decision variables that use all supplier capacities.
- Find the decision variable values (i.e., the shipment quantities) that result in the best objective function (i.e., lowest total cost) and satisfy all constraints.
- Find the values of the decision variables that satisfy all the demand constraints.None of these.
5.
Question 5
What does the Excel “=SUMPRODUCT(A1:A3,B1:B3)” function do?
1 point
- Sums each range and multiplies the sums. That is, (A1+A2+A3)*(B1+B2+B3).
- Sums each pair of cells and multiples each sum. That is, (A1+B1)*(A2+B2)*(A3+B3).
- Multiplies each range and sums the products. That is, (A1*A2*A3)+(B1*B2*B3)
- Multiplies each pair of cells and sums the products. That is, (A1*B1)+(A2*B2)+(A3*B3).
6.
Question 6
What function is used to add the contents of cells A1, A2, and A3?
1 point
- =ADD(A1:A3).
- =TOTAL(A1:A3).
- =SUM(A1:A3).
- =PRODUCT(A1:A3).
7.
Question 7
Suppose that three decision variables are in cells A1, A2, and A3. To add nonnegativity constraints with the Analytic Solver Platform, you click on Constraints in the Optimization Model group, then choose Variable Type/Bound, click on “>=”, and fill out the dialogue as follows:
1 point
- True
- False
8.
Question 8
What is true about the ASP optimization model shown below? :
1 point
- The model has 6 decision variables, three in cells A1 to A3 and three in cells C4 to C6.
- The model enforces the following constraint: C4+C5+C6 <= D4+D5+D6.
- The model minimizes the value of C1 by changing the nonnegative values in cells A1 to A3.
9.
Question 9
Which of the following statements are true about a Sensitivity Report?
1 point
- It provides very useful information for pricing decisions, the value of resources, and the robustness of the optimal solution.
- It’s not able to provide answers to what-if questions that involve multiple changes in the model, such as simultaneously changing the coefficient of a decision variable and a the right-hand-side of a constraint.
- It provides information about decision variables (reduced costs) and constraints (shadow prices).
10.
Question 10
If the shadow price for a resource constraint is 0, the allowable increase is 200 units, and 150 units of the resource are added, what happens to the objective function value?
1 point
- It increases by 150
- It increases by more than 0 but less than 150
- No change
- It increases but by an unknown amount
11.
Question 11
Which of the following approaches provided by the Analytic Solver Platform can automatically run multiple optimization while varying model parameters (e.g., the right hand side of a constraint) within a prespecified range?
1 point
- Breakdown analysis
- Parameter analysis
- Uncertainty analysis
- Sensitivity analysis
12.
Question 12
A bar chart is an effective way of visualizing the use of a resource in an optimal solution, where colors represent how the resource is used and the height represents how much of the resource is used.
1 point
- True
- False
Week 3 Application Assignment – Linear Optimization
1.
Question 1
A paper recycling company converts newspaper, mixed paper, white office paper, and cardboard into pulp for newsprint, packaging paper, and print stock quality paper. The following table summarizes the yield for each kind of pulp recovered from each ton of recycled material.
This table shows that, for instance, a ton of newspaper can produce either 0.85 tons of newsprint pulp or 0.80 tons of packaging pulp. The following table shows the processing costs per ton, the purchase cost, and the availability of the recycled material.
1 point
- Used tons (F23:F26)
- Processed tons (C23:E26)
- Pulp production(C27:E27)
- Purchase and production costs (C30:C31)
2.
Question 2
The constraints in the optimization model are:
1 point
- Pulp production >= Required pulp (C27:E27 >= C18:E18), Used tons <= Available tons (F23:F26 <= G14:G17), and Processed tons >= 0 (C23:E26 >= 0)
- Pulp production <= Required pulp (C27:E27 <= C18:E18), Used tons >= Available tons (F23:F26 >= G14:G17), and Processed tons >= 0 (C23:E26 >= 0)
- Production cost >= Purchase cost (C31 >= C30)
- There are no constraints in the problem.
3.
Question 3
The objective function in the optimization model is:
1 point
- Maximize total cost (Max C32)
- Minimize Production cost (Min C31)
- Minimize total cost (Min C32)
- Maximize pulp production (Max SUM(C27:E27))
4.
Question 4
Solve the optimization model that results from your answers to questions 13, 14, and 15. What is the total cost for the optimal solution?
1 point
- $41,841.91
- $44,067.74
- $35,692.86
- None of the above
5.
Question 5
Generate the Sensitivity Report for the optimal solution and use it to figure out how much should the the recycling company be willing to pay for an additional ton of recycled newspaper. (Hint: To generate the report, go to the Analysis group of the Analytic Solver Platform tab and click on Reports -> Optimization -> Sensitivity. If the report is not there, make sure that the Standard LP Engine was chosen to solve the model.)
1 point
- No more than $3.10
- No more than $4.20
- No more than $28.99
- $0.00
Week 4 Quiz
1.
Question 1
Which of the following is not a benefit of using binary variables?
1 point
- Models are easy to solve (i.e., the solvers can find optimal solutions faster) because the variables can only be zero or one.
- Binary variables are useful in selection problems.
- Binary variables can be used to model yes/no decisions.
- Binary variables can enforce logical conditions.
2.
Question 2
An optimization model has 5 binary decision variables. How many possible integer solutions are there to this problem?
1 point
- 5
- 10
- 25
- 32
3.
Question 3
A company wants to select no more than 2 projects from a set of 4 possible projects. Which of the following constraints ensures that no more than 2 will be selected, assuming that the P variables are binary and represent whether a project is selected (value of 1) or not (value of 0)?
1 point
- P1+P2+P3+P4 = 2
- P1+P2+P3+P4 ≤ 2
- P1+P2+P3+P4 ≥ 2
- P1+P2+P3+P4 ≥ 0
4.
Question 4
A company must invest in project 1 in order to invest in project 2. P1 is a binary variable representing whether project 1 is chosen (value of 1) or not (value of 0). P2 has the same interpretation for project 2. Which of the following constraints ensures that if project 2 is chosen then project 1 must also be chosen?
1 point
- P1+P2 = 0
- P1+P2 = 1
- P1-P2 ≥ 0
- P1-P2 ≤ 0
5.
Question 5
An optimization model for a production process must deal with the following situation. The model must decide whether or not to produce a product. If the decision is to produce the product, then the policy is that at least 100 units of this product must be produced. The following Excel cells are part of a spreadsheet model for this problem:
Cell B1 contains a binary decision variable, where 1 = produce and 0 = not produce. B4 is a decision variable indicating the amount to produce. Which of the following combination of an Excel function for B3 and a solver constraint enforces the production policy?
1 point
- =B1*B2 and B4 >= B3
- =B1*B3 and B3 >= B4
- =B1+B2 and B4 >= B3
- =B1*B4 and B3 >= B2
6.
Question 6
Which of the following statements is not true about metaheuristic optimization?
1 point
- Metaheuristics provide great modeling flexibility.
- Metaheuristics can solve optimization models with nonlinear and/or non-smooth functions.
- The metaheuristic solver in the Analytic Solver Platform is called the Evolutionary Engine.
- Metaheuristics are exact procedures that guarantee finding an optimal solution.
7.
Question 7
In market basket analysis, the Lift Ratio tells us how much more likely it is for item Y to be purchased given that item X has been purchased ?
1 point
- True
- False
8.
Question 8
A chance constraint is a special type of constraint that it is satisfied only in a fraction of the trials in a simulation.
1 point
- True
- False
9.
Question 9
An optimization model includes a chance constraint to satisfy demand of a particular product. The demand is uncertain and is modeled with an integer uniform distribution with parameter value of 0 and 4. That is, the probability that the demand is 0, 1, 2, 3, or 4 is exactly the same. A decision is made to order 2 units of the product from a supplier in order to satisfy the uncertain demand. What is the value at risk (VaR) for the demand constraint?
1 point
- 30%
- 40%
- 50%
- 60%
Week 4 Application Assignment – Simulation Optimization
1.
Question 1
A technology company has $2 million to invest in new research and development projects. The following table summarizes the initial cost, probability of success, and revenue potential for each of the projects under consideration.
Management has built the Monte Carlo simulation model in the Excel file Project Selection SO and would like to find the portfolio that maximizes the probability of making at least $1 million in profits. Questions 1, 2, and 3 guide you through the implementation of an optimization model. Add the optimization model as you answer these questions. (Hint: The three elements of the optimization model, decision variables, constraints, and the objective function, are of the “Normal” type. Also turn on the Simulation Bulb in the Solver Action group of the Analytic Solver Platform.)
Portfolio Selection SO.xlsx
The decision variables in the optimization model are:
1 point
- Select? (H5:H12)
- Success? (I5:I12)
- Revenue (J5:J12)
- Profit (K5:K12)
2.
Question 2
The constraints in the optimization model are:
1 point
- Revenue >= Profit (J5:J12 >= K5:K12)
- Select? >= Success? (H5:H12 >= I5:I12)
- Total cost <= Available funds (H14 <= H15) and binary variables
- Total profit >= Probability that the total profit is at least $1 million (K14 >= K15)
3.
Question 3
The objective function in the optimization model is:
1 point
- Minimize total cost (Min H14)
- Maximize the probability that the total profit is at least $1 million (Max K15)
- Maximize total profit (Max K14)
- Maximize available funds (Max H15)
4.
Question 4
Use the Evolutionary Engine to solve the optimization model that results from your answers to questions 1, 2, and 3. (Make sure that the number of trials is set to 10000.) Compare the solution that you found with the following solutions. Which of the following solutions is the best? (Hint: The Evolutionary Solver might not have found the best solution, so try all these solutions in your model before answering the question.)
1 point
- Projects 1, 2, 3, 6, 7, and 8
- Projects 2, 4, 5, 6, and 8
- Projects 1, 2, 3, 5, 6, and 8
- Projects 1, 2, 3, 4, and 7