Question

Why is partial least square good with missing data?

Why is partial least square good with missing data?

Homework Answers

Answer #1

Why Partial Least Squares (PLS) algorithm is good with missing data:

PLS is a successful tool in chemometrics. The PLS algorithm is used to solve the latent variables, a kind of missing data problem that cannot be observed directly. Several kinds of PLS algorithms can be widely used for estimating the value of latent variables. One approach is combining traditional linear regression type PLS algorithms with missing data handling methods. Also, it introduces quantile regression improving the performances of PLS algorithms when the relationships among manifest and latent variables are not fixed per the explored quantile of interest. PLS algorithms perform well when missing manifest variables occur. PLS algorithm is a new algorithm that is used for dealing with missing values in predictive modeling. A trimmed score regression (TSR) adaptation is proposed from PLS model exploitation with missing values. PPLS is used for building the multivariate calibration models. The PLS algorithm appraises incomplete data. The PLS algorithm is becoming popular in analyzing interactive marketing applications which may be attributed to its robustness and accuracy when data are abnormally distributed. PLS provides a data classification mechanism with missing data handling.

In the iterative replacement method:
The PLS algorithm consists of the following steps:

1) It replaces missing data by an initial value.
2) It optionally centers and scales the matrices.
3) From the resulting data matrices, it calculates the PLS model.
4) It obtains new values for the missing variables or data as the corresponding entries in the prediction matrices.
5) In case, convergence is not reached, it returns to step (2).

PLS carries out the critical step in its algorithm which is to calculate the imputations for the missing measurements.

Know the answer?
Your Answer:

Post as a guest

Your Name:

What's your source?

Earn Coins

Coins can be redeemed for fabulous gifts.

Not the answer you're looking for?
Ask your own homework help question
Similar Questions
Maximise square root utility Consumption of good 1 is denoted x_1 and consumption of good 2...
Maximise square root utility Consumption of good 1 is denoted x_1 and consumption of good 2 by x_2. The agent has the utility function u(x_1, x_2) = √x_1 + √x_2 (the square root of x_1 plus the square root of x_2). Here, √ denotes the square root, * multiplication, + addition. Maximize u by choosing (x_1,x_2) subject to the budget constraint x_1 +3*x_2<=12 and the minimal consumption amount constraints x_1>=0 and x_2>=3.35. Write the quantity x_1 of good 1 that...
For a least-square regression question, the error vector's entries always sum up to zero. Explain why...
For a least-square regression question, the error vector's entries always sum up to zero. Explain why this is always the case for the error vector for a least-squares regression line?
List at least three reasons why ATP is a good source of energy .Draw the hydrolysis...
List at least three reasons why ATP is a good source of energy .Draw the hydrolysis reactions of ATP(ATP ?>AMP)?
The first partial (for males) has a chi-square value of 1.07, with p > .10 and...
The first partial (for males) has a chi-square value of 1.07, with p > .10 and a Cramer’s V of .07, with p > .10; the second partial (for females) has a chi-square value of 15.5, with p < .001 and a Cramer’s V of .56, p <.001. What do these statistics reveal about the relationships between the variables?
What makes a good tool for analyzing and interpreting data? And why? Give an example.
What makes a good tool for analyzing and interpreting data? And why? Give an example.
Cost-Volume-Profit Relations: Missing Data Following are data from 4 separate companies. Supply the missing data in...
Cost-Volume-Profit Relations: Missing Data Following are data from 4 separate companies. Supply the missing data in each independent case. Case 1 Case 2 Case 3 Case 4 Sales revenue $100,000 $90,000 Answer Answer Contribution margin $40,000} Answer $20,000 Answer Fixed Costs $24,000 Answer Answer Answer Net income Answer $7,000 $9,000 Answer Variable cost ratio Answer 0.50 Answer 0.20 Contribution margin ratio Answer Answer 0.40 Answer Break-even point (dollars) Answer Answer Answer $20,000 Margin of safety (dollars) Answer Answer Answer $30,000
Cost-Volume-Profit Relations: Missing Data Following are data from 4 separate companies. Supply the missing data in...
Cost-Volume-Profit Relations: Missing Data Following are data from 4 separate companies. Supply the missing data in each independent case Case A Case B Case C Case D Unit Sales 1,000 800 Answer Answer Sales revenue $20,000 $Answer $Answer $60,000 Variable cost per unit $10 $2 $14 $Answer Contribution margin $Answer $800 $Answer $Answer Fixed Costs $8,300 $Answer $100,000 $Answer Net income $Answer $600 $Answer $Answer Unit contribution margin $Answer $Answer $Answer $12 Break-even point (units) Answer Answer 4,000 2,000 Margin...
Use the data in part a (which were collect at 460°C and are equilibrium partial pressures)...
Use the data in part a (which were collect at 460°C and are equilibrium partial pressures) to determine K for the reaction: H2 (g) + I2 (g) <---> 2 HI (g), and complete the missing information for part b. PH2 (atm) PI2 (atm) PHI (atm) a 6.47e-3 0.594e-3 0.0137 b 3.84e-3 1.52e-3 ? Is K and Kp the same value for the above reaction? Please show work.
Create your's own example of a polynomial with degree 5, which include at least one missing...
Create your's own example of a polynomial with degree 5, which include at least one missing term divided by (X - 2), and provide a full solution to this problem.
. Suppose you have the following data set with missing values: 1 2 3 4 5...
. Suppose you have the following data set with missing values: 1 2 3 4 5 NA 7 6 NA 5 4 3 NA 2 6 10 14 NA 4 4 4 NA 10 13 16 19 NA (a) Give R expressions that return a vector of the data set without missing values (b) Give R expressions that return a vector of the data set after replacing missing values by the last non-missing values. For example, the last non-missing value...