|
How the "Peak Probability Contrasts" (PPC) method worksPPC is a method for classifying spectra into two or more groups (such as normal and diseased), from labelled training data.
With a list of peaks for each spectra, PPC does the following In this last step, the proportions are shrunk towards each other by as user-specified amount (estimated by cross-validation). If they are shrunk so as to be equal to one another, that common peak is eliminated from the classifier. Hence PPC has built-in feature selection. Peaks whose proportions are the most different between the two groups, are the best ones for discriminating between the groups. PPC classifies new spectra as follows. A feature vector of zeroes and
ones is created, with an entry for each common peak: 1 if the new spectra
has a common peak with intensity above the optimal cutpoint for that
common peak, and 0 otherwise. Then the feature vector is compared to
the vectors of proportions for the normal and diseased groups (computed
in step 4), and assigned to the group whose vector it is closest to,
in simple Euclidean distance. |
|