cg

diff grant.txt @ 63:af5fd52f453f
.
author: bshanks@bshanks.dyndns.org
date: Sun Apr 19 15:23:53 2009 -0700 (16 years ago)
parents: ecf330fcfba3
children: 54ac7984b164
--- a/grant.txt	Sun Apr 19 14:50:20 2009 -0700
+++ b/grant.txt	Sun Apr 19 15:23:53 2009 -0700
@@ -50,6 +50,7 @@
+The requirement to find combinations of only a small number of genes limits us from straightforwardly applying many of the most simple techniques from the field of supervised machine learning. In the parlance of machine learning, our task combines feature selection with supervised learning.
@@ -289,7 +290,7 @@
-This finds pairs of genes which are most informative (at least at these discretization thresholds) relative to the question, "Is this surface pixel a member of the target area?".
+This finds pairs of genes which are most informative (at least at these discretization thresholds) relative to the question, "Is this surface pixel a member of the target area?". Its advantage over linear methods such as logistic regression is that it takes account of arbitrarily nonlinear relationships; for example, if the XOR of two variables predicts the target, conditional entropy would notice, whereas linear methods would not.
@@ -356,18 +357,18 @@
-
-=== Specific to Aim 1 (and Aim 3) ===
+\vspace{0.3cm}**Feature selection integrated with prediction**
+As noted earlier, in general, any predictive method can be used for feature selection by running it inside a stepwise wrapper. Also, some predictive methods integrate soft constraints on number of features used. Examples of both of these will be seen in the section "Locating areas with gene expression".
+
+
+=== Locating areas with gene expression ===
-todo
+As a pilot run, for five cortical areas (SS, AUD, RSP, VIS, and MO), we performed forward stepwise logistic regression to find single genes, pairs of genes, and triplets of genes which predict areal identify. This is an example of feature selection integrated with prediction using a stepwise wrapper. Some of the single genes found were shown in previous figures, and Figure \ref{MOcombo} shows a combination of genes which was found.
-In order to see how well one can do when looking at all genes at once, we ran a support vector machine to classify cortical surface pixels based on their gene expression profiles. We achieved classification accuracy of about 81%\footnote{5-fold cross-validation.}. As noted above, however, a classifier that looks at all the genes at once isn't practically useful. 
-
-The requirement to find combinations of only a small number of genes limits us from straightforwardly applying many of the most simple techniques from the field of supervised machine learning. In the parlance of machine learning, our task combines feature selection with supervised learning.
-
+In order to see how well one can do when looking at all genes at once, we ran a support vector machine to classify cortical surface pixels based on their gene expression profiles. We achieved classification accuracy of about 81%\footnote{5-fold cross-validation.}. As noted above, however, a classifier that looks at all the genes at once isn't as practically useful as a classifier that uses only a few genes. 
@@ -396,7 +397,7 @@
-=== Specific to Aim 2 (and Aim 3) ===
+=== Data-driven redrawing of the cortical map ===
@@ -443,6 +444,10 @@
+layerfinding
+
+
+
author	bshanks@bshanks.dyndns.org
date	Sun Apr 19 15:23:53 2009 -0700 (16 years ago)
parents	ecf330fcfba3
children	54ac7984b164