cg

changeset 20:c2609c6e7736
.
author: bshanks@bshanks.dyndns.org
date: Mon Apr 13 03:07:26 2009 -0700 (16 years ago)
parents: 717d4025b861
children: b9643c30e352
files: grant.html grant.odt grant.pdf grant.txt
--- a/grant.html	Sun Apr 12 15:35:00 2009 -0700
+++ b/grant.html	Mon Apr 13 03:07:26 2009 -0700
@@ -297,9 +297,28 @@
-               We are aware of one other effort to computationally analyze spatial gene
-            expression data.
+               We are aware of two existing efforts to relate spatial gene expression data to
+            anatomy through computational methods.
+               [?] describes an analysis of the anatomy of the hippocampus using the ABA
+            dataset. In addition to manual analysis, two clustering methods were employed,
+            a modified Non-negative Matrix Factorization (NNMF), and a hierarchial bifur-
+            cation clustering scheme based on correlation as the similarity score. The paper
+            yielded impressive results, proving the usefulness of such research. We have run
+            NNMF on the cortical dataset and while the results are promising (see Prelim-
+            inary Data), we think that it will be possible to find a better method2 (we also
+            think that more automation of the parts that this paper&#8217;s authors did manually
+            will be possible).
+               and [?] describes AGEA. todo
+__________________________
+   2We ran &#8220;vanilla&#8221; NNMF, whereas the paper under discussion used a modified method.
+Their main modification consisted of adding a soft spatial contiguity constraint.  However,
+on our dataset,  NNMF naturally produced spatially contiguous clusters,  so no additional
+constraint was needed. The paper under discussion mentions that they also tried a hierarchial
+variant of NNMF, but since they didn&#8217;t report its results, we assume that the result were not
+any more impressive than the non-hierarchial variant.
+                                            7
+
@@ -311,22 +330,20 @@
-                                            7
-
-            regression, gene wwc12 is the best fit single gene for predicting whether or not a
+            regression, gene wwc13 is the best fit single gene for predicting whether or not a
-               Gnee mtif23 is shown in figure the upper-right of Fig. . Mtif2 captures MO&#8217;s
+               Gnee mtif24 is shown in figure the upper-right of Fig. . Mtif2 captures MO&#8217;s
@@ -338,28 +355,9 @@
-            3 genes which most match area AUD, according to a pointwise method4.  The
-            bottom row displays the 3 genes which most match AUD according to a method
-            which considers local geometry5 The pointwise method in the top row identifies
-            genes which express more strongly in AUD than outside of it; its weakness is that
-            this includes many areas which don&#8217;t have a salient border matching the areal
-            border. The geometric method identifies genes whose salient expression border
-            seems to partially line up with the border of AUD; its weakness is that this
-            includes genes which don&#8217;t express over the entire area. Genes which have high
-            rankings using both pointwise and border criteria, such as Aph1a in the example,
-            may be particularly good markers.   None of these genes are,  individually,  a
-            perfect marker for AUD; we deliberately chose a &#8220;difficult&#8221; area in order to
-            better contrast pointwise with geometric methods.
-   2&#8220;WW, C2 and coiled-coil domain containing 1&#8221;; EntrezGene ID 211652
-    3&#8220;mitochondrial translational initiation factor 2&#8221;; EntrezGene ID 76784
-    4For each gene, a logistic regression in which the response variable was whether or not a
-surface pixel was within area AUD, and the predictor variable was the value of the expression
-of the gene underneath that pixel. The resulting scores were used to rank the genes in terms
-of how well they predict area AUD.
-    5For each gene the gradient similarity (see section ??) between (a) a map of the expression
-of each gene on the cortical surface and (b) the shape of area AUD, was calculated, and this
-was used to rank the genes.
+   3&#8220;WW, C2 and coiled-coil domain containing 1&#8221;; EntrezGene ID 211652
+    4&#8220;mitochondrial translational initiation factor 2&#8221;; EntrezGene ID 76784
@@ -372,6 +370,8 @@
+                                            9
+
@@ -379,16 +379,36 @@
-                                            9
-
+            3 genes which most match area AUD, according to a pointwise method5.  The
+            bottom row displays the 3 genes which most match AUD according to a method
+            which considers local geometry6 The pointwise method in the top row identifies
+            genes which express more strongly in AUD than outside of it; its weakness is that
+            this includes many areas which don&#8217;t have a salient border matching the areal
+            border. The geometric method identifies genes whose salient expression border
+            seems to partially line up with the border of AUD; its weakness is that this
+            includes genes which don&#8217;t express over the entire area. Genes which have high
+            rankings using both pointwise and border criteria, such as Aph1a in the example,
+            may be particularly good markers.   None of these genes are,  individually,  a
+            perfect marker for AUD; we deliberately chose a &#8220;difficult&#8221; area in order to
+            better contrast pointwise with geometric methods.
+__________________________
+   5For each gene, a logistic regression in which the response variable was whether or not a
+surface pixel was within area AUD, and the predictor variable was the value of the expression
+of the gene underneath that pixel. The resulting scores were used to rank the genes in terms
+of how well they predict area AUD.
+    6For each gene the gradient similarity (see section ??) between (a) a map of the expression
+of each gene on the cortical surface and (b) the shape of area AUD, was calculated, and this
+was used to rank the genes.
+                                            10
+
-            gene expression profiles.  We achieved classification accuracy of about 81%6.
+            gene expression profiles.  We achieved classification accuracy of about 81%7.
@@ -399,6 +419,8 @@
+               todo
+               (might want to incld nnMF since mentioned above)
@@ -417,13 +439,13 @@
-__________________________
-   65-fold cross-validation.
-                                            10
-
+__________________________
+   75-fold cross-validation.
+                                            11
+
@@ -456,27 +478,34 @@
-                                            11
-
-            _______________________________________________________________________________________________________ stuff i dunno where to put yet (there is more scattered through grant-
+______________________________________________
+    stuff  i  dunno  where  to  put  yet  (there  is  more  scattered  through  grant-
-    In anatomy, the manifold of interest is usually either defined by a combina-
-tion of two relevant anatomical axes (todo), or by the surface of the structure
-(as is the case with the cortex).  In the former case, the manifold of interest is
-a plane, but in the latter case it is curved. If the manifold is curved, there are
-various methods for mapping the manifold into a plane.
-    The method that we will develop will begin by mapping the data into a
-2-D plane.  Although the manifold that characterized cortical areas is known
-to be the cortical surface, it remains to be seen which method of mapping the
-manifold into a plane is optimal for this application. We will compare mappings
-which attempt to preserve size (such as the one used by Caret??) with mappings
-which preserve angle (conformal maps).
-    Although there is much 2-D organization in anatomy, there are also struc-
-tures whose shape is fundamentally 3-dimensional.  If possible, we would like
-the method we develop to include a statistical test that warns the user if the
-assumption of 2-D structure seems to be wrong.
-    todo: replace aim # bullet pts with #s
-
+               In anatomy, the manifold of interest is usually either defined by a combina-
+            tion of two relevant anatomical axes (todo), or by the surface of the structure
+            (as is the case with the cortex).  In the former case, the manifold of interest is
+            a plane, but in the latter case it is curved. If the manifold is curved, there are
+            various methods for mapping the manifold into a plane.
+               The method that we will develop will begin by mapping the data into a
+            2-D plane.  Although the manifold that characterized cortical areas is known
+            to be the cortical surface, it remains to be seen which method of mapping the
+            manifold into a plane is optimal for this application. We will compare mappings
+            which attempt to preserve size (such as the one used by Caret??) with mappings
+            which preserve angle (conformal maps).
+               Although there is much 2-D organization in anatomy, there are also struc-
+            tures whose shape is fundamentally 3-dimensional.  If possible, we would like
+            the method we develop to include a statistical test that warns the user if the
+            assumption of 2-D structure seems to be wrong.
+               if we need citations for aim 3 significance,  http://www.sciencedirect.
+            com/science?_ob=ArticleURL&amp;_udi=B6WSS-4V70FHY-9&amp;_user=4429&amp;_coverDate=
+            12%2F26%2F2008&amp;_rdoc=1&amp;_fmt=full&amp;_orig=na&amp;_cdi=7054&amp;_docanchor=&amp;_acct=
+            C000059602&amp;_version=1&amp;_urlVersion=0&amp;_userid=4429&amp;md5=551eccc743a2bfe6e992eee0c3194203#
+            app2 has examples of genetic targeting to specific anatomical regions
+               &#8212;
+               note:
+                                            13
+
+
--- a/grant.txt	Sun Apr 12 15:35:00 2009 -0700
+++ b/grant.txt	Mon Apr 13 03:07:26 2009 -0700
@@ -142,7 +142,12 @@
-We are aware of one other effort to computationally analyze spatial gene expression data. 
+We are aware of two existing efforts to relate spatial gene expression data to anatomy through computational methods.
+
+\cite{thompson_genomic_2008} describes an analysis of the anatomy of the hippocampus using the ABA dataset. In addition to manual analysis, two clustering methods were employed, a modified Non-negative Matrix Factorization (NNMF), and a hierarchial bifurcation clustering scheme based on correlation as the similarity score. The paper yielded impressive results, proving the usefulness of such research. We have run NNMF on the cortical dataset and while the results are promising (see Preliminary Data), we think that it will be possible to find a better method\footnote{We ran "vanilla" NNMF, whereas the paper under discussion used a modified method. Their main modification consisted of adding a soft spatial contiguity constraint. However, on our dataset, NNMF naturally produced spatially contiguous clusters, so no additional constraint was needed. The paper under discussion mentions that they also tried a hierarchial variant of NNMF, but since they didn't report its results, we assume that the result were not any more impressive than the non-hierarchial variant.} (we also think that more automation of the parts that this paper's authors did manually will be possible).
+
+
+ and \cite{ng_anatomic_2009} describes AGEA. todo
@@ -237,6 +242,9 @@
+todo
+
+(might want to incld nnMF since mentioned above)
@@ -309,4 +317,8 @@
-todo: replace aim # bullet pts with #s 
+if we need citations for aim 3 significance, http://www.sciencedirect.com/science?_ob=ArticleURL&_udi=B6WSS-4V70FHY-9&_user=4429&_coverDate=12%2F26%2F2008&_rdoc=1&_fmt=full&_orig=na&_cdi=7054&_docanchor=&_acct=C000059602&_version=1&_urlVersion=0&_userid=4429&md5=551eccc743a2bfe6e992eee0c3194203#app2 has examples of genetic targeting to specific anatomical regions
+
+---
+
+note:
author	bshanks@bshanks.dyndns.org
date	Mon Apr 13 03:07:26 2009 -0700 (16 years ago)
parents	717d4025b861
children	b9643c30e352
files	grant.html grant.odt grant.pdf grant.txt