cg

changeset 36:c1152241ab12
.
author: bshanks@bshanks.dyndns.org
date: Mon Apr 13 23:11:04 2009 -0700 (16 years ago)
parents: 99e5d268bab0
children: af3389b432e9
files: grant.doc grant.html grant.odt grant.pdf grant.txt
--- a/grant.html	Mon Apr 13 20:27:32 2009 -0700
+++ b/grant.html	Mon Apr 13 23:11:04 2009 -0700
@@ -155,12 +155,30 @@
-single agreed-upon map can be seen by contrasting the recent maps given by Swanson[?] on the one hand, and Paxinos
-and Franklin[?] on the other. While the maps are certainly very similar in their general arrangement, significant differences
+single agreed-upon map can be seen by contrasting the recent maps given by Swanson[4] on the one hand, and Paxinos
+and Franklin[3] on the other. While the maps are certainly very similar in their general arrangement, significant differences
+The Allen Mouse Brain Atlas dataset
+The Allen Mouse Brain Atlas (ABA) data was produced by doing in-situ hybridization on slices of male, 56-day-old
+C57BL/6J mouse brains.  Pictures were taken of the processed slice, and these pictures were semi-automatically analyzed
+in order to create a digital measurement of gene expression levels at each location in each slice.  Per slice, cellular spatial
+resolution is achieved. Using this method, a single physical slice can only be used to measure one single gene; many different
+mouse brains were needed in order to measure the expression of many genes.
+Next, an automated nonlinear alignment procedure located the 2D data from the various slices in a single 3D coordinate
+system.  In the final 3D coordinate system, voxels are cubes with 200 microns on a side.  There are 67x41x58 = 159,326
+voxels in the 3D coordinate system, of which 51,533 are in the brain[2].
+Mus musculus, the common house mouse, is thought to contain about 22,000 protein-coding genes[6]. The ABA contains
+data on about 20,000 genes in sagittal sections, out of which over 4,000 genes are also measured in coronal sections.  Our
+dataset is derived from only the coronal subset of the ABA, because the sagittal data does not cover the entire cortex,
+and has greater registration error[2]. Genes were selected by the Allen Institute for coronal sectioning based on, &#8220;classes of
+known neuroscientific interest... or through post hoc identification of a marked non-ubiquitous expression pattern&#8221;[2].
+_________________________________________
+   2This would seem to contradict our finding in aim 1 that some cortical areas are combinatorially coded by multiple genes.  However, it is
+possible that the currently accepted cortical maps divide the cortex into subregions which are unnatural from the point of view of gene expression;
+perhaps there is some other way to map the cortex for which each subregion can be identified by single genes.
@@ -176,10 +194,6 @@
-_________________________________________
-   2This would seem to contradict our finding in aim 1 that some cortical areas are combinatorially coded by multiple genes.  However, it is
-possible that the currently accepted cortical maps divide the cortex into subregions which are unnatural from the point of view of gene expression;
-perhaps there is some other way to map the cortex for which each subregion can be identified by single genes.
@@ -192,7 +206,7 @@
-[3 ] describes an analysis of the anatomy of the hippocampus using the ABA dataset.  In addition to manual analysis,
+[5 ] describes an analysis of the anatomy of the hippocampus using the ABA dataset.  In addition to manual analysis,
@@ -212,14 +226,7 @@
-The hierarchial clustering is different from our Aim 2 in at least three ways.  First, the clustering finds clusters cor-
-responding to layers, but no clusters corresponding to areas7  8  Our Aim 2 will not be accomplished until a clustering is
-produced which yields areas.  Second, AGEA uses perhaps the simplest possible similarity score (correlation), and does no
-dimensionality reduction before calculating similarity. While it is possible that a more complex system will not do any better
-than this, we believe further exploration of alternative methods of scoring and dimensionality reduction is warranted. Third,
-AGEA did not look at clusters of genes; in Preliminary Data we have shown that clusters of genes may identify intersting
-spatial subregions such as cortical areas.
-_______
+_________________________________________
@@ -230,30 +237,42 @@
-    7This is for the same reason as in footnote 4.
+The hierarchial clustering is different from our Aim 2 in at least three ways.  First, the clustering finds clusters cor-
+responding to layers, but no clusters corresponding to areas7  8  Our Aim 2 will not be accomplished until a clustering is
+produced which yields areas.  Second, AGEA uses perhaps the simplest possible similarity score (correlation), and does no
+dimensionality reduction before calculating similarity. While it is possible that a more complex system will not do any better
+than this, we believe further exploration of alternative methods of scoring and dimensionality reduction is warranted. Third,
+AGEA did not look at clusters of genes; in Preliminary Data we have shown that clusters of genes may identify intersting
+spatial subregions such as cortical areas.
+_______
+   7This is for the same reason as in footnote 4.
-
-                   
-
-Figure 1: Upper left: wwc1. Upper right: mtif2. Lower left: wwc1 + mtif2 (each pixel&#8217;s value on the lower left is the sum
-of the corresponding pixels in the upper row). Within each picture, the vertical axis roughly corresponds to anterior at the
-top and posterior at the bottom, and the horizontal axis roughly corresponds to medial at the left and lateral at the right.
-The red outline is the boundary of region MO. Pixels are colored approximately according to the density of expressing cells
-underneath each pixel, with red meaning a lot of expression and blue meaning little.
-We created a mask which selects only those voxels within the ABA atlas space which belong to cerebral cortex.
-todo
-Using Caret, [1]
-We manually entered the boundaries of each cortical area into Caret.
-Cortical layers are found at different depths in different parts of the cortex. We have manually demarcated the depth of
-the outer boundary of cortical layer 5 throughout the cortex.
-In preparation for extracting the layer-specific datasets, we have extended Caret with routines that allow the depth of
-the ROI for volume-to-surface projection to vary.
+We downloaded the ABA data and applied a mask to select only those voxels which belong to cerebral cortex. We divided
+the cortex into hemispheres.
+Using Caret[1], we created a mesh representation of the surface of the selected region.  For each gene, for each node of
+the mesh, we calculated an average of the gene expression of the voxels &#8220;underneath&#8221; that mesh node. Using Caret, we then
+flattened the cortex, creating a two-dimensional mesh.
+We sampled the nodes of the irregular, flat mesh in order to create a regular grid of pixel values. We converted this grid
+into a MATLAB matrix.
+We manually traced the boundaries of each cortical area from the ABA coronal reference atlas slides. We then converted
+these manual traces into Caret-format regional boundary data on the mesh surface. Using Caret, we projected the regions
+onto the 2-d mesh, and then onto the grid, and then we converted the region data into MATLAB format.
+At this point, the data is in the form of a number of 2-D matrices, each registered to each other, with the matrix entries
+representing a grid of points (pixels) over the cortical surface:
+&#x2219;A 2-D matrix whose entries represent the regional label associated with each surface pixel
+&#x2219;For each gene, a 2-D matrix whose entries represent the average expression level underneath each surface pixel
+Rather than a single average expression level for each surface pixel, we plan to create a separate matrix for each cortical
+layer to represent the average expression level within that layer.  Cortical layers are found at different depths in different
+parts of the cortex.  In preparation for extracting the layer-specific datasets, we have extended Caret with routines that
+allow the depth of the ROI for volume-to-surface projection to vary.
+In the Research Plan, we describe how we will automatically locate the layer depths. For validation, we have manually
+demarcated the depth of the outer boundary of cortical layer 5 throughout the cortex.
@@ -268,20 +287,32 @@
+To show that local geometry can provide useful information that cannot be detected via pointwise analyses, consider Fig.
+. The top row of Fig.  displays the 3 genes which most match area AUD, according to a pointwise method11. The bottom
+row displays the 3 genes which most match AUD according to a method which considers local geometry12  The pointwise
+method in the top row identifies genes which express more strongly in AUD than outside of it; its weakness is that this
+includes many areas which don&#8217;t have a salient border matching the areal border.  The geometric method identifies genes
+   11For each gene, a logistic regression in which the response variable was whether or not a surface pixel was within area AUD, and the predictor
+variable was the value of the expression of the gene underneath that pixel. The resulting scores were used to rank the genes in terms of how well
+they predict area AUD.
+   12For each gene the gradient similarity (see section ??) between (a) a map of the expression of each gene on the cortical surface and (b) the
+shape of area AUD, was calculated, and this was used to rank the genes.
+                   
+
+Figure 1: Upper left: wwc1. Upper right: mtif2. Lower left: wwc1 + mtif2 (each pixel&#8217;s value on the lower left is the sum
+of the corresponding pixels in the upper row). Within each picture, the vertical axis roughly corresponds to anterior at the
+top and posterior at the bottom, and the horizontal axis roughly corresponds to medial at the left and lateral at the right.
+The red outline is the boundary of region MO. Pixels are colored approximately according to the density of expressing cells
+underneath each pixel, with red meaning a lot of expression and blue meaning little.
-To show that local geometry can provide useful information that cannot be detected via pointwise analyses, consider Fig.
-. The top row of Fig.  displays the 3 genes which most match area AUD, according to a pointwise method11. The bottom
-row displays the 3 genes which most match AUD according to a method which considers local geometry12  The pointwise
-method in the top row identifies genes which express more strongly in AUD than outside of it; its weakness is that this
-includes many areas which don&#8217;t have a salient border matching the areal border.  The geometric method identifies genes
@@ -304,18 +335,13 @@
-_________________________________________
-  11For each gene, a logistic regression in which the response variable was whether or not a surface pixel was within area AUD, and the predictor
-variable was the value of the expression of the gene underneath that pixel. The resulting scores were used to rank the genes in terms of how well
-they predict area AUD.
-   12For each gene the gradient similarity (see section ??) between (a) a map of the expression of each gene on the cortical surface and (b) the
-shape of area AUD, was calculated, and this was used to rank the genes.
-   135-fold cross-validation.
+_________________________________________
+  135-fold cross-validation.
@@ -355,10 +381,45 @@
-[3]Carol L. Thompson, Sayan D. Pathak, Andreas Jeromin, Lydia L. Ng, Cameron R. MacPherson, Marty T. Mortrud,
+[3]George Paxinos and Keith B.J. Franklin. The Mouse Brain in Stereotaxic Coordinates. Academic Press, 2 edition, July
+2001.
+[4]Larry Swanson. Brain Maps: Structure of the Rat Brain. Academic Press, 3 edition, November 2003.
+[5]Carol L. Thompson, Sayan D. Pathak, Andreas Jeromin, Lydia L. Ng, Cameron R. MacPherson, Marty T. Mortrud,
+[6]Robert H Waterston, Kerstin Lindblad-Toh, Ewan Birney, Jane Rogers, Josep F Abril, Pankaj Agarwal, Richa Agarwala,
+Rachel Ainscough, Marina Alexandersson, Peter An, Stylianos E Antonarakis, John Attwood, Robert Baertsch, Jonathon
+Bailey, Karen Barlow, Stephan Beck, Eric Berry, Bruce Birren, Toby Bloom, Peer Bork, Marc Botcherby, Nicolas Bray,
+Michael R Brent, Daniel G Brown, Stephen D Brown, Carol Bult, John Burton, Jonathan Butler, Robert D Campbell,
+Piero Carninci, Simon Cawley, Francesca Chiaromonte, Asif T Chinwalla, Deanna M Church, Michele Clamp, Christopher
+Clee, Francis S Collins, Lisa L Cook, Richard R Copley, Alan Coulson, Olivier Couronne, James Cuff, Val Curwen, Tim
+Cutts, Mark Daly, Robert David, Joy Davies, Kimberly D Delehaunty, Justin Deri, Emmanouil T Dermitzakis, Colin
+Dewey, Nicholas J Dickens, Mark Diekhans, Sheila Dodge, Inna Dubchak, Diane M Dunn, Sean R Eddy, Laura Elnitski,
+Richard D Emes, Pallavi Eswara, Eduardo Eyras, Adam Felsenfeld, Ginger A Fewell, Paul Flicek, Karen Foley, Wayne N
+Frankel, Lucinda A Fulton, Robert S Fulton, Terrence S Furey, Diane Gage, Richard A Gibbs, Gustavo Glusman, Sante
+Gnerre, Nick Goldman, Leo Goodstadt, Darren Grafham, Tina A Graves, Eric D Green, Simon Gregory, Roderic Guig,
+Mark Guyer, Ross C Hardison, David Haussler, Yoshihide Hayashizaki, LaDeana W Hillier, Angela Hinrichs, Wratko
+Hlavina, Timothy Holzer, Fan Hsu, Axin Hua, Tim Hubbard, Adrienne Hunt, Ian Jackson, David B Jaffe, L Steven
+Johnson, Matthew Jones, Thomas A Jones, Ann Joy, Michael Kamal, Elinor K Karlsson, Donna Karolchik, Arkadiusz
+Kasprzyk, Jun Kawai, Evan Keibler, Cristyn Kells, W James Kent, Andrew Kirby, Diana L Kolbe, Ian Korf, Raju S
+Kucherlapati, Edward J Kulbokas, David Kulp, Tom Landers, J P Leger, Steven Leonard, Ivica Letunic, Rosie Levine, Jia
+Li, Ming Li, Christine Lloyd, Susan Lucas, Bin Ma, Donna R Maglott, Elaine R Mardis, Lucy Matthews, Evan Mauceli,
+John H Mayer, Megan McCarthy, W Richard McCombie, Stuart McLaren, Kirsten McLay, John D McPherson, Jim
+Meldrim, Beverley Meredith, Jill P Mesirov, Webb Miller, Tracie L Miner, Emmanuel Mongin, Kate T Montgomery,
+Michael Morgan, Richard Mott, James C Mullikin, Donna M Muzny, William E Nash, Joanne O Nelson, Michael N
+Nhan, Robert Nicol, Zemin Ning, Chad Nusbaum, Michael J O&#8217;Connor, Yasushi Okazaki, Karen Oliver, Emma Overton-
+Larty, Lior Pachter, Gens Parra, Kymberlie H Pepin, Jane Peterson, Pavel Pevzner, Robert Plumb, Craig S Pohl, Alex
+Poliakov, Tracy C Ponce, Chris P Ponting, Simon Potter, Michael Quail, Alexandre Reymond, Bruce A Roe, Krishna M
+Roskin, Edward M Rubin, Alistair G Rust, Ralph Santos, Victor Sapojnikov, Brian Schultz, Jrg Schultz, Matthias S
+Schwartz, Scott Schwartz, Carol Scott, Steven Seaman, Steve Searle, Ted Sharpe, Andrew Sheridan, Ratna Shownkeen,
+Sarah Sims, Jonathan B Singer, Guy Slater, Arian Smit, Douglas R Smith, Brian Spencer, Arne Stabenau, Nicole Stange-
+Thomann, Charles Sugnet, Mikita Suyama, Glenn Tesler, Johanna Thompson, David Torrents, Evanne Trevaskis, John
+Tromp, Catherine Ucla, Abel Ureta-Vidal, Jade P Vinson, Andrew C Von Niederhausern, Claire M Wade, Melanie Wall,
+Ryan J Weber, Robert B Weiss, Michael C Wendl, Anthony P West, Kris Wetterstrand, Raymond Wheeler, Simon
+Whelan, Jamey Wierzbowski, David Willey, Sophie Williams, Richard K Wilson, Eitan Winter, Kim C Worley, Dudley
+Wyman, Shan Yang, Shiaw-Pyng Yang, Evgeny M Zdobnov, Michael C Zody, and Eric S Lander. Initial sequencing and
+comparative analysis of the mouse genome. Nature, 420(6915):520&#8211;62, December 2002. PMID: 12466850.
@@ -376,5 +437,6 @@
+    two hemis
--- a/grant.txt	Mon Apr 13 20:27:32 2009 -0700
+++ b/grant.txt	Mon Apr 13 23:11:04 2009 -0700
@@ -119,7 +119,15 @@
-Even the questions of how many areas should be recognized in cortex, and what their arrangement is, are still not completely settled. A proposed division of the cortex into areas is called a cortical map. In the rodent, the lack of a single agreed-upon map can be seen by contrasting the recent maps given by Swanson\cite{brain_swanson_2003} on the one hand, and Paxinos and Franklin\cite{mouse_paxinos_2001} on the other. While the maps are certainly very similar in their general arrangement, significant differences remain in the details.
+Even the questions of how many areas should be recognized in cortex, and what their arrangement is, are still not completely settled. A proposed division of the cortex into areas is called a cortical map. In the rodent, the lack of a single agreed-upon map can be seen by contrasting the recent maps given by Swanson\cite{swanson_brain_2003} on the one hand, and Paxinos and Franklin\cite{paxinos_mouse_2001} on the other. While the maps are certainly very similar in their general arrangement, significant differences remain in the details.
+
+\vspace{0.3cm}**The Allen Mouse Brain Atlas dataset**
+
+The Allen Mouse Brain Atlas (ABA) data was produced by doing in-situ hybridization on slices of male, 56-day-old C57BL/6J mouse brains. Pictures were taken of the processed slice, and these pictures were semi-automatically analyzed in order to create a digital measurement of gene expression levels at each location in each slice. Per slice, cellular spatial resolution is achieved. Using this method, a single physical slice can only be used to measure one single gene; many different mouse brains were needed in order to measure the expression of many genes. 
+
+Next, an automated nonlinear alignment procedure located the 2D data from the various slices in a single 3D coordinate system. In the final 3D coordinate system, voxels are cubes with 200 microns on a side. There are 67x41x58 \= 159,326 voxels in the 3D coordinate system, of which 51,533 are in the brain\cite{ng_anatomic_2009}.
+
+Mus musculus, the common house mouse, is thought to contain about 22,000 protein-coding genes\cite{waterston_initial_2002}. The ABA contains data on about 20,000 genes in sagittal sections, out of which over 4,000 genes are also measured in coronal sections. Our dataset is derived from only the coronal subset of the ABA, because the sagittal data does not cover the entire cortex, and has greater registration error\cite{ng_anatomic_2009}. Genes were selected by the Allen Institute for coronal sectioning based on, "classes of known neuroscientific interest... or through post hoc identification of a marked non-ubiquitous expression pattern"\cite{ng_anatomic_2009}. 
@@ -179,17 +187,24 @@
-We created a mask which selects only those voxels within the ABA atlas space which belong to cerebral cortex.
-
-todo
-
-Using Caret, \cite{van_essen_integrated_2001}
-
-We manually entered the boundaries of each cortical area into Caret. 
-
-Cortical layers are found at different depths in different parts of the cortex. We have manually demarcated the depth of the outer boundary of cortical layer 5 throughout the cortex.
-
-In preparation for extracting the layer-specific datasets, we have extended Caret with routines that allow the depth of the ROI for volume-to-surface projection to vary.
+We downloaded the ABA data and applied a mask to select only those voxels which belong to cerebral cortex. We divided the cortex into hemispheres. 
+
+Using Caret\cite{van_essen_integrated_2001}, we created a mesh representation of the surface of the selected region. For each gene, for each node of the mesh, we calculated an average of the gene expression of the voxels "underneath" that mesh node. Using Caret, we then flattened the cortex, creating a two-dimensional mesh. 
+
+We sampled the nodes of the irregular, flat mesh in order to create a regular grid of pixel values. We converted this grid into a MATLAB matrix.
+
+We manually traced the boundaries of each cortical area from the ABA coronal reference atlas slides. We then converted these manual traces into Caret-format regional boundary data on the mesh surface. Using Caret, we projected the regions onto the 2-d mesh, and then onto the grid, and then we converted the region data into MATLAB format.
+
+At this point, the data is in the form of a number of 2-D matrices, each registered to each other, with the matrix entries representing a grid of points (pixels) over the cortical surface:
+
+* A 2-D matrix whose entries represent the regional label associated with each surface pixel
+* For each gene, a 2-D matrix whose entries represent the average expression level underneath each surface pixel 
+
+Rather than a single average expression level for each surface pixel, we plan to create a separate matrix for each cortical layer to represent the average expression level within that layer. Cortical layers are found at different depths in different parts of the cortex. In preparation for extracting the layer-specific datasets, we have extended Caret with routines that allow the depth of the ROI for volume-to-surface projection to vary. 
+
+In the Research Plan, we describe how we will automatically locate the layer depths. For validation, we have manually demarcated the depth of the outer boundary of cortical layer 5 throughout the cortex.
+
+
@@ -363,3 +378,4 @@
+two hemis
author	bshanks@bshanks.dyndns.org
date	Mon Apr 13 23:11:04 2009 -0700 (16 years ago)
parents	99e5d268bab0
children	af3389b432e9
files	grant.doc grant.html grant.odt grant.pdf grant.txt