cg
diff grant.txt @ 22:69aa7c47c0e5
.
author | bshanks@bshanks.dyndns.org |
---|---|
date | Mon Apr 13 03:13:10 2009 -0700 (16 years ago) |
parents | b9643c30e352 |
children | 161319ea4991 |
line diff
1.1 --- a/grant.txt Mon Apr 13 03:10:37 2009 -0700
1.2 +++ b/grant.txt Mon Apr 13 03:13:10 2009 -0700
1.3 @@ -140,7 +140,7 @@
1.4
1.5 There is a substantial body of work on the analysis of gene expression data, however, most of this concerns gene expression data which is not fundamentally spatial, for example, microarray datasets.
1.6
1.7 -As noted above, there has been much work on both supervised learning and clustering, and there are many available algorithms for each. Many of these algorithms are flexible enough to accomodate new scoring measures; and the performance of most of the algorithms is greatly affected by preprocessing and by the choice of which representation to use for feature values. We think it likely that for this application, the development of domain-specific scoring measures (such as gradient similarity, which is discussed in Preliminary Work) will be necessary in order to achieve the best results. In essence, the machine learning community has provided algorithms, but the scientist must provide a framework for representing the problem domain, and the way that this framework is set up has a large impact on performance. Creating a good framework can require creatively reconceptualizing the problem domain, and is not merely a mechanical "fine-tuning" of numerical parameters. Therefore, the completion of Aims 1 and 2 involves more than just reimplementing an existing algorithm, and more than just choosing between a set of existing algorithms, and will constitute a substantial contribution to biology.
1.8 +As noted above, there has been much work on both supervised learning and clustering, and there are many available algorithms for each. However, the completion of Aims 1 and 2 involves more than just choosing between a set of existing algorithms, and will constitute a substantial contribution to biology. The algorithms require the scientist to provide a framework for representing the problem domain, and the way that this framework is set up has a large impact on performance. Creating a good framework can require creatively reconceptualizing the problem domain, and is not merely a mechanical "fine-tuning" of numerical parameters. For example, we believe that domain-specific scoring measures (such as gradient similarity, which is discussed in Preliminary Work) may be necessary in order to achieve the best results in this application.
1.9
1.10 We are aware of two existing efforts to relate spatial gene expression data to anatomy through computational methods.
1.11