<?xml version="1.0" encoding="UTF-8"?>

<rdf:RDF
   xmlns:rdf="http://www.w3.org/1999/02/22-rdf-syntax-ns#"
   xmlns:rdfs="http://www.w3.org/2000/01/rdf-schema#"
   xmlns="http://purl.org/rss/1.0/"
   xmlns:dc="http://purl.org/dc/elements/1.1/"
   xmlns:prism="http://prismstandard.org/namespaces/1.2/basic/"
   xmlns:dcterms="http://purl.org/dc/terms/"

>
<channel rdf:about="http://www.citeulike.org/about">
<pubDate>Sat, 26 Jul 2008 03:26:01 BST</pubDate>


	<title>CiteULike: xingxu tiling</title>
	<description>CiteULike: xingxu tiling</description>


	<link>http://www.citeulike.org/user/xingxu/tag/tiling</link>
	<dc:publisher>CiteULike.org</dc:publisher>
	<dc:language>en-gb</dc:language>
	<dc:rights>Copyright &#169; 2004-2008 citeulike.org</dc:rights>
	<items>
    <rdf:Seq>
        <rdf:li rdf:resource="http://www.citeulike.org/user/xingxu/article/910292"/>
        <rdf:li rdf:resource="http://www.citeulike.org/user/xingxu/article/1646957"/>

	</rdf:Seq>
	</items>
	</channel>


<item rdf:about="http://www.citeulike.org/user/xingxu/article/910292">
    <title>A supervised hidden Markov model framework for efficiently segmenting tiling array data in transcriptional and ChIP-chip experiments: systematically incorporating validated biological knowledge.</title>
    <link>http://www.citeulike.org/user/xingxu/article/910292</link>
    <description>&lt;i&gt;Bioinformatics (12 October 2006)&lt;/i&gt;&lt;br /&gt;&lt;br /&gt;MOTIVATION: Large-scale tiling array experiments are becoming increasingly common in genomics. In particular, the ENCODE project requires the consistent segmentation of many different tiling array data sets into &#34;active regions&#34; (e.g. finding transfrags from transcriptional data and putative binding sites from ChIP-chip experiments). Previously, such segmentation was done in an unsupervised fashion mainly based on characteristics of the signal distribution in the tiling array data itself. Here we propose a supervised framework for doing this. It has the advantage of explicitly incorporating validated biological knowledge into the model and allowing for formal training and testing. Methodology: In particular, we use a hidden Markov model (HMM) framework, which is capable of explicitly modeling the dependency between neighboring probes and whose extended version (the generalized HMM) also allows explicit description of state duration density. We introduce a formal definition of the tiling-array analysis problem, and explain how we can use this to describe sampling small genomic regions for experimental validation to build up a gold-standard set for training and testing. We then describe various ideal and practical sampling strategies (e.g. maximizing signal entropy within a selected region versus using gene annotation or known promoters as positives for transcription or ChIP-chip data, respectively). RESULTS: For the practical sampling and training strategies, we show how the size and noise in the validated training data affects the performance of an HMM applied to the ENCODE transcriptional and ChIP-chip experiments. In particular, we show that the HMM framework is able to efficiently process tiling array data as well as or better than previous approaches. For the idealized sampling strategies, we show how we can assess their performance in a simulation framework and how a maximum entropy approach, which samples sub-regions with very different signal intensities, gives the maximally performing gold-standard. This latter result has strong implications for the optimum way medium-scale validation experiments should be carried out to verify the results of the genome-scale tiling array experiments. SUPPLEMENTARY INFORMATION: The supplementary materials are available at http://tiling.gersteinlab.org/hmm/.</description>
    <dc:title>A supervised hidden Markov model framework for efficiently segmenting tiling array data in transcriptional and ChIP-chip experiments: systematically incorporating validated biological knowledge.</dc:title>

    <dc:creator>Jiang Du</dc:creator>
    <dc:creator>Joel S Rozowsky</dc:creator>
    <dc:creator>Jan O Korbel</dc:creator>
    <dc:creator>Zhengdong D Zhang</dc:creator>
    <dc:creator>Thomas E Royce</dc:creator>
    <dc:creator>Martin H Schultz</dc:creator>
    <dc:creator>Michael Snyder</dc:creator>
    <dc:creator>Mark Gerstein</dc:creator>
    <dc:identifier>doi:10.1093/bioinformatics/btl515</dc:identifier>
    <dc:source>Bioinformatics (12 October 2006)</dc:source>
    <dc:date>2006-10-23T15:32:54-00:00</dc:date>
    <prism:publicationYear>2006</prism:publicationYear>
    <prism:publicationName>Bioinformatics</prism:publicationName>
    <prism:issn>1460-2059</prism:issn>
    <prism:category>bioinformatics</prism:category>
    <prism:category>methods</prism:category>
    <prism:category>tiling</prism:category>
    <prism:category>transcriptome</prism:category>
</item>



<item rdf:about="http://www.citeulike.org/user/xingxu/article/1646957">
    <title>Mapping the C. elegans noncoding transcriptome with a whole-genome tiling microarray</title>
    <link>http://www.citeulike.org/user/xingxu/article/1646957</link>
    <description>&lt;i&gt;Genome Res. (4 September 2007), gr.6611807.&lt;/i&gt;&lt;br /&gt;&lt;br /&gt;The number of annotated protein coding genes in the genome of Caenorhabditis elegans is similar to that of other animals, but the extent of its non-protein-coding transcriptome remains unknown. Expression profiling on whole-genome tiling microarrays applied to a mixed-stage C. elegans population verified the expression of 71% of all annotated exons. Only a small fraction (11%) of the polyadenylated transcription is non-annotated and appears to consist of [~]3200 missed or alternative exons and 7800 small transcripts of unknown function (TUFs). Almost half (44%) of the detected transcriptional output is non-polyadenylated and probably not protein coding, and of this, 70% overlaps the boundaries of protein-coding genes in a complex manner. Specific analysis of small non-polyadenylated transcripts verified 97% of all annotated small ncRNAs and suggested that the transcriptome contains [~]1200 small (&#60;500 nt) unannotated noncoding loci. After combining overlapping transcripts, we estimate that at least 70% of the total C. elegans genome is transcribed. 10.1101/gr.6611807</description>
    <dc:title>Mapping the C. elegans noncoding transcriptome with a whole-genome tiling microarray</dc:title>

    <dc:creator>Housheng He</dc:creator>
    <dc:creator>Jie Wang</dc:creator>
    <dc:creator>Tao Liu</dc:creator>
    <dc:creator>Shirley Liu</dc:creator>
    <dc:creator>Tiantian Li</dc:creator>
    <dc:creator>Yunfei Wang</dc:creator>
    <dc:creator>Zuwei Qian</dc:creator>
    <dc:creator>Haixia Zheng</dc:creator>
    <dc:creator>Xiaopeng Zhu</dc:creator>
    <dc:creator>Tao Wu</dc:creator>
    <dc:creator>Baochen Shi</dc:creator>
    <dc:creator>Wei Deng</dc:creator>
    <dc:creator>Wei Zhou</dc:creator>
    <dc:creator>Geir Skogerbo</dc:creator>
    <dc:creator>Runsheng Chen</dc:creator>
    <dc:identifier>doi:10.1101/gr.6611807</dc:identifier>
    <dc:source>Genome Res. (4 September 2007), gr.6611807.</dc:source>
    <dc:date>2007-09-12T08:47:56-00:00</dc:date>
    <prism:publicationYear>2007</prism:publicationYear>
    <prism:publicationName>Genome Res.</prism:publicationName>
    <prism:startingPage>gr.6611807</prism:startingPage>
    <prism:category>genomics</prism:category>
    <prism:category>microarray</prism:category>
    <prism:category>nematode</prism:category>
    <prism:category>tiling</prism:category>
    <prism:category>transcriptome</prism:category>
</item>



</rdf:RDF>

