Transcriptome sequencing datasets from Francis 2011

The following datasets are made available to the research community as part of a recently submitted publication. The software at the centre of this publication can be found here

Datasets

  • The levin dataset (compressed 1.1GB; uncompressed 2.6GB)
    • This dataset (used in the paper above) is from published work by Levin and colleagues and consists of ~14 million 76mer fastq reads. The total processing time for this dataset will depend on your system but will be approximately 3 hours with a local Ensembl database.

  • Subset test dataset (compressed 6.2M; uncompressed 17MB)
    • This is a smaller subset of the levin dataset consisting of ~85 thousand 76mer reads and should give results from a single fusion. The total processing time for this dataset will depend on your system but will be approximately 2 minutes with a local Ensembl database.

Reference Data


These datasets contain all transcripts of all genes annotated in the listed Ensembl version, either with or without pseudogenes

  • Latest (Ensembl 68)
    Coding and noncoding transcripts references
    • Fasta (compressed 58MB; uncompressed 275MB)
    • Bowtie Index (compressed 243MB; uncompressed 323MB)

  • Ensembl 67
    Coding and noncoding transcripts references
    • Fasta (compressed 59MB; uncompressed 284MB)
    • Bowtie Index (compressed 248MB; uncompressed 332MB)

  • Ensembl 66
    Coding and noncoding transcripts references
    • Fasta (compressed 58MB; uncompressed 281MB)
    • Bowtie Index (compressed 245MB; uncompressed 328MB)

  • Ensembl 65
    Coding and noncoding transcripts references
    • Fasta (compressed 57MB; uncompressed 272MB)
    • Bowtie Index (compressed 239MB; uncompressed 319MB)

  • Ensembl 64
    Coding and noncoding transcripts references
    • Fasta (compressed 56MB; uncompressed 268MB)
    • Bowtie Index (compressed 235MB; uncompressed 314MB)

  • Ensembl 63
    Coding and noncoding transcripts references
    • Fasta (compressed 55MB; uncompressed 264MB)
    • Bowtie Index (compressed 232MB; uncompressed 309MB)

  • Ensembl 62
    Coding and noncoding transcripts references
    • Fasta (compressed 54MB; uncompressed 260MB)
    • Bowtie Index (compressed 229MB; uncompressed 305MB)