Consolidated transcript models are define by transcripts from the various pipelines at the level of Unique Intron Chain (UIC), allowing for variability in the 5' and 3' definition. BED files for the UIC models for each sample are available here and in the LRGASP Track Hub.
A barcode for each UIC indicates the type and number of pipelines where it was detected, together with general transcript properties. The fields of the UIC barcode are:
Field | Description |
---|---|
1 | Number of pipelines using Pacbio reads where the UIC was detected |
2 | Number of pipelines using Nanopore reads where the UIC was detected |
3 | Number of pipelines using the freestyle category where the UIC was detected |
4 | Number of pipelines using cDNA library prep where the UIC was detected |
5 | Number of pipelines using dRNA library prep where the UIC was detected |
6 | Number of pipelines using R2C2 library prep where the UIC was detected |
7 | Number of pipelines using CapTrap library prep where the UIC was detected |
8 | Number of pipelines using only long reads where the UIC was detected |
9 | Number of pipelines using long and short reads where the UIC was detected |
10 | Number of pipelines using only short reads where the UIC was detected |
11 | Number of exons of the UIC |
12 | Median length of the transcript models in the UIC |
13 | Median Counts Per Million of the UIC in the detected pipelines |
14 | Standard deviation of the 5' end positions of the transcript models in the UIC |
15 | Standard deviation of the 3' end positions of the transcript models in the UIC |