Adjoined Transcripts#
Adjoined Transcript: a transcript (RNA molecule) composed of adjoined RNA from two or more contributing transcripts.
Syntax#
| Syntax | five_prime_sequence_identifier ":" coord_type "." five_prime_range "::" [ linker_sequence "::" ] three_prime_sequence_identifier ":" coord_type "." three_prime_range |
|---|---|
| Examples |
|
| Explanation of Symbols | |
| |
Notes#
- Adjoined transcripts are a product of some gene fusions.
- The adjoined transcript syntax proposed in Community Proposal SVD WG007 and adopted
here has the following limitations:
- This syntax is for two-partner adjoined transcripts only.
- This syntax is for RNA sequence only (no use of coding (
c.) / non-coding DNA (n.) reference sequences). - Linker sequences are specified using General Recommendations for RNA sequence character codes, e.g.
aggcucccuugg
- This syntax REQUIRES the use of a range (not a single position) for
five_prime_range/three_prime_range. - When the adjoined transcript junction but not the entire transcript is analyzed, the outer range bounds SHOULD be
specified with
?, e.g.NM_152263.2:r.?_775::NM_002609.3:r.1580_? - All adjoined transcripts are described using the same format, irrespective of whether they derive from inter-chromosomal or intra-chromosomal DNA rearrangements (translocation, deletion, inversion) or other mechanisms (trans-splicing).
Examples#
-
translocation-derived adjoined transcript
NM_152263.2:r.-115_775::NM_002609.3:r.1580_*1924
describes an adjoined transcript from aTPM3::PDGFRBgene fusion, where nucleotidesr.-115tor.775(reference transcriptNM_152263.2, TPM3 gene) are coupled to nucleotidesr.1580tor.*1924(reference transcriptNM_002609.3, PDGFRB gene).
-
deletion-derived adjoined transcripts
-
NM_002354.2:r.-358_555::NM_000251.2:r.212_*279
describes an adjoined transcript from anEPCAM::MSH2gene fusion, where nucleotidesr.-358tor.555(reference transcriptNM_002354.2, EPCAM gene) are coupled to nucleotidesr.212tor.*279(reference transcriptNM_000251.2, MSH2 gene). -
NM_002354.2:r.?_555::guaugauuuuuuaataa::NM_000251.2:r.212_?
describes an adjoined transcript from anEPCAM::MSH2gene fusion, where only the fusion break point has been characterised, showing the insertion of a 17 nucleotide sequence (guaugauuuuuuaataa) between two adjoined transcripts.
-