Construct: ORF ccsbBroadEn_04770
Construct Description:
- Construct Type:
- ORF
- Other Identifiers:
- ORF007145.1_s300c1, BRDN0000392986
- DNA Barcode:
- None
- Epitope Tag:
- None
- Notes:
- No stop codon in insert
Originally Annotated References:
- Gene:
- LEO1 (123169)
Vector Information:
- Vector Backbone:
- pDONR223
- Pol II Cassette 1:
- n/a
- Pol II Cassette 2:
- n/a
- Selection Marker:
- n/a
- Visible Reporter:
- n/a
- Epitope Tag:
- n/a
Current transcripts matched by this ORF:
Taxon | Gene | Symbol | Description | Transcript | Nuc. Match %[?]A simple nucleotide-based global alignment percentage, calculated as follows: total nt. matches ---------------------------------- aligned length (incl. gaps) |
Prot. Match %[?]A simple amino acid-based global alignment percentage, calculated as follows: total aa. matches ---------------------------------- aligned length (incl. gaps) |
Match Diffs[?]This field may contain sequence annotations in HGVS format. For more information about HGVS annotations, please refer to the HGVS Quick Reference Guide. | |
---|---|---|---|---|---|---|---|---|
1 | human | 123169 | LEO1 | LEO1 homolog, Paf1/RNA poly... | NM_138792.4 | 100% | 100% | |
2 | human | 123169 | LEO1 | LEO1 homolog, Paf1/RNA poly... | NM_001323904.2 | 98.1% | 95.6% | 1894_1895insAGGA;1962_1963ins32 |
3 | human | 123169 | LEO1 | LEO1 homolog, Paf1/RNA poly... | XM_017021911.1 | 95.9% | 94% | (many diffs) |
4 | human | 123169 | LEO1 | LEO1 homolog, Paf1/RNA poly... | NM_001323903.1 | 93.7% | 92.7% | 1895_1992del;2064_2065ins32 |
5 | human | 123169 | LEO1 | LEO1 homolog, Paf1/RNA poly... | NM_001286430.1 | 90.9% | 90.9% | 1158_1159ins180 |
6 | mouse | 235497 | Leo1 | Leo1, Paf1/RNA polymerase I... | NM_001039522.1 | 86.8% | 93.5% | (many diffs) |
7 | mouse | 235497 | Leo1 | Leo1, Paf1/RNA polymerase I... | XM_017313344.1 | 85.2% | 89.5% | (many diffs) |
Sequence Information
Note: uppercase bases indicate empirically verified sequence.
- ORF start:
- 66
- ORF end:
- 2064
- ORF length:
- 1998
- Sequence:
-
1 gttcgttgca acaaattgat gagcaatgct tttttataat gccaaCTTTG TACAAAAAAG 61 TTGGCATGGC GGATATGGAG GATCTCTTCG GGAGCGACGC CGACAGCGAA GCTGAGCGTA 121 AAGATTCTGA TTCTGGATCT GACTCAGATT CTGATCAAGA GAATGCTGCC TCTGGCAGTA 181 ATGCCTCTGG AAGTGAAAGT GATCAGGATG AAAGAGGTGA TTCAGGACAA CCAAGTAATA 241 AGGAACTGTT TGGAGATGAC AGTGAGGACG AGGGAGCTTC ACATCATAGT GGTAGTGATA 301 ATCACTCTGA AAGATCAGAC AATAGATCAG AAGCTTCTGA GCGTTCTGAC CATGAGGACA 361 ATGACCCCTC AGATGTAGAT CAGCACAGTG GATCAGAAGC CCCTAATGAT GATGAAGACG 421 AAGGTCATAG ATCGGATGGA GGGAGCCATC ATTCAGAAGC AGAAGGTTCT GAAAAAGCAC 481 ATTCAGATGA TGAAAAATGG GGCAGAGAAG ATAAAAGTGA CCAGTCAGAT GATGAAAAGA 541 TACAAAATTC TGATGATGAG GAGAGGGCAC AAGGATCTGA TGAAGATAAG CTGCAGAATT 601 CTGACGATGA TGAGAAAATG CAGAACACAG ATGATGAGGA GAGGCCTCAG CTTTCCGATG 661 ATGAGAGACA ACAGCTATCT GAGGAGGAAA AGGCTAATTC TGATGATGAA CGGCCGGTAG 721 CTTCTGATAA TGATGATGAG AAACAGAATT CTGATGATGA AGAACAACCA CAGCTGTCTG 781 ATGAAGAGAA AATGCAAAAT TCTGATGATG AAAGGCCACA GGCCTCAGAT GAAGAACACA 841 GGCATTCAGA TGATGAAGAG GAACAGGATC ATAAATCAGA ATCTGCAAGA GGCAGTGATA 901 GTGAAGATGA AGTTTTACGA ATGAAACGCA AGAATGCGAT TGCATCTGAT TCAGAAGCGG 961 ATAGTGACAC TGAGGTGCCA AAAGATAATA GTGGAACCAT GGATTTATTT GGAGGTGCAG 1021 ATGATATCTC TTCAGGGAGT GATGGAGAAG ACAAACCACC TACTCCAGGA CAGCCTGTTG 1081 ATGAAAATGG ATTGCCTCAG GATCAACAGG AAGAGGAGCC AATTCCTGAG ACCAGAATAG 1141 AAGTAGAAAT ACCCAAAGTA AACACTGATT TAGGAAACGA CTTATATTTT GTTAAACTGC 1201 CCAACTTTCT CAGTGTAGAG CCCAGACCTT TTGATCCTCA GTATTATGAA GATGAATTTG 1261 AAGATGAAGA AATGCTGGAT GAAGAAGGTA GAACCAGGTT AAAATTAAAG GTAGAAAATA 1321 CTATAAGATG GAGGATACGC CGAGATGAAG AAGGAAATGA AATTAAAGAA AGCAATGCTC 1381 GGATAGTCAA GTGGTCAGAT GGAAGCATGT CCCTGCATTT AGGCAATGAA GTGTTTGATG 1441 TGTACAAAGC CCCACTGCAG GGCGACCACA ATCATCTTTT TATAAGACAA GGTACTGGTC 1501 TACAGGGACA AGCAGTCTTT AAAACGAAAC TCACCTTCAG ACCTCACTCT ACGGACAGTG 1561 CCACACATAG AAAGATGACT CTGTCACTTG CAGATAGGTG TTCAAAGACA CAGAAGATTA 1621 GAATCTTGCC AATGGCTGGT CGTGATCCTG AATGCCAACG CACAGAAATG ATTAAGAAAG 1681 AAGAAGAACG TTTGAGGGCT TCCATACGTA GGGAATCTCA GCAGCGCCGA ATGAGAGAGA 1741 AACAGCACCA GCGGGGGCTG AGCGCCAGTT ACCTGGAACC TGATCGATAC GATGAGGAGG 1801 AGGAAGGCGA GGAGTCCATC AGCTTGGCTG CCATTAAAAA CCGATATAAA GGGGGCATTC 1861 GAGAGGAACG AGCCAGAATC TATTCATCAG ACAGTGATGA GGGATCAGAA GAAGATAAAG 1921 CTCAAAGATT ACTCAAAGCA AAGAAACTTA CCAGTGATGA GGAAGGTGAA CCTTCCGGAA 1981 AGAGAAAAGC AGAAGATGAT GATAAAGCAA ATAAAAAGCA TAAGAAGTAT GTGATCAGCG 2041 ATGAAGAGGA AGAAGATGAT GATTGCCCAA CTTTCTTGTA CAAAGTtggc attataagaa 2101 agcattgctt atcaatttgt tgcaacgaac