Construct: ORF ccsbBroadEn_07082
Construct Description:
- Construct Type:
- ORF
- Other Identifiers:
- ORF016534.1_s300c1, BRDN0000387109
- DNA Barcode:
- None
- Epitope Tag:
- None
- Notes:
- No stop codon in insert
Originally Annotated References:
- Gene:
- TMPRSS2 (7113)
Vector Information:
- Vector Backbone:
- pDONR223
- Pol II Cassette 1:
- n/a
- Pol II Cassette 2:
- n/a
- Selection Marker:
- n/a
- Visible Reporter:
- n/a
- Epitope Tag:
- n/a
Current transcripts matched by this ORF:
Taxon | Gene | Symbol | Description | Transcript | Nuc. Match %[?]A simple nucleotide-based global alignment percentage, calculated as follows: total nt. matches ---------------------------------- aligned length (incl. gaps) |
Prot. Match %[?]A simple amino acid-based global alignment percentage, calculated as follows: total aa. matches ---------------------------------- aligned length (incl. gaps) |
Match Diffs[?]This field may contain sequence annotations in HGVS format. For more information about HGVS annotations, please refer to the HGVS Quick Reference Guide. | |
---|---|---|---|---|---|---|---|---|
1 | human | 7113 | TMPRSS2 | transmembrane serine protea... | NM_005656.4 | 99.9% | 100% | 768T>C |
2 | human | 7113 | TMPRSS2 | transmembrane serine protea... | NM_001135099.1 | 92.9% | 93% | 1_111del;879T>C |
Sequence Information
Note: uppercase bases indicate empirically verified sequence.
- ORF start:
- 66
- ORF end:
- 1542
- ORF length:
- 1476
- Sequence:
-
1 gttcgttgca acaaattgat gagcaatgct tttttataat gccaaCTTTG TACAAAAAAG 61 TTGGCATGGC TTTGAACTCA GGGTCACCAC CAGCTATTGG ACCTTACTAT GAAAACCATG 121 GATACCAACC GGAAAACCCC TATCCCGCAC AGCCCACTGT GGTCCCCACT GTCTACGAGG 181 TGCATCCGGC TCAGTACTAC CCGTCCCCCG TGCCCCAGTA CGCCCCGAGG GTCCTGACGC 241 AGGCTTCCAA CCCCGTCGTC TGCACGCAGC CCAAATCCCC ATCCGGGACA GTGTGCACCT 301 CAAAGACTAA GAAAGCACTG TGCATCACCT TGACCCTGGG GACCTTCCTC GTGGGAGCTG 361 CGCTGGCCGC TGGCCTACTC TGGAAGTTCA TGGGCAGCAA GTGCTCCAAC TCTGGGATAG 421 AGTGCGACTC CTCAGGTACC TGCATCAACC CCTCTAACTG GTGTGATGGC GTGTCACACT 481 GCCCCGGCGG GGAGGACGAG AATCGGTGTG TTCGCCTCTA CGGACCAAAC TTCATCCTTC 541 AGGTGTACTC ATCTCAGAGG AAGTCCTGGC ACCCTGTGTG CCAAGACGAC TGGAACGAGA 601 ACTACGGGCG GGCGGCCTGC AGGGACATGG GCTATAAGAA TAATTTTTAC TCTAGCCAAG 661 GAATAGTGGA TGACAGCGGA TCCACCAGCT TTATGAAACT GAACACAAGT GCCGGCAATG 721 TCGATATCTA TAAAAAACTG TACCACAGTG ATGCCTGTTC TTCAAAAGCA GTGGTTTCTT 781 TACGCTGTAT AGCCTGCGGG GTCAACTTGA ACTCAAGCCG CCAGAGCAGG ATCGTGGGCG 841 GCGAGAGCGC GCTCCCGGGG GCCTGGCCCT GGCAGGTCAG CCTGCACGTC CAGAACGTCC 901 ACGTGTGCGG AGGCTCCATC ATCACCCCCG AGTGGATCGT GACAGCCGCC CACTGCGTGG 961 AAAAACCTCT TAACAATCCA TGGCATTGGA CGGCATTTGC GGGGATTTTG AGACAATCTT 1021 TCATGTTCTA TGGAGCCGGA TACCAAGTAG AAAAAGTGAT TTCTCATCCA AATTATGACT 1081 CCAAGACCAA GAACAATGAC ATTGCGCTGA TGAAGCTGCA GAAGCCTCTG ACTTTCAACG 1141 ACCTAGTGAA ACCAGTGTGT CTGCCCAACC CAGGCATGAT GCTGCAGCCA GAACAGCTCT 1201 GCTGGATTTC CGGGTGGGGG GCCACCGAGG AGAAAGGGAA GACCTCAGAA GTGCTGAACG 1261 CTGCCAAGGT GCTTCTCATT GAGACACAGA GATGCAACAG CAGATATGTC TATGACAACC 1321 TGATCACACC AGCCATGATC TGTGCCGGCT TCCTGCAGGG GAACGTCGAT TCTTGCCAGG 1381 GTGACAGTGG AGGGCCTCTG GTCACTTCGA AGAACAATAT CTGGTGGCTG ATAGGGGATA 1441 CAAGCTGGGG TTCTGGCTGT GCCAAAGCTT ACAGACCAGG AGTGTACGGG AATGTGATGG 1501 TATTCACGGA CTGGATTTAT CGACAAATGA GGGCAGACGG CTACCCAACT TTCTTGTACA 1561 AAGTtggcat tataagaaag cattgcttat caatttgttg caacgaac