Construct: ORF TRCN0000471431
Construct Description:
- Construct Type:
- ORF
- Other Identifiers:
- ORF003019.1_s317c1
- Derived from:
- ccsbBroadEn_04885
- DNA Barcode:
- GCAGTGCGTTCTTTAACGATTGAA
- Epitope Tag:
- V5
- Notes:
- No stop codon in insert
Originally Annotated References:
- Gene:
- UGT3A1 (133688)
Vector Information:
- Vector Backbone:
- pLX_317
- Pol II Cassette 1:
- SV40-PuroR
- Pol II Cassette 2:
- EF1a-TRCN0000471431
- Selection Marker:
- PuroR
- Visible Reporter:
- n/a
- Epitope Tag:
- V5
Current transcripts matched by this ORF:
Taxon | Gene | Symbol | Description | Transcript | Nuc. Match %[?]A simple nucleotide-based global alignment percentage, calculated as follows: total nt. matches ---------------------------------- aligned length (incl. gaps) |
Prot. Match %[?]A simple amino acid-based global alignment percentage, calculated as follows: total aa. matches ---------------------------------- aligned length (incl. gaps) |
Match Diffs[?]This field may contain sequence annotations in HGVS format. For more information about HGVS annotations, please refer to the HGVS Quick Reference Guide. | |
---|---|---|---|---|---|---|---|---|
1 | human | 133688 | UGT3A1 | UDP glycosyltransferase fam... | NM_152404.4 | 100% | 100% | |
2 | human | 133688 | UGT3A1 | UDP glycosyltransferase fam... | XM_011513958.2 | 93.4% | 93.4% | 93_94ins102 |
3 | human | 133688 | UGT3A1 | UDP glycosyltransferase fam... | XM_011513957.2 | 90.5% | 85.6% | (many diffs) |
4 | human | 133688 | UGT3A1 | UDP glycosyltransferase fam... | XM_005248243.4 | 89.6% | 89.6% | 0_1ins162 |
5 | human | 133688 | UGT3A1 | UDP glycosyltransferase fam... | XM_011513959.2 | 89.6% | 89.6% | 0_1ins162 |
6 | human | 167127 | UGT3A2 | UDP glycosyltransferase fam... | NM_174914.4 | 85.8% | 78.8% | (many diffs) |
7 | human | 167127 | UGT3A2 | UDP glycosyltransferase fam... | XM_011513988.2 | 82.1% | 74.9% | (many diffs) |
8 | human | 167127 | UGT3A2 | UDP glycosyltransferase fam... | NM_001168316.2 | 80.8% | 73.8% | (many diffs) |
9 | human | 167127 | UGT3A2 | UDP glycosyltransferase fam... | XM_017009150.1 | 79.1% | 72.3% | (many diffs) |
10 | human | 133688 | UGT3A1 | UDP glycosyltransferase fam... | XR_001741997.1 | 60.7% | (many diffs) | |
11 | human | 133688 | UGT3A1 | UDP glycosyltransferase fam... | XR_001741998.1 | 51.7% | (many diffs) | |
12 | human | 133688 | UGT3A1 | UDP glycosyltransferase fam... | NM_001171873.2 | 47.4% | 44.3% | (many diffs) |
13 | human | 167127 | UGT3A2 | UDP glycosyltransferase fam... | NR_031764.2 | 38.1% | (many diffs) |
Sequence Information
Note: uppercase bases indicate empirically verified sequence.
- ORF start:
- 66
- ORF end:
- 1635
- ORF length:
- 1569
- Sequence:
-
1 tcttccattt caggtgtcgt gaggctagca tcgattgatc aacaagtttg tacaaaaaag 61 ttggcatggt tgggcagcgg gtgctgcttc tagtggcctt ccttctttct ggggtcctgc 121 tctcagaggc tgccaaaatc ctgacaatat ctacactggg tggaagccat tacctactgt 181 tggaccgggt gtctcagatt cttcaagagc atggtcataa tgtgactatg cttcatcaga 241 gtggaaagtt tttgatccca gatattaaag aggaggaaaa atcataccaa gttatcaggt 301 ggttttcacc tgaagatcat caaaaaagaa ttaagaagca ttttgatagc tacatagaaa 361 cagcattgga tggcagaaaa gaatctgaag cccttgtaaa gctaatggaa atatttggga 421 ctcaatgtag ttatttgcta agcagaaagg atataatgga ttccttaaag aatgagaact 481 atgatctggt atttgttgaa gcatttgatt tctgttcttt cctgattgct gagaagcttg 541 tgaaaccatt tgtggccatt cttcccacca cattcggctc tttggatttt gggctaccaa 601 gccccttgtc ttatgttcca gtattccctt ccttgctgac tgatcacatg gacttctggg 661 gccgagtgaa gaattttctg atgttcttta gtttctccag gagccaatgg gacatgcagt 721 ctacatttga caacaccatc aaggagcatt tcccagaagg ctctaggcca gttttgtctc 781 atcttctact gaaagcagag ttgtggtttg ttaactctga ttttgccttt gattttgccc 841 ggcccctgct tcccaacact gtttatattg gaggcttgat ggaaaaacct attaaaccag 901 taccacaaga cttggacaac ttcattgcca actttgggga tgcagggttt gtccttgtgg 961 cctttggctc catgttgaac acccatcagt cccaggaagt cctcaagaag atgcacaatg 1021 cctttgccca cctccctcaa ggagtgatat ggacatgtca gagttctcat tggcccagag 1081 atgttcattt ggccacaaat gtgaaaattg tggactggct tcctcagagt gacctcctgg 1141 ctcaccccag catccgtctt tttgtcactc atggtgggca gaacagcgta atggaggCCA 1201 TCCGTCATGG TGTGCCCATG GTGGGATTAC CAGTCAATGG AGACCAGCAT GGAAACATGG 1261 TCCGAGTAGT AGCCAAAAAT TATGGTGTCT CTATCCGGTT GAATCAGGTC ACAGCCGACA 1321 CACTGACACT TACAATGAAA CAAGTCATAG AAGACAAGAG GTACAAGTCG GCAGTGGTGG 1381 CAGCCAGTGT CATCCTGCAC TCTCAGCCCC TGAGCCCCGC ACAGCGGCTG GTGGGCTGGA 1441 TCGACCACAT CCTCCAGACT GGGGGAGCGA CGCACCTCAA GCCCTATGCC TTCCAGCAGC 1501 CTTGGCATGA GCAGTACCTC ATTGATGTCT TTGTGTTTCT GCTGGGGCTC ACTCTGGGCA 1561 CTATGTGGCT TTGTGGGAAG CTGCTGGGTG TGGTGGCCAG GTGGCTGCGT GGGGCCAGGA 1621 AGGTGAAGAA GACATGCCCA ACTTTCTTGT ACAAAGTGGT TGATATCGGT AAGCCTATCC 1681 CTAACCCTCT CCTCGGTCTC GATTCTACGT AGTAATGAAC TAGTCCGTAA CTTGAAAGTA 1741 TTTCGATTTC TTGGCTTTAT ATATCTTGTG GAAAGGACGA GCAGTGCGTT CTTTAACGAT 1801 TGAAACGCGT TAAGTCgaca atcaacctct ggattacaaa atttgtgaaa gatt