Construct: ORF ccsbBroadEn_03445
Construct Description:
- Construct Type:
- ORF
- Other Identifiers:
- ORF000789.1_s300c1, BRDN0000391450
- DNA Barcode:
- None
- Epitope Tag:
- None
- Notes:
- No stop codon in insert
Originally Annotated References:
- Gene:
- UGT1A1 (54658)
Vector Information:
- Vector Backbone:
- pDONR223
- Pol II Cassette 1:
- n/a
- Pol II Cassette 2:
- n/a
- Selection Marker:
- n/a
- Visible Reporter:
- n/a
- Epitope Tag:
- n/a
Current transcripts matched by this ORF:
Taxon | Gene | Symbol | Description | Transcript | Nuc. Match %[?]A simple nucleotide-based global alignment percentage, calculated as follows: total nt. matches ---------------------------------- aligned length (incl. gaps) |
Prot. Match %[?]A simple amino acid-based global alignment percentage, calculated as follows: total aa. matches ---------------------------------- aligned length (incl. gaps) |
Match Diffs[?]This field may contain sequence annotations in HGVS format. For more information about HGVS annotations, please refer to the HGVS Quick Reference Guide. | |
---|---|---|---|---|---|---|---|---|
1 | human | 54658 | UGT1A1 | UDP glucuronosyltransferase... | NM_000463.3 | 100% | 100% | |
2 | human | 54579 | UGT1A5 | UDP glucuronosyltransferase... | NM_019078.1 | 77.4% | 73.1% | (many diffs) |
3 | human | 54578 | UGT1A6 | UDP glucuronosyltransferase... | NM_205862.2 | 48.5% | 48.5% | (many diffs) |
Sequence Information
Note: uppercase bases indicate empirically verified sequence.
- ORF start:
- 69
- ORF end:
- 1668
- ORF length:
- 1599
- Sequence:
-
1 gttcgttgca acaaattgat gagcaatgct tttttataat gccaacttTG TACAAAAAAG 61 TTGGCACCAT GGCTGTGGAG TCCCAGGGCG GACGCCCACT TGTCCTGGGC CTGCTGCTGT 121 GTGTGCTGGG CCCAGTGGTG TCCCATGCTG GGAAGATACT GTTGATCCCA GTGGATGGCA 181 GCCACTGGCT GAGCATGCTT GGGGCCATCC AGCAGCTGCA GCAGAGGGGA CATGAAATAG 241 TTGTCCTAGC ACCTGACGCC TCGTTGTACA TCAGAGACGG AGCATTTTAC ACCTTGAAGA 301 CGTACCCTGT GCCATTCCAA AGGGAGGATG TGAAAGAGTC TTTTGTTAGT CTCGGGCATA 361 ATGTTTTTGA GAATGATTCT TTCCTGCAGC GTGTGATCAA AACATACAAG AAAATAAAAA 421 AGGACTCTGC TATGCTTTTG TCTGGCTGTT CCCACTTACT GCACAACAAG GAGCTCATGG 481 CCTCCCTGGC AGAAAGCAGC TTTGATGTCA TGCTGACGGA CCCTTTCCTT CCTTGCAGCC 541 CCATCGTGGC CCAGTACCTG TCTCTGCCCA CTGTATTCTT CTTGCATGCA CTGCCATGCA 601 GCCTGGAATT TGAGGCTACC CAGTGCCCCA ACCCATTCTC CTACGTGCCC AGGCCTCTCT 661 CCTCTCATTC AGATCACATG ACCTTCCTGC AGCGGGTGAA GAACATGCTC ATTGCCTTTT 721 CACAGAACTT TCTGTGCGAC GTGGTTTATT CCCCGTATGC AACCCTTGCC TCAGAATTCC 781 TTCAGAGAGA GGTGACTGTC CAGGACCTAT TGAGCTCTGC ATCTGTCTGG CTGTTTAGAA 841 GTGACTTTGT GAAGGATTAC CCTAGGCCCA TCATGCCCAA TATGGTTTTT GTTGGTGGAA 901 TCAACTGCCT TCACCAAAAT CCACTATCCC AGGAATTTGA AGCCTACATT AATGCTTCTG 961 GAGAACATGG AATTGTGGTT TTCTCTTTGG GATCAATGGT CTCAGAAATT CCAGAGAAGA 1021 AAGCTATGGC AATTGCTGAT GCTTTGGGCA AAATCCCTCA GACAGTCCTG TGGCGGTACA 1081 CTGGAACCCG ACCATCGAAT CTTGCGAACA ACACGATACT TGTTAAGTGG CTACCCCAAA 1141 ACGATCTGCT TGGTCACCCG ATGACCCGTG CCTTTATCAC CCATGCTGGT TCCCATGGTG 1201 TTTATGAAAG CATATGCAAT GGCGTTCCCA TGGTGATGAT GCCCTTGTTT GGTGATCAGA 1261 TGGACAATGC AAAGCGCATG GAGACTAAGG GAGCTGGAGT GACCCTGAAT GTTCTGGAAA 1321 TGACTTCTGA AGATTTAGAA AATGCTCTAA AAGCAGTCAT CAATGACAAA AGTTACAAGG 1381 AGAACATCAT GCGCCTCTCC AGCCTTCACA AGGACCGCCC GGTGGAGCCG CTGGACCTGG 1441 CCGTGTTCTG GGTGGAGTTT GTGATGAGGC ACAAGGGCGC GCCACACCTG CGCCCCGCAG 1501 CCCACGACCT CACCTGGTAC CAGTACCATT CCTTGGACGT GATTGGTTTC CTCTTGGCCG 1561 TCGTGCTGAC AGTGGCCTTC ATCACCTTTA AATGTTGTGC TTATGGCTAC CGGAAATGCT 1621 TGGGGAAAAA AGGGCGAGTT AAGAAAGCCC ACAAATCCAA GACCCATTTG CCAACTTTCT 1681 TGTACAAAGT tggcattata agaaagcatt gcttatcaat ttgttgcaac gaac