Construct: ORF ccsbBroadEn_00523
Construct Description:
- Construct Type:
- ORF
- Other Identifiers:
- ORF016535.1_s300c1, BRDN0000395709
- DNA Barcode:
- None
- Epitope Tag:
- None
- Notes:
- No stop codon in insert
Originally Annotated References:
- Gene:
- EXT1 (2131)
Vector Information:
- Vector Backbone:
- pDONR223
- Pol II Cassette 1:
- n/a
- Pol II Cassette 2:
- n/a
- Selection Marker:
- n/a
- Visible Reporter:
- n/a
- Epitope Tag:
- n/a
Current transcripts matched by this ORF:
Taxon | Gene | Symbol | Description | Transcript | Nuc. Match %[?]A simple nucleotide-based global alignment percentage, calculated as follows: total nt. matches ---------------------------------- aligned length (incl. gaps) |
Prot. Match %[?]A simple amino acid-based global alignment percentage, calculated as follows: total aa. matches ---------------------------------- aligned length (incl. gaps) |
Match Diffs[?]This field may contain sequence annotations in HGVS format. For more information about HGVS annotations, please refer to the HGVS Quick Reference Guide. | |
---|---|---|---|---|---|---|---|---|
1 | human | 2131 | EXT1 | exostosin glycosyltransfera... | NM_000127.2 | 100% | 100% | |
2 | mouse | 14042 | Ext1 | exostoses (multiple) 1 | NM_010162.2 | 95.8% | 99.3% | (many diffs) |
Sequence Information
Note: uppercase bases indicate empirically verified sequence.
- ORF start:
- 66
- ORF end:
- 2304
- ORF length:
- 2238
- Sequence:
-
1 gttcgttgca acaaattgat gagcaatgct tttttataat gccaaCTTTG TACAAAAAAG 61 TTGGCATGCA GGCCAAAAAA CGCTATTTCA TCCTGCTCTC AGCTGGCTCT TGTCTCGCCC 121 TTTTGTTTTA TTTCGGAGGC TTGCAGTTTA GGGCATCGAG GAGCCACAGC CGGAGAGAAG 181 AACACAGCGG TAGGAATGGC TTGCACCACC CCAGTCCGGA TCATTTCTGG CCCCGCTTCC 241 CGGACGCTCT GCGCCCCTTC GTTCCTTGGG ATCAATTGGA AAACGAGGAT TCCAGCGTGC 301 ACATTTCCCC CCGGCAGAAG CGAGATGCCA ACTCCAGCAT CTACAAAGGC AAGAAGTGCC 361 GCATGGAGTC CTGCTTCGAT TTCACCCTTT GCAAGAAAAA CGGCTTCAAA GTCTACGTAT 421 ACCCACAGCA AAAAGGGGAG AAAATCGCCG AAAGTTACCA AAACATTCTA GCGGCCATCG 481 AGGGCTCCAG GTTCTACACC TCGGACCCCA GCCAGGCGTG CCTCTTTGTC CTGAGTCTGG 541 ATACTTTAGA CAGAGACCAG TTGTCACCTC AGTATGTGCA CAATTTGAGA TCCAAAGTGC 601 AGAGTCTCCA CTTGTGGAAC AATGGTAGGA ATCATTTAAT TTTTAATTTA TATTCCGGCA 661 CTTGGCCTGA CTACACCGAG GACGTGGGGT TTGACATCGG CCAGGCGATG CTGGCCAAAG 721 CCAGCATCAG TACTGAAAAC TTCCGACCCA ACTTTGATGT TTCTATTCCC CTCTTTTCTA 781 AGGATCATCC CAGGACAGGA GGGGAGAGGG GGTTTTTGAA GTTCAACACC ATCCCTCCTC 841 TCAGGAAGTA CATGCTGGTA TTCAAGGGGA AGAGGTACCT GACAGGGATA GGATCAGACA 901 CCAGGAATGC CTTATATCAC GTCCATAACG GGGAGGACGT TGTGCTCCTC ACCACCTGCA 961 AGCATGGCAA AGACTGGCAA AAGCACAAGG ATTCTCGCTG TGACAGAGAC AACACCGAGT 1021 ATGAGAAGTA TGATTATCGG GAAATGCTGC ACAATGCCAC TTTCTGTCTG GTTCCTCGTG 1081 GTCGCAGGCT TGGGTCCTTC AGATTCCTGG AGGCTTTGCA GGCTGCCTGC GTCCCTGTGA 1141 TGCTCAGCAA TGGATGGGAG TTGCCATTCT CTGAAGTGAT TAATTGGAAC CAAGCTGCCG 1201 TCATAGGCGA TGAGAGATTG TTATTACAGA TTCCTTCTAC AATCAGGTCT ATTCATCAGG 1261 ATAAAATCCT AGCACTTAGA CAGCAGACAC AATTCTTGTG GGAGGCTTAT TTTTCTTCAG 1321 TTGAGAAGAT TGTATTAACT ACACTAGAGA TTATTCAGGA CAGAATATTC AAGCACATAT 1381 CACGTAACAG TTTAATATGG AACAAACATC CTGGAGGATT GTTCGTACTA CCACAGTATT 1441 CATCTTATCT GGGAGATTTT CCTTACTACT ATGCTAATTT AGGTTTAAAG CCCCCCTCCA 1501 AATTCACTGC AGTCATCCAT GCGGTGACCC CCCTGGTCTC TCAGTCCCAG CCAGTGTTGA 1561 AGCTTCTCGT GGCTGCAGCC AAGTCCCAGT ACTGTGCCCA GATCATAGTT CTATGGAATT 1621 GTGACAAGCC CCTACCAGCC AAACACCGCT GGCCTGCCAC TGCTGTGCCT GTCGTCGTCA 1681 TTGAAGGAGA GAGCAAGGTT ATGAGCAGCC GTTTTCTGCC CTACGACAAC ATCATCACAG 1741 ACGCCGTGCT CAGCCTTGAC GAGGACACGG TGCTTTCAAC AACAGAGGTG GATTTCGCCT 1801 TCACAGTGTG GCAGAGCTTC CCTGAGAGGA TTGTGGGGTA CCCCGCGCGC AGCCACTTCT 1861 GGGATAACTC TAAGGAGCGG TGGGGATACA CATCAAAGTG GACGAACGAC TACTCCATGG 1921 TGTTGACAGG AGCTGCTATT TACCACAAAT ATTATCACTA CCTATACTCC CATTACCTGC 1981 CAGCCAGCCT GAAGAACATG GTGGACCAAT TGGCCAATTG TGAGGACATT CTCATGAACT 2041 TCCTGGTGTC TGCTGTGACA AAATTGCCTC CAATCAAAGT GACCCAGAAG AAGCAGTATA 2101 AGGAGACAAT GATGGGACAG ACTTCTCGGG CTTCCCGTTG GGCTGACCCT GACCACTTTG 2161 CCCAGCGACA GAGCTGCATG AATACGTTTG CCAGCTGGTT TGGCTACATG CCGCTGATCC 2221 ACTCTCAGAT GAGGCTCGAC CCCGTCCTCT TTAAAGACCA GGTCTCTATT TTGAGGAAGA 2281 AATACCGAGA CATTGAGCGA CTTTGCCCAA CTTTCTTGTA CAAAGTtggc attataagaa 2341 agcattgctt atcaatttgt tgcaacgaac