Construct: ORF ccsbBroadEn_11312
Construct Description:
- Construct Type:
- ORF
- Other Identifiers:
- ORF001130.1_s300c1, BRDN0000388995
- DNA Barcode:
- None
- Epitope Tag:
- None
- Notes:
- No stop codon in insert
Originally Annotated References:
- Gene:
- NR1I2 (8856)
Vector Information:
- Vector Backbone:
- pDONR223
- Pol II Cassette 1:
- n/a
- Pol II Cassette 2:
- n/a
- Selection Marker:
- n/a
- Visible Reporter:
- n/a
- Epitope Tag:
- n/a
Current transcripts matched by this ORF:
Taxon | Gene | Symbol | Description | Transcript | Nuc. Match %[?]A simple nucleotide-based global alignment percentage, calculated as follows: total nt. matches ---------------------------------- aligned length (incl. gaps) |
Prot. Match %[?]A simple amino acid-based global alignment percentage, calculated as follows: total aa. matches ---------------------------------- aligned length (incl. gaps) |
Match Diffs[?]This field may contain sequence annotations in HGVS format. For more information about HGVS annotations, please refer to the HGVS Quick Reference Guide. | |
---|---|---|---|---|---|---|---|---|
1 | human | 8856 | NR1I2 | nuclear receptor subfamily ... | NM_003889.3 | 87% | 87% | 1_165del;519_521delGCT |
2 | human | 8856 | NR1I2 | nuclear receptor subfamily ... | NM_022002.2 | 79.9% | 79.9% | 1_282del;636_638delGCT |
3 | human | 8856 | NR1I2 | nuclear receptor subfamily ... | NM_033013.2 | 78.9% | 78.9% | 1_165del;518_519ins108 |
Sequence Information
Note: uppercase bases indicate empirically verified sequence.
- ORF start:
- 66
- ORF end:
- 1200
- ORF length:
- 1134
- Sequence:
-
1 gttcgttgca acaaattgat gagcaatgct tttttataat gccaaCTTTG TACAAAAAAG 61 TTGGCATGAC ATGTGAAGGA TGCAAGGGCT TTTTCAGGAG GGCCATGAAA CGCAACGCCC 121 GGCTGAGGTG CCCCTTCCGG AAGGGCGCCT GCGAGATCAC CCGGAAGACC CGGCGACAGT 181 GCCAGGCCTG CCGCCTGCGC AAGTGCCTGG AGAGCGGCAT GAAGAAGGAG ATGATCATGT 241 CCGACGAGGC CGTGGAGGAG AGGCGGGCCT TGATCAAGCG GAAGAAAAGT GAACGGACAG 301 GGACTCAGCC ACTGGGAGTG CAGGGGCTGA CAGAGGAGCA GCGGATGATG ATCAGGGAGC 361 TGATGGACGC TCAGATGAAA ACCTTTGACA CTACCTTCTC CCATTTCAAG AATTTCCGGC 421 CAGGGGTGCT TAGCAGTGGC TGCGAGTTGC CAGAGTCTCT GCAGGCCCCA TCGAGGGAAG 481 AAGCTGCCAA GTGGAGCCAG GTCCGGAAAG ATCTGTGCTC TTTGAAGGTC TCTCTGCAGC 541 TGCGGGGGGA GGATGGCAGT GTCTGGAACT ACAAACCCCC AGCCGACAGT GGCGGGAAAG 601 AGATCTTCTC CCTGCTGCCC CACATGGCTG ACATGTCAAC CTACATGTTC AAAGGCATCA 661 TCAGCTTTGC CAAAGTCATC TCCTACTTCA GGGACTTGCC CATCGAGGAC CAGATCTCCC 721 TGCTGAAGGG GGCCGCTTTC GAGCTGTGTC AACTGAGATT CAACACAGTG TTCAACGCGG 781 AGACTGGAAC CTGGGAGTGT GGCCGGCTGT CCTACTGCTT GGAAGACACT GCAGGTGGCT 841 TCCAGCAACT TCTACTGGAG CCCATGCTGA AATTCCACTA CATGCTGAAG AAGCTGCAGC 901 TGCATGAGGA GGAGTATGTG CTGATGCAGG CCATCTCCCT CTTCTCCCCA GACCGCCCAG 961 GTGTGCTGCA GCACCGCGTG GTGGACCAGC TGCAGGAGCA ATTCGCCATT ACTCTGAAGT 1021 CCTACATTGA ATGCAATCGG CCCCAGCCTG CTCATAGGTT CTTGTTCCTG AAGATCATGG 1081 CTATGCTCAC CGAGCTCCGC AGCATCAATG CTCAGCACAC CCAGCGGCTG CTGCGCATCC 1141 AGGACATACA CCCCTTTGCT ACGCCCCTCA TGCAGGAGTT GTTCGGCATC ACAGGTAGCT 1201 GCCCAACTTT CTTGTACAAA GTtggcatta taagaaagca ttgcttatca atttgttgca 1261 acgaac