Construct: ORF TRCN0000487815
Construct Description:
- Construct Type:
- ORF
- Other Identifiers:
- ORF021630.1_s317c1
- DNA Barcode:
- AATGACCCCCAGAGAGCGCTTGTC
- Epitope Tag:
- V5 (not translated due to prior stop codon)
- Notes:
- Has stop codon in insert
Originally Annotated References:
- Gene:
- NR1H4 (9971)
Vector Information:
- Vector Backbone:
- pLX_317
- Pol II Cassette 1:
- SV40-PuroR
- Pol II Cassette 2:
- EF1a-TRCN0000487815
- Selection Marker:
- PuroR
- Visible Reporter:
- n/a
- Epitope Tag:
- V5
Current transcripts matched by this ORF:
Taxon | Gene | Symbol | Description | Transcript | Nuc. Match %[?]A simple nucleotide-based global alignment percentage, calculated as follows: total nt. matches ---------------------------------- aligned length (incl. gaps) |
Prot. Match %[?]A simple amino acid-based global alignment percentage, calculated as follows: total aa. matches ---------------------------------- aligned length (incl. gaps) |
Match Diffs[?]This field may contain sequence annotations in HGVS format. For more information about HGVS annotations, please refer to the HGVS Quick Reference Guide. | |
---|---|---|---|---|---|---|---|---|
1 | human | 9971 | NR1H4 | nuclear receptor subfamily ... | NM_005123.4 | 100% | 100% | |
2 | human | 9971 | NR1H4 | nuclear receptor subfamily ... | NM_001206977.2 | 99.1% | 99.1% | 587_598del |
3 | human | 9971 | NR1H4 | nuclear receptor subfamily ... | NM_001206979.2 | 99.1% | 99.1% | 587_598del |
4 | human | 9971 | NR1H4 | nuclear receptor subfamily ... | XM_011539040.2 | 99.1% | 99.1% | 587_598del |
5 | human | 9971 | NR1H4 | nuclear receptor subfamily ... | NM_001206992.2 | 95.4% | 93.5% | (many diffs) |
6 | human | 9971 | NR1H4 | nuclear receptor subfamily ... | NM_001206993.2 | 94.6% | 92.7% | (many diffs) |
7 | human | 9971 | NR1H4 | nuclear receptor subfamily ... | NM_001206978.2 | 90% | 90% | 445_446ins141 |
8 | human | 9971 | NR1H4 | nuclear receptor subfamily ... | XM_011539041.2 | 85.7% | 83.8% | (many diffs) |
9 | human | 9971 | NR1H4 | nuclear receptor subfamily ... | XM_011539042.1 | 53.2% | 46.9% | (many diffs) |
10 | human | 9971 | NR1H4 | nuclear receptor subfamily ... | XM_006719719.2 | 52.8% | 46.6% | (many diffs) |
11 | human | 9971 | NR1H4 | nuclear receptor subfamily ... | NR_135146.2 | 51.8% | 1_383del;1103_1107delGTGTA;1805_2732del | |
12 | mouse | 20186 | Nr1h4 | nuclear receptor subfamily ... | XM_006513393.3 | 85% | 92.1% | (many diffs) |
13 | mouse | 20186 | Nr1h4 | nuclear receptor subfamily ... | NM_001163504.1 | 84.3% | 91.4% | (many diffs) |
14 | mouse | 20186 | Nr1h4 | nuclear receptor subfamily ... | XM_006513391.3 | 84.3% | 91.4% | (many diffs) |
15 | mouse | 20186 | Nr1h4 | nuclear receptor subfamily ... | NM_009108.2 | 81.6% | 87.2% | (many diffs) |
16 | mouse | 20186 | Nr1h4 | nuclear receptor subfamily ... | NM_001163700.1 | 81% | 86.5% | (many diffs) |
17 | mouse | 20186 | Nr1h4 | nuclear receptor subfamily ... | XR_001779493.1 | 77.1% | (many diffs) |
Sequence Information
Note: uppercase bases indicate empirically verified sequence.
- ORF start:
- 72
- ORF end:
- 1488
- ORF length:
- 1416
- Sequence:
-
1 tcttccattt caggtgtcgt gaggctagca tcgattgatc aacaagtttg tacaaaaaag 61 caggcttcac catgggatca aaaatgaatc tcattgaaca ttcccattta cctaccacag 121 atgaattttc tttttctgaa aatttatttg gtgttttaac agaacaagtg gcaggtcctc 181 tgggacagaa cctggaagtg gaaccatact cgcaatacag caatgttcag tttccccaag 241 ttcaaccaca gatttcctcg tcatcctatt attccaacct gggtttctac ccccagcagc 301 ctgaagagtg gtactctcct ggaatatatg aactcaggcg tatgccagct gagactctct 361 accagggaga aactgaggta gcagagatgc ctgtaacaaa gaagccccgc atgggcgcgt 421 cagcagggag gatcaaaggg gatgagctgt gtgttgtttg tggagacaga gcctctggat 481 accactataa tgcactgacc tgtgaggggt gtaaaggttt cttcaggaga agcattacca 541 aaaacgctgt gtacaagtgt aaaaacgggg gcaactgtgt gatggatatg tacatgcgaa 601 gaaagtgtca agagtgtcga ctaaggaaat gcaaagagat gggaatgttg gctgaatgct 661 tgttaactga aattcagtgt aaatctaagc gactgagaaa aaatgtgaag cagcatgcag 721 atcagaccgt gaatgaagac agtgaaggtc gtgacttgcg acaagtgacc tcgacaacaa 781 agtcatgcag ggagaaaact gaactcaccc cagatcaaca gactcttcta cattttatta 841 tggattcata taacaaacag aggatgcctc aggaaataac aaataaaatt ttaaaagaag 901 aattcagtgc agaagaaaat tttctcattt tgacggaaat ggcaaccaat catgtacagg 961 ttcttgtaga attcacaaaa aagctaccag gatttcagac tttggaccat gaagaccaga 1021 ttgctttgct gaaagggtct gcggttgaag ctatgttcct tcgttcagct gagattttca 1081 ataagaaact tccgtctggg cattctgacc tattggaaga aagaattcga aatagtggta 1141 tctctgatga atatataaca cctatgttta gtttttataa aagtattggG GAACTGAAAA 1201 TGACTCAAGA GGAGTATGCT CTGCTTACAG CAATTGTTAT CCTGTCTCCA GATAGACAAT 1261 ACATAAAGGA TAGAGAGGCA GTAGAGAAGC TTCAGGAGCC ACTTCTTGAT GTGCTACAAA 1321 AGTTGTGTAA GATTCACCAG CCTGAAAATC CTCAACACTT TGCCTGTCTC CTGGGTCGCC 1381 TGACTGAATT ACGGACATTC AATCATCACC ACGCTGAGAT GCTGATGTCA TGGAGAGTAA 1441 ACGACCACAA GTTTACCCCA CTTCTCTGTG AAATCTGGGA CGTGCAGTGA GACCCAGCTT 1501 TCTTGTACAA AGTGGTTGAT ATCGGTAAGC CTATCCCTAA CCCTCTCCTC GGTCTCGATT 1561 CTACGTAGTA ATGAACTAGT CCGTAACTTG AAAGTATTTC GATTTCTTGG CTTTATATAT 1621 CTTGTGGAAA GGACGAAATG ACCCCCAGAG AGCGCTTGTC ACGCGTTAAG TCgacaatca 1681 acctctggat tacaaaattt gtgaaagatt