Construct: ORF ccsbBroadEn_03378
Construct Description:
- Construct Type:
- ORF
- Other Identifiers:
- ORF018109.1_s300c1, BRDN0000457267
- DNA Barcode:
- None
- Epitope Tag:
- None
- Notes:
- No stop codon in insert
Originally Annotated References:
- Gene:
- ERAP1 (51752)
Vector Information:
- Vector Backbone:
- pDONR223
- Pol II Cassette 1:
- n/a
- Pol II Cassette 2:
- n/a
- Selection Marker:
- n/a
- Visible Reporter:
- n/a
- Epitope Tag:
- n/a
Current transcripts matched by this ORF:
Taxon | Gene | Symbol | Description | Transcript | Nuc. Match %[?]A simple nucleotide-based global alignment percentage, calculated as follows: total nt. matches ---------------------------------- aligned length (incl. gaps) |
Prot. Match %[?]A simple amino acid-based global alignment percentage, calculated as follows: total aa. matches ---------------------------------- aligned length (incl. gaps) |
Match Diffs[?]This field may contain sequence annotations in HGVS format. For more information about HGVS annotations, please refer to the HGVS Quick Reference Guide. | |
---|---|---|---|---|---|---|---|---|
1 | human | 51752 | ERAP1 | endoplasmic reticulum amino... | NM_001040458.3 | 100% | 100% | |
2 | human | 51752 | ERAP1 | endoplasmic reticulum amino... | NM_001198541.2 | 100% | 100% | |
3 | human | 51752 | ERAP1 | endoplasmic reticulum amino... | XM_011543486.3 | 100% | 100% | |
4 | human | 51752 | ERAP1 | endoplasmic reticulum amino... | NM_001349244.1 | 99.1% | 99% | 2818_2819insGT;2822_2844del |
5 | human | 51752 | ERAP1 | endoplasmic reticulum amino... | NM_016442.4 | 99.1% | 99% | 2818_2819insGT;2822_2844del |
6 | human | 51752 | ERAP1 | endoplasmic reticulum amino... | XM_005272015.5 | 98.9% | 98.8% | 2820_2824delTTCTC;2828_2853delinsG |
7 | human | 51752 | ERAP1 | endoplasmic reticulum amino... | XM_005272016.4 | 98.9% | 98.8% | 2820_2824delTTCTC;2828_2853delinsG |
8 | human | 51752 | ERAP1 | endoplasmic reticulum amino... | XM_011543480.2 | 98.9% | 98.8% | 2820_2824delTTCTC;2828_2853delinsG |
9 | human | 51752 | ERAP1 | endoplasmic reticulum amino... | XM_011543481.2 | 98.9% | 98.8% | 2820_2824delTTCTC;2828_2853delinsG |
10 | human | 51752 | ERAP1 | endoplasmic reticulum amino... | XM_011543484.2 | 98.9% | 98.8% | 2820_2824delTTCTC;2828_2853delinsG |
11 | human | 51752 | ERAP1 | endoplasmic reticulum amino... | XM_011543485.2 | 98.9% | 98.8% | 2820_2824delTTCTC;2828_2853delinsG |
12 | human | 51752 | ERAP1 | endoplasmic reticulum amino... | XM_017009581.1 | 98.9% | 98.8% | 2820_2824delTTCTC;2828_2853delinsG |
13 | human | 51752 | ERAP1 | endoplasmic reticulum amino... | XM_024446113.1 | 98.9% | 98.8% | 2820_2824delTTCTC;2828_2853delinsG |
14 | human | 51752 | ERAP1 | endoplasmic reticulum amino... | XM_017009583.2 | 60.5% | 60.4% | 0_1ins1095;1725_1729delTTCTC;1733_1758delinsG |
15 | human | 51752 | ERAP1 | endoplasmic reticulum amino... | XR_001742119.2 | 43.9% | (many diffs) | |
16 | mouse | 80898 | Erap1 | endoplasmic reticulum amino... | NM_030711.4 | 84.9% | 84.2% | (many diffs) |
17 | mouse | 80898 | Erap1 | endoplasmic reticulum amino... | XM_006517476.3 | 84.9% | 84.2% | (many diffs) |
Sequence Information
Note: uppercase bases indicate empirically verified sequence.
- ORF start:
- 66
- ORF end:
- 2889
- ORF length:
- 2823
- Sequence:
-
1 gttcgttgca acaaattgat gagcaatgct tttttataat gccaaCTTTG TACAAAAAAG 61 TTGGCATGGT GTTTCTGCCC CTCAAATGGT CCCTTGCAAC CATGTCATTT CTACTTTCCT 121 CACTGTTGGC TCTCTTAACT GTGTCCACTC CTTCATGGTG TCAGAGCACT GAAGCATCTC 181 CAAAACGTAG TGATGGGACA CCATTTCCTT GGAATAAAAT ACGACTTCCT GAGTACGTCA 241 TCCCAGTTCA TTATGATCTC TTGATCCATG CAAACCTTAC CACGCTGACC TTCTGGGGAA 301 CCACGAAAGT AGAAATCACA GCCAGTCAGC CCACCAGCAC CATCATCCTG CATAGTCACC 361 ACCTGCAGAT ATCTAGGGCC ACCCTCAGGA AGGGAGCTGG AGAGAGGCTA TCGGAAGAAC 421 CCCTGCAGGT CCTGGAACAC CCCCGTCAGG AGCAAATTGC ACTGCTGGCT CCCGAGCCCC 481 TCCTTGTCGG GCTCCCGTAC ACAGTTGTCA TTCACTATGC TGGCAATCTT TCGGAGACTT 541 TCCACGGATT TTACAAAAGC ACCTACAGAA CCAAGGAAGG GGAACTGAGG ATACTAGCAT 601 CAACACAATT TGAACCCACT GCAGCTAGAA TGGCCTTTCC CTGCTTTGAT GAACCTGCCT 661 TCAAAGCAAG TTTCTCAATC AAAATTAGAA GAGAGCCAAG GCACCTAGCC ATCTCCAATA 721 TGCCATTGGT GAAATCTGTG ACTGTTGCTG AAGGACTCAT AGAAGACCAT TTTGATGTCA 781 CTGTGAAGAT GAGCACCTAT CTGGTGGCCT TCATCATTTC AGATTTTGAG TCTGTCAGCA 841 AGATAACCAA GAGTGGAGTC AAGGTTTCTG TTTATGCTGT GCCAGACAAG ATAAATCAAG 901 CAGATTATGC ACTGGATGCT GCGGTGACTC TTCTAGAATT TTATGAGGAT TATTTCAGCA 961 TACCGTATCC CCTACCCAAA CAAGATCTTG CTGCTATTCC CGACTTTCAG TCTGGTGCTA 1021 TGGAAAACTG GGGACTGACA ACATATAGAG AATCTGCTCT GTTGTTTGAT GCAGAAAAGT 1081 CTTCTGCATC AAGTAAGCTT GGCATCACAA TGACTGTGGC CCATGAACTG GCTCACCAGT 1141 GGTTTGGGAA CCTGGTCACT ATGGAATGGT GGAATGATCT TTGGCTAAAT GAAGGATTTG 1201 CCAAATTTAT GGAGTTTGTG TCTGTCAGTG TGACCCATCC TGAACTGAAA GTTGGAGATT 1261 ATTTCTTTGG CAAATGTTTT GACGCAATGG AGGTAGATGC TTTAAATTCC TCACACCCTG 1321 TGTCTACACC TGTGGAAAAT CCTGCTCAGA TCCGGGAGAT GTTTGATGAT GTTTCTTATG 1381 ATAAGGGAGC TTGTATTCTG AATATGCTAA GGGAGTATCT TAGTGCTGAC GCATTTAAAA 1441 GTGGTATTGT ACAGTATCTC CAGAAGCATA GCTATAAAAA TACAAAAAAC GAGGACCTGT 1501 GGGATAGTAT GGCAAGTATT TGCCCTACAG ATGGTGTAAA AGGGATGGAT GGCTTTTGCT 1561 CTAGAAGTCA ACATTCATCT TCATCCTCAC ATTGGCATCA GGAAGGGGTG GATGTGAAAA 1621 CCATGATGAA CACTTGGACA CTGCAGAAGG GTTTTCCCCT AATAACCATC ACAGTGAGGG 1681 GGAGGAATGT ACACATGAAG CAAGAGCACT ACATGAAGGG CTCTGACGGC GCCCCGGACA 1741 CTGGGTACCT GTGGCATGTT CCATTGACAT TCATCACCAG CAAATCCGAC ATGGTCCATC 1801 GATTTTTGCT AAAAACAAAA ACAGATGTGC TCATCCTCCC AGAAGAGGTG GAATGGATCA 1861 AATTTAATGT GGGCATGAAT GGCTATTACA TTGTGCATTA CGAGGATGAT GGATGGGACT 1921 CTTTGACTGG CCTTTTAAAA GGAACACACA CAGCAGTCAG CAGTAATGAT CGGGCGAGTC 1981 TCATTAACAA TGCATTTCAG CTCGTCAGCA TTGGGAAGCT GTCCATTGAA AAGGCCTTGG 2041 ATTTATCCCT GTACTTGAAA CATGAAACTG AAATTATGCC CGTGTTTCAA GGTTTGAATG 2101 AGCTGATTCC TATGTATAAG TTAATGGAGA AAAGAGATAT GAATGAAGTG GAAACTCAAT 2161 TCAAGGCCTT CCTCATCAGG CTGCTAAGGG ACCTCATTGA TAAGCAGACA TGGACAGACG 2221 AGGGCTCAGT CTCAGAGCGA ATGCTGCGGA GTCAACTACT ACTCCTCGCC TGTGTGCACA 2281 ACTATCAGCC GTGCGTACAG AGGGCAGAAG GCTATTTCAG AAAGTGGAAG GAATCCAATG 2341 GAAACTTGAG CCTGCCTGTC GACGTGACCT TGGCAGTGTT TGCTGTGGGG GCCCAGAGCA 2401 CAGAAGGCTG GGATTTTCTT TATAGTAAAT ATCAGTTTTC TTTGTCCAGT ACTGAGAAAA 2461 GCCAAATTGA ATTTGCCCTC TGCAGAACCC AAAATAAGGA AAAGCTTCAA TGGCTACTAG 2521 ATGAAAGCTT TAAGGGAGAT AAAATAAAAA CTCAGGAGTT TCCACAAATT CTTACACTCA 2581 TTGGCAGGAA CCCAGTAGGA TACCCACTGG CCTGGCAATT TCTGAGGAAA AACTGGAACA 2641 AACTTGTACA AAAGTTTGAA CTTGGCTCAT CTTCCATAGC CCACATGGTA ATGGGTACAA 2701 CAAATCAATT CTCCACAAGA ACACGGCTTG AAGAGGTAAA AGGATTCTTC AGCTCTTTGA 2761 AAGAAAATGG TTCTCAGCTC CGTTGTGTCC AACAGACAAT TGAAACCATT GAAGAAAACA 2821 TCGGTTGGAT GGATAAGAAT TTTGATAAAA TCAGAGTGTG GCTGCAAAGT GAAAAGCTTG 2881 AACGTATGTA CCCAACTTTC TTGTACAAAG TTGGcattat aagaaagcat tgcttatcaa 2941 tttgttgcaa cgaac