Construct: ORF ccsbBroadEn_12409
Construct Description:
- Construct Type:
- ORF
- Other Identifiers:
- ORF005389.1_s300c1, BRDN0000395036
- DNA Barcode:
- None
- Epitope Tag:
- None
- Notes:
- No stop codon in insert
Originally Annotated References:
- Gene:
- SLC4A5 (57835)
Vector Information:
- Vector Backbone:
- pDONR223
- Pol II Cassette 1:
- n/a
- Pol II Cassette 2:
- n/a
- Selection Marker:
- n/a
- Visible Reporter:
- n/a
- Epitope Tag:
- n/a
Current transcripts matched by this ORF:
Taxon | Gene | Symbol | Description | Transcript | Nuc. Match %[?]A simple nucleotide-based global alignment percentage, calculated as follows: total nt. matches ---------------------------------- aligned length (incl. gaps) |
Prot. Match %[?]A simple amino acid-based global alignment percentage, calculated as follows: total aa. matches ---------------------------------- aligned length (incl. gaps) |
Match Diffs[?]This field may contain sequence annotations in HGVS format. For more information about HGVS annotations, please refer to the HGVS Quick Reference Guide. | |
---|---|---|---|---|---|---|---|---|
1 | human | 57835 | SLC4A5 | solute carrier family 4 mem... | NM_133478.3 | 90.9% | 90.8% | 80_271del;2316_2429del |
2 | human | 57835 | SLC4A5 | solute carrier family 4 mem... | NM_021196.3 | 89.6% | 89.5% | 80_271del;2316_2429del;2917_2964del |
3 | mouse | 232156 | Slc4a5 | solute carrier family 4, so... | NM_001166067.1 | 80.6% | 84.6% | (many diffs) |
4 | mouse | 232156 | Slc4a5 | solute carrier family 4, so... | XM_006505967.2 | 79.4% | 82.8% | (many diffs) |
5 | mouse | 232156 | Slc4a5 | solute carrier family 4, so... | XM_006505965.2 | 77.5% | 80.8% | (many diffs) |
6 | mouse | 232156 | Slc4a5 | solute carrier family 4, so... | XM_006505964.2 | 75.2% | 78.4% | (many diffs) |
7 | mouse | 232156 | Slc4a5 | solute carrier family 4, so... | XM_006505963.3 | 70.5% | 73.5% | (many diffs) |
8 | mouse | 232156 | Slc4a5 | solute carrier family 4, so... | XM_006505966.2 | 70.3% | 73.7% | (many diffs) |
Sequence Information
Note: uppercase bases indicate empirically verified sequence.
- ORF start:
- 69
- ORF end:
- 3126
- ORF length:
- 3057
- Sequence:
-
1 gttcgttgca acaaattgat gagcaatgct tttttataat gccaacttTG TACAAAAAAG 61 TTGGCACCAT GAAGGTGAAG GAGGAGAAGG CTGGGGTAGG AAAGCTGGAC CACACTAACC 121 ACAGGAGGAG ATTTCCGGAT CAGAAAGGGT CCCCAGCTGC TGAGCAGCTC CAGGACATCC 181 TGGGGGAGGA AGATGAGGCT CCCAACCCCA CCCTCTTTAC AGAGATGGAT ACTCTGCAGC 241 ATGACGGAGA CCAGATGGAG TGGAAGGAGT CAGCCAGGTG GATAAAGTTT GAAGAAAAGG 301 TAGAGGAAGG CGGCGAACGC TGGAGCAAGC CCCACGTGTC CACACTATCC CTGCACAGCC 361 TCTTCGAGCT CCGTACCTGC CTGCAGACGG GGACGGTGCT GCTGGATTTG GACAGTGGCT 421 CCTTACCACA GATCATAGAT GATGTCATTG AGAAGCAGAT TGAGGATGGT CTCCTGCGGC 481 CAGAGCTCCG GGAGAGGGTC AGTTACGTCC TCCTGAGGAG GCACCGCCAC CAAACCAAGA 541 AGCCCATCCA CCGCTCCTTA GCTGACATTG GGAAGTCAGT CTCCACCACA AATCGCAGTC 601 CTGCCCGGAG CCCTGGTGCT GGCCCGAGTC TACACCACTC CACGGAAGAC CTGCGGATGC 661 GGCAGAGTGC AAATTACGGA CGTCTGTGTC ATGCCCAGAG CAGAAGCATG AATGACATTT 721 CTCTCACCCC AAACACAGAC CAGCGGAAAA ACAAATTCAT GAAGAAGATC CCCAAGGACT 781 CAGAAGCGTC CAACGTGCTC GTGGGCGAGG TGGACTTCCT AGACCAGCCA TTCATCGCGT 841 TCGTGCGCCT CATCCAGTCG GCCATGCTGG GAGGAGTGAC CGAGGTGCCT GTCCCCACCA 901 GATTTCTGTT TATACTACTG GGACCTTCTG GGAGAGCAAA ATCCTACAAT GAAATTGGCC 961 GTGCCATTGC AACCCTCATG GTAGATGATC TCTTCAGTGA CGTGGCCTAC AAAGCCCGCA 1021 ATCGGGAAGA TCTGATCGCA GGAATTGATG AATTTCTGGA TGAGGTCATC GTCCTTCCTC 1081 CTGGAGAATG GGACCCAAAT ATCCGGATTG AGCCCCCCAA GAAGGTGCCC TCTGCTGACA 1141 AGAGGAAATC TGTGTTCTCC CTAGCAGAGC TGGGCCAGAT GAATGGCTCT GTGGGAGGAG 1201 GCGGCGGAGC TCCTGGAGGA GGCAATGGAG GTGGTGGTGG TGGTGGCAGT GGCGGCGGGG 1261 CTGGCAGTGG CGGGGCCGGC GGAACAAGCA GCGGGGATGA TGGAGAGATG CCAGCCATGC 1321 ATGAAATCGG GGAGGAACTT ATCTGGACAG GAAGGTTCTT CGGTGGCCTG TGTCTGGATA 1381 TCAAGAGGAA GTTGCCCTGG TTCCCAAGTG ACTTCTATGA TGGCTTCCAC ATTCAGTCCA 1441 TCTCTGCCAT CCTATTCATC TACCTCGGCT GTATCACCAA CGCGATCACC TTTGGTGGGC 1501 TTCTGGGGGA TGCCACCGAC AATTATCAGG GAGTGATGGA GAGCTTCCTG GGCACTGCCA 1561 TGGCTGGCTC CTTGTTCTGC CTCTTCTCGG GACAGCCTCT CATCATTCTC AGCAGCACGG 1621 GGCCCATCCT CATCTTTGAG AAGCTCCTCT TCGACTTCAG CAAAGGCAAT GGCCTGGACT 1681 ACATGGAGTT CCGCCTCTGG ATTGGCCTAC ACTCAGCTGT CCAGTGCCTT ATCCTAGTGG 1741 CCACAGATGC CAGCTTTATC ATCAAATATA TCACCCGCTT CACCGAGGAG GGCTTCTCCA 1801 CCCTTATCAG CTTCATCTTC ATCTACGATG CCATCAAGAA GATGATCGGT GCCTTCAAGT 1861 ACTACCCTAT CAATATGGAC TTCAAGCCAA ACTTCATCAC TACCTACAAG TGCGAGTGTG 1921 TCGCCCCTGA CACAGTGAAT ACAACCGTGT TCAATGCTTC AGCCCCATTG GCACCAGACA 1981 CCAACGCTTC TCTGTACAAC CTCCTTAACC TCACAGCGTT GGACTGGTCC CTGCTGAGCA 2041 AGAAGGAGTG TCTGAGCTAC GGCGGGCGCC TGCTTGGGAA TTCCTGCAAG TTTATCCCAG 2101 ACCTGGCGCT CATGTCCTTC ATCCTTTTCT TTGGGACATA CTCCATGACC CTGACCCTGA 2161 AGAAGTTCAA ATTCAGCCGC TATTTTCCTA CCAAGCCAAC GCGGCCTGAC CGAGGCTGGT 2221 TCGTGGCCCC CTTTGGGAAG AACCCGTGGT GGGTATACCC AGCAAGCATC CTGCCCGCCC 2281 TGCTGGTGAC CATCCTGATC TTCATGGACC AGCAGATCAC TGCCGTCATT GTCAACCGGA 2341 AGGAGAACAA ACTGAAGAAG GCTGCCGGCT ACCATCTGGA CCTGTTCTGG GTGGGCATCC 2401 TCATGGCTTT GTGCTCCTTT ATGGGGCTCC CCTGGTACGT GGCTGCCACG GTCATCTCCA 2461 TCGCCCACAT CGACAGCCTC AAGATGGAGA CAGAGACCAG TGCCCCTGGG GAGCAGCCCC 2521 AGTTTCTGGG AGTCAGGGAA CAGAGAGTAA CCGGCATCAT CGTCTTCATC CTGACGGGAA 2581 TCTCTGTCTT CCTGGCTCCC ATCCTAAAGT GTATCCCCCT GCCGGTGCTG TACGGAGTCT 2641 TCCTCTACAT GGGCGTGGCC TCCCTGAATG GCATCCAGTT CTGGGAACGC TGCAAGCTCT 2701 TCCTGATGCC AGCCAAGCAC CAGCCGGACC ATGCCTTCCT GCGGCACGTG CCGCTGCGCC 2761 GGATCCACCT CTTCACCCTG GTGCAGATCC TCTGCCTGGC GGTGCTCTGG ATCCTCAAAT 2821 CCACGGTGGC TGCCATCATC TTCCCGGTCA TGATCCTGGG CCTCATCATC GTTCGAAGGC 2881 TTCTGGATTT CATCTTTTCC CAGCACGACC TGGCCTGGAT TGACAACATC CTCCCAGAGA 2941 AGGAAAAAAA GGAGACAGAC AAGAAGAGGA AGAGAAAAAA AGGGGCCCAC GAGGACTGTG 3001 ATGAGGAGCC CCAGTTCCCT CCTCCCTCGG TTATAAAGAT TCCCATGGAA AGTGTCCAAT 3061 CAGATCCCCA AAACGGTATC CACTGCATTG CCAGAAAAAG ATCTTCCAGT TGGAGTTACT 3121 CACTCTTGCC AACTTTCTTG TACAAAGTtg gcattataag aaagcattgc ttatcaattt 3181 gttgcaacga ac