Construct: ORF ccsbBroadEn_14581
Construct Description:
- Construct Type:
- ORF
- Other Identifiers:
- ORF008144.1_s300c1, BRDN0000393221
- DNA Barcode:
- None
- Epitope Tag:
- None
- Notes:
- No stop codon in insert
Originally Annotated References:
- Gene:
- CEACAM5 (1048)
Vector Information:
- Vector Backbone:
- pDONR223
- Pol II Cassette 1:
- n/a
- Pol II Cassette 2:
- n/a
- Selection Marker:
- n/a
- Visible Reporter:
- n/a
- Epitope Tag:
- n/a
Current transcripts matched by this ORF:
Taxon | Gene | Symbol | Description | Transcript | Nuc. Match %[?]A simple nucleotide-based global alignment percentage, calculated as follows: total nt. matches ---------------------------------- aligned length (incl. gaps) |
Prot. Match %[?]A simple amino acid-based global alignment percentage, calculated as follows: total aa. matches ---------------------------------- aligned length (incl. gaps) |
Match Diffs[?]This field may contain sequence annotations in HGVS format. For more information about HGVS annotations, please refer to the HGVS Quick Reference Guide. | |
---|---|---|---|---|---|---|---|---|
1 | human | 1048 | CEACAM5 | CEA cell adhesion molecule 5 | NM_001291484.3 | 99.7% | 99.2% | (many diffs) |
2 | human | 1048 | CEACAM5 | CEA cell adhesion molecule 5 | NM_004363.6 | 99.7% | 99.2% | (many diffs) |
3 | human | 1048 | CEACAM5 | CEA cell adhesion molecule 5 | NM_001308398.2 | 99.5% | 99.1% | (many diffs) |
4 | human | 1048 | CEACAM5 | CEA cell adhesion molecule 5 | XM_017026145.2 | 99.5% | 99.1% | (many diffs) |
5 | human | 1048 | CEACAM5 | CEA cell adhesion molecule 5 | XM_017026146.2 | 87.6% | 87% | (many diffs) |
6 | human | 1048 | CEACAM5 | CEA cell adhesion molecule 5 | XM_011526322.2 | 74.5% | 74.3% | 12C>N;628A>T;682_683ins534 |
7 | human | 634 | CEACAM1 | CEA cell adhesion molecule 1 | NM_001184816.2 | 44.5% | 38.6% | (many diffs) |
8 | human | 634 | CEACAM1 | CEA cell adhesion molecule 1 | XM_011527206.2 | 43.9% | 37.8% | (many diffs) |
9 | human | 4680 | CEACAM6 | CEA cell adhesion molecule 6 | NM_002483.7 | 43.3% | 38.8% | (many diffs) |
10 | human | 4680 | CEACAM6 | CEA cell adhesion molecule 6 | XM_011526990.2 | 41.6% | 38.3% | (many diffs) |
11 | human | 1088 | CEACAM8 | CEA cell adhesion molecule 8 | XM_017026196.1 | 39.9% | 34.6% | (many diffs) |
12 | human | 1088 | CEACAM8 | CEA cell adhesion molecule 8 | XM_011526341.1 | 31.6% | 26.2% | (many diffs) |
13 | human | 1088 | CEACAM8 | CEA cell adhesion molecule 8 | XM_011526342.1 | 31.6% | 26.2% | (many diffs) |
14 | human | 1088 | CEACAM8 | CEA cell adhesion molecule 8 | XM_017026198.1 | 31.6% | 26.2% | (many diffs) |
Sequence Information
Note: uppercase bases indicate empirically verified sequence.
- ORF start:
- 66
- ORF end:
- 2172
- ORF length:
- 2106
- Sequence:
-
1 gttcgttgca acaaattgat gagcaatgct tttttataat gccaaCTTTG TANAAAAAAG 61 TTGGCATGGA GTCTCCNTCG GCCCCTCCCC ACAGATGGTG CATCCCCTGG CAGAGGCTCC 121 TGCTCACAGC CTCACTTCTA ACCTTCTGGA ACCCGCCCAC CACTGCCAAG CTCACTATTG 181 AATCCACGCC GTTCAATGTC GCAGAGGGGA AGGAGGTGCT TCTACTTGTC CACAATCTGC 241 CCCAGCATCT TTTTGGCTAC AGCTGGTACA AAGGTGAAAG AGTGGATGGC AACCGTCAAA 301 TTATAGGATA TGTAATAGGA ACTCAACAAG CTACCCCAGG GCCCGCATAC AGTGGTCGAG 361 AGATAATATA CCCCAATGCA TCCCTGCTGA TCCAGAACAT CATCCAGAAT GACACAGGAT 421 TCTACACCCT ACACGTCATA AAGTCAGATC TTGTGAATGA AGAAGCAACT GGCCAGTTCC 481 GGGTATACCC GGAGCTGCCC AAGCCCTCCA TCTCCAGCAA CAACTCCAAA CCCGTGGAGG 541 ACAAGGATGC TGTGGCCTTC ACCTGTGAAC CTGAGACTCA GGACGCAACC TACCTGTGGT 601 GGGTAAACAA TCAGAGCCTC CCGGTCAGTC CCAGGCTGCA GCTGTCCAAT GGCAACAGGA 661 CCCTCACTCT ATTCAATGTC ACAAGAAATG ACTCAGCAAG CTACAAATGT GAAACCCAGA 721 ACCCAGTGAG TGCCAGGCGC AGTGATTCAG TCATCCTGAA TGTCCTCTAT GGCCCGGATG 781 CCCCCACCAT TTCCCCTCTA AACACATCTT ACAGATCAGG GGAAAATCTG AACCTCTCCT 841 GCCACGCAGC CTCTAACCCA CCTGCACAGT ACTCTTGGTT TGTCAATGGG ACTTTCCAGC 901 AATCCACCCA AGAGCTCTTT ATCCCCAACA TCACTGTGAA TAATAGTGGA TCCTATACGT 961 GCCAAGCCCA TAACTCAGAC ACTGGCCTCA ATAGGACCAC AGTCACGACG ATCACAGTCT 1021 ATGCAGAGCC ACCCAAACCC TTCATCACCA GCAACAACTC CAACCCCGTG GAGGATGAGG 1081 ATGCTGTAGC CTTAACCTGT GAACCTGAGA TTCAGAACAC AACCTACCTG TGGTGGGTAA 1141 ATAATCAGAG CCTCCCAGTC AGTCCCAGGC TGCAGCTGTC CAATGGCAAC AGGACCCTCA 1201 CTCTATTCAA TGTCACAagg aatgatgtag gaccctatga gtgtggaatc cagaacgaat 1261 taagtgttga ccacagcgac ccagtcatcc tgaatgtcct ctatggccca gacgacccca 1321 ccatttcccc ctcatacacc tattaccgtc caggggtgaa cctcagcctc tcctgccatg 1381 cagcctctaa cccacctgca cagtattctt ggctgattga tgggaacatc cagcaacaca 1441 cacaagagct ctttatctcc aacatcactg agaagaacag cggactctat acctgccagg 1501 ccaataactc agccagtggc cacagcagga ctacagtcaa gacaatcaca gtctctgCGG 1561 AGCTGCCCAA GCCCTCCATC TCCAGCAACA ACTCCAAACC CGTGGAGGAC AAGGATGCTG 1621 TGGCCTTCAC CTGTGAACCT GAGGCTCAGA ACACAACCTA CCTGTGGTGG GTAAATGGTC 1681 AGAGCCTCCC AGTCAGTCCC AGGCTGCAGC TGTCCAATGG CAACAGGACC CTCACTCTAT 1741 TCAATGTCAC AAGAAATGAC GCAAGAGCCT ATGTATGTGG AATCCAGAAC TCAGTGAGTG 1801 CAAACCGCAG TGACCCAGTC ACCCTGGATG TCCTCTATGG GCCGGACACC CCCATCATTT 1861 CCCCCCCAGA CTCGTCTTAC CTTTCGGGAG CGAACCTCAA CCTCTCCTGC CACTCGGCCT 1921 CTAACCCATC CCCGCAGTAT TCTTGGCGTA TCAATGGGAT ACCGCAGCAA CACACACAAG 1981 TTCTCTTTAT CGCCAAAATC ACGCCAAATA ATAACGGGAC CTATGCCTGT TTTGTCTCTA 2041 ACTTGGCTAC TGGCCGCAAT AATTCCATAG TCAAGAGCAT CACAGTCTCT GCATCTGGAA 2101 CTTCTCCTGG TCTCTCAGCT GGGGCCACTG TCGGCATCAT GATTGGAGTG CTGGTTGGGG 2161 TTGCTCTGAT ATACCCAACT TTCTTGTACA AAGTtggcat tataagaaag cattgcttat 2221 caatttgttg caacgaac