Synthetic Biology:Vectors/Barcode

From OpenWetWare

(Difference between revisions)
Jump to: navigation, search
Line 1: Line 1:
 +
 +
==Proposed barcode==
 +
 +
Each codon represents an alphanumeric character (case-insensitive).  For convenience, those letters of the alphabet which represent a single letter amino acid code, are coded by their first codon (in alphabetical order).
 +
 +
(Note this table was done by hand so please correct errors!)
 +
 +
{| border="1"
 +
|-
 +
! Codon
 +
! Character
 +
|-
 +
| GCA
 +
| A
 +
|-
 +
|
 +
| B
 +
|-
 +
| TGC
 +
| C
 +
|-
 +
| GAC
 +
| D
 +
|-
 +
| GCA
 +
| E
 +
|-
 +
| TTC
 +
| F
 +
|-
 +
| GGA
 +
| G
 +
|-
 +
| CAC
 +
| H
 +
|-
 +
| ATA
 +
| I
 +
|-
 +
|
 +
| J
 +
|-
 +
| AAA
 +
| K
 +
|-
 +
| CTA
 +
| L
 +
|-
 +
| ATG
 +
| M
 +
|-
 +
| AAC
 +
| N
 +
|-
 +
|
 +
| O
 +
|-
 +
| CCA
 +
| P
 +
|-
 +
| CAA
 +
| Q
 +
|-
 +
| CGA
 +
| R
 +
|-
 +
| TCA
 +
| S
 +
|-
 +
| ACA
 +
| T
 +
|-
 +
|
 +
| U
 +
|-
 +
| GTA
 +
| V
 +
|-
 +
| TGG
 +
| W
 +
|-
 +
|
 +
| X
 +
|-
 +
| TAC
 +
| Y
 +
|-
 +
|
 +
| Z
 +
|}
 +
==Early discussions==
==Early discussions==
Is there a plan for the barcode?
Is there a plan for the barcode?

Revision as of 13:46, 31 January 2006

Proposed barcode

Each codon represents an alphanumeric character (case-insensitive). For convenience, those letters of the alphabet which represent a single letter amino acid code, are coded by their first codon (in alphabetical order).

(Note this table was done by hand so please correct errors!)

Codon Character
GCA A
B
TGC C
GAC D
GCA E
TTC F
GGA G
CAC H
ATA I
J
AAA K
CTA L
ATG M
AAC N
O
CCA P
CAA Q
CGA R
TCA S
ACA T
U
GTA V
TGG W
X
TAC Y
Z

Early discussions

Is there a plan for the barcode?

  • Should the barcode only be readable by sequencing or is it sufficient to just look for an amplified band in a PCR reaction.
    • If PCR is sufficient we could build in a unique sequence just before the BB prefix and then design a reverse primer to that sequence to use along with VF.
  • It seems like the most likely short-mid term problem is that a researcher would be uncertain as to which BioBrick vector they had, rather than the doomsday question of trying to work out if there is a BioBrick vector somewhere in the drink that turned Drew's hair pink.
    • Given this assumption, could we choose restriction sites, each of which are found uniquely in one of our BioBrick vectors? A researcher could just prep, digest and run on a gel to tell which vector they had.--BC
      • It might be useful to be able to tell the plasmid (and resistance) by colony PCR rather than a prep. A PCR requires less starting material. -Jkm
  • There is no current plan for the barcode. The intention was just to make the identity of the plasmid obvious from a sequencing reaction but this goal is compatible with making the plasmid identifiable via a colony PCR as well. Choosing a unique restriction site for each vector would be more difficult because that would involve placing additional requirements in the BioBricks standard. i.e. Parts cannot have any of the BioBrick enzymes nor this list of restriction enzymes that are identifiers for vectors. This doesn't seem practical to me. -- RS
    • I'm not in favor of inserting restriction sites but you can probably get away without using any new enzymes under certain assumptions. First let's assume one always inserts into a new plasmid (3-way ligation, either with or without 3 antibiotic selection). Then you can just insert various combinations of BioBrick enzymes into specific locations into the plasmids and look at the pattern of bands when you cut with them. The benefit of this is let's say you cut a part with ES, run on gel, and based on the band pattern from the plasmid, you know immediately which plasmid it's in, and if it's correct, you isolate the part band and can proceed with the assembly. You have the same problem as below if one of the plasmid pieces is the same length as the part, but now you may have more potential conflicting bands. 3-antibiotic assembly without purification shouldn't really be impacted by a couple more pieces of plasmid floating around. You can also take this idea by defining another single enzyme that will be used for this purpose and you can tell plasmids apart again by the differetn lengths generated after digest. So you definitely don't need one enzyme/plasmid.

One plan that I am currently considering is actually encoding the name of the plasmid in DNA. For instance,

AAA = 0; AAC = 1; AAG = 2; AAT = 3; . . . AGC = 9; AGG = A; AGT = B; . . . GAT = Z;

So that you could literally write out pSB5AC4-P1010.I50020 in DNA. Of course, we may want to make this slightly more intelligent to space out characters, include start and stop strings and avoid key codons like ATG, TAA and TGA. Any comments? --RS

Personal tools