A New Readout Approach in DNA Computing Based on Real-Time PCR with TaqMan Probes

Size: px
Start display at page:

Download "A New Readout Approach in DNA Computing Based on Real-Time PCR with TaqMan Probes"

Transcription

1 A New Readout Approach in DNA Computing Based on Real-Time PCR with TaqMan Probes Zuwairie Ibrahim 1, John A. Rose 2, Yusei Tsuboi 3, Osamu Ono 4, and Marzuki Khalid 1 1 Center for Artificial Intelligence and Robotics, Department of Mechatronics and Robotics, Faculty of Electrical Engineering, Universiti Teknologi Malaysia, UTM Skudai, Johor Darul Takzim, Malaysia zuwairie@fke.utm.my, marzuki@utmkl.utm.my 2 Institute of Information Communication Technology, Ritsumeikan Asia Pacific University, 1-1 Jumonjibaru, Beppu-shi, Oita Japan Japan Science and Technology Agency-CREST jarose@apu.ac.jp 3 Bio-Mimetic Control Research Center, RIKEN, Anagahora, Shimoshidami, Moriyama-ku, Nagoya, , Japan tsuboi@bmc.riken.jp 4 Institute of Applied DNA Computing, Meiji University, Higashi-mita, Tama-ku, Kawasaki-shi, Kanagawa-ken, Japan ono@isc.meiji.ac.jp Abstract. A new readout approach for the Hamiltonian Path Problem (HPP) in DNA computing based on the real-time polymerase chain reaction (PCR) is investigated. Several types of fluorescent probes and detection mechanisms are currently employed in real-time PCR, including SYBR Green, molecular beacons, and hybridization probes. In this study, real-time amplification performed using the TaqMan probes is adopted, as the TaqMan detection mechanism can be exploited for the design and development of the proposed readout approach. Double-stranded DNA molecules of length 120 base-pairs are selected as the input molecules, which represent the solving path for an HPP instance. These input molecules are prepared via the self-assembly of 20-mer and 30-mer single-stranded DNAs, by parallel overlap assembly. The proposed readout approach consists of two steps: real-time amplification in vitro using TaqManbased real-time PCR, followed by information processing in silico to assess the results of real-time amplification, which in turn, enables extraction of the Hamiltonian path. The performance of the proposed approach is compared with that of conventional graduated PCR. Experimental results establish the superior performance of the proposed approach, relative to graduated PCR, in terms of implementation time. 1 Introduction Since the discovery of the polymerase chain reaction (PCR) [1], numerous applications have been explored, primarily in the life sciences and medicine, and importantly, in DNA computing as well. The subsequent innovation of real-time PCR has rapidly C. Mao and T. Yokomori (Eds.): DNA12, LNCS 4287, pp , Springer-Verlag Berlin Heidelberg 2006

2 A New Readout Approach in DNA Computing 351 gained popularity and plays a crucial role in molecular medicine and clinical diagnostics [2]. All real-time amplification instruments require a fluorescence reporter molecule for detection and quantitation, whose signal increase is proportional to the amount of amplified product. Although a number of reporter molecules currently exist, it has been found that the mechanism of the TaqMan hydrolysis probe is very suitable for the design and development of readout method for DNA computing, and is thus selected for the current study. A TaqMan DNA probe is a modified, non-extendable dual-labeled oligonucleotides. The 5 and 3 ends of the oligonucleotide are terminated with an attached reporter, such as FAM, and quencher fluorophore dyes, such as TAMRA, respectively, as shown in Fig. 1 [3]. Upon laser excitation at 488 nm, the FAM fluorophore, in isolation emits fluorescence at 518 nm. Given proximity of the TAMRA quencher, however, based on the principle of fluorescence resonance energy transfer (FRET), the excitation energy is not emitted by the FAM fluorophore, but rather is transferred along the sugar-phosphate-backbone to TAMRA. As TAMRA emits this absorbed energy at a significantly longer wavelength (580 nm), the resulting fluorescence is not observable in Channel 1 of real-time PCR instruments [4]. Fig. 1. Illustration of the structure of a TaqMan DNA probe. Here, R and Q denote the reporter and quencher fluorophores, respectively. The combination of dual-labeled TaqMan DNA probes with forward and reverse primers is a must for a successful real-time PCR. As PCR is a repeated cycle of three steps (denaturation, annealing, and polymerization), a TaqMan DNA probe will anneal to a site within the DNA template in between the forward and reverse primers during the annealing step, if a subsequence of the DNA template is complementary to the sequence of the DNA probe. During polymerization, Thermus aquaticus (Taq) DNA polymerase will extend the primers in a 5 to 3 direction. At the same time, the Taq polymerase also acts as a scissor to degrade the probe via cleavage, thus separating the reporter from the quencher, as shown in Fig. 2 [5], where R and Q denote the reporter and quencher dyes, respectively. This separation subsequently allows the reporter to emit its fluorescence [6]. This process occurs in every PCR cycle and does not interfere with the exponential accumulation of PCR product. As a result of PCR, the amount of DNA template increases exponentially, which is accompanied by a proportionate increase in the overall fluorescence intensity emitted by the reporter group of the excised TaqMan probes. Hence, the intensity of the measured fluorescence at the end of each PCR polymerization is correlated to the total amount of PCR product, which can then be detected, using a real-time PCR instrument for visualization.

3 352 Z. Ibrahim et al. In this paper, we propose a new readout method tailored specifically to HPP in DNA computing, which employs a hybrid in vitro-in silico approach. In the in vitro phase, O( V 2 ) TaqMan-based real-time PCR reactions are performed in parallel, to investigate the ordering of pairs of nodes in the Hamiltonian path of a V -node instance graph, in terms of relative distance from the DNA sequence encoding the known start node. The resulting relative orderings are then processed in silico, which efficiently returns the complete Hamiltonian path. To the best of our knowledge, the proposed approach is the first experimentally validated optical method specifically designed for the quick readout of HPP instances, in DNA computing. Previously, graduated PCR, which was originally demonstrated by Adleman [7], was employed to perform such operations. While a DNA chip based methodology, which makes use of biochip hybridization for the same purpose has been proposed [8-10], this method is more costly, and has yet to be experimentally implemented. Fig. 2. Degradation of a TaqMan probe, via cleavage by DNA polymerase 2 Notation and Basic Principle First of all, v 1(a) v 2(b) v 3(c) v 4(d) denotes a double-stranded DNA (dsdna) which contains the base-pairs subsequences, v 1, v 2, v 3, and v 4, respectively. Here, the subscripts in parenthesis (a, b, c, and d) indicate the length of each respective base-pair subsequence. For instance, v 1(20) indicates that the length of the double-stranded subsequence, v 1 is 20 base-pairs (bp). When convenient, a dsdna may also be represented without indicating segment lengths (e.g., v 1 v 2 v 3 v 4 ). A reaction denoted by TaqMan(v 0,v k,v l ) indicates that real-time PCR is performed using forward primer v 0, reverse primer v l, and TaqMan probe v k. Based on the proposed approach, there are two possible reaction conditions regarding the relative locations of the TaqMan probe and reverse primer. In particular, the first condition occurs when the TaqMan probe specifically hybridizes to the template, between the forward and reverse primers, while the second occurs when the reverse primer hybridizes between the forward primer and the TaqMan probe. As shown in Fig. 3, these two conditions would result in different amplification patterns during real-time PCR, given the same DNA template (i.e., assuming that they occurred separately, in two different PCR reactions). The higher fluorescent output of the first condition is a typical amplification plot for real-time PCR. In contrast, the relatively lower fluorescent output of the second condition, which reflects the cleavage of a lower number of TaqMan probes via DNA polymerase due to the unfavourable hybridization position

4 A New Readout Approach in DNA Computing 353 of the reverse primer, is due to linear rather than exponential amplification of the template. Thus, TaqMan(v 0,v k,v l ) = YES if an amplification plot similar to the first condition is observed, while TaqMan(v 0,v k,v l ) = NO if an amplification plot similar to the second condition is observed. Fig. 3. An example of amplification plots corresponding to TaqMan(v 0,v k,v l ) = YES (first condition) and TaqMan(v 0,v k,v l ) = NO (second condition) 3 The Proposed Readout Approach Let the output of an in vitro computation of an HPP instance of the input graph be represented by a 120-bp dsdna v 0(20) v 2(20) v 4(20) v 1(20) v 3(20) v 5(20), where the Hamiltonian path V 0 V 2 V 4 V 1 V 3 V 5, begins at node V 0, ends at node V 5, and contains intermediate nodes V 2, V 4, V 1, and V 3, respectively. Note that in practice, only the identities of the starting and ending nodes, and the presence of all intermediate nodes will be known in advance to characterize a solving path. The specific order of the intermediate nodes within such a path is unknown. The first part of the proposed approach, which is performed in vitro, consists of [( V -2) 2 -( V -2)]/2 real-time PCR reactions, each denoted by TaqMan(v 0,v k,v l ) for all k and l, such that 0 < k < V -2, 1 < l < V -1, and k < l. For this example instance, so that the DNA template is dsdna v 0 v 2 v 4 v 1 v 3 v 5, these 6 reactions, along with the expected output in terms of YES or NO are as follows: (1) TaqMan(v 0,v 1,v 2 ) = NO (2) TaqMan(v 0,v 1,v 3 ) = YES (3) TaqMan(v 0,v 1,v 4 ) = NO (4) TaqMan(v 0,v 2,v 3 ) = YES (5) TaqMan(v 0,v 2,v 4 ) = YES (6) TaqMan(v 0,v 3,v 4 ) = NO

5 354 Z. Ibrahim et al. Note that the overall process consists of a set of parallel real-time PCR reactions, and thus requires O(1) laboratory steps for in vitro amplification. The accompanying SPACE complexity, in terms of the required number of capillary tubes is O( V 2 ). Clearly, only one forward primer is required for all real-time PCR reactions, while the number of reverse primers and TaqMan probes required with respect to the size of input graph are each V -3. After all real-time PCR reactions are completed, the in vitro output is subjected to a pseudo-code for in silico information processing, producing the satisfying Hamiltonian path of the HPP instance in O(n 2 ) TIME (here, n denotes vertex number) as follows: Input: A[0 V -1]=2 // A[2, 2, 2, 2, 2, 2] A[0]=1, A[ V -1]= V // A[1, 2, 2, 2, 2, 6] for k=1 to V -3 for l=2 to V -2 while l>k if TaqMan(v 0,v k,v l ) = YES A[l] = A[l]+1 else A[k] = A[k]+1 endif endwhile endfor endfor It is assumed that a Hamiltonian path is stored in silico, in an array (e.g., A[0 V - 1]), for storage, information retrieval, and processing, such that A[i] A returns the exact location of a node, V i V, in the Hamiltonian path. Based on the proposed pseudo-code, and the example instance, the input array A is first initialized to A = {1, 2, 2, 2, 2, 6}. During the loop operations of the pseudo-code, the elements A[0] A and A[ V -1] A, are not involved, as these two elements may conveniently be initialized to the correct values, as the distinguished starting and ending nodes of the Hamiltonian path are known in advance. The loop operations are thus strictly necessary only for the remaining elements, A[1,2,3, V -2] A. Again, for the example instance, the output of the in silico information processing is A = {1, 4, 2, 5, 3, 6}, which represents the Hamiltonian path, V 0 V 2 V 4 V 1 V 3 V 5. For instance, in this case, it is indicated that V 3 is the fifth node in the Hamiltonian path, since A[3] = 5, and so on. 4 Experiments 4.1 Preparation of Input Molecules A pool of 120-bp input molecules v 0(20) v 2(20) v 4(20) v 1(20) v 3(20) v 5(20) is prepared, via standard protocol of parallel overlap assembly (POA) of single-stranded DNA strands (ssdnas). For this purpose, 11 ssdnas are required, including additional ssdnas, which act as link sequences for self-assembly. These strands are listed in Table 1. After completion, amplification via PCR was performed using the same protocol as POA. The forward primers and reverse primers used for the PCR reaction were 5 - CCTTAGTAGTCATCCAGACC-3 and 5 -CCACTGGTTCTGCATGTAAC-3, respectively.

6 A New Readout Approach in DNA Computing 355 Table 1. The required single-stranded DNAs for the generation of input molecules Name DNA Sequences (5-3 ) v 0 CCTTAGTAGTCATCCAGACC 20 v 2 CGCGCACCTTCTTAATCTAC 20 v 4 ATGCGCCAGCTTCTAACTAC 20 v 1 TGGACAACCGCAGTTACTAC 20 v 3 TCCACGCTGCACTGTAATAC 20 v 5 GTTACATGCAGAACCAGTGG 20 v 2 v 4 GCAGCGTGGAGTAGTTAGAA 20 v 4 v 1 AAGGTGCGCGGTATTACAGT 20 v 1 v 3 CGGTTGTCCAGTAGATTAAG 20 v 0 v 2 GCTGGCGCATGGTCTGGATGACTACTAAGG 30 v 3 v 5 CCACTGGTTCTGCATGTAACGTAGTAACTG 30 Length The PCR product was subjected to gel electrophoresis and the resultant gel image was captured, as shown in Fig. 4. The 120-bp band in lane 2 shows that the input molecules have been successfully generated. Afterwards, the DNA of interest is extracted. The final solution for real-time PCR was prepared via dilution of the extracted solution, by adding ddh 2 O (Maxim Biotech, Japan) into 100 µl. 4.2 Real-Time PCR Experiments The real-time PCR reaction involves primers (Proligo, Japan), TaqMan probes (Proligo, Japan), and LightCycler TaqMan Master (Roche Applied Science, Germany). The sequences for forward primers, reverse primers, and TaqMan probes are listed in Tables 2 and Table 3. In Table 2, the GC contents (GC%) and melting temperature (T m ), are also shown. The LightCycler TaqMan Master essentially contains 1 vial of enzyme, 3 vials of master mix, and 2 vials of PCR grade/water. The master mix contains FastStart Taq DNA polymerase, reaction buffer, MgCl 2, and dntp mix (with dutp instead of dttp). PCR grade/water is important to adjust the final reaction volume for real-time PCR. Subsequently, a reaction mix is prepared by pipetting 10 µl of enzyme into the master mix. For real-time PCR, as recommended by the manufacturer, the final concentration of primers should be between µm, whereas the final concentration of the DNA probes should be between µm. The final concentration for primers is set to 0.5 µm in this study, and for the TaqMan probes, the maximum final concentration, which is 0.1 µm, is chosen and prepared. In this study, real-time PCR was performed on a LightCycler 2.0 Instrument (Roche Applied Science, Germany) where amplification is carried out in a 20 µl LightCycler capillary tube (Roche Applied Science, Germany). Two solutions were prepared for each reaction: (1) 3 µl of a 10x primer/probe solution (5 µm of primers and 1 µm of probes), prepared by mixing 0.75 µl of 20 µm forward primer solution, 0.75 µl of a 20 µm reverse primer solution, and 1.5 µl of a 2 µm probe solution; and, (2) 15 µl of PCR mix, containing 4 µl of reaction mix, 9 µl PCR grade/water, and 2 µl of the previous 10x primer/probe solution. Note that even though 3 µl of 10x

7 356 Z. Ibrahim et al. Fig. 4. Gel image for the preparation of input molecules. Lane M denotes a 20-bp molecular marker, lane 1 is the product of initial pool generation based on parallel overlap assembly, and lane 2 is the amplified PCR product. Table 2. Sequences for forward primer and reverse primers employed for the real-time PCR Primer DNA Sequences (5-3 ) GC% T m (ºC) Forward primer, v 0 CCTTAGTAGTCATCCAGACC Reverse primer, v 2 GTAGATTAAGAAGGTGCGCG Reverse primer, v 4 GTAGTTAGAAGCTGGCGCAT Reverse primer, v 1 GTAGTAACTGCGGTTGTCCA Reverse primer, v 3 GTATTACAGTGCAGCGTGGA

8 A New Readout Approach in DNA Computing 357 Table 3. Sequences for TaqMan dual-labeled probes TaqMan Probes Taqw Taqx Taqy Taqz Taqw Sequences R-5 -CGCGCACCTTCTTAATCTAC-3 -Q R-5 -ATGCGCCAGCTTCTAACTAC-3 -Q R-5 -TGGACAACCGCAGTTACTAC-3 -Q R-5 -TCCACGCTGCACTGTAATAC-3 -Q R-5 -CGCGCACCTTCTTAATCTAC-3 -Q primer/probe solution was prepared, only 2 µl of that solution was used for the preparation of PCR mix. 15 µl of PCR mix was then injected via pipette into a capillary tube. Afterwards, 5 µl of the input molecule solution was injected into the same capillary tube. The capillary tube was then sealed with a stopper, and placed in an adapter. The adapter containing the capillary was placed into a microcentrifuge, and the centrifugation was performed at 3000 rpm. Seven separate real-time PCR reactions, including a negative control were performed, in order to implement the first stage of the proposed HPP readout. Note that the seventh reaction is for a negative control, which contains PCR grade/water instead of input molecules. The amplification consists of 45 cycles of denaturation, annealing, and extension, performed at 95ºC, 48ºC, and 72ºC, respectively. The annealing temperature is primer-dependent, and should be selected at 5ºC below the calculated primer melting temperatures. In the current study, the lowest primer melting temperature was estimated at 53.5ºC. Accordingly, an annealing temperature of 48ºC was selected. The resulting real-time PCR amplification plots are illustrated in Fig. 5. Fig. 5. Output of real-time PCR and grouping of output signals into three regions: amplification region (YES), non-amplification region (NO), and negative control region. The amplification phase, static phase, and error phase are also shown. The numbering 1 to 6 indicate the [( V -2) 2 - ( V -2)]/2 reactions TaqMan(v 0,v k,v l ) of input instance, while the seventh reaction is for negative control.

9 358 Z. Ibrahim et al. 6 Discussion As discussed in Section 2, in the in vitro stage of the proposed approach, each real-time PCR reaction is mapped to a binary output (i.e., either YES or NO ), based on the occurrence or absence of the typical exponential amplification. More specifically, a combination of forward primer, reverse primer, and TaqMan probe, operating on an input template molecule produces an output signal, which may then be recognized as an amplification plot corresponding to either the first or second condition, respectively. Based on the output plot of Fig. 5, the output of the negative control (seventh reaction) can be distinguished easily. As shown in Fig. 5, the real-time PCR output signals can be grouped into three regions: an amplification region, a non-amplification region, and a negative control region. Given the existence of this grouping, the subsequent in silico information processing is able to determine the Hamiltonian path of the input instance (e.g., V 0 V 2 V 4 V 1 V 3 V 5, for the example instance). In this study, each real-time PCR reaction consists of 45 cycles, and requires about 1 hour and 10 minutes. The resulting set of real-time PCR output signals may collectively be subdivided into three phases: an amplification phase, a static phase, and an error phase. Here, the amplification phase is defined as the initial time period, during which amplification-like signals are observed. The static phase, on the other hand, is defined as the subsequent time period, in which nearly all of the output signals exhibit only slight increases in fluorescence, with an increasing number of cycles. Lastly, the error phase is defined as the time period in which the negative control signal behaves as an amplification-like signal. Consideration of this effect is important, as shown by the current example instance. In particular, during the error phase (only), the output signal of reaction 3, which is properly clustered into the non-amplification region, enters the amplification region erroneously. In practice, real-time PCR may be halted, if desired, after the 11th cycle (i.e., after 25 min), as the output signals may already be easily distinguished and grouped into the above defined regions, and a set of binary output (i.e., either YES or NO ) data may be obtained, as the static and error phases are not important for this purpose. Note that at present, however, this grouping itself is not computerized. If the ability to extract molecular information is graded based on time factors, the proposed TaqMan-based real-time PCR approach is superior to conventional graduated PCR. In particular, for the case of graduated PCR, following DNA extraction, roughly 30 min is normally required to perform each PCR reaction, 30 min for gel electrophoresis, and another 30 min to stain the gel with SYBR Gold solution. Thus, graduated PCR is proven to be a very time consuming method. As discussed previously, it required about 25 min for the real-time PCR, and the in silico information processing can be done in less than 1 min. Hence, it would appear that the proposed approach is superior, in terms of implementation time compared to conventional graduated PCR. 7 Conclusion This research offers an improved and effective readout approach for DNA computing, which consists of in vitro real-time amplification and in silico information processing,

10 A New Readout Approach in DNA Computing 359 respectively. It is clear that the use of real-time PCR, as proposed in this paper differs from the conventional application of real-time PCR in the life sciences and medicine. Note that although the current approach has been developed here specifically for HPP readout, variations of this approach could also be applied to other computation models of DNA computing, in which automated and rapid visualization of the computational output are required. Acknowledgments. Zuwairie Ibrahim is very thankful to Universiti Teknologi Malaysia (UTM) for granting a study leave in Meiji University, Japan, under SLAB-JPA scholarship. This work was supported, in part, by a Grant-in-Aid for Scientific Research B ( ; J. Rose), from the Japan Society for the Promotion of Science (JSPS). References 1. Mullis, K. et al.: Specific Enzymatic Amplification of DNA in vitro: The Polymerase Chain Reaction, Cold Spring Harbor Symposium on Quantitative Biology, Vol. 51 (1986) Overbergh, L. et al.: The Use of Real-Time Reverse Transcriptase PCR for the Quantification of Cytokine Gene Expression, Journal of Biomolecular Techniques, Vol. 14 (2003) pp Walker, N.J.: A Technique Whose Time Has Come, Science, Vol. 296 (2002) Bubner, B. and Baldwin, I.T.: Use of Real-Time PCR for Determining Copy Number and Zygosity in Transgenic Plants, Plant Cell Reports, Vol. 23 (2004) Heid, C.A. et al.: Real-Time Quantitative PCR, Genome Research, Vol. 6 (1996) Holland, P.M. et al.: Detection of Specific Polymerase Chain Reaction Product by Utilizing the 5 3 Exonuclease Activity of Termus Aquaticus DNA Polymerase, Proceedings of the National Academy of Sciences of the United States of America, Vol. 88 (1991) Adleman, L.: Molecular Computation of Solutions to Combinatorial Problems, Science, Vol. 266 (1994) Rose, J.A. et al.: The Effect of Uniform Melting Temperatures on the Efficiency of DNA Computing, DIMACS Workshop on DNA Based Computers III (1997) Wood, D.H.: A DNA Computing Algorithm for Directed Hamiltonian Paths, Proceedings of the Third Annual Conference on Genetic Programming (1998) Wood, D.H. et al.: Universal Biochip Readout of Directed Hamiltonian Path Problems, Lecture Notes in Computer Science, Vol (1999)