We are going to use the LacI sequence for this analysis. The crystal structure of LacI was determined in 1996 and is shown in the image below that was pulled from that page.
![]() | Figure 5. From Lewis et al., (1996) Science 271(5253):1247-1254. (A) View of the lac repressor-DNA complex monomer where the domains have been colored separately. The DNA helix-turn-helix binding domain is colored in red, the DNA binding hinge helix is shown in yellow, the NH2-terminal subdomain of the core is light blue, the COOH-terminal subdomain of the core is dark blue, and the helix of the tetramerization domain is purple. (B) Topology diagram of the lac repressor-DNA complex monomer. Helices are depicted as circles and strands as squares. The DNA binding domain consists of helix 1 (residues 6 to 12), helix 2 (17 to 25), helix 3 (32 to 45), and the hinge helix 4 (50 to 58) (red and yellow). The NH2-terminal subdomain of the core (light blue) is comprised of strand A (63 to 68), helix 5 (74 to 90), strand B (92 to 98), helix 6 (104 to 116), strand C (121 to 124), helix 7 (131 to 137), strand D (145 to 149), strand E (158 to 161), helix 13 (293 to 309), and strand K (316 to 318). The COOH-terminal subdomain of the core (dark blue) consists of helix 8 (164 to 175), strand F (182 to 185), helix 9 (192 to 205), strand G (214 to 217), helix 10 (222 to 233), strand H (240 to 244), helix 11 (247 to 259), strand I (269 to 274), helix 12 (279 to 281), strand J (287 to 290), and strand L (322 to 324). The helix of the tetramerization domain (purple) is helix 14 (340 to 357) and residues 358 to 360 are disordered. The linkers between the NH (2-terminal) and COOH-terminal core subdomains are highlighted in green. N-terminus, NH2-terminus; C-terminus, COOH-terminus. From: Lewis: Science, Volume 271(5253).March 1, 1996.1247-1254 |
We are using this sequence becuase many of the features are already known for it. Notably there is a helix-turn-helix motif at the start of the protein. This motif is a common one in protein sequences that bind DNA, and we can predict that from some of the analyses that we will do.
The HelicalWheel program can predict whether there are helices in short regions of proteins. We can use this program with just the N-terminal region of lacI.
Load the sequence into SeqWeb, and run the HelicalWheel program. Your output will look something like this, although I have added amino acid numbers to make the results clearer.
The three residues in blue (G9, V10, S11) form the turn in the helix-turn-helix. However the distribution of the other sequences clearly demonstrates the helical wheel.
In addition to just predicting individual helices, you can also predict helix-turn-helix motifs. This is better than the HelicalWheel program because it can use the whole sequence.
sequence manager icon, and then select Add and click Database as highlighted below:

and click Add selected

and choose HTHScan:


HTHScan Results
HTHScan February 27, 2002 16:03
Weight matrix: htharac.dat Minimum score for H-T-Hs (threshold): 4.0
| Sequence | Hit # | Score | Probability |
|---|---|---|---|
| laci_ecoli.swissprot | 1 | 11.1 | 3.420E-03 |
> sequence: laci_ecoli
name: laci_ecoli.swissprot check: 1946 from: 1 to: 360
1. 6 LYDVAEYAGVSYQTVSRVVN 25
Score: 11.1
Probability: 3.420E-03
Input sequences searched: 1
Number of sequences with predicted H-T-Hs: 1
CPU time (sec): 0.22
This shows that there is a helix-turn-helix between positions one and 25, as shown in the crystal structure above
CoilScan can look for coiled-coil motifs, regions of the protein that could be involved in leucine-zipper type dimerizations.


CoilScan of: laci_ecoli
i
COILSCAN of: laci_ecoli.swissprot Check 1946 from: 1 to 360 February 27, 2002 16:37 ID LACI_ECOLI STANDARD; PRT; 360 AA. AC P03023; P71309; Q47338; O09196; DT 21-JUL-1986 (Rel. 01, Created) DT 01-NOV-1997 (Rel. 35, Last sequence update) DT 15-DEC-1998 (Rel. 37, Last annotation update) DE LACTOSE OPERON REPRESSOR. . . . Window: 14 Coiled-coil segments predicted at a minimum probability of 0.50 Weight matrix: localdatadirs:mtidkcoils.dat 32 AKTREKVEAAMAELN 46 Probability: 0.75 344 DSLMQLARQVSRLE 357 Probability: 0.55
You will also have a table that shows the probability that each amino acid is part of a coil.
Note that this protein correctly predicts the C-terminal coil that is part of the tetramerization domain of LacI and is shown in the crystal structure above