This page describes
tools and guidelines for use in assigning newly identified Hops
to existing families and subgroups. These protocols are written
for researchers whose expertise in phylogenetic analyses does
not extend beyond basic BLAST analyses. The general approach involves
alignment of sequences in a given Hop family using ClustalW followed
by use of MEGA2.1 to evaluate clustering and calculate genetic
distance.
NOTE
3-24-05: Since publication of Lindeberg et al., 2005 and
the writing of this tutorial, MEGA 3.0 has been released. However,
MEGA 2.1 is still available for downloading at the site listed
below. Instructions for calculating genetic distance using MEGA
3.0 are currently being developed.
Note that
guidelines for selecting the Hop names themselves including names
for novel Hops, Hops belonging to novel or preexisting subgroups
in previously identified families, as well as chimeric, truncated,
or non-expressed Hops can be found on the Name
structure and Selection page.
Outline
of procedures for Hop phylogenetic analysis:
(download
pdf of this protocol)
I.
Confirm similarity to one or more previously characterized Hops
using BLAST analysis
II. Sequence alignment in ClustalW
A.
Obtain
a file listing all sequences in the Hop family of interest
B. Generate alignment file
III.
Clustering analysis and genetic distance calculation in MEGA
A.
Download and install MEGA
B. Convert Clustal alignment file to
MEGA format
C. Perform clustering analysis in MEGA
D. Calculate genetic distance between
the new Hop and established subgroups
I.
Confirm similarity to one or more previously characterized Hops
using BLAST analysis.
- Conduct
BLASTP analyses to determine whether a given protein is similar
to any previously characterized Hops.
- If
there IS NO significant similarity to one or more previously
characterized Hops, and and the newly identified Hop has been
confirmed by criteria other than sequence similarity (see
Criteria for Hop name assignment),
go to Name structure and Selection:
How to name a Hop for guidelines on naming novel Hop proteins.
- If
there IS significant similarity to one or more previously
characterized Hops (roughly defined as a BLAST expect value
of less than 10-5 and with alignment
extending over 60% of the length of the protein) follow the
steps below to assign a subgroup classification, or contact
the PPI site administrator
and a subgroup classification can be generated for you.
II.
Sequence alignment in ClustalW
A.
Obtain a file listing all sequences in the Hop family of interest
- Go to
the list of assigned Hop names
and obtain the sequences for all members of the appropriate
family by clicking on the family name.
- Save
the list of sequences as a text file. Note that the sequences
are listed in FASTA
format.
- Add the
sequence of the newly identified Hop to the list (also in
FASTA format) and save the file.
B.
Generate alignment file
ClustalW
is a general purpose tool for alignment of multiple sequences.
It is available through numerous websites, including the EMBL-EBI
site described here. (The HopA family is used to illustrate
the various procedures)
- Go to
ClustalW
at EMBL-EBI. The following window will appear
 |
1.
Under OUTPUT FORMAT Select "aln w/o numbers"
(All
other settings can stay at their default value)
2.
Paste or upload list of FASTA formatted sequences
into the window
3.
Hit "Run"
|
- ClustalW
will generate the files listed in the window below. Click
on the link to the sequence alignment file (designated by
an .aln extension). Save the .aln file as a text file, renaming
it if desired

III.
Clustering analysis and genetic distance calculation in MEGA.
MEGA (Molecular Evolutionary Genetics Analysis) is a free software
package for comparative sequence analysis