Note: Descriptions are shown in the official language in which they were submitted.
DEMANDES OU BREVETS VOLUMINEUX
LA PRESENTE PARTIE DE CETTE DEMANDE OU CE BREVETS
= COMPREND PLUS D'UN TOME.
CECI EST LE TOME 1 _________________________ DE 2
NOTE: Pour les tomes additionels, veillez contacter le Bureau Canadien des
Brevets.
JUMBO APPLICATIONS / PATENTS
THIS SECTION OF THE APPLICATION / PATENT CONTAINS MORE
THAN ONE VOLUME.
THIS IS VOLUME 1 OF 2
NOTE: For additional volumes please contact the Canadian Patent Office.
= CA 2772224 2017-03-13
1 = =
COMPLETE GENOME SEQUENCE OF THE METHANOGEN
METHANOBREVIBACTER RUMINANTIUM
RELATED APPLICATIONS
This application claims the benefit of United States . Provisional Patent
Application
=61/237,296 filed August 27, 2009 and New Zealand Provisional Patent
Application
579292 filed August 27, 2009.
FIELD OF THE INVENTION
The present invention encompasses the complete genome sequence for the
methanogen, Mettranobrevibacter ruminantium. The invention encompasses
polynucleotides which encode M. ruminantium polypeptides or peptides, as well
as
polynucleotides from non-coding and intergenic regions. Also encompassed are
the
encoded M. ruminantium polypeptides and peptides, and antibodies directed to
these
peptides or polypeptides. The invention also encompasses expression vectors
and host
cells for producing these peptides, polypeptides, polynudeotides, and
antibodies. The
invention further encompasses methods and compositions for detecting,
targeting, and
inhibiting microbial cells, especially methanogen cells such as M. ruminantium
cells,
using one or more of the disclosed peptides, polypeptides, polynucleotides,
antibodies,
= - 20 expression vectors, and host cells.
BACKGROUND OF THE INVENTION
Global surface temperatures are predicted to increase between 1.1 C to 6.4 C
during
the twenty-first century primarily due to increased levels of greenhouse
gases.(GHGs)
in the atmosphere (Solomon et al., 2007). Methane (CH4) is a particularly
potent GHG,
having a global warming potential 21 times that of carbon dioxide (CO2) (IPCC,
2007)
and accounts for 16% of total global GHG emissions (Scheehle & Kruger, 2006).
Methane from agriculture represents around 40% of the emissions produced by
human-
related activities; the single largest source of which is from enteric
fermentation in
livestock, mainly from ruminant animals (Steinfeld et al., 2006). The
worldwide demand
for meat and milk is predicted to double by 2050 (Food and Agriculture
Organization of
the United Nations (FAO), 2008) and ruminant-based agricultural activities are
expected
to continue to be an important contributor to global CH4 emissions. Therefore,
reducing
CH4 emissions from ruminants will be important in meeting international
commitments
CA 02772224 2012-02-24
WO 2011/025394 PCT/NZ2010/000169
2 .
under the Kyoto Protocol and also in ensuring the long-term sustainability of
ruminant-
based agriculture. Moreover, as CH4 production in the rumen accounts for 2-12%
of the
ingested energy (Johnson & Johnson, 1995), it is predicted that reducing CH4
emissions from ruminants will also make more energy available to the animal
and
therefore enhance their productivity. Ruminant animals are particularly
important to
agriculture in New Zealand (NZ), producing a third of NZ's commodity exports
(Statistics
New Zealand, 2008) and making up a large proportion of the internationally
traded lamb
and milk products (Leslie et al., 2008). Consequently, NZ has an unusual GHG
emission profile, with ruminant CH4 emissions accounting for 31% of NZ's total
GHG
emissions (Ministry for the Environment, 2007).
Methane is formed in the fore-stomach (reticulorumen, or more commonly known
as the
rumen) by methanogens, a subgroup of the Archaea. During normal rumen
function,
plant material is broken down by fibre-degrading microorganisms and fermented
mainly
to volatile fatty acids (VFAs), ammonia, hydrogen (H2) and CO2. Rumen
methanogens
principally use H2 to reduce CO2 to CH4 in a series of reactions that are
coupled to ATP
synthesis. The rumen harbours a variety of different methanogen species, but
analyses
of archaeal small subunit ribosomal RNA genes from rumen samples of ruminants
on
differing diets around the world suggest the majority fall into three main
groups:
Methanobrevibacter, Methanomicrobium and a large, as yet uncultured, group of
rumen
archaea referred to as rumen cluster C (Janssen & Kirs, 2008). Sequences
affiliated
with Methanobrevibacter dominate, on average accounting for 61.6% of rumen
archaea,
with sequences associated with M. gottschalkii (33.6%) and M. ruminant/urn
(27.3%)
being most prominent.
Attempts have been made to inhibit the action of methanogens in the rumen
using a
variety of interventions but most have failed, or met with only limited
success, due to
low efficacy, poor selectivity, toxicity of compounds against the host, or
build up of
resistance to anti-methanogen compounds (McAllister & Newbold, 2008).
Currently
there are few practical methane reduction technologies available for housed
ruminant
animals, and no effective technologies for grazing animals. Methane
interventions
should ideally target features that are conserved across all rumen
methanogens, so that
no unaffected methanogens can fill the vacated niche. Interventions should
also be
specific for methanogens only, such that other rumen microbes continue their
normal
.. digestive functions. Whole genome sequencing allows the definition of gene
targets that
CA 02772224 2012-02-24
WO 2011/025394 PCT/NZ2010/000169
3
are both conserved and specific to rumen methanogens. It is not yet possible
to obtain
genome sequences of all methanogen groups present in the rumen as some are yet
to
be cultivated, and a rumen methanogen "metagenome" is prevented by the
inability of
current sequencing technologies to reassemble complete genomes from complex
microbial ecosystems. Therefore, sequencing the genomes of individual rumen
methanogens currently in culture is a critical step in developing CH4
mitigation
technologies for ruminant animals.
SUMMARY OF THE INVENTION
Here we report the genome sequence of M. ruminantium M1T (DSM 1093), the first
rumen methanogen genome to be completely sequenced. We have included a
particular emphasis on identifying targets for enteric methane mitigation
technologies
focusing on vaccine development and anti-methanogen drug leads.
The invention thus features the complete genome sequence for the methanogen,
Methanobrevibacter ruminantium. The invention features, in particular,
isolated
peptides, polypeptides, and polynucleotides of M. ruminantium, as well as
expression
vectors, host cells, and antibodies, and methods of use thereof, as described
in detail
herein.
The invention specifically features an isolated peptide comprising, for
example, at least
a fragment of one amino acid sequence selected from the group consisting of
SEQ ID
NO: 5867-7584. In a particular aspect, the peptide comprises at least a
fragment of an
amino acid sequence of any one of SEQ ID NO: 5867-7584. In a further aspect,
the
peptide comprises at least a fragment of an amino acid sequence of any one of
SEQ ID
NO: 5867-7584. In another aspect, the peptide is a fragment, for example,
comprising
at least one amino acid sequence encompassing an extracellular domain of any
one of
SEQ ID NO: 5867-7584.
The invention specifically features an isolated polypeptide comprising, for
example, at
= 30 least one amino acid sequence selected from the group consisting of
SEQ ID NO:
5867-7584. In a particular aspect, the polypeptide comprises the amino acid
sequence
of any one of SEQ ID NO: 5867-7584. In a further aspect, the polypeptide
comprises
the amino acid sequence of any one of SEQ ID NO: 5867-7584. In another aspect,
the
CA 02772224 2012-02-24
WO 2011/025394 PCT/NZ2010/000169
4
polypeptide is a fragment, for example, comprising at least one amino acid
sequence
encompassing an extracellular domain of any one of SEQ ID NO: 5867-7584.
The invention additionally features an isolated polynucleotide comprising a
coding
sequence for at least one peptide. In one aspect, the polynucleotide comprises
a coding
sequence for at least a fragment of an amino acid sequence selected from the
group
consisting of SEQ ID NO: 5867-7584. In a particular aspect, the polynucleotide
comprises a coding sequence for at least a fragment of any one of SEQ ID NO:
5867-
7584. In a further aspect, the polynucleotide comprises a coding sequence for
at least a
fragment of any one of SEQ ID NO: 5867-7584. In another aspect, the
polynucleotide
comprises a fragment of a coding sequence, for example, least one amino acid
sequence encompassing an extracellular domain of any one of SEQ ID NO: 5867-
7584.
The invention additionally features an isolated polynucleotide comprising a
coding
sequence for at least one polypeptide. In one aspect, the polynucleotide
comprises a
coding sequence for at least one amino acid sequence selected from the group
consisting of SEQ ID NO: 5867-7584. In a particular aspect, the polynucleotide
comprises a coding sequence for any one of SEQ ID NO: 5867-7584. In a further
aspect, the polynucleotide comprises a coding sequence for any one of SEQ ID
NO:
5867-7584. In another aspect, the polynucleotide comprises a fragment of a
coding
sequence, for example, least one amino acid sequence encompassing an
extracellular
domain of any one of SEQ ID NO: 5867-7584.
In an additional aspect, the invention features an isolated polynucleotide
comprising, for
example, a nucleic acid sequence selected from the group consisting of SEQ ID
NO: 1-
1718. In a particular aspect, the polynucleotide comprises the nucleic acid
sequence of
SEQ ID NO: 1-1718. In another aspect, the polynucleotide is a fragment or an
oligonucleotide comprising, for example, the nucleic acid sequence
encompassing an
extracellular domain as encoded by any one of SEQ ID NO: 1-1718. In addition,
the
invention encompasses an isolated polynucleotide, or fragment thereof, which
hybridizes to any one of the nucleic acid sequences of SEQ ID NO: 1-1718. The
invention further encompasses an isolated polynucleotide comprising the
complement,
reverse complement, reverse sequence, or fragments thereof, of any one of the
nucleic
acid sequences.
CA 02772224 2012-02-24
WO 2011/025394 PCT/NZ2010/000169
The invention features an expression vector comprising a polynucleotide of the
invention. In one aspect, the expression vector comprises a coding sequence
for at
least a fragment of an amino acid sequence selected from the group consisting
of SEQ
ID NO: 5867-7584. In a particular aspect, the expression vector comprises a
coding
5 sequence for at least a fragment of at least one of SEQ ID NO: 5867-7584.
In a further
aspect, the expression vector comprises a coding sequence for at least one
amino acid
sequence of at least one of SEQ ID NO: 5867-7584. In another aspect, the
expression
vector comprises a coding sequence for at least one amino acid sequence
encompassing an extracellular domain of any one of SEQ ID NO: 5867-7584.
The invention also features a host cell, for example, a microbial host cell,
comprising at
least one expression vector.
The invention specifically features an antibody directed to a peptide,
polypeptide, or
polynucleotide as disclosed herein. In certain aspects, the antibody is
directed to an
amino acid sequence selected from the group consisting of SEQ ID NO: 5867-
7584. In
alternate aspects, the antibody is directed to at least a fragment of a
polypeptide
sequence selected from the group consisting of SEQ ID NO: 5867-7584. In a
particular
aspect, the antibody binds to at least a fragment of the peptide sequence of
any one of
SEQ ID NO: 5867-7584. In a further aspect, the antibody binds to at least a
fragment of
the polypeptide sequence of any one of SEQ ID NO: 5867-7584. In an alternate
aspect,
the antibody binds to at least a fragment of a peptide or polypeptide
encompassing an
extracellular domain of any one of SEQ ID NO: 5867-7584. In another aspect,
the
antibody includes one or more fusions or conjugates with at least one cell
inhibitor, for
example, anti-methanogenesis compounds (e.g., bromoethanesulphonic acid),
antibodies and antibody fragments,, lytic enzymes, peptide nucleic acids,
antimicrobial
peptides, and other antibiotics as described in detail herein.
The invention additionally features modified peptides or polypeptides, e.g.,
for at least
one of SEQ ID NO: 5867-7584, including biologically active alterations,
fragments,
variants, and derivatives, described herein. Also featured are polynucleotides
encoding
these modified peptides or polypeptides, as well as alterations, fragments,
variants, and
derivatives of the disclosed polynucleotides; antibodies raised using these
modified
peptides, polypeptides, or polynucleotides; expression vectors comprising
these
polynucleotides; and host cells comprising these vectors. Further featured are
modified
CA 02772224 2012-02-24
WO 2011/025394 PCT/NZ2010/000169
6
antibodies, including biologically active alterations, fragments, variants,
and derivatives,
described herein. In specific aspects, the compositions and methods of the
invention
employ these modified peptides, polypeptides, polynucleotides, antibodies, or
corresponding expression vectors or host cells.
The invention features a composition comprising an isolated peptide or
polypeptide,
e.g., at least one of SEQ ID NO: 5867-7584. Also featured is a composition
comprising
an isolated polynucleotide, e.g., at least one of SEQ ID NO: 1-1718. The
invention
additionally features a composition comprising an antibody, e.g., directed to
a peptide,
polypeptide, or polynucleotide sequence disclosed herein. Further featured is
a
composition that includes an expression vector, or host cell comprising an
expression
vector, in accordance with the invention. The composition can include any one
of the
biologically active alterations, fragments, variants, and derivatives
described herein.
The compositions can include at least one cell inhibitor (e.g., as a fusion or
conjugate),
and can be formulated, for example, as pharmaceutical compositions, in
particular,
vaccine compositions.
The invention also features a composition of the invention as part of a kit
for targeting
and/or inhibiting microbial cells, especially methanogen cells, in accordance
with the
disclosed methods. The kits comprise: a) at least one composition as set out
herein;
and b) optionally, instructions for use, for example, in targeting cells or
inhibiting cell
growth or replication for methanogens or other microbes.
The invention also features a method for producing a peptide or polypeptide,
e.g., at
least a fragment of any one of SEQ ID NO: 5867-7584, the method comprising: a)
culturing an expression vector or host cell comprising an expression vector,
which
comprises at least part of a coding sequence for at. least one peptide or
polypeptide
under conditions suitable for the expression of the peptide or polypeptide;
and b)
recovering the peptide or polypeptide from the culture. In particular aspects,
the peptide
or polypeptide comprises at least one amino acid sequence selected from the
group
consisting of SEQ ID NO: 5867-7584, or modified sequences thereof.
The invention also features a method for producing an antibody, e.g., directed
to at
least a fragment of any one of SEQ ID NO: 5867-7584, the method comprising: a)
culturing an expression vector or host cell comprising an expression vector,
which
CA 02772224 2012-02-24
WO 2011/025394 PCT/NZ2010/000169
7
comprises at least part of a coding sequence for at least one antibody or
antibody
fragment under conditions suitable for the expression of the antibody or
antibody
fragment; and b) recovering the amino acid sequence from the culture. In
particular
aspects, the antibody or antibody fragment is directed to at least one amino
acid
sequence selected from the group consisting of SEQ ID NO: 5867-7584, or
modified
sequences thereof. In an alternate aspect, the antibody is produced by
administration to
a host animal, as described in detail herein.
The invention additionally features a method for producing an antibody, e.g.,
directed to
at least a fragment of any one of SEQ ID NO: 5867-7584, which comprises a
fusion or
conjugate with at least one cell inhibitor. Such method comprises: a)
culturing an
expression vector or host cell comprising an expression vector, which
comprises a
coding sequence for at least one antibody or antibody fragment under
conditions
suitable for the expression of the antibody or antibody fragment; b) forming a
fusion or
conjugate to the antibody or antibody fragment (e.g., by expression of the
fused
sequence or chemical conjugation to the cell inhibitor); and c) recovering the
fusion or
conjugate.
In particular aspects, the antibody is directed to at least a fragment of any
one of SEQ
ID NO: 5867-7584, or modified sequences thereof. In further aspects, the
inhibitor is
selected from anti-methanogenesis compounds (e.g., bromoethanesulphonic acid),
antibodies and antibody fragments, lytic enzymes, peptide nucleic acids,
antimicrobial
peptides, and other antibiotics as described in detail herein. In an alternate
aspect, the
antibody is produced by administration to a host animal and then conjugated,
as
described in detail herein.
In addition, the invention features a method of inhibiting (e.g., inhibiting
growth or
replication) of a microbial cell, in particular, a methanogen cell,
comprising: contacting
= the cell with antibody or antibody fragment, e.g., directed to at least a
fragment of any
one of SEQ ID NO: 5867-7584, or an antibody fusion or conjugate, or any
modified
antibody. As another method, the cell is inhibited by administration of a
vaccine
composition as described in detail herein.
The invention further features a method of inhibiting (e.g., inhibiting growth
or
replication) of a microbial cell, in particular, a methanogen cell,
comprising: a)
CA 02772224 2012-02-24
WO 2011/025394 PCT/NZ2010/000169
8
=
optionally, producing or isolating at least one antibody as disclosed herein;
and b)
contacting the cell with the antibody. In a particular aspect, the antibody is
directed to at
least a fragment of any one of SEQ ID NO: 5867-7584, or a modified sequence
thereof.
= In certain aspects, the antibody further comprises at least one cell
inhibitor, attached,
for example, as a fusion or conjugate. In other aspects, the antibody is
administered to
a subject as a composition, e.g., a vaccine composition.
Additionally, the invention features a method of inhibiting (e.g., inhibiting
growth or
replication) of a microbial cell, in particular, a methanogen cell,
comprising: a)
optionally, producing or isolating at least one peptide or polypeptide as
disclosed
herein; and b) administering the peptide or polypeptide to a subject to induce
an
immune response thereto. In a particular aspect, the peptide or polypeptide
comprises
at least one amino acid sequence selected from the group consisting of SEQ ID
NO:
5867-7584, or a modified sequence thereof. In other aspects, the peptide or
polypeptide
is administered to a subject as a composition, e.g., a vaccine composition.
The invention furthermore features a method of detecting and/or measuring the
levels of
a polypeptide, in particular, a cell surface polypeptide, or corresponding
peptides or
polynucleotides, comprising: 1) contacting a sample from a subject with an
antibody
directed to the polypeptide (e.g., at least a fragment of any one of SEQ ID
NO: 5867-
7584, or a modified sequence thereof), or a corresponding peptide or
polynucleotide
(e.g., at least a fragment of one of SEQ ID NO: 1-1718, or a modified sequence
thereof); and 2) determining the presence or levels of the antibody complex
formed with
the corresponding polypeptide, peptide, or polynucleotide in the sample. Such
methods
can also be used for detecting and/or measuring the levels of a microbial
cell, in
particular, a methanogen cell.
The invention also features a method of detecting and/or measuring the levels
of a
polynucleotide, in particular, a polynucleotide encoding a cell surface
component,
30. comprising: 1) contacting a sample from a subject with a complementary
polynucleotide (e.g., a sequence complementary to at least a fragment of any
one of
SEQ ID NO: 1-1718, or a modified sequence thereof); and 2) determining the
presence
or levels of the hybridization complex formed with the polynucleotide in the
sample.
Such methods can also be used for detecting and/or measuring the levels of a
microbial
cell, in particular, a methanogen cell.
CA 02772224 2012-02-24
WO 2011/025394 PCT/NZ2010/000169
9
In particular aspects, the methods of the invention utilize in vivo or in
vitro expression
components. In other aspects, the methods employ peptides, polypeptides,
polynucleotides, or antibodies produced by recombinant, synthetic, or semi-
synthetic
.. means, or by endogenous means. .
Other aspects and embodiments of the invention are described herein below.
BRIEF DESCRIPTION OF THE DRAWINGS
This invention is described with reference to specific embodiments thereof and
with
reference to the figures.
FIG. 1: Chemogenomic and vaccine gene targets within M1. The number of genes
identified by each analysis is shown in the Venn diagram and a selection of
the gene
.. targets are summarized in the tables grouped by functional category (a)
Chemogenomic
gene targets were defined by identification 'of genes that occurred across
three separate
analyses; the Functional Genome Distribution (FGD), Differential BLAST
analysis (DBA)
and metabolic profiling. (b) Vaccine target genes were defined as described
and
discussed below. TMH, transmembrane helices, SP, signal peptide.
FIG. 2: GC analysis. Base-pair scale (outer circle), G+C content (middle
circle) and GC
skew (inner circle, (G-C/G+C), darker shade indicates values >1, lighter shade
<1.
Genonnes of the Methanobacteriales order display a DNA skew similar to
bacterial
chromosomes. In M. ruminantium M1 the origin of replication (oriC) was
identified as
being immediately upstream of the Cdc6-1 gene (mru0001) based on GC skew
analysis
and homology to the origin of replication experimentally verified for M.
thennoautotrophicus (Reymond et al., 2004). As with other Methanobacteriales
genomes, M. ruminantium M1 contains a second Cdc6 homolog (mru0423). It also
contains a truncated third Cdc6 homolog within the prophage sequence.
FIG. 3: PROmer alignments (De!cher et al., 2003) of M. ruminantium M1 against
complete Methanobacteriales genomes. Whenever the two sequences agree, a
shaded
line or dot is plotted. The forward matches are displayed in the lighter
shade, while the
reverse matches are plotted in the darker shade. Where the two sequences were
perfectly identical, a single lighter line would extend from the bottom left
to the top right.
CA 02772224 2012-02-24
WO 2011/025394 PCT/NZ2010/000169
=
An X-shape pattern is visible is all three synteny plots indicating moderately
diverged
Methanobacteriales genomes. It has been proposed that the X-pattem is
generated by
symmetric chromosomal inversions around the origin of replication
(Emanuelsson,
2007). Units displayed in base-pairs.
5
FIG. 4: Consensus sequence of forty-four C-terminal regions (200 amino acids)
from
= adhesin-like proteins of M. ruminantium M1 (A). LogoBar display of this
consensus. In
both figures region of homology to Big_1 domain (Pfam02369) is highlighted in
grey
(B).
FIG. 5: Proposed biosynthetic pathway for pseudomurein in M1 (Kandler and
KOnig H,
1998; Konig et al., 1994). The disaccharide backbone of M. ruminantium M1
pseudomurein consists of N-acetylgalactosamine (GaINAc) and N-
acetyltalosanninuronic acid (TaINAc) in a (3(1-3) linkage. TalNac has not been
detected
as a monomer and it is believed to be formed during the synthesis of the
disaccharide
probably by epimerization and oxidation of UDP-GaINAc (Step 1). Synthesis of
the
pentapeptide involved, in crosslinking is believed to start with UDP-glutamic
acid
followed by stepwise addition of L-amino acids (Step 2). The amino acids found
in the
pentapeptide are usually alanine, lysine (Lys) and glutamic acid (Glu), but M.
ruminantium M1 is reported to contain threonine (Thr) instead of alanine
(Kandler and
KOnig, 1978). The UDP activated pentapetide is linked to the disaccharide to
give a
UDP-disaccharide pentapeptide (Step 3) which is subsequently translocated to
the
membrane via covalent bond formation with a membrane embedded undecaprenyl
phosphate (Step 4). Following their intracellular biosynthesis the
pseudomurein
repeating units must be exported and assembled. Homologs of the E. coil
peptidoglycan
lipid II flippase (MurJ) have been reported for pseudomurein producing
methanogens
(Ruiz, 2008; Step 5), but there are no genes similar to the penicillin binding
proteins that
carry out the transglycosylation (Step 6) and transpeptidation reactions in
bacterial
peptidoglycan assembly. Peptide crosslinking of pseudomurein requires removal
of a
terminal residue of one peptide and linkage from a glutamic acid to the lysine
of an
adjacent peptide (Step 7), and is probably carried out by transglutaminases.
None of
the enzymes involved in pseudomurein biosynthesis have been characterized, but
analysis of the genome sequence has suggested candidates to carry out several
of the
steps. Several of these have homologs only in those methanogens with
pseudomurein-
=
CA 02772224 2012-02-24
WO 2011/025394 PCT/NZ2010/000169
11
containing cell walls. Two other transmembrane proteins of unknown function
(mru1585
and mru1635) are also only found in pseudomurein-containing species.
FIG. 6: (a) PFGE of genomic DNA from Ml. Lane 1, A ladder (New England
Biolabs);
Lane 2, ApallBssHII double digest; Lane 3, Apal digest; Lane 4, MU digest;
Lane 5,
Sizes of Miul fragments. The bands in the A ladder are multiples of 48.5 kb.
(b) In silk
restriction map of the M1 chromosome showing the position and fragment size of
the
M/u1 digest.
FIG. 7: Gene organisation of three clusters proposed to be involved in
secondary
metabolite metabolism in Ml. Clusterl. Mru0068 is predicted to encode two non-
ribosomal peptide synthetase (NRPS) modules, each containing condensation,
adenylation and thiolation domains. The presence of a condensation domain in
the first
module is often associated with NRPSs that make N-acylated peptides (Steller
et al.,
1999). The second module is followed by a terminal thioester reductase domain
which
is thought to release the peptide from the final thiolation domain. Mru0068 is
surrounded by genes that encode two serine phosphatases (mru0066, mru0071), an
anti-sigma factor antagonist (mru0067) and a MatE efflux family protein
(mru0069)
which are likely to be involved in environment sensing, regulating NRPS
expression and
export of the NRP, respectively. Cluster2. The second NRPS gene (mru0351)
contains
4 modules and a C-terminal thioester reductase domain. Immediately downstream
of
mru0351 is another MatE efflux family protein (mru0352), presumably involved
in the
efflux of the NRP. Cluster3. A small cluster of genes elsewhere in the genome
(mru0513-0516) appears to encode NRPS-associated functions. The cluster
includes a
4'-phosphopantetheinyl transferase (mru0514) which primes NRPSs by adding a
phosphopantetheinyl group to a conserved serine within the thiolation domain,
an
acyltransferase (mru0512) possibly involved in NRP acylation, a serine
phosphatase
(mru0515), an anti-sigma factor antagonist (mru0513), and an anti-sigma
regulatory
factor serine/threonine protein kinase (mru0516) that may function in sensing
the
environment and NRPS regulation. Although the products of each NRPS are
unknown,
an analysis of adenylation domain amino acid sequences by NRPSpredictor
(Rausch et
al., 2005) predicts 10 residues which are important for substrate binding and
catalysis.
FIG. 8: Effect of the lytic enzyme PeiR on M1 growth in vitro. (A) Addition of
PeiR to
growing cultures at 73 h resulted in a dramatic drop in culture density,
indicative of cell
CA 02772224 2012-02-24
WO 2011/025394 PCT/NZ2010/000169
12
lysis. At a low concentration of PeiR (final concentration of 2.5 mg per
litre), the cultures
were able to recover, indicated by the increase in culture density after 100
h, and (B) by
production of methane at levels similar to that of cultures receiving no PeiR.
Addition of
higher concentrations of PeiR (7.5 and 22.5 mg per litre) resulted in a
lasting effect, with
(A) no subsequent recovery of growth and (B) a reduced methane yield.
Chloroform, a
known potent inhibitor of methanogens, resulted in a similarly reduced methane
yield
(B), but the decrease in culture density was less (A), as expected since it
halts
metabolism rather than lysing cells. PeiR was added to 10 ml cultures in 0.1
ml of
beer. The buffer alone had no effect. The symbols (A) and solid bars (B) are
means of
3 replicates, and the thin vertical bars represent one standard error on
either side of the
mean.
FIG. 9: Observation of interspecies interactions between M. ruminantium M1 and
B.
proteoclasticus B316-r. Graph displays growth rate of M1 in co-culture with
B316.
= 15 Microscopic images taken at 2, 8 and 12 h post inoculation of
B316 (lighter, rod-shaped
organism) into BY+(+0.2% xylan) media containing a mid-exponential M1 culture
(darker, short ovoid rod-shaped organism). Growth as determined by Thoma slide
enumeration, is shown along with sampling time.
FIG. 10: Genome atlas of M. ruminantium Ml. The circle was created using
Genewiz
(Jensen et al., 1999) and in house developed software. The right-hand legend
describes the single circles in the top-down- outermost-innermost direction.
Outermost
1st ring: Differential Blast Analysis between the non-redundant (nr) database
(Ring 3)
and a custom methanogen database (Ring 2). Regions in medium-dark shading
indicate protein sequences highly conserved between M. ruminantium and at
methanogens but not found in the nr database. Regions in darker shading
indicate
protein sequences conserved between M. ruminantium and the nr database but not
present in other methanogens genomes. 2nd ring: gapped BlastP results using a
custom
methanogen database consisting of publicly accessible genome project sequences
(Table 10), Td ring: gapped BlastP results using the non-redundant database
minus
published methanogen genome project sequences. In both rings, regions in
medium-
dark shading represent unique proteins in M. ruminantium, whereas highly
conserved
features are shown in darker shading. The degree of colour saturation
corresponds to
the level of similarity. 4th ring: G+C content deviation: medium shading
highlights low-
GC regions, light shading high-GC islands. Annotation rings 5 and 6 indicate
absolute
CA 02772224 2012-02-24
= WO 2011/025394
PCT/NZ2010/000169
13
position of functional features as indicated. 7th ring: ORF orientation. ORFs
in sense
orientation (ORF+) are shown in dark shading; ORFs oriented in antisense
direction
(ORF-) are shown in medium-dark shading. 8th ring: prediction of membrane
bound and
cell surface proteins. White: no transmembrane helices (TMH) were identified,
Black:
ORFs with at least one TMH, Medium-dark shading: ORFs predicted to encompass a
Signal Peptide sequence and Medium shading: ORFs predicted to incorporate both
TMH and SignalP domains. 9th ring: COG classification. COG families were
assembled
into 5 major groups: information storage and processing (light shading);
cellular
processes and signalling (medium shading); metabolism (light-medium shading);
poorly
characterized (dark shading); and ORFs with uncharacterized COGs or no COG
assignment (grey). 10th ring: GC-skew. Innermost ring: genome size (Mb).
Selected
features representing single ORFs are shown outside of circle 1 with bars
indicating
their absolute size. Origin and terminus of DNA replication are identified in
light-medium
shading and dark-medium shading, respectively.
FIG. 11: Methanogenesis pathway. The predicted pathway of methane formation in
M1
based on the scheme of Thauer et al. (Thauer et al., 2008) for methanogens
without
cytochromes. The pathway is divided into three partitions; capture of
reductant (left
side, medium shaded background), reduction of CO2 (centre, lighter shaded
background) and energy conservation (right side, darker shaded background).
The
main reactions are indicated by thick arrows and enzymes catalysing each step
are
coloured green. Cofactor participation is indicated with thin arrows. Membrane
located
proteins are coloured light brown and potential vaccine and chemogenomic
targets are
labelled with a circled V or C respectively. Small upwards arrows signify up-
regulated
genes during co-culture with Butyrivibrio proteoclasticus. Full gene names and
corresponding locus tag numbers can be found in Table 9, below. H4MPT;
tetrahydromethanopterin; MF, methanofuran; F420, co-factor F420 oxidised;
F420F12, co-
factor F420 reduced; Fed(dx)?, unknown oxidised ferredoxin; Fed(1ed)? ,
unknown reduced
ferredoxin; HSCoM, reduced coenzyme M; HSCoB, reduced coenzyme B, CoBS-
SCoM, coenzyme B-coenzyme M heterodisulphide; NADP+, nicotinamide adenosine
dinucleotide phosphate non-reduced; NADPH, nicotinamide adenosine dinucleotide
phosphate reduced.
FIG. 12: Cell envelope topography of Ml. Ultrastructural studies of M1 (Zeikus
&
Bowen, 1975; Miller, 2001) show that the cell wall is composed of three layers
and is
CA 02772224 2012-02-24
WO 2011/025394 PCT/NZ2010/000169
14
comparable to the organization seen in Gram positive bacteria (Graham &
Beveridge,
1994). The three layers can be described as: (1) a thin electron-dense inner
layer
composed of compacted newly synthesised pseudomurein, (2) a thicker less-
electron-
dense middle layer which is also composed of pseudomurein, and (3) a rough
irregular
outer layer that is distal to the pseudomurein layers and assumed to be
composed of
cell wall glycopolymers, wall. associated proteins and possibly other
components.
Representative adhesin-like proteins with different cell-anchoring domains are
shown.
The number of these proteins predicted in the M1 genome is shown in brackets.
OT,
oligosaccharyl transferase; Sec, Sec protein secretion pathway; PMBR,
pseudomurein
binding repeat (PF09373); M1-C, M1 adhesin-like protein conserved C-terminal
domain.
FIG. 13: Functional Genome Distribution (FGD) of 26 methanogen genomes.
Publicly
available complete genomes were downloaded in GenBank format were possible.
Publicly available draft phase genomes were downloaded in FASTA format,
concatenated using a universal spacer-stop-spacer sequence (TTAGTTAGTTAG; SEQ
ID NO: 7585) and automatically annotated using GAMOLA. Predicted ORFeonnes of
all
genomes were subjected to an FGD analysis and the resulting distance matrix
was
imported into MEGA4 (Samuel et al., 2007). The functional distribution was
visualised
using the UPGMA method (Boekhorst et al., 2005). The optimal tree with the sum
of
branch length = 49.7 is shown. The tree is drawn to scale, with branch lengths
in the
same units as those of the functional distances used to infer the distribution
tree.
Accession numbers for individual genomes can be found in Table 10, below.
=
FIG. 14: Stimulation of growth of M1 by alcohols. The inclusion of (A) 20 mM
methanol
or (B) 5 or 10 mM ethanol when M1 was grown on H2 resulted in an increase in
culture
density (measured as 00600 nm) compared to cultures grown on H2 alone. H2 was
added once only, at the time of inoculation, by gassing the cultures with H2 +
CO2 (4:1)
to 180 kPa overpressure. Higher concentrations of ethanol (20 mM) resulted in
some
inhibition of growth (not shown), and there was no stimulation by isopropanol
(5 to 20
mM; not shown). No growth occurred when cultures were supplemented with
methanol
(A), ethanol (B), or isopropanol (not shown) when no H2 was added, and no
methane
was formed by those cultures. The symbols in panel are means of 4 replicates,
and the
thin vertical bars in panel (A) represent one standard error on either side of
the mean.
Error bars are omitted from panel (B) for the sake of clarity.
15
FIG. 15: Distribution of genes in the predicted ORFeomes of members of the
Methanobacteriales classified according to functional categories in the
archaeal COG
database (Makarova et al., 2007).
FIG. 16: BLAST Heat Map depicting BLAST-result distribution across the M.
ruminantium M1 ORFeome. In both figures, the X-axis (horizontal axis) shows
all
genera with at least 500 and 250 Blast hits throughout the ORFeome,
respectively.
Genera are phylogenetically sorted based on a semi-dynamically re-parsed
phylogenetic tree obtained from the Ribosomal Database Project II (RDP II),
selecting NCB' taxonomy,
level 10 genera display list and set to include archaeal sequences. Bacterial
or archaeal
genera not covered within the RDPII data were entered and parsed from a
separate
data file, where appropriate. Phylogenetic distribution and grouping of genera
is
indicated using an ASCII based tree-abstraction. The Y-axis indicates e-value
ranges,
and the Z-axis (colour coded) represents the frequency of hits for each genus
in each e-
value range in log-scale. Respective Log-colour-scales of frequencies are
indicated in
each figure, whereby warmer colours indicate higher frequencies. Figure (a)
allows all
BLAST hits per genus per ORF, accepting multiple genus hits per ORF. Figure
(b)
employs a frequency cutoff of one hit per genus per ORF, effectively limiting
the hit rate
to the best Blast hit found in each given ORF and genus..
FIG. 17: Sheep antibody responses to vaccination with peptides designed
against M.
ruminantium H4MPT methyltransferase subunits (MtrCDE) and surface proteins and
binding of antibodies to immobilised M. ruminantium cells. (a) Vaccination
with peptides
designed against M1 genes. (b) Binding of antibodies to immobilised M1 cells.
In the
antibody-binding experiment a negative control (NC) serum from a sheep which
had not
had colostrum as a lamb was included, as was a sample without added serum
which
served as a blank, B.
FIG. 18: NRPS alignment. ClustalW (Larkin et al., 2007) alignment of non-
ribosomal
peptide synthetases from M. ruminantium M1 (mru00068) and Syntrophomonas
wolfei
subsp. wolfei str. Goettingen (swo11094). Alignment was visualised using
Jalview
(Waterhouse et al., 2009). Conserved residues are shown in medium shading.
Domain
organisation of M. ruminantium M1 is displayed via boxes (box marked with
rounded
brackets = condensation domain; box marked with pointed brackets = adenylation
CA 2772224 2019-03-25
CA 02772224 2012-02-24
WO 2011/025394 PCT/NZ2010/000169
16
domain; box marked with triangle = phosphopantetheine attachment site; box
marked
with double arrow = thioester reductase domain).
FIG. 19: Mbb. ruminantium Ai& ATP synthase PCR cloning and introduction of a
hexa-
histidine tag at the N-terminal of subunit A by PCR overlap expression. (A)
PCR overlap
extension; (B) digested insert; (C) ligation.
FIG. 20: M. smith!! Al& ATP synthase PCR cloning and introduction of a hexa-
histidine
tag by PCR overlap expression. (A) Cloning; (B) PCR overlap extension; (C)
digested
insert and ligation.
FIG. 21: (A) pTrMbrA1HIS clone which contains the genes encoding for the M.
, ruminantium Al-ATPase in the E. coli expression vector pTrc99a, and
includes a Hexa-
Histidine tag on the N-terminal of Subunit A. (B) pTrMbrA1A0HIS9 clone.
FIG. 22: Western blot analysis of pTrMbrA1HIS expression. Lane 1: Pre-stain
Protein
Marker; Lane 2: Soluble Material 10 pg; Lane 3: Unbound Material 10 pg; Lane
4: Wash
1, 10 pg; Lane 5: Wash 2, 10 pg; Lane 6: Elutant 1, 10 pg (150 mM lmidazole);
Lane 7:
Elutant 2, 10 pg (150 mM Imidazole); Lane 8: Elutant 2, 20 pg; Lane 9: Elutant
2, 20 pg,
Denatured 95 C 5 min; Lane 10: Soluble Material 10 pg, Denatured 95 C 5 min.
FIG. 23: Western blot analysis of pTrMbsA1HIS expression. Lane 1: Pre-stain
Protein
Marker; Lane 2: C41-pLysRARE/pTrMbsA1HIS starting material; Lane 3:
pTrMbsA1HIS s
unbound material; Lane 4: pIrMbsA1HIS Wash 1, 40 mM imidazole; Lane .5:
pTrMbsA1HIS Wash 2, 40 mM imidazole; Lane 6: pTrMbsA1HIS Elutant 1, 150 mM
imidazole; Lane 7: pTrMbsA1HIS Elutant 2, 150 mM imidazole; Lane 8:
pIrMbsA1HIS
Elutant 3, 400 mM imidazole.
FIG. 24: Growth of E. coil strains 8L21 (A), DK8 (B) and C41 (C) harbouring
pTRMbbr-
AlAo at 37 C, showing the effect of induced expression of the Mbb.
ruminantium A1A0-
ATP synthase. (D) Localization of the Mbb. ruminantium AiAo-ATPase in 0K8 by
SDS-
PAGE and western analysis. Samples were resolved on a 14% polyacrylamide gel
in
the presence of 0.1% sodium dodecyl sulfate (SDS) and either stained with
Coomassie
Brilliant blue or transferred for western analysis to a polyvinylidene
difluoride membrane
in the presence of 0.02% SDS and then immunoblotted using a penta-His antibody
CA 02772224 2012-02-24
WO 2011/025394 PCT/NZ2010/000169
17
conjugate to detect the hexa-histidine tag on the A-subunit. Lane 1, protein
molecular
weight ladder; Lane 2, His-tagged TA2.A1 F1F0 ATPase purification (positive
control);
Lane 3, cell lysate; Lane 4, cytoplasmic fraction; Lane 5, membrane fraction.
FIG. 25: Purification (A and B) of the Mbb. ruminantium AlAo-ATPase.
FIG. 26: Subunits of the Mbb. ruminantium Aik-ATPase.
FIG. 27: Extraction of the AlAcrATPase.K-subunit monomer.
FIG. 28: Nat binding motif in the A1A0-ATPase K-subunit.
FIG. 29: Mbb. ruminantium purified AlA, ATP synthase activity. (A) Kinetics of
ATP
hydrolysis by purified Mbb. ruminantium AlA, ATP synthase. Background ATPase
activity generated by thermal hydrolysis of ATP or contaminant ATP in buffer
or enzyme
has been subtracted (these totalled <5 % of the final value shown). (B)
Influence of
Mg2t on the kinetics of ATP hydrolysis by purified recombinant Mbb.
ruminantium AiA,õ
ATP synthase. Background ATPase activity generated by thermal hydrolysis of
ATP or
contaminant ATP in buffer or enzyme has been subtracted (these totalled <5 %
of the
final value shown). (C) ATPase activity over a pH range in the presence and
absence of
Nat. Background ATPase activity generated by thermal hydrolysis of ATP or
contaminant ATP in buffer or enzyme has been subtracted from the activities
displayed
(these totalled <5 % of the final value shown). (D) Stability of the purified
and
membrane-bound (in DK8 membranes) Mbb. ruminantium A1A0 ATP synthase.
.. Background ATPase activity generated by thermal hydrolysis of ATP or
contaminant
ATP in buffer or enzyme has been subtracted (these totalled <5 % of the final
value
shown).
FIG. 30: Mbb. ruminantium soluble and membrane-bound AiA, ATP synthase
activity.
(A and B) Effects of TBT and DCCD on ATPase activity. E. coli DK8 (Aatp)
inverted
membranes containing the recombinant AlAo-ATPase were used to determine the
effects of the inhibitors TBT (200 pM) and DCCD (250 pM) at different pH
values.
ATPase activity was measured in presence of 130 mM Nat (A) and in absence of
Nat
(B). Background ATPase activity generated by thermal hydrolysis of ATP or
contaminant ATP in buffer or enzyme has been subtracted (these totalled <5 %
of the
CA 02772224 2012-02-24
WO 2011/025394 PCT/NZ2010/000169
18
final value shown). (C) Tributylin Inhibition of ATP Hydrolysis by Purified
Recombinant
Mbb. ruminantium AlA. ATP synthase. Background ATPase activity generated by
thermal hydrolysis of ATP or contaminant ATP in buffer or enzyme has been
subtracted
(these totalled <5 % of the final value shown). (D) Amiloride Inhibition of
ATP Hydrolysis
of the Mbb. ruminantium AiA, ATP synthase. Background ATPase activity
generated by
thermal hydrolysis of ATP or contaminant ATP in buffer or enzyme has been
subtracted
(these totalled <5 % of the final value shown).
FIG. 31: Mbb. ruminantium membrane-bound AlA, ATP synthase activity. (A)
Tributylin
Inhibition of ATP Hydrolysis by the Mbb. ruminantium Aik ATP synthase in DK8
and
Native Membranes. Background ATPase activity generated by thermal hydrolysis
of
ATP or contaminant ATP in buffer or enzyme has been subtracted (these totalled
<5 %
of the final value shown). (B) Amiloride Inhibition of ATP Hydrolysis of
Purified
Recombinant Mbb. ruminantium Alk ATP synthase in DK8 and Native Membranes.
Background ATPase activity generated by thermal hydrolysis of ATP or
contaminant
ATP in buffer or enzyme has been subtracted (these totalled <5 % of the final
value
shown).
FIG. 32: ATP synthesis in E. coil DK8 inverted membrane vesicles. Closed
squares with
no DCCD; closed triangles, a 20 min preincubation with 250 tiM TBT.
FIG. 33: Model of the.membrane-associated sodium ion-translocating
methyltransferase
complex from methanogenic archaea.
FIG. 34: SDS-PAGE analysis of mtrF-His, mtrG-His, mtrH-His and mtrEDCBAFGH-His
expression in E. coil C41 (IPTG induction, at 37 C for 5 hr).
FIG. 35: SDS-PAGE analysis of mtrA-His, nntre-His, mtrD-His, mtrE-His, mtrH-
His and
mtrEDCBAFGH-His expression in E. coil C43 (IPTG induction, at 37 C for 4 hr).
FIG 36: Western Blot analysis of M. ruminantium surface protein antisera
against M1
cell fraction. Pre: sera obtained before immunisation; Post: sera obtained
after
. immunisation.
CA 02772224 2012-02-24
WO 2011/025394 PCT/NZ2010/000169
19
FIG. 37: (A) The a282y2 subunit structure of MCR which contains two nickel
porphinoid
F430 rings and two molecules each of methylcoenzyme M (CoM) and coenzyme B
(CoB). (B) The final reaction of the energy conserving pathway of methanogenic
archaea in which CoM and CoB are converted to methane and the heterodisulfide
= 5 product CoM-S-S-CoB.
FIG. 38: Optical cell density was measured in pure culture over time to assess
the
effectiveness of potential inhibitors.
- DETAILED DESCRIPTION OF THE INVENTION
Definitions
The term "antibody" should be understood in the broadest possible sense and is
intended to include intact monoclonal antibodies and polyclonal antibodies. It
is also
intended to cover fragments and derivatives of antibodies so long as they
exhibit the
desired biological activity. Antibodies encompass immunoglobulin molecules and
immunologically active portions of immunoglobulin (Ig) molecules, i.e.,
molecules that
contain an antigen binding site that specifically binds (immunoreacts with) an
antigen.
These include, but are not limited to, polyclonal, monoclonal, chimeric,
single chain, Fc,
Fab, Fab', and Fab2fragments, and a Fab expression library.
Antibody molecules relate to any of the classes IgG, IgM, IgA, IgE, and IgD,
which differ
from one another by the nature of heavy chain present in the molecule. These
include
subclasses as well, such as IgG1, IgG2, and others. The light chain may be a
kappa
chain or a lambda chain. Reference herein to antibodies includes a reference
to all
classes, subclasses, and types. Also included are chimeric antibodies, for
example,
monoclonal antibodies or fragments thereof that are specific to more than one
source,
e.g., one or more mouse, human, or ruminant sequences. Further included are
camelid
antibodies or nanobodies. It will be understood that each reference to
"antibodies" or
any like term, herein includes intact antibodies, as well as any fragments,
alterations,
derivatives, or variants thereof.
"Altered" polynucleotides encoding peptides, polypeptides, or antibodies, as
used
herein, include those with deletions, insertions, or substitutions of
different nucleotides
resulting in a polynucleotide that encodes the same or functionally equivalent
sequence.
CA 02772224 2012-02-24
WO 2011/025394 PCT/NZ2010/000169
=
The encoded peptide, polypeptide, or antibody may also be "altered" and
contain
deletions, insertions, or substitutions of amino acid residues which produce a
silent
change and result in a functionally equivalent sequence. Deliberate amino acid
substitutions may be made on the basis of similarity in polarity, charge,
solubility,
5 hydrophobicity, hydrophilicity, and/or the amphipathic nature of the
residues as long as
the biological activity (e.g., cell association, membrane association) or
immunogenic/immunological activity is retained. For example, negatively
charged amino
acids may include aspartic acid and glutamic acid; positively charged amino
acids may
include lysine and arginine; and amino acids with uncharged polar head groups
having
10 similar hydrophilicity values may include leucine, isoleucine, and
valine, glycine and
alanine, asparagine and glutamine, serine and threonine, and phenylalanine and
tyrosine.
The amino acid molecules as noted herein, refer to oligopeptides, peptides,
15 polypeptides, proteins or antibodies, and any fragments thereof, and to
any naturally
occurring, recombinant, synthetic, or semi-synthetic molecules. These
molecules of the
invention comprise at least 5, 10, 15, 20, 25, 30, 35, 40, 45, 50, 100, 150,
200, 250
amino acids, preferably at least 5 to 10, 10 to 20, 20 to 30, 30 to 40, 40 to
50, 50 to 100,
100 to 150, 150 to 200, 200 to 250, or at least 300, 350, 400, 450, or.500
amino acids.
20 Such amino acid sequences preferably retain the biological activity
(e.g., effect on cell
growth) or the immunogenicity/immunological activity of the molecule. The
amino acid
molecules noted herein are not limited to the complete, native sequence
associated
with the full-length molecule, but include also any fragments, alterations,
derivatives,
and variants thereof.
"Amplification", as used herein, refers to the production of additional copies
of a nucleic
acid sequence and is generally carried out using polymerase chain reaction
(PCR)
technologies well known in the art (Dieffenbach, C. W. and G. S. Dveksler
(1995) PCR
Primer,. a Laboratory Manual, Cold Spring Harbor Press, Plainview, NY).
The terms "biologically active" or "functional," as used herein, refer to a
peptide or
polypeptide retaining one or more structural, immunogenic, or biochemical
functions
(e.g., cell association, membrane association) of a naturally occurring
sequence.
CA 02772224 2012-02-24
WO 2011/025394 PCT/NZ2010/000169
21
The terms "cell inhibitor" or "inhibitor," as used herein, refer to agents
that decrease or
= block the growth or replication of microbial cells, especially methanogen
cells. A cell
inhibitor can act to decrease or block, for example, cellular division. An
inhibitor can
= decrease or block, for example, DNA synthesis, RNA synthesis, protein
synthesis, or
post-translational modifications. An inhibitor can also decrease or block the
activity of
enzymes involved in the methanogenesis pathway. An inhibitor can also target a
cell for
recognition by immune system components. Inhibition of a cell also includes
cell killing
and cell death, for example, from lysis, apoptosis, necrosis, etc. Useful
inhibitors
include, but are not limited to, anti-methanogenesis compounds (e.g.,
bronnoethanesulphonic acid), antibodies and antibody fragments, lytic enzymes,
peptide
nucleic acids, antimicrobial peptides, and other antibiotics as described in
detail herein.
The terms "complementary" or "complementarity," as used herein, refer to the
natural
binding of polynucleotides under permissive salt and temperature conditions by
base-
pairing. For the sequence A-G-T, the complementary sequence is T-C-A, the
reverse
complement is A-C-T and the reverse sequence is T-G-A. Complementarity between
two single stranded molecules may be partial, in which only some of the
nucleic acids
bind, or it may be complete when total complementarity exists between the
single
stranded molecules. The degree of complementarity between nucleic acid strands
has
significant effects on the efficiency and strength of hybridization between
nucleic acid
strands. This is of particular importance in amplification reactions, which
depend upon
binding between nucleic acids strands and in the design and use of PNA
molecules.
As used herein, "computer readable media" refers to any medium which can be
read
and accessed directly by a computer. A "computer-based system" refers to the
hardware means, software means, and data storage means used to analyze the
sequence information of the present invention.
The term "derivative", as used herein, refers to the chemical modification of
a nucleic
acid encoding a peptide, polypeptide, or antibody, or a nucleic acid
complementary
thereto. Such modifications include, for example, replacement of hydrogen by
an alkyl,
acyl, or amino group. In preferred aspects, a nucleic acid derivative encodes
a peptide,
polypeptide, or antibody which retains a biological or
immunogenicity/immunological
activity of the natural molecule. A derivative peptide, polypeptide, or
antibody is one
which is modified by glycosylation, pegylation, or any similar process which
retains one
22
or more biological function (e.g., cell association, membrane association) or
immunogenicity/immunological activity of the sequence from which it was
derived.
The term "homology", as used herein, refers to a degree of complementarity.
There may
be partial homology (i.e., less than 100% identity) or complete homology
(i.e., 100%
identity). A partially complementary sequence that at least partially inhibits
an identical
sequence from hybridizing to a target nucleic acid is referred to using the
functional
term "substantially homologous." The inhibition of hybridization of the
completely
complementary sequence to the target sequence may be examined using a
hybridization assay (e.g., Southern or northern blot, solution hybridization
and the like)
under conditions of low stringency. A substantially homologous sequence or
hybridization probe will compete for and inhibit the binding of a completely
homologous
sequence to the target sequence under conditions of low stringency. This is
not to say
that conditions of low stringency are such that non-specific binding is
permitted; low
stringency conditions require that the binding of two sequences to one another
be a
specific (i.e., selective) interaction.
The term "hybridization", as used herein, refers to any process by which a
strand of
nucleic acid binds with a complementary strand through base pairing.
An "immunogenic epitope" is defined as a part of a protein that elicits an
antibody
response when the whole protein is the immunogen. On the other hand, a region
of a
protein molecule to which an antibody can bind is defined as an "antigenic
epitope."
An "insertion" or "addition", as used herein, refers to a change in an amino
acid or
nucleotide sequence resulting in the addition of one or more amino acid
residues or
nucleotides, respectively, as compared to the naturally occurring molecule.
A "methanogen," as used herein, refers to microbes that produce methane gas,
which
include Methanobrovibactor, Methanothermobacter, Methanomicrobium,
Methanobacterium, and Methanosarcina. Specific methanogens include, but are
not
limited to, Methanobrevibacter ruminantiurn (i.e., the M1 strain (also called
"Ml"), or
strain DSM1093,
Methanobrevibacter smithii, Methanobrevibacter acididurans, Methanobrevibacter
thaueri, Methanobacterium bryantii, Methanobacterium formicicum,
=
CA 2772224 2019-03-25
CA 02772224 2012-02-24
WO 2011/025394 PCT/NZ2010/000169
23
Methanothermobacter marburgensis, Methanothermobacter wolfeii, Methanosphaera
stadtmanae, Methanomicrobium mobile, Methanosarcina barked, Methanosarcina
mazei, Methanococcoides burtonii, and Methanolobus taylorii. All methanogen
genera
and species are encompassed by this term.
"Microbial" cells as used herein, refers to naturally-occurring or genetically
modified
microbial cells including archaebacteria such as methanogens, halophiles, and
thermoacidophiles, and eubacteria, such as cyanobacteria, spirochetes,
proteobacteria,
as well as Gram positive and Gram negative bacteria.
The term "modified" refers to altered sequences and to sequence fragments,
variants,
and derivatives, as described herein.
The nucleic acid molecules as noted herein refer to polynucleotides,
oligonucleotides,
or fragments thereof, and to DNAs or RNAs of natural, recombinant, synthetic
or semi-
synthetic, origin which may be single or double stranded, and can represent
sense or
antisense strands, or coding or non-coding regions, or intergenic regions.
These
molecules of the invention preferably comprise at least 12, 15, 30, 45, 60,
75, 90, 105,
120, 135, 150, 300, 450, 600, 750 nucleotides, preferably at least 15 to 30,
30 to 60, 60
to 90, 90 to 120, 120 to 150, 150 to 300, 300W. to 450, 450 to 600, or 600 to
750
nucleotides, or at least 800, 850, 900, 950, 1000, 1200, 1300, 1400, or 1500
, nucleotides. It will be understood that a nucleic acid molecule as noted
herein, will
include the native, full length sequence, as well as any complements,
fragments,
alterations, derivatives, or variants, thereof.
The term "oligonucleotide" refers to a nucleic acid sequence of at least 6, 8,
10, 12, 15,
18, 21, 25, 27, 30, or 36 nucleotides, or at least 12 to 36 nucleotides, or at
least 15 to
nucleotides, which can be used in PCR amplification, sequencing, or
hybridization
assays. As used herein, oligonucleotide is substantially equivalent to the
terms
30 "amplimers," "primers," "oligomers," and "probes," as commonly defined
in the art.
The term "polynucleotide," when used in the singular or plural, generally
refers to any
nucleic acid sequence, e.g., any polyribonucleotide or polydeoxribonucleotide,
which
may be unmodified RNA or DNA or modified RNA or DNA. This includes, without
limitation, single and double stranded DNA, DNA including single and double
CA 02772224 2012-02-24
WO 2011/025394 PCT/NZ2010/000169
24
stranded regions, single and double stranded RNA, and RNA including single and
double stranded regions, hybrid molecules comprising DNA and RNA that may be
single stranded or, more typically, double stranded or include single and
double
stranded regions. Also included are triple-stranded regions comprising RNA or
DNA
or both RNA and DNA. Specifically included are mRNAs, cDNAs, and genomic
DNAs, and any fragments thereof. The term includes DNAs and RNAs that contain
one or more modified bases, such as tritiated bases, or unusual bases, such as
inosine. The polynucleotides of the invention can encompass coding (e.g., SEQ
ID
NO: 1-1718) or non-coding sequences (e.g., SEQ ID NO: 1719-3102 or SEQ ID
NO:7607-7684), or intergenic sequences (e.g., SEQ ID NO: 3103-5866), or sense
or
antisense sequences, or iRNAs such as siRNAs. It will be understood that each
reference to a "polynucleotide" or like term, herein, will include the full
length
sequences as well as any complements, fragments, alterations, derivatives, or
=
variants thereof.
A "peptide" and "polypeptide," as used herein, refer to the isolated peptides
or
polypeptides of the invention obtained from any species, preferably microbial,
from any
source whether natural, synthetic, semi-synthetic, or recombinant.
Specifically, a
peptide or polypeptide of the invention can be obtained from methanogen cells,
such as
Methanobrevibacter cells, in parkular, M. ruminantium, or M. smithii cells.
For
recombinant production, a peptide or polypeptide of the invention can be
obtained from
microbial or eukaryotic cells, for example, Escherichia, Streptomyces,
Bacillus,
Salmonella, yeast, insect cells such as Drosophila, animal cells such as COS
and CHO
cells, or plant cells. It will be understood that each reference to a
"peptide' or
"polypeptide," herein, will include the full-length sequence, as well as any
fragments,
alterations, derivatives, or variants, thereof.
"Peptide nucleic acid" or "PNA" as used herein, refers to an antisense
molecule or anti-
gene agent which comprises bases linked via a peptide backbone.
The term "ruminant," as used herein, refers to animals that have a rumen as a
special
type of digestive organ. Ruminants include, but are not limited to, cattle,
sheep, goats,
buffalo, moose, antelope, caribou, and deer.
CA 02772224 2012-02-24
WO 2011/025394 PCT/NZ2010/000169
The term "SEQ ID NO:" refers to a specifically numbered sequence as disclosed
herein.
The format of "SEQ ID NO: #--Ir refers to each sequence taken individually,
and any
combination thereof.
5 The terms "stringent conditions" or "stringency," as used herein, refer
to the conditions
for hybridization as defined by the nucleic acid, salt, and temperature. These
conditions
are well known in the art and may be altered in order to identify or detect
identical or
related polynucleotide 'sequences. See, e.g., Sambrook, J. et al. (1989)
Molecular
Cloning, A Laboratory Manual, Cold Spring Harbor Press, Plainview, NY, and
Ausubel,
10 .. F. M. et al. (1989) Current Protocols in Molecular Biology, John Wiley &
Sons, New
York, NY. Numerous equivalent conditions comprising either low or high
stringency
depend on factors such as the length and nature of the sequence (DNA, RNA,
base
composition), nature of the target (DNA, RNA, base composition), milieu (in
solution or
immobilized on a solid substrate), concentration of salts and other components
(e.g.,
15 formamide, dextran sulfate and/or polyethylene glycol), and temperature
of the
reactions (e.g., within a range from about 5 C below the melting temperature
of the
probe to about 20 C to 25 C below the melting temperature). One or more
factors may
be varied to generate conditions of either low or high stringency different
from, but
equivalent to, the above listed conditions.
The term "subject" includes human and non-human animals. Non-human animals
include, but are not limited to, birds and mammals, such as ruminants, and in
particular,
mice, rabbits, cats, dogs, pigs, sheep, goats, cows, and horses.
The terms "substantially purified" or "isolated" as used herein, refer to
polypeptides,
peptides, or polynucleotides, that are removed from their cellular,
recombinant, or
synthetic environment, and are at least 60% free, preferably 75% free, and
most
preferably at least 90% free or at least 99% free from other components with
which they
are associated in their environment. "Isolated" polynucleotides and
polypeptides have
.. been identified and separated from at least one contaminant molecule with
which they
are associated in their natural state. Accordingly, it will be understood that
isolated
polynucleotides and polypeptides are in a form which differs from the form or
setting in
which they are found in nature. It will further be appreciated that "isolated"
does not
necessarily reflect the exact extent (e.g., a specific percentage) to which
the sequence
has been purified.
CA 02772224 2012-02-24
WO 2011/025394 PCT/NZ2010/000169
26
=
"Transformation," as defined herein, describes a process by which exogenous
DNA
enters and changes a recipient cell. It may occur under natural or artificial
conditions
using various methods well known in the art. Transformation may rely on any
known
method for the insertion of foreign polynucleotides into a prokaryotic or
eukanjotic host
cell. The method is selected based on the type of host cell being transformed
and may
include, but is not limited to, viral infection, electroporation, heat shock,
lipofection, and
particle bombardment. Such "transformed" cells include stably transformed
cells in
which the inserted DNA is capable of replication either as an autonomously
replicating
plasmid or as part of the host chromosome. They also include cells which
transiently
express the inserted DNA or RNA for limited periods of time. ,
"Vaccines" as used herein include all components and compositions for
stimulating the
immune response in a subject. Particularly useful in this regard are subunit
vaccines,
including peptide vaccines, and also vectored vaccines, nucleic acid vaccines,
and
edible vaccines. Vaccines can be used to establish or strengthen an immune
response
to an antigen, particularly a microbial antigen. In particular aspects,
vaccines comprise
antigens that evoke host-protective reactions, e.g., antibody formation, T
helper, and T
cell responses. Vaccines can also comprise antibodies, for example, for
passive
immunization.
A "variant" of a peptide, polypeptide, or antibody, as used herein, refers to
an amino
acid molecule that is altered by one or more amino acids. A variant
polynucleotide is
altered by one or more nucleotides. A variant may result in "conservative"
changes,
wherein a substituted amino acid has similar structural or chemical
properties, e.g.,
replacement of leucine with isoleucine. More rarely, a variant may result in
"nonconservative" changes, e.g., replacement of a glycine with a tryptophan.
Analogous
minor variations may also include amino acid deletions or insertions, or both.
Guidance
in determining which amino acid residues may be substituted, inserted, or
deleted
without abolishing biological or immunogenic/immunological activity may be
found using
computer programs well known in the art, for example, LASERGENE software
(DNASTAR).
The invention also encompasses nucleic acid and amino acid variants which
retain at
least one biological activity (e.g., cell association, membrane association)
or
CA 02772224 2012-02-24
WO 2011/025394 PCT/NZ2010/000169
27
immunogenicity/imrnunological activity. A preferred variant is one having
substantially
the same or a functionally equivalent sequence, for example, having at least
70%, at
least 75%, at least 80%, at least 85%, and more preferably at least 90%,
sequence
identity to a disclosed sequence. A most preferred variant is one having at
least 95%, at
.. least 97%, at least 98%, or at least 99%, at least 99.5%, at least 99.8%,
or at least
99.9% sequence identity to a sequence disclosed herein. The percentage
identity is
determined by aligning the two sequences to be compared as described below,
determining the number of identical residues/nucleotides in the aligned
portion, dividing
that number by the total number of residues/nucleotides in the= inventive
(queried)
sequence, and multiplying the result by 100. A useful alignment program is
AlignX
(Vector NTI).
Description of the invention
Methane is produced in the foregut of ruminants by methanogens which act as
terminal
.. reducers of carbon in the rumen system. The multi-step rnethanogenesis
pathway is
well- elucidated, mainly from the study of non-rumen methanogens. However, the
adaptations that allow methanogens to grow and persist in the rumen are not
well
Understood. Methanobrevibacter ruminantium (formerly Methanobacterium
ruminantium) is the so-called type species of the Methanobrevibacter genus and
was
isolated from the bovine rumen (Smith, 1958). M. ruminantium is a dominant
methanogen worldwide which is found in ruminants fed a wide variety of diets
(Janssen,
2008). As such, M. ruminantium represents an important target for anti-methane
technology.
We have embarked on a programme to sequence the genomes of cultured
representatives of the main rumen methanogen groups. Defining gene targets
within
rumen methanogens for CH4 mitigation technologies is somewhat akin to
developing a
therapeutic intervention for a microbial pathogen. Therefore, our analysis of
the M.
ruminantium genome is presented with an emphasis on identifying conserved
methanogen surface proteins suitable for vaccine development via reverse
vaccinology
techniques (Rappuoli, 2001) and enzyme targets susceptible to small molecule
inhibitors through a chemogenomics approach (Caron et al., 2001).
CA 02772224 2012-02-24
. WO 2011/025394 PCT/NZ2010/000169
28
In view of this, we have elucidated the full genome sequence of M. ruminantium
and
have identified all of the components of the methanogenesis pathway therein.
Comparison of these gene, sequences with those from Methanobacterium
the rmoautotrophicum and Methanosphaera stadtmanae indicates methanogenesis
gene organisation is conserved within the Methanobacteriales (Figures 1, 10,
13, and
15). The M. ruminantium genome also includes many large surface proteins which
may
mediate association with other rumen microbes. Based on the role of M.
ruminantium in
the rumen environment, the identified polynucleotides and polypeptides can be
used as
a means for inhibiting methanogens and/or methanogenesis, and to further
elucidate
the role of M. ruminantium in methane formation. Particularly useful are the
disclosed
polynucleotides and polypeptides identified as components involved in
methanogenesis
(Tables 2, 4, 5, and 9, below), as cell surface components (Tables 3, 5, 6,
and 9,
below), as components involved in exopolysaccharide biosynthesis (Tables 2, 4,
and 9,
below), as components with membrane spanning domains (Tables 3 and 9, below),
as
components involved in non-ribosomal peptide synthesis (Tables 7 and 9,
below), as
well as the polynucleotides and polypeptides for antibody production (Tables 2
and 9,
below). The specific M. ruminantium sequences are disclosed herewith include
identified ORFs (Table 11), non-coding features (Table 12), and intergenic
regions
(Table 13).
The M1 genome was sequenced, annotated and subjected to comparative genomic
and
metabolic pathway analyses. Conserved and methanogen-specific gene sets
suitable
as targets for vaccine development or chemogenomic-based inhibition of rumen
methanogens were identified. The feasibility of using a synthetic peptide-
directed
vaccinology approach to target epitopes of methanogen surface proteins was
demonstrated. A prophage genome was described and its lytic enzyme,
endoisopeptidase PeiR, was shown to lyse M1 cells in pure culture. A predicted
stimulation of M1 growth by alcohols was demonstrated and microarray analyses
indicated up-regulation of methanogenesis genes during co-culture with a
hydrogen
(H2) producing rumen bacterium. We also report the discovery of non-ribosomal
peptide
synthetases in M. ruminantium Ml, the first reported in archaeal species. The
M1
genome sequence provides new insights into the lifestyle and cellular
processes of this
important rumen methanogen. It also defines vaccine and chemogenomic targets
for
broad inhibition of rumen methanogens and represents a significant
contribution to
CA 02772224 2012-02-24
WO 2011/025394 PCT/NZ2010/000169
29
worldwide efforts to mitigate ruminant methane emissions and reduce production
Of
anthropogenic greenhouse gases.
Peptides, polypeptides, and polynucleotides
The invention encompasses peptides and polypeptides, including those
comprising at
least one of SEQ ID NO: 5867-7584, and fragments, variants, and derivatives
thereof. -
The peptides and polypeptides of the present invention may be expressed and
used in
various assays to determine their biological activity. The peptides and
polypeptides may
be used for large-scale synthesis and isolation protocols, for example, for
commercial
production. Such peptides and polypeptides may be used to raise antibodies, to
isolate
corresponding amino acid sequences, and to quantitatively determine levels of
the
e
amino acid sequences. The peptides and polypeptides can be used for vaccines
for
targeting and inhibiting microbial cells, especially methanogen cells. The
peptides and
polypeptides can also be used for preparing antibodies to inhibit the growth
or
replication of such cells. The peptides and polypeptides of the present
invention may
also be used as compositions, for example, pharmaceutical compositions,
especially
vaccine compositions. In particular aspects, slow-release ruminal devices can
be used
in conjunction with the peptides, polypeptides, antibodies, and compositions
(e.g.,
pharmaceutical compositions, especially vaccine compositions) of the
invention.
The peptides of the present invention comprise at least one sequence selected
from the
group consisting of: (a) peptides comprising at least a fragment of an one
amino acid
sequence selected from the group consisting of SEQ ID NO: 5867-7584, or
fragments,
variants, or derivatives thereof; (b) peptides comprising a functional domain
of at least
one amino acid sequence selected from the group consisting of SEQ ID NO: 5867-
7584, and fragments and variants thereof; and (c) peptides comprising at least
a
specified number of contiguous residues (see exemplary lengths hereinabove) of
at
least one amino acid sequence selected from the group consisting of SEQ ID NO:
5867-7584, or variants or derivatives thereof. In one embodiment, the
invention
encompasses an isolated peptide comprising the amino acid sequence of at least
one
of SEQ ID NO: 5867-7584. All of these sequences are collectively, referred to
herein as
peptides of the invention.
CA 02772224 2012-02-24
WO 2011/025394 PCT/NZ2010/000169
The polypeptides of the present invention comprise at least one sequence
selected
from the group consisting of: (a) polypeptides comprising at least one amino
acid
sequence selected from the group consisting of SEQ ID NO: 5867-7584, or
fragments,
variants, or derivatives thereof; (b) polypeptides comprising a functional
domain of at
5 least one amino acid sequence selected from the group consisting of SEQ ID
NO:
5867-7584, and fragments and variants thereof; and (c) polypeptides comprising
at
least a specified number of .contiguous residues (see exemplary lengths
hereinabove)
of at least one amino acid sequence selected from the group consisting of SEQ
ID NO:
5867-7584, or variants or derivatives thereof. In one embodiment, the
invention
10 encompasses an isolated polypeptide comprising the amino acid sequence
of at least
one of SEQ ID NO: 5867-7584. All of these sequences are collectively referred
to
herein as polypeptides of the invention.
The invention also encompasses an isolated polynucleotide That encodes a
peptide or
15 polypeptide of SEQ ID NO: 58677584. The isolated polynucleotides of the
present
invention have utility in genome mapping, in physical mapping, and in cloning
of genes
of more or less related cell surface components. Probes designed using the
polynucleotides of the present invention may be used to detect the presence
and
examine the expression patterns of genes in any organism having sufficiently
20 homologous DNA and RNA sequences in their cells, using techniques that are
well
known in the art, such as slot blot techniques or microarray analysis. Primers
designed
using the polynucleotides of the present invention may be used for sequencing
and
PCR amplifications. The polynucleotides of the invention can be used for
preparing
expression vectors and host cells for vaccines to target and inhibit microbial
cells,
25 especially methanogen cells. The invention further encompasses the use of
the
polynucleotides for the production of antibodies to inhibit the growth or
replication of
such cells. The polynucleotides of the present invention may also be used as
compositions, for example, pharmaceutical compositions, especially vaccine
compositions. In particular aspects, slow-release ruminal devices can be used
in
30 conjunction with the polynucleotides, vectors, host cells, and
compositions (e.g.,
pharmaceutical compositions, especially vaccine compositions) of the
invention.
The polynucleotides of the present invention comprise at least one sequence
selected
from the group consisting of: (a) sequences comprising a coding sequence for
at least
one amino acid sequence selected from the group consisting of SEQ ID NO: 5867-
CA 02772224 2012-02-24
WO 2011/025394 PCT/NZ2010/000169
31
7584, or fragments or variants thereof; (b) complements, reverse sequences,
and
reverse complements of a coding sequence for at least one amino acid sequence
selected from the group consisting of SEQ ID NO: 5867-7584, or fragments or
variants
thereof; (c) open reading frames contained in the coding sequence for at least
one
amino acid sequence selected from the group consisting of SEQ ID NO: 5867-
7584,
and their fragments and variants; (d) functional domains of a coding sequence
for at
least one amino acid sequence selected from the group consisting of SEQ ID NO:
5867-7584, and fragments and variants thereof; and (e) sequences comprising at
least
a specified number of contiguous residues (see exemplary lengths hereinabove)
of a
coding sequence for at least one amino acid sequence selected from the group
consisting of SEQ ID NO: 5867-7584, or variants thereof; and (f) sequences
comprising
at least a specified number of contiguous nucleotides (see exemplary lengths -
hereinabove) of any one of SEQ ID NO: 1-1718. Oligonucleotide probes and
primers
(e.g., SEQ ID NO: 7586-7607) and their variants are also provided. All of
these
polynucleotides and oligonucleotide probes and primers are collectively
referred to
herein, as polynucleotides of the invention.
It will be appreciated by those skilled in the art that as a result of the
degeneracy of the
genetic code, a multitude of nucleotide sequences encoding the peptides or
polypeptides of the invention, some bearing minimal homology to the nucleotide
sequences of any known and naturally occurring gene, may be produced. Thus,
the
invention contemplates each and every possible variation of nucleotide
sequence that
could be made by selecting combinations based on possible codon choices. These
combinations are made in accordance with the standard triplet genetic code as
applied
to naturally occurring amino acid sequences, and all such variations are to be
considered as being specifically disclosed.
Nucleotide sequences which encode the peptides or polypeptides, or their
fragments or
variants, are preferably capable of hybridizing to the nucleotide sequence of
the
naturally occurring sequence under appropriately selected conditions of
peptide or
stringency. However, it may be advantageous to produce nucleotide sequences
encoding a peptide or polypeptide, or its fragment or derivative, possessing a
substantially different codon usage. Codons may be selected to increase the
rate at
which expression of the peptide or polypeptide occurs in a particular
prokaryotic or
eukaryotic host in accordance with the frequency with which particular codons
are
CA 02772224 2012-02-24
WO 2011/025394 PCT/NZ2010/000169
32
utilized by the host. Other reasons for substantially, altering the nucleotide
sequence
encoding peptides or polypeptides and its derivatives without altering the
encoded
amino acid sequences include the production of RNA transcripts having more
desirable
properties, such as a greater half-life, than transcripts produced from the
naturally
occurring sequence.
The invention also encompasses production of DNA sequences, or fragments
thereof,
which encode the peptides or polypeptides, or their fragments or variants,
entirely by
synthetic chemistry. After production, the synthetic sequence may be inserted
into any
of the many available expression vectors and cell systems using reagents that
are well
known in the art. Moreover, synthetic chemistry may be used to introduce
mutations into
a sequence encoding a peptide or polypeptide, or any variants or fragment
thereof. Also
encompassed by the invention are polynucleotide sequences that are capable of
hybridizing to the claimed nucleotide sequences, and in particular, those
shown in SEQ
ID NO: 1-1718, under various conditions of stringency as taught in Wahl, G. M.
and S.
L. Berger (1987; Methods Enzymol. 152:399-407) and Kimmel, A. R. (1987;
Methods
Enzymol. 152:507-511).
Methods for DNA sequencing which are well known and generally available in the
art
and may be used to practice any of the embodiments of the invention. The
methods
may employ such enzymes as the Klenow fragment of DNA polymerase I,
SEQUENASE (U.S. Biochemical Corp, Cleveland, OH), Taq polymerase (Perkin
Elmer), thermostable T7 polymerase Amersham Pharmacia Biotech (Piscataway,
NJ),
or combinations of polymerases and proofreading exonucleases such as those
found in
the ELONGASE Amplification System marketed by Life Technologies (Gaithersburg,
MD). Preferably, the process is automated with machines such as the Hamilton
Micro
Lab 2200 (Hamilton, Reno, NV), Peltier Thermal Cycler (PTC200; MJ Research,
Watertown, MA) the ABI Catalyst and 373 and 377 DNA Sequencers (Perkin Elmer),
or
the Genome Sequencer 2011" (Roche Diagnostics).
The polynucleotides encoding the peptides or polypeptides may be extended
utilizing a
partial nucleotide sequence and employing various methods known in the art to
detect
upstream sequences such as promoters and regulatory elements. For example, one
method which may be employed, "restriction-site" PCR, uses universal primers
to
retrieve unknown sequence adjacent to a known locus (Sarkar, G. (1993) PCR
Methods
CA 02772224 2012-02-24
WO 2011/025394 PCT/NZ2010/000169
33
Applic. 2:318-322). In particular, genomic DNA is first amplified in the
presence of
primer to a linker sequence and a primer specific to the known region. The
amplified
sequences are then subjected to a second round of PCR with the same linker
primer
and another specific primer internal to the first one. Products of each round
of PCR are
transcribed with an appropriate RNA polymerase and sequenced using reverse
transcriptase.
Capillary electrophoresis systems which are commercially available may be used
to
analyze the size or confirm the nucleotide sequence of sequencing or PCR
products. In
particular, capillary sequencing may employ flowable polymers far
electrophoretic
separation, four different fluorescent dyes (one for each nucleotide) which
are laser
activated, and detection of the emitted wavelengths by a charge coupled device
camera. Output/light intensity may be converted to electrical signal using
appropriate
software (e.g. GENOTYPER and Sequence NAVIGATOR, Perkin Elmer) and the entire
process from loading of samples to computer analysis and electronic data
display may
be computer controlled. Capillary electrophoresis is especially preferable for
the
sequencing of small pieces of DNA which might be present in limited amounts in
a
particular sample.
In another embodiment of the invention, polynucleotides or fragments thereof
which
encode peptides or polypeptides may be used in recombinant DNA molecules to
direct
expression of the peptides or polypeptides, or fragments or variants thereof,
in
appropriate host cells. Due to the inherent degeneracy of the genetic code,
other DNA
sequences which encode substantially the same or a functionally equivalent
amino acid
sequence may be produced, and these sequences may be used to clone and express
peptides or polypeptides. The nucleotide sequences of the present invention
can be
engineered using methods generally known in the art in order to alter amino
acid-
encoding sequences for a variety of reasons, including but not limited to,
alterations
which modify the cloning, processing, and/or expression of the gene product.
DNA
shuffling by random fragmentation and PCR reassembly of gene fragments and
synthetic oligonucleotides may be used to engineer the nucleotide sequences.
For
example, site-directed mutagenesis may be used to insert new restriction
sites, alter
glycosylation patterns, change codon preference, introduce mutations, and so
forth.
CA 02772224 2012-02-24
WO 2011/025394 PCT/NZ2010/000169
34
In another embodiment of the invention, natural, modified, or recombinant
nucleic acid
sequences encoding peptides or polypeptides may be ligated to a heterologous '
sequence to encode a fusion protein. For example, it may be useful to encode a
chimeric sequence that can be recognized by a commercially available antibody.
A
fusion protein may also be engineered to contain a cleavage site located
between the
peptide or polypeptide of the invention and the heterologous protein sequence,
so that
the peptide or polypeptide may be cleaved and purified away from the
heterologous
moiety.
In another embodiment, sequences encoding peptides or polypeptides may be
synthesized, in whole or in part, using chemical methods well known in the art
(see
- Caruthers, M. H. et al. (1980) Nucl. Acids Res. Symp. Ser. 215-223, Horn,
T. et al.
(1980) Nucl. Acids Res. Symp. Ser. 225-232). Alternatively, the polypeptide
itself may
be produced using chemical methods to synthesize the amino acid molecule, or a
fragment thereof. For example, polypeptide synthesis can be performed using
various
solid-phase techniques (Roberge, J. Y. et al. (1995) Science 269:202-204;
Merrifield J.
(1963) J. Am. Chem. Soc. 85:2149-2154) and automated synthesis may be
achieved, -
for example, using the ABI 431A Peptide Synthesizer (Perkin Elmer). Various
fragments
of peptides or polypeptides may be chemically synthesized separately and
combined
using chemical methods to produce the full length molecule.
The newly synthesized peptide or polypeptide may be isolated by preparative
high
performance liquid chromatography (e.g., Creighton, T. (1983) Proteins
Structures and
Molecular Principles, VVH Freeman and Co., New York, NY). The composition of
the
synthetic peptides or polypeptides may be confirmed by amino acid analysis or
sequencing (e.g., the Edman degradation procedure; Creighton, supra).
Additionally,
the amino acid sequence of the peptide or polypeptide, or any part thereof,
may be
altered during direct synthesis and/or combined using chemical methods with
sequences from other proteins, or any part thereof, to produce a variant
molecule.
In order to express a biologically active peptides or polypeptides, the
nucleotide
sequences encoding the sequences or functional equivalents, may be inserted
into an
appropriate expression vector, i.e., a vector which contains the necessary
elements for
the transcription and translation of the inserted coding sequence. Methods
which are
well known to those skilled in the art may be used to construct expression
vectors
CA 02772224 2012-02-24
WO 2011/025394 PCT/NZ2010/000169
containing sequences encoding the peptide or polypeptide and appropriate
transcriptional and translational control elements. These methods include in
vitro
recombinant DNA techniques, synthetic techniques, and in vivo genetic
recombination.
Such techniques are described in Sambrook, J. et al_ (1989) Molecular Cloning,
A
5 Laboratory Manual, Cold Spring Harbor Press, Plainview, NY, and Ausubel,
F. M. et al.
(1989) Current Protocols in Molecular Biology, John Wiley & Sons, New York,
NY.
A variety of expression vector/host systems may be utilized to contain and
express
sequences encoding the peptides or polypeptides of the invention. These
include, but
10 are not limited to, microorganisms such as bacteria transformed with
recombinant
phage, plasmid, or cosmid DNA expression vectors; yeast transformed with yeast
expression vectors; insect cell systems infected with virus expression vectors
(e.g.,
baculovirus); plant cell systems transformed with virus expression vectors
(e.g.,
cauliflower mosaic virus, CaMV; tobacco mosaic virus, TMV) or with bacterial
15 expression vectors (e.g., Ti or pBR322 plasmids); or animal cell
systems. For bacteria,
useful plasmids include pET, pRSET, pTrcHis2, and pBAD plasmids from
Invitrogen,
pET and pCDF plasmids from Novagen, and Director114 plasmids from Sigma-
Aldrich.
For methanogens, useful plasmids include, but are not limited to pME2001,
pMV15, and
pMP1. The invention is not limited by the expression vector or host cell
employed.
The "control elements" or "regulatory sequences" are those non-translated
regions of
the vector¨enhancers, promoters, 5' and 3' untranslated regions--which
interact with
host cellular proteins to carry out transcription and translation. Such
elements may vary
in their strength and specificity. Depending on the vector system and host
utilized, any
number of suitable transcription and translation elements, including
constitutive and
inducible promoters, may be used. For example, when cloning in bacterial
systems,
inducible promoters such as the hybrid lacZ promoter of the BLUESCRIPT
phagemid
(Stratagene, LaJolla, CA) or pSPORT1 plasmid (Life Technologies) and the like
may be
used. The baculovirus polyhedrin promoter may be used in insect cells.
Promoters or
enhancers derived from the genomes of plant cells (e.g., heat shock, RUBISCO,
and
storage protein genes) or from plant viruses (e.g., viral promoters or leader
sequences)
may be cloned into the vector.
In bacterial systems, a number of expression vectors may be selected depending
upon
the use intended for the peptide or polypeptide. For example, when large
quantities of
CA 02772224 2012-02-24
WO 2011/025394 PCT/NZ2010/000169
36
=
peptide or polypeptide are needed, vectors which direct high level expression
of fusion
proteins that are readily purified may be used. Such vectors include, but are
not limited
to, the multifunctional E. coli cloning and expression vectors such as
BLUESCRIPT
(Stratagene), in which the sequence encoding a polypeptide may be ligated into
the
vector in frame with sequences for the amino-terminal Met and the subsequent 7
residues of j3-galactosidase so that a hybrid protein is produced; pIN vectors
(Van
Heeke, G. and S. M. Schuster (1989) J. Biol. Chem. 264:5503-5509); and the
like.
pGEX vectors (Promega, Madison, WI) may also be used to express peptides or
polypeptides as fusion proteins with 'glutathione S-transferase (GST). In
general, such
fusion proteins are soluble and can easily be purified from lysed cells by
adsorption to
glutathione-agarose beads followed by elution in the presence of free
glutathione.
Proteins made in such systems may be designed to include heparin, thrombin, or
factor
Xa protease cleavage sites-so that the cloned peptide or polypeptide of
interest can be
released from the GST moiety at will. In the yeast, Saccharomyces cerevisiae,
a
number of vectors containing constitutive or inducible promoters such as alpha
factor,
alcohol oxidase, and PGH may be used. For reviews, see Ausubel et al. (supra)
and
Grant et al. (1987) Methods Enzymol. 153:516-544.
Specific initiation signals may also be used to achieve more efficient
translation of
sequences encoding the peptides or polypeptides of the invention. Such signals
include
the ATG initiation codon and adjacent sequences. In cases where sequences
encoding
a peptide or polypeptide, its initiation codon, and upstream sequences are
inserted into
the appropriate expression vector, no additional transcriptional or
translational control
signals may be needed. However, in cases where only coding sequence, or a
fragment
thereof, is inserted, exogenous translational control signals including the
ATG initiation
codon should be provided. Furthermore, the initiation codon should be in the
correct
reading frame to ensure translation of the entire insert. Exogenous
translational
elements and initiation codons may be of various origins, both natural and
synthetic.
The efficiency of expression may be enhanced by the inclusion of enhancers
which are
appropriate for the particular cell system which is used, such as those
described in the
literature (Scharf, D. et al. (1994) Results Probl. Cell Differ. 20:125-162).
In addition, a host cell strain may be chosen for its ability to modulate the
expression of
the inserted sequences or to process the expressed peptide or polypeptide in
the
CA 02772224 2012-02-24
WO 2011/025394 PCT/NZ2010/000169
37
desired fashion. Such modifications of the sequence include, but are not
limited to,
acetylation, carboxylation, glycosylation, phosphorylation, lipidation, and
acylation.
Post-translational processing which cleaves a "prepro" form of the peptide or
polypeptide may also be used to facilitate correct insertion, folding, and/or
function.
Different host cells which have specific cellular machinery and characteristic
mechanisms for post-translational activities are available from the American
Type
Culture Collection (ATCC; Bethesda, MD) and may be chosen to ensure the
correct
modification and processing of the sequence. Specific host cells include, but
are not
limited to, methanogen cells, such as Methanobrevibacter cells, in particular,
M.
ruminantium, or M. smithii cells. Host cells of interest include, for example,
Rhodotorula,
Aureobasidium, Saccharomyces, Sporobolomyces, Pseudomonas, Erwinia and
Flavobacterium; or such other organisms as Escherichia, Lactobacillus,
Bacillus,
Streptomyces, and the like. Specific host cells include Escherichia coli,
which is
particularly suited for use with the present invention, Saa charomyces
cerevisiae,
Bacillus thuringiensis, Bacillus subtilis, Streptomyces lividans, and the
like.
There are several techniques for introducing polynucleotides into eukaryotic
cells
_ cultured in vitro. These include chemical methods (Feigner et al., Proc.
Natl. Acad. Sci.,
USA, 847413 7417 (1987); Bothwell et al., Methods for Cloning and Analysis of
Eukaryotic Genes, Eds., Jones and Bartlett Publishers Inc., Boston, Mass.
(1990), _
Ausubel et al., Short Protocols in Molecular Biology, John Wiley and Sons, New
York,
NY (1992); and Farhood, Annal. NY Acad. Sci., 716:23 34 (1994)), use of
protoplasts
(Bothwell, supra) or electrical pulses (Vatteroni et al., Mutn. Res., 291:163
169 (1993);
Sabelnikov, Prog. Biophys. Mol. Biol., 62: 119 152 (1994); Bothwell et al.,
supra; and
Ausubel et al., supra), use of attenuated viruses (Davis et al., J. Viral.
1996, 70(6), 3781
3787; Brinster et al. J. Gen. Virol. 2002, 83(Pt 2), 369 381; Moss, Dev. Biol.
Stan.,
82:55 63 (1994); and Bothwell et al., supra), as well as physical methods
(Fynan et al.,
Int J lmmunopharmacol. 1995 Feb;17(2):79-83; Johnston et al., Meth. Cell
Biol., 43(Pt
A):353 365 (1994); Bothwell et al., supra; and Ausubel et al., supra).
Successful delivery of polynucleotides to animal tissue can be achieved by
cationic
liposomes (Watanabe et al., Mol. Reprod. Dev., 38:268 274 (1994)), direct
injection of
naked DNA or RNA into animal muscle tissue (Robinson et al., Vacc., 11:957 960
(1993); Hoffman et al., Vacc. 12:1529 1533; (1994); Xiang et al., Viral.,
199:132 140
(1994); Webster et al., Vacc., 12:1495 1498 (1994); Davis et al., Vacc.,
12:1503 1509
CA 02772224 2012-02-24
WO 2011/025394 PCT/NZ2010/000169
38
(1994); Davis et al., Hum. Molec. Gen., 2:1847 1851 (1993); Dalemans et al.
Ann NY
Acad. Sci. 1995, 772, 255 256. Conry, et at. Cancer Res. 1995, 55(7), 1397-
1400), and
embryos (Naito et al., Mol. Reprod. Dev., 39:153 161 (1994); and Burdon et
al., Mol.
Reprod. Dev., 33:436 442 (1992)), intramuscular injection of self replicating
RNA
vaccines (Davis et al., J Virol 1996, 70(6), 3781 3787; Balasuriya et at.
Vaccine 2002,
20(11 12), 1609 1617) or intradermal injection of DNA using "gene gun"
technology
(Johnston et al., supra).
A variety of protocols for detecting and measuring the expression of the
peptides or
.. polypeptides of the invention, using either polyclonal or monoclonal
antibodies specific
for the protein are known in the art. Examples include enzyme-linked
immunosorbent
assay (ELISA), radioimmunoassay (RIA), and fluorescence activated cell sorting
(FACS). A two-site, monoclonal-based immunoassay can be used with monoclonal
antibodies reactive to two non-interfering epitopes on the peptide or
polypeptide, but a
.. competitive binding assay can also be used. These and other assays are
described,
among other places, in Hampton, R. et al. (1990; Serological Methods, a
laboratory -
Manual, APS Press, St Paul, MN) and Maddox, D. E. et al. (1983; J. Exp. Med.
158:1211-1216).
A wide variety of labels and conjugation techniques are known by those skilled
in the art
and may be used in various nucleic acid and amino acid assays. Means for
producing
labeled hybridization or PCR probes for detecting sequences related to
polynucleotides
= include oligolabeling, nick translation, end-labeling or PCR
amplification using a labeled
nucleotide. Alternatively, the sequences encoding the peptides or
polypeptides, or any
fragments or variants thereof, may be cloned into a vector for the production
of an
mRNA probe. Such vectors are known in the art, are commercially available, and
may
be used to synthesize RNA probes in vitro by addition of an appropriate RNA
= polymerase such as T7, T3, or SP6 and labeled nucleotides. These
procedures may be
conducted using a variety of commercially available kits Amersham Phamnacia
Biotech,
.. Promega, and US Biochemical. Suitable reporter molecules or labels, which
may be
used for ease of detection, include radionuclides, enzymes, fluorescent,
chemiluminescent, or chromogenic agents as well as substrates, cofactors,
inhibitors,
magnetic particles, and the like.
= CA 2772224 2017-03-13
39
=
Expression vectors or host cells transformed with expression vectors may be
cultured
under conditions suitable for the expression and recovery of the peptide or
polypeptide
from culture. The culture can comprise components for in vitro or in vivo
expression. In
vitro expression components include those for rabbit reticulocyte lysates, E.
coil lysates,
and wheat germ extracts, for example, ExpresswayTm or RiPs systems from
InvitrOgen,
GenelatorTm systems from iNtRON Biotechnology, EcoProTm or STP311" systems
from
Novagen, TNT Quick Coupled systems from Promega; and EasyXpress systems. from
QIAGEN. The peptide or polypeptide produced from culture may be secreted or
= contained intracellularly depending on the sequence and/or the vector
used. ' In
- particular aspects, expression vectors which encode a peptide or polypeptide
can be
designed to contain signal sequences which direct secretion of the peptide or
polypeptide through a prokaryotic or eukaryotic cell membrane. Specific signal
peptides
for use herein have been disclosed in detail in US 60/975,104 filed 25
September 2007,
= and in PCT/2008/000247 filed 25 September 2008,
Other constructions may include an amino add domain which will facilitate
purification
of the peptide or polypeptide. Such domains include, but are not limited to,
metal -
chelating domains such as histidine-tryptophan (e.g., 6X-HIS) modules that
allow
purification on immobilized metals, protein A domains that allow purification
on
immobilized immunoglobulin, and the domain utilized in the FLAG
extension/affinity
purification system (Immunex Corp., Seattle, WA). Useful epitope tags include
3XFLAGO, HA, VSV-G, V5, HSV, GST, GFP, MBP, GAL4, and B-galactosidase. Useful
plasmids include those comprising a biotin tag (e.g., PinPointn, plasmids from
Promega), calmodulin binding protein (e.g., pCAL plasmids from Stratagene),
streptavidin binding peptide (e_g_, InterPlayTm plasmids from Stratagene), a c-
myc or
FLAG tag (e.g., Immunoprecipitation plasmids from Sigma-Aldrich), or a
histidine tag
(e.g., QIAExpress plasmids from QIAGEN).
To facilitate purification, expression vectors can include cleavable linker
sequences -
such as those specific for Factor Xa or enterokinase (lnvitrogen, San Diego,
CA). For
= example, the vector can include one or more linkers between the
purification domain
and the peptide or polypeptide. One such expression vector provides for
expression of. -
a fusion protein comprising a peptide or polypeptide of the invention and a
polynucleotide encoding 6 histidine residues preceding a thioredoxin or an
enterokinase
CA 02772224 2012-02-24
WO 2011/025394 PCT/NZ2010/000169
cleavage site. The histidine residues facilitate purification on IMAC
(immobilized metal
ion affinity chromatography as described in Porath, J. et at. (1992) Prot.
Exp. Purif. 3:
263-281) while the enterokinase cleavage site provides a means for purifying
the
peptide or polypeptide from the fusion protein. A discussion of vectors which
contain
5 fusion proteins is provided in Kroll, D. J. et al. (1993; DNA Cell Biol.
12:441-453).
In another aspect, the invention provides a peptide or polypeptide comprising
an
epitope-bearing portion of a polypeptide of the invention. The epitope of this
polypeptide
portion can be an immunogenic (eliciting an immune response) or antigenic
(antibody-
10 binding) epitope. These immunogenic epitopes can, be confined to a few
loci on the
molecule. It is understood that the number of immunogenic epitopes of a
protein ,
generally is less than the number of antigenic epitopes. See, for instance,
Geysen et al.,
Proc. Natl. Acad. Sci. USA 81:3998-4002 (1983).
15 As to the selection of peptides or polypeptides bearing an antigenic
epitope (i.e., that
contain a region of a protein molecule to which an antibody can bind), it is
well known in
that art that relatively short synthetic peptides that mimic part of a protein
sequence are
routinely capable of eliciting an antiserum that reacts with the partially
mimicked protein.
See, for instance, Sutcliffe, J. G., Shinnick, T. M., Green, N. and Learner,
R. A. (1983).
20 Antibodies that react with predetermined sites on proteins are described in
Science
219:660-666. Peptides capable of eliciting protein-reactive sera are
frequently
represented in the primary sequence of a protein, can be characterized by a
set of
simple chemical rules, and are confined neither to immunodominant regions of
intact
proteins (i.e., immunogenic epitopes) nor to the amino or carboxyl .terminals.
Peptides
25 that are extremely hydrophobic and those of six or fewer residues generally
are
ineffective at inducing antibodies that bind to the mimicked protein; longer,
peptides,
especially those containing proline residues, usually are effective. Sutcliffe
et al., p. 661.
Antigenic epitope-bearing peptides and polypeptides of the invention are
therefore
30 useful to raise antibodies, including monoclonal antibodies, that bind
specifically to a
polypeptide of the invention. Thus, a high proportion of hybridomas obtained
by fusion
of spleen cells from donors immunized with an antigen epitope-bearing peptide
= generally secrete antibody reactive with the native protein. Sutcliffe et
al., p. 663. The
antibodies raised by antigenic epitope-bearing peptides or polypeptides are
useful to
35 detect the mimicked protein, and antibodies to different peptides may be
used for
CA 02772224 2012-02-24
WO 2011/025394 PCT/NZ2010/000169
41
tracking the fate of various regions of a protein precursor which undergoes
post-
translational processing. The peptides andanti-peptide antibodies may be used
in a
variety of qualitative or quantitative assays for the mimicked protein, for
instance in
competition assays since it has been shown that even short peptides (e.g.,
about 9
amino acids) can bind and displace the larger peptides in immunoprecipitation
assays.
See, for instance, Wilson et al., Cell 37:767-778 (1984) at 777. The anti-
peptide
antibodies of the invention also are useful for purification of the mimicked
protein, for
instance, by adsorption chromatography using methods well known in the art.
Antigenic epitope-bearing peptides and polypeptides of the invention designed
according to the above guidelines preferably contain a sequence of at least
seven,
more preferably at least nine and most preferably between about 15 to about 30
amino
acids contained within the amino acid sequence of a polypeptide of the
invention.
However, peptides or polypeptides comprising a larger portion of an amino acid
sequence of a polypeptide of the invention, containing about 30 to about 50
amino
acids, or any length up to and including the entire amino acid sequence of a
polypeptide
of the invention, also are considered epitope-bearing peptides or polypeptides
of the
invention and also are useful for inducing antibodies that react with the
mimicked
protein. Preferably, the amino acid sequence of the epitope-bearing peptide is
selected
to provide substantial solubility in aqueous solvents (i.e., the sequence
includes
relatively hydrophilic residues and highly hydrophobic sequences are
preferably
avoided); and sequences containing proline residues are particularly
preferred.
The epitope-bearing peptides and polypeptides of the invention may be produced
by _
any conventional means for making peptides or polypeptides including
recombinant
means using polynucleotides of the invention. For instance, a short ,epitope-
bearing
amino acid sequence may be fused to a larger polypeptide which acts as a
carrier
during recombinant production and purification, as well as during immunization
to
produce anti-peptide antibodies. Epitope-bearing peptides also may be
synthesized=
using known methods of chemical synthesis. See, e.g., Houghten, R. A. (1985)
General
method for the rapid solid-phase synthesis of large numbers of peptides:
specificity of
antigen-antibody interaction at the level of individual amino acids. Proc.
Natl. Acad. Sci.
USA 82:5131-5135. This process is further described in U.S. Pat. No. 4,631,211
to
Houghten et al. (1986). In this procedure the individual resins for the solid-
phase
synthesis of various peptides are contained in separate solvent-permeable
packets,
CA 02772224 2012-02-24
WO 2011/025394 PCT/NZ2010/000169
42
=
enabling the optimal use of the many identical repetitive steps involved in
solid-phase
methods. A completely manual procedure allows 500-1000 or more syntheses to be
conducted simultaneously. Houghten et al., p. 5134
Epitope-bearing peptides and polypeptides of the invention are used to induce
antibodies according to methods well known in the art. See, for instance,
Sutcliffe et al.,
supra; Wilson et al., supra; Chow, M. et al., Proc. Natl. Acad. Sci. USA
82:910-914; and
Bittle, F. J. et al., J. Gen Virol. 66:2347-2354 (1985). Generally, animals
may be
immunized with free peptide; however, anti-peptide antibody titer may be
boosted by
coupling of the peptide to a nnacromolecular carrier, such as keyhole limpet
hemacyanin
(KLH) or tetanus toxoid. For instance, peptides containing cysteine may be
coupled to
carrier using a linker such as m-maleimidobenzoyl-N-hydroxysuccinimide ester
(MBS),
while other peptides may be coupled to carrier using a more general linking
agent such
as glutaraldehyde. Animals such as rabbits, rats and mice are immunized with
either
free or carrier-coupled peptides, for instance, by intraperitoneal -and/or
intradermal
injection of emulsions containing about 100 g peptide or carrier protein and
Freund's
adjuvant. Several booster injections may be needed, for instance, at intervals
of about
two weeks, to provide a useful titer of, anti-peptide antibody which can be
detected, for
example, by ELISA assay using free peptide adsorbed to a solid surface. The
titer of
anti-peptide antibodies in serum from an immunized animal may be increased by
= selection of anti-peptide antibodies, for instance, by adsorption to the
peptide on a solid
support and elution of the selected antibodies according to methods well known
in the
art.
Immunogenic epitope-bearing peptides of the invention, i.e., those parts of a
protein
that elicit an antibody response when the whole protein is the immunogen, are
identified
according to methods known in the art. For instance, Geysen et al., supra,
discloses a
procedure for rapid concurrent synthesis on solid supports of hundreds of
peptides of
sufficient purity to react in an enzyme-linked immunosorbent assay.
Interaction of
=_synthesized peptides with antibodies is then easily detected without
removing them
from the support. In this manner a peptide bearing an immunogenic epitope of a
desired
protein may be identified routinely by one of ordinary skill in the art For
instance, the
immunologically important epitope in the coat protein of foot-and-mouth
disease .virus
was located by Geysen et al. with a resolution of seven amino acids by
synthesis of an
.. overlapping set of all 208 possible hexapeptides covering the entire 213
amino acid
CA 02772224 2012-02-24
WO 2011/025394 PCT/NZ2010/000169
43
sequence of the protein. Then, a complete replacement set of peptides in which
all 20
amino acids were substituted in turn at every position within the epitope were
synthesized, and the particular amino acids conferring. specificity for the
reaction with
antibody were determined. Thus, peptide analogs of the epitope-bearing
peptides of the
invention can be made routinely by this method. U.S. Pat. No. 4,708,781 to
Geysen
(1987) further describes this method of identifying a peptide bearing an
immunogenic
epitope of a desired protein.
Further still, U.S. Pat. No. 5,194,392 to Geysen (1990) describes a general
method of
detecting or determining the sequence of monomers (amino acids or other
compounds)
which is a topological equivalent of the epitope (i.e., a "mimotopen) which is
complementary to a ,particular paratope (antigen binding site) of an antibody
of interest.
More -generally, U.S. Pat. No. 4,433,092 to Geysen (1989) describes a method
of
detecting or determining a sequence of monomer which is a topographical
equivalent of
a ligand which is complementary to the ligand binding site of a particular
receptor of
interest. Similarly, U.S. Pat. No. 5,480,971 to Houghten, R. A. et al. (1996)
on
Peralkylated Oligopeptide Mixtures discloses linear C1-C7-alkyl peralkylated
oligopeptides and sets and libraries of such peptides, as well as methods for
using such
oligopeptide sets and libraries for determining the sequence of a peralkylated
oligopeptide that preferentially binds to an acceptor molecule of interest.
Thus, non-
peptide analogs of the epitope-bearing peptides of the invention also can be
made
routinely by these methods.
As one of skill in the art will appreciate, the polypeptides of the present
invention and
the epitope-bearing fragments thereof described above can be combined with
parts of
the constant domain of irnnnunoglobulins (IgG), resulting in chimeric
polypeptides.
These fusion proteins facilitate purification and show an increased half-life
in vivo. This
has been demonstrated, e.g., for chimeric proteins consisting of the first two
domains of
the human CD4-polypeptide and various domains of the constant regions of the
heavy
or light chains of mammalian immunoglobulins (EPA 394,827; Traunecker et al.,
Nature
331:84-86 (1988)). Fusion proteins that have a disulfide-linked dimeric
structure due to
the IgG part can also be more efficient in binding and neutralizing other
molecules than
the monomeric protein or protein fragment alone (Fountoulakis et al., J
Biochem
270:3958-3964 (1995)).
=
CA 02772224 2012-02-24
WO 2011/025394 PCT/NZ2010/000169
44
Non-ribosomal peptide synthetases
As particular examples, the polypeptides of the invention may include non-
ribosomal
peptide synthetases. Non-ribosomal peptides are synthesized on enzymatic
thiotemplates termed non-ribosomal peptide synthetases (NRPS). The non-
ribosomal
peptides encompass a wide range of compounds having diverse activities
including, but
not limited to, immunosupressive (such as cyclosporin), surfactant (such as
surfactin),
siderophores (such as enterobactin), virulence factors (such as
yersinabactin),
antibacterial (such as penicillin and vancomycin), and anti-cancer (such as
actinomycin
and bleomycin) activities (Weber et al., Current Genomics 1994; 26:120-25;
Ehmann et
al., Proc. Nat. Acad. Sci. 2000; 97:2509-14; Gehring et al., Biochemistry
1998;
37:11637; Ka!low et al., Biochemistry 1998; 37:5947-52; Trauger et al., Proc.
Nat. Acad.
Sci. 2000; 97:3112-17; Schauweker et al., J. Bacteriology 1999; 27:2468-74;
and Shen
et al., Bioorganic Chem 1999; 27:155-71). As to particular NRPS products, see
Felnagle et al., Nonribosomal peptide synthetases involved in the production
of
medically relevant natural products, Mol. Pharm. 2008 Mar-Apr;5(2):191-211.
Non-ribosomal peptides typically range in size from 1-11 amino acids and are
produced
by a variety of microbes including cyanobacteria, actinomycetes, and fungi. In
many
cases the non-ribosomal peptides contain non-proteogenic amino acids such as
norleucine, p-alanine, omithine, etc., for which biogenesis pathways, which
are
secondary to primary metabolism, are required and are post--synthetically
modified (e.g.,
hydroxylated, methylated or acylated) by tailoring enzymes. The term
proteogenic
indicates that the amino acid can be incorporated into a protein in a cell
through well-
known metabolic pathways. The choice of including a (D)- or (L)-amino add into
a
peptide of the present invention depends, in part, on the desired
characteristics of the
peptide. For example, the incorporation of one or more (D)-amino acids can
confer
increasing stability on the peptide in vitro or in vivo. The term amino acid
equivalent
refers to compounds which depart from the structure of the naturally occurring
amino
acids, but which have substantially the structure of an amino acid, such that
they can be
substituted within a peptide that retains biological activity. Thus, for
example, amino
acid equivalents can include amino acids having side chain modifications
and/or
substitutions, and also include related organic acids, amides or the like.
CA 02772224 2012-02-24
WO 2011/025394 PCT/NZ2010/000169
The polynucleotide sequences required to make a NRPS and the necessary
tailoring
enzymes have been shown in all cases to be localized to the chromosome of the
producing microbe. NRPS are modular in nature, where a module may be defined
as a
segment of the NRPS necessary to catalyze the activation of a specific amino
acid and -
5 result in the incorporation of that amino acid into a non-ribosomal
peptide. A minimal
module contains three domains: (1) adenylation domains (about 60 kDa),
responsible
for selecting and activating an amino acid and transferring the aminoacyl
adenylate to a
peptidyl carrying centre; (2) thiolation domains, also referred to as peptidyl
carrier
proteins (8-10 kDa), containing a serine residue which is post-translationally
modified
10 with a 4-phosphopantetheine group (Ppant) which acts as an acceptor for
the aminoacyl
adenylate; and (3) condensation domains (50-60 kDa) which catalyze peptide
bond-
forming chain-translocating steps between an upstream peptidyl-s-Ppant and the
downstream aminoacyl-Ppant of the adjacent module (Doekel, S. and Marahiel, M.
A.
2000; Chem. Biol. 7:373-384). This minimal module for chain extension is
typically
15 repeated within a synthetase and a co-linear relationship exists between
the number of
modules present and the number of amino acids in the final product with the
order of
the modules in the synthetase determining the order of the amino acids in the
peptide.
The adenylation domain is typically about 60 kDa. The main function of this
domain is to
20 select and activate a specific amino acid as an aminoacyl adenylate. Based
on its
function, the adenylation domain regulates the sequence of the peptide being
produced.
Once charged (as an amino acyl adenylate moiety), the amino add is transferred
to a
thiolation domain (peptidyl carrying centre). The thiolation domain is also
referred to as
a peptidyl carrier protein. This domain is typically 8-10 kDa and contains a
serine
25 residue that is post-translationally modified with a 4-
phosphopantetheine group. This
group acts as an acceptor for the aminoacyl adenylate moiety on the amino
acid. A
nucleophilic reaction leads to the release of the aminoacyl adenylate and
conjugation of
the amino acid to thiolation domain via a thioester bond. The condensation
domain is
typically about 50-60 kDa in size. The main function of this domain is to
catalyze the
30 formation of a peptide bond between two amino acids. In this reaction an
upstream
tethered peptidyl group is translocated to the downstream aminoacyl-s-Ppant
and linked
to the amino acid by peptide bond formation.
This minimal module for chain extension is typically repeated within a
synthetase.
35 Additionally, and typically, a co-linear relationship exists between the
number of
CA 02772224 2012-02-24
WO 2011/025394 PCT/NZ2010/000169
46
modules present and the number of amino acids in the final product with the
order of
the modules in the synthetase determining the order of the amino acids in the
peptide.
This 1:1 relationship, with every amino acid in the product having one module
within the
enzyme, is referred to as the co-linearity rule. Examples have been found that
violate
this rule, and in such cases, the NRPS contains more modules than one would
expect
based on the number of amino acids incorporated in the peptide product
(Challis et al.,
Chem. Biol. 2000; 7:211-24). In some cases the minimal module also is
supplemented
with additional domains (epimerization, N- or C-methylation, or cyclization
domain), with
their position in the synthetase determining the substrate upon which they can
act. In
addition, it has been observed that NRPS contain inter-domain spacers or
linker
regions. It has been proposed that these spacers may play a critical role in
communication between domains, modules, and even entire synthetases.
There are highly conserved motifs in the Catalytic domains of peptide
synthetases
including: 10 conserved motifs in the adenylation domain; 1 conserved motif in
the
thiolation domain; 7 conserved motifs in the condensation domain; 1 conserved
motif in
the thioesterase domain; 7 conserved motifs in the epimerization domains; and
3
conserved motifs in the N-methylation domains. These are detailed in Marahiel
et al.,
Chemical Rev. 1997; 97:2651-73. In addition to modifications such as
epimerization,
methylation and cyclization during peptide synthesis, post-translational
modifications
including methylation, hydroxylation, oxidative cross-linking and
glycosylation can occur
(Walsh et al., Curr. Opin. Chem. Biol. 2001; 5:525-34).
In the present invention, the polynucleotide and polypeptide sequences for
NRPS from
M. ruminantium have been characterized (see below). For use with the present
invention, the enzymes may be tailored as needed. For example, after
production of the
core of the peptide product, the sequence may then be modified by additional
enzymes
which are herein termed tailoring enzymes. These enzymes alter the amino acids
in the
compound without altering the number or the specific amino acids present
within the
compound. Such tailoring enzymes may include, but are not limited to, arginine
cyclase,
an 0-mannosyltransferase, a phenylalanine C-methyltransferase, a first
isovaleryl
transferase, and a second isovaleryl transferase. The present invention
permits specific
changes to be made, either by site directed mutagenesis or replacement, to
genetically
modify the peptide core. The modifications may be made in a rational manner to
improve the biological activity of the peptide produced by the strain or to
direct
CA 02772224 2012-02-24
WO 2011/025394 PCT/NZ2010/000169
47
synthesis of compounds that are structurally related to the peptide. The
invention also
allows for the tailoring enzymes to be used for biotransformation experiments
to
produce enzymes to modify and possibly improve other useful compounds. As to
modified NRPS products, see Velkov et al., Non-ribosomal peptide synthetases
as
technological platforms for the synthesis of highly modified peptide
bioeffectors--,
Cyclosporin synthetase as a complex example, Biotechnol Annu Rev. 2003;9:151-
97.
The determination of the NRPS from M. ruminantium also enables one of ordinary
skill
in the art to clone and express the pathway into a heterologous organism. Any
organism
may be used, although preferably a bacterial strain is used. The choice of
organism is
dependent upon the needs of the skilled artisan. For example, a strain that is
amenable
to genetic manipulation may be used in order to facilitate modification and
production of
the peptide product. The present invention advantageously permits specific
changes to
be made to individual modules of NRPS, either by site directed mutagenesis or
replacement, to genetically modify the peptide core. Additionally, the NRPS
modules
can be used to modify other NRPS that direct the synthesis of other useful
peptides
through module swapping. For example, the module in NRPS that incorporates
tyrosine
into the peptide core of the product may be modified so as to incorporate a
serine in its
place.
The activity of any NRPS disclosed herein may be evaluated using any method
known
in the art. For example, specific modifications to the polypeptide sequence
may be
produced to alter the final product. Other non-limiting examples of studies
that may be
conducted with these proteins include (i) evaluation of the biological
activity of a protein
and (ii) manipulation of a synthetic pathway to alter the final product from
microbes.
Genetic manipulations and expression of the polypeptides discussed herein may
be
conducted by any method known in the art. For example, the effect of point
mutations
may be evaluated. The mutations may be produced by any method known in the
art. In
one specific method, the manipulations and protein expression may be conducted
using
a vector that comprises at least one origin of replication. The origins of
replication allow
for replication of the polynucleotide in the vector in the desired cells.
Additionally, the
vector may comprise a multiple cloning site that allows for the insertion of a
heterologous nucleic acid that may be replicated and transcribed by a host
cell. In one
particular aspect, conjugation can be used for the direct transfer of nucleic
acid from
one prokaryotic cell to another via direct contact of cells. The origin of
transfer is
CA 02772224 2012-02-24
WO 2011/025394 PCT/NZ2010/000169
48.
determined by a vector, so that both donor and recipient cells obtain copies
of the
vector. Transmissibility by conjugation is controlled by a set of genes in the
tra region,
which also has the ability to mobilize the transfer of chromosomes when the
origin of
transfer is integrated into them (Pansegrau et at., J. Mol. Biol., 239:623-
663, 1994; Fong
and Stanisich, J. Bact., 175:448-456, 1993).
The vector described previously may be used to assess the biological activity
of the
NRPS. The vector may be used to alter a polypeptide, either by partial or
complete
removal of the polynucleotide sequence encoding the protein, or by disruption
of that
sequence. Evaluation of the products produced when the altered polypeptide is
present
is useful in determining the functionality of the polypeptide. As discussed
above,
specific polypeptides within the biochemical pathway may be modified to assess
the
activity of the compounds produced by these altered polypeptides and to
determine
which sections of the product are important for activity and function. The
present
invention contemplates any method of altering any of the NRPS of the present
invention. More specifically, the invention contemplates any method that would
insert
amino acids, delete amino acids or replace amino acids in the polypeptides of
the
invention. Additionally, a whole domain in a module in a NRPS may be replaced.
Therefore, for example, the acylation domain that incorporates tyrosine into
the final
product may be replaced with a domain that incorporates serine. The
modifications may
be performed at the nucleic acid level. These modifications may be performed
by
standard techniques and are well known within the art. Upon production of the
polynucleotide encoding the modified polypeptide, the amino acid sequence can
be
expressed in a host cell. Then the host cell can be cultured under conditions
that permit
production of a product of the altered pathway. Once the product is isolated,
the activity
of the product may be assessed using any method known in the art. The activity
can be
compared to the product of the non-modified biosynthetic pathway and to
products
produced by other modifications. Correlations may be drawn between specific
alterations and activity. For example, it may be determined that an active
residue at a
specific position may increase activity. These types of correlations will
allow one of
ordinary skill to determine the most preferred product structure for specified
activity. As
to modification techniques for NRPS, see Durfahrt et al., Functional and
structural basis
for targeted modification of non-ribosomal peptide synthetases, Ernst Schering
Res
Found Workshop. 2005;(51):79-106.
=
CA 02772224 2012-02-24
WO 2011/025394 PCT/NZ2010/000169
49
The present invention also contemplates a method for using an intergeneric
vector to
manipulate, modify, or isolate a protein involved in the synthesis of a
specific product.
For example, the vector of the present invention may be used to alter an
enzyme which
is involved in incorporation of an alanine residue into a peptide, so that a
tyrosine
residue is incorporated instead. The effect of this modification on peptide
function may
be then be evaluated for biological efficacy. In the above example,
modifications to the
enzyme may include, but are not limited to, removal of amino acids and/or
sequences
that specifically recognize alanine and/or incorporation of amino acids and/or
sequences that specifically recognize tyrosine. Therefore, in general terms,
the vector
of the present invention may be used to alter a gene sequence by insertion of
nucleic
acid sequences, deletion of nucleic acid sequences, or alteration of specific
bases
within a nucleic acid sequence to alter the sequence of a polypeptide of
interest;
. thereby producing a modified protein of interest. Preferably, the
polypeptide of interest
is involved in the synthesis of a compound of interest. The method of
modifying a
protein may comprise (i) transfecting a first microbial cell with the vector
of the present
invention, (ii) culturing the first microbial cell under conditions that allow
for replication of
the vector, (iii) conjugating the first microbial cell with a second microbial
cell under
conditions that allow for the direct transfer of the vector from the first
microbial cell to
the second microbial cell, and (iv) isolating the second microbial cell
transformed with
the vector. Other method of vector transfer are also contemplated and
disclosed herein.
=
Antibodies and vaccines
The antibodies of the invention may be produced using methods which are
generally
known in the art. In particular, purified peptides, polypeptides, or
polynucleotides may
be used to produce antibodies in accordance with known methods. Such
antibodies
may include, but are not limited to, polyclonal, monoclonal, chimeric, and
single chain
antibodies, Fab fragments, and fragments produced by a Fab expression library.
Neutralizing antibodies, (i.e., those which inhibit function) are especially
preferred for
use with vaccines.
For the production of antibodies, various hosts including goats, rabbits,
rats, mice,
humans, and others, may be immunized by injection with a peptide, polypeptide,
polynucleotide, or any fragment thereof which has immunogenic properties.
Depending
on the host species, various adjuvants may be used to increase immunological
CA 02772224 2012-02-24
WO 2011/025394 PCT/NZ2010/000169
response. Such adjuvants include, but are not limited to, Freund's, mineral
gels such as
aluminium hydroxide, and surface active substances such as lysolecithin,
pluronic
polyols, polyanions, peptides, oil emulsions, keyhole limpet hemocyanin, and
dinitrophenol. Among adjuvants used in humans, BCG (bacilli Calmette-Guerin)
and
5 Corynebacterium parvum are especially preferable.
,
It is preferred that the peptides, polypeptides, or fragments used to induce
antibodies
have an amino acid sequence comprising at least five amino acids and more
preferably
at least 10 amino acids. It is also preferable that they are identical to a
portion of the
10 amino acid sequence of the natural protein, and they may contain the
entire amino acid
sequence of a small, naturally occurring molecule. Short stretches of amino
acids may
be fused with those of another protein such as keyhole limpet hemocyanin and
antibody
produced against the chimeric molecule.
15 Monoclonal antibodies may be prepared using any technique which provides
for the
production of antibody molecules by continuous cell lines in culture. These
include, but
are not limited to, the hybridoma technique, the human B-cell hybridoma
technique, and
the EBV-hybridoma technique (Kohler, G. et al. (1975) Nature 256A95-497;
Kozbor, D.
et at. (1985) J. lmmunol. Methods 81:31-42; Cote, R. J. et at. (1983) Proc.
Natl. Acad.
20 Sci. 80:2026-2030; Cole, S. P. et al. (1984) Mol. Cell Biol. 62:109-
120).
In addition, techniques developed for the production of "chimeric antibodies",
e.g., the
combining of mouse antibody genes and human antibody genes to obtain a
molecule
with appropriate antigen specificity and biological activity can be used
(Morrison, S. L.
25 et at. (1984) Proc. Natl. Acad. Sci. 81:6851-6855; Neuberger, M.S. et
al. (1984) Nature
312:604-608; Takeda, S. et al. (1985) Nature 314:452-454). Alternatively,
techniques
described for the production of single chain antibodies may be adapted, using
methods
known in the art, to produce specific single chain antibodies. Antibodies with
related
specificity, but of distinct idiotypic composition, may be generated by chain
shuffling
30 from random combinatorial immunoglobin libraries (Burton D. R. (1991)
Proc. Natl.
Acad. Sci. 88:11120-3).
Those of skill in the art to which the invention relates will appreciate the
terms
"diabodies" and "triabodies". These are molecules which comprise a heavy chain
35 variable domain (VH) connected to a light chain variable domain (VL) by
a short peptide
CA 02772224 2012-02-24
WO 2011/025394 PCT/NZ2010/000169
51
linker that is too short to allow pairing between the two domains on the same
chain.
This promotes pairing with the complementary domains of one or more other
chains
and encourages the formation of dimeric or trimeric molecules with two or more
functional antigen binding sites. The resulting antibody molecules may be
monospecific
or multispecific (e.g., bispecific in the case of diabodies). Such antibody
molecules may
be created from two or more antibodies using methodology standard in the art
to which
the invention relates; for example, as described by Todorovska et al. (Design
and
application of diabodies, triabodies and tetrabodies for cancer targeting. J.
lmmunol.
Methods. 2001 Feb 1;248(1-2):47-66).
Antibodies may also be produced by inducing in vivo production in the
lymphocyte
population or by screening immunoglobulin libraries or panels of highly
specific binding
reagents as disclosed in the literature (Orlandi, R. et al. (1989) Proc. Natl.
Acad. Sci.
86:3833-3837; Winter, G. et al. (1991) Nature 349:293-299).
Antibody fragments which contain specific binding sites may also be generated.
For
example, such fragments include, but are not limited to, the F(alp')2
fragments which can
be produced by pepsin digestion of the antibody molecule and the Fab fragments
which
can be generated by reducing the disulfide bridges of the F(a131)2 fragments.
Alternatively, Fab expression libraries may 6e constructed to allow rapid and
easy
identification of monoclonal Fab fragments with the desired specificity (Huse,
W. D. et
al. (1989) Science 254:1275-1281).
Various immunoassays may be used for screening to identify antibodies having
binding
specificity. Numerous protocols for competitive binding or immunoradiometric
assays
using either polyclonal or monoclonal antibodies with established
specificities are well
known in the art. Such immunoassays typically involve the measurement of
complex
formation between a peptide, polypeptide, or polynucleotide and its specific
antibody. A
two-site, monoclonal-based immunoassay utilizing monoclonal antibodies
reactive to
two non-interfering epitopes is preferred, but a competitive binding assay may
also be
employed (Maddox, supra).
The antibodies described herein have the ability to target and/or inhibit
cells and are
also useful as carrier molecules for the delivery of additional inhibitory
molecules into
microbial cells. The chemistry for coupling compounds to amino acids is well
developed
CA 02772224 2012-02-24
WO 2011/025394 PCT/NZ2010/000169
52
and a number of different molecule types could be linked to the antibodies.
The most
common coupling methods rely on the presence of free amino (alpha-amino or
Lys),
sufhydryl (Cys), or carboxylic acid groups (Asp, Glu, or alpha-carboxyl).
Coupling
methods can be used to link the antibody to the cell inhibitor via the carboxy-
or amino-
terminal residue. In some cases, a sequence includes multiple residues that
may react
with the chosen chemistry. This can be used to produce multimers, comprising
more
than one cell inhibitor. Alternatively, the antibody can be shortened or
chosen so that
reactive residues are localized at either the amino or the carboxyl terminus
of the
sequence.
For example, a reporter molecule such as fluorescein can be specifically
incorporated at
a lysine residue (Ono et al., 1997) using N-a-Fmoc-Ne-1-(4,4-dimethy1-2,6
dioxocyclohex-1-ylidene-3-methylbuty1)-L-lysine during polypeptide synthesis.
Following
synthesis, 5- and 6-carboxyfluorescein succinimidyl esters can be coupled
after 4,4-
dimethy1-2,6 dioxocyclohex-1-ylidene is removed by treatment with hydrazine.
Therefore coupling of an inhibitory molecule to the antibody can be
accomplished by
inclusion of a lysine residue to the polypeptide sequence, then reaction with
a suitably
derivatised cell inhibitor.
EDC (1-ethyl-3-(3-dimethylaminopropyl) carbodiimide hydrochloride) or the
carbodiimide coupling method can also be used. Carbodiimides can activate the
side
chain carboxylic groups of aspartic and glutamic acid as well as the carboxyl-
terminal
group to make them reactive sites for coupling with primary amines. The
activated
antibody is mixed with the cell inhibitor to produce the final conjugate. If
the cell inhibitor
is activated first, the EDC method will couple the cell inhibitor through the
N-terminal
alpha amine and possibly through the amine in the side-chain of Lys, if
'present in the
sequence.
m-Maleimidobenzoyl-N-hydroxysuccinimide ester (MBS) is a heterobifunctional
reagent
that can be used to link an antibody to cell inhibitors via cysteines. The
coupling takes
place with the thiol group of cysteine residues. If the chosen sequence does
not contain
Cys it is common to place a Cys residue at the N- or C-terminus to obtain
highly
controlled linking of the antibody to the cell inhibitor. For synthesis
purposes, it may be
helpful for the cysteine to be placed at the N-terminus of the antibody. MBS
is
particularly suited for use with the present invention.
CA 02772224 2012-02-24
WO 2011/025394 PCT/NZ2010/000169
53
Glutaraldehyde can be used as a bifunctional coupling reagent that links two
compounds through their amino groups. Glutaraldehyde provides a highly
flexible
spacer between the antibody and cell inhibitor for favorable presentation.
Glutaraldehyde is a very reactive compound and will react with Cys, Tyr, and
His to a
limited extent. The glutaraldehyde coupling method is particularly useful when
a
polypeptide contains only "a single free amino group at its amino terminus. If
the
antibody contains more than one free amino group, large multimeric complexes
can be
formed.
In one aspect, the antibodies of the invention can be fused (e.g., by in-frame
cloning) or
linked (e.g., by chemical coupling) to cell inhibitors such as antimicrobial
agents.
Included among these are antimicrobial peptides, for example,
bactericidal/permeability-
increasing protein, cationic antimicrobial proteins, lysozymes, lactoferrins,
and
cathelicidins (e.g., from neutrophils; see, e.g., Hancock and Chapple, 1999,
Antinnicrob.
Agents Chemother.43:1317-1323; Ganz and Lehrer, 1997, Curr. Opin. Hematol.
4:53-
58; Hancock et al., 1995, Adv. Microb. Physiol. 37:135-175). Antimicrobial
peptides
further include defensins (e.g., from epithelial cells or neutrophils) and
platelet
microbiocidal proteins (see, e.g., Hancock and Chapple, 1999, Antimicrob.
Agents
Chemother.43:1317-1323). Additional antimicrobial peptides include, but are
not limited
to, gramicidin S, bacitracin, polymyxin B, tachyplesin, bactenecin (e.g.,
cattle
bactenecin), ranalexin, cecropin A, indolicidin (e.g., cattle indolicidin),
and nisin (e.g.,
bacterial nisin).
Also included as antimicrobial agents are ionophores, which facilitate
transmission of an
ion, (such as sodium), across a lipid barrier such as a cell membrane. Two
ionophore
compounds particularly suited to this invention are the RUMENSINTm (Eli Lilly)
and
Lasalocid (Hoffman LaRoche). Other ionophores include, but are not limited to,
salinomycin, avoparcin, aridcin, and actaplanin. Other antimicrobial agents
include
MonensinTm and azithromycin, metronidazole, streptomycin, kanamycin, and
penicillin,
as well as, generally, 11-lactams, aminoglycosides, macrolides,
chloramphenicol,
novobiocin, rifampin, and fluoroquinolones (see, e.g., Horn et al., 2003,
Applied
Environ. Microbiol. 69:74-83; Eckburg et al., 2003, Infection Immunity 71:591-
596;
Gijzen et al., 1991, Applied Environ. Microbiol. 57:1630-1634; Bonelo et al.,
1984,
CA 02772224 2012-02-24
WO 2011/025394 PCT/NZ2010/000169
54
FEMS Microbiol. Lett. 21:341-345; Huser et al., 1982, Arch. Microbiol. 132:1-
9; Hilpert
et al., 1981, Zentbl. Bakteriol. Mikrobiol. Hyg. 1 Abt Orig. C 2:21-31).
Particularly useful inhibitors are compounds that block or interfere with
methanogenesis, including bromoethanesulphonic acid, e.g., 2-
bromoethanesulphonic
acid (BES) or a salt thereof, for example, a sodium salt. Sodium molybdate
(Mo) is an
inhibitor of sulfate reduction, and can be used with bromoethanesulphonic
acid. Other
anti-methanogenesis compounds include, but are not limited to, nitrate,
formate, methyl
fluoride, chloroform, chloral hydrate, sodium sulphite, ethylene and
unsaturated
hydrocarbons, acetylene, fatty acids such as linoleic and cis-oleic acid,
saturated fatty
acids such as behenic and stearic acid, and, also ,Iumazine (e.g., 2,4-
pteridinedione).
Additional compounds include 3-bromopropanesulphonate (BPS), propynoic acid,
and
ethyl 2-butynoate.
Further included as antimicrobial agents are lytic enzymes, including phage
lysozyme,
endolysin, lysozyme, lysin, phage lysin, muralysin, murannidase, and
virolysin. Useful
enzymes exhibit the ability to hydrolyse specific bonds in the bacterial cell
wall.
Particular lytic enzymes include, but are not limited to, glucosaminidases,
which
hydrolyse the glycosidic bonds between the amino sugars (e.g., N-acetylmuramic
acid
and N-acetylglucosannine) of the peptidoglycan, amidases, which cleave the N-
acetylmuramoyl-L-alanine amide linkage between the glycan strand and the cross-
linking peptide, and endopeptidases, which hydrolyse the interpeptide linkage
(e.g.,
cysteine endopeptidases) and endoisopeptidases that attack pseudomurein of
methanogens from the family Methanobacteriaceae.
Additionally, PNAs are included as antimicrobial agents. PNAs are peptide-
nucleic acid
hybrids in which the phosphate backbone has been replaced by an achiral and
neutral
backbone made from N-(2-aminoethyl)-glycine units (see, e.g., Eurekah
Bioscience
Collection. PNA and Oligonucleotide Inhibitors of Human Telomerase. G. Gavory
and S.
Balasubramanian, Landes Bioscience, 2003). The bases A, G, T, C are attached
to the
amino nitrogen on the backbone via methylenecarbonyl linkages (P.E. Nielsen et
al.,
Science 1991. 254: 1497-1500; M. Egholm et al., Nature 1993. 365: 566-568).
PNAs
bind complementary sequences with high specificity, and higher affinity
relative to
analogous DNA or RNA (M. Egholm et al., supra). PNA/DNA or PNA/RNA hybrids
also
exhibit higher thermal stability compared to the corresponding DNA/DNA or
DNA/RNA
CA 02772224 2012-02-24
WO 2011/025394 PCT/NZ2010/000169
duplexes (M. Egholm et al., supra). PNAs also possess high chemical and
biological
stability, due to the unnatural amide backbone that is not recognized by
nucleases Or
proteases (V. Demidov et al., Biochem Pharmacol 1994. 48: 1310-1313).
Typically,
PNAs are at least 5 bases in length, and include a terminal lysine. PNAs may
be
= 5 pegylated to further extend their lifespan (Nielsen, P. E. et al.
(1993) Anticancer Drug
Des. 8:53-63).
In one particular aspect, the antibodies of the invention can be fused or
linked to other
antibodies or fragments thereof. The added antibodies or antibody fragments
can be
10 directed to microbial cells, or particularly methanogen cells, or one or
more cell
components. For example, cell surface proteins, e.g., extracellular receptors,
can be
targeted. In certain aspects, the antibodies or antibody fragments can be
engineered
with sequences that are specifically expressed in subjects, for example, human
or
ruminant .sequences. Also included are chimeric antibodies, for example,
monoclonal
15 antibodies or fragments thereof that are specific to more than one
source, e.g., one or
more mouse, human, or ruminant sequences. Further included are camelid
antibodies
or nanobodies.
The antibodies of the invention find particular use in targeting a microbial
cell, in
20 particular, a methanogen cell. In certain aspects, the antibodies can be
used to
associate with or bind to the cell wall or membrane and/or inhibit growth or
replication of
the cell. As such, the antibodies can be used for transient or extended
attachment to the
cell, or to mediate sequestration or engulfment of the cell, and/or lysis. To
effect
targeting, the microbial cell can be contacted with an antibody as isolated
from a host
25 organism, or -produced by expression vectors and/or host cells, or
synthetic or semi-
synthetic chemistry as described in detail herein. Alternately, the antibodies
can be
produced by the host organism itself in response to the administration or the
peptides,
polypeptides, or polynucleotides disclosed herein. It is understood that the
antibodies of
the invention, as well as the corresponding polynucleotides, expression
vectors, host
30 .. cells, peptides, and polypeptides, can be used to target various
microbes, for example,
Methanobrevibacter ruminantium, which is the primary methanogen in ruminants,
and
Methanobrevibacter smithii, which is the primary methanogen in humans. In
particular
aspects, the antibodies, or corresponding polynucleotides, expression vectors,
host
cells, peptides, or polypeptides, are delivered to subjects as a composition
described in
35 detail herein, for example, through use of a slow-release ruminal
device.
CA 02772224 2012-02-24
WO 2011/025394 PCT/NZ2010/000169
56
In various aspects, the agents of the invention (e.g., one or more peptides,
polypeptides, polynucleotides, and antibodies) can be included in a
composition, for
example, a pharmaceutical composition, and especially a vaccine composition.
The
composition comprises, for example: a) an isolated peptide or alteration,
fragment,
variant, or derivative thereof; b) an isolated polypeptide, or an alteration,
fragment,
variant, or derivative thereof; c) an isolated polynucleotide, or an
alteration, fragment,
variant, or derivative thereof; d) an expression vector comprising this
polynucleotide; e)
a host cell comprising this expression vector; or (f) an antibody, or an
alteration,
10= fragment, variant, or derivative thereof. The compositions of the
invention can be
specifically packaged as part of kits for targeting, and/or. inhibiting
microbial cells,
especially methanogen cells, in accordance with the disclosed methods. The
kits
comprise at least one composition as set out herein and instructions for use
in targeting
cells or inhibiting cell growth or replication, for methanogens or other
microbes.
For vaccines, a number of approaches can be used to increase antigen
immunogenicity, for example, by use of antigen particles; antigen polymers and
polymerization; emulsifying agents; microencapsulation of antigens; killed
bacteria and
bacterial products; chemical adjuvants and cytokines; and agents for targeting
antigens
to antigen presenting cells (reviewed in Paul, Fundamental Immunology, 1999,
Lippincott-Raven Publishers, New York, NY, p. 1392-1405).
To render antigens particulate, alum precipitation can be used. With the use
of
aluminium hydroxide or aluminium phosphate, the antigen in question becomes
incorporated into an insoluble, gel-like precipitate or else is bound to
preformed gel by
electrostatic interactions. Antigens can be subjected to mild heat
aggregation. Antigens
exhibiting self-assembly can also be used. Liposomes, virosomes, and
immunostaining
complexes (ISCOMs) are also useful for forming particulates.
To promote polymerization, nonionic block copolymers can be used as additives
to
adjuvants, e.g., polymers or polyoxypropylene and polyoxyethylene, with which
antigen
can be associated. These are found as components of complex adjuvant
formulations
by both Syntex (SAF-1, Syntex Adjuvant Formulation-1) and Ribi Chemical Co.
Carbohydrate polymers of mannose (e.g., mannan) or of 131-3 glucose (e.g.,
glucan)
can be used in similar fashion (Okawa Y, Howard CR, Steward MW. Production of
anti-
57
peptide antibody in mice following immunization of mice with peptides
conjugated to
merman. J Immunol Methods 1992;142:127-131; Ohta M, Kido N, Hasegawa T, et al.
Contribution of the mannan side chains to the adjuvant action of
lipopolysaccharides.
Immunology 1987;60:503-507).
Various agents can be used for emulsification, including water-in-oil
emulsions, such as
Freund's adjuvants (e.g., Freund's incomplete adjuvant), or other mixtures
comprising
tiny droplets of water stabilized by a surfactant such as mannide monooleate
in a
continuous phase of mineral oil or other oils, such as squalane. An
alternative approach
is to use oil-in-water emulsions, such as MF5963 (Chiron), or other mixtures
comprising
TM
oil droplets of squalene and a mixture of emulsifying agents TVVEEN80 and
SPAN85,
and chemical innmunomodulators such as derivatives or muramyl dipeptide, e.g.,
muramyl tripeptide-phosphatidyl ethanolamine (MTP-PE) (Valensi J-PM, Carlson
JR,
Van Nest GA. Systemic cytokine profiles in Balb/c mice immunized with
trivalent
influenza vaccine containing MF59 oil emulsion and other advanced adjuvants. J
Immunol 1994;153:4029-4039). Small amounts of polysorbate 80 and sorbitan
trioleate
can also be used in the mixtures. As another example, SAF-165 (Syntex) can be
used,
or other oil-in-water mixtures comprising Pluronic L121, squalene, and
TVVEEN80.
Microcapsules, in particular, biodegradable microcapsules, can be used to
prepare
controlled-release vaccines (Chang TMS. Biodegradable, semi-permeable
microcapsules containing enzymes hormones, vaccines and other biologicals. J
Bioeng
1976;1:25-32; Langer R. Polymers for the sustained release of macromolecules:
their
use in a single step method of immunization. Methods Enzymol 1981;73:57-75).
Cyanoacrylates are another form of biodegradable polymer. For example,
poly(buty1-2-
cyanoacrylate) can be used as an adjuvant for oral immunization (O'Hagan DT,
Palin
KJ, Davis SS. Poly (butyl-2-oyanoacrylate) particles as adjuvants for oral
immunization.
Vaccine 1989;7:213-216). Microcapsules are useful for the mucosal
administration of
vaccines. Particles of very small size (nanoparticles) are particularly
suitable. Digestion
in the stomach can be countered by enteric coated polymers, and coating with
substances that increase intestinal absorption, as needed.
Various bacteria, other than killed M. tuberculosis, can be used as adjuvants.
Where the
killed bacterial preparation is itself highly antigenic, the adjuvant
properties extend to
the co-administered antigen. Useful organisms include Bordetella pertussis,
CA 2772224 2019-03-25
CA 02772224 2012-02-24
WO 2011/025394 PCT/NZ2010/000169
58
Corynebacterium parvum, and Nippostrongylus brasiliensis. Peptide and lipid
components of bacteria can also be used. Exemplary components include
acetylmuramyl-L-alanyl-D-isoglutamine, or murannyl dipeptide (MDP) (Ellouz F,
Adam
A, Ciorbaru R, Lederer E. Minimal structural requirements for adjuvant
activity of
bacterial peptidoglycans. Biochem Biophys Res Commun 1974;59:1317-1325), MDP
(murabutide) (Chedid L, Parant MA,. Audibert FM, et al. Biological activity of
a new
synthetic murannyl dipeptide devoid of pyrogenicity. Infect Immun 1982;35:417-
424),
threonyl MDP (Allison AC, Byars NE. An adjuvant formulation that selectively
elicits the
formation of antibodies of protective isotypes and cell-mediated immunity. J
Immunol
Methods 1986;95157-168), and MTP-PE. Lipid adjuvants can comprise LPS
endotoxins of gram-negative bacteria, such as Escherichia, Salmonella, and
Pseudomonas. In certain approaches, the lipid A structure can be chemically
modified
to lower toxicity but retain adjuvanticity, e.g., as for monophosphoryl lipid
A (MPL)
(Johnson AG, Tomai M, Solem L, Beck L, Ribi E. Characterization of non-toxic
monophosphoryl lipid. Rev Infect Dis 1987;9:S512).
Various chemicals can be used as adjuvants, including polynucleotides, such as
poly-
I:C and poly-A:U, vitamin 03, dextran sulphate, inulin, dimethyl dioctadecyl
ammonium
bromide (DDA), avridine, carbohydrate polymers similar to mannan, and
trehalose
dimycolate (Morein B, Lovgren-Bengtsson K, Cox J. Modern adjuvants: functional
aspects. In: Kaufmann SHE, ed. Concepts in vaccine development. Berlin: Walter
de
Gruyter, 1996:243-263). Also included are polyphosphazines (initially
introduced as
slow release-promoting agents) and a Leishmania protein, LelF. Cytokines can
also be
used as adjuvants, for example, IL-2, IL-4, IL-6, IL-10, GM-CSF, and IFN-g.
For targeting antigen presenting cells, C3d .domains, Fc domains, and CTB
domains
can be used (Dempsey PW, Allison MED, Akkaraju S, Goodnow CC, Fearon DT. C3d
of complement as a molecular adjuvant: bridging innate and acquired immunity.
Science 1996;271:348-350; Sun J-B, Holmgren J, Czerkinsky C. Cholera toxin B
subunit: an efficient transmucosal carrier-delivery system for induction of
peripheral
immunological tolerance. Proc Natl Acad Sci USA 1994;91:10795-10799; Sun J-B,
Rask C, Olsson T, Holmgren J, Czerkinsky C. Treatment of experimental
autoinnmune
encephalomyelitis by feeding myelin basic protein conjugated to cholera toxin
B subunit.
Proc Natl Acad Sci USA 1996;93:7196-7201).
CA 02772224 2012-02-24
WO 2011/025394 PCT/NZ2010/000169
59
Specific adjuvants for mucosal delivery, e.g., CT, LT, and Fragment C of
tetanus toxin,
can also be used (Elson CJ, Ea!ding W. Generalized systemic and mucosal
immunity in
mice after mucosal stimulation with cholera toxin. J Immunol 1984;132:2736-
2743;
Holmgren J, Lycke N, Czerkinsky C. Cholera toxin and cholera B subunit as oral-
mucosal adjuvant and antigen vector systems. Vaccine 1993;11:1179-1184;
Clements
JD, Hartzog NM, Lyon FL. Adjuvant activity of Escherichia colt heat-labile
enterotoxin
and effect on the induction of oral tolerance in mice to unrelated protein
antigens.
Vaccine 1988;6:269-277 Gomez-Duarte OG, Galen J, Chatfield SN, Rappuoli R,
Eidels L, Levine MM. Expression of fragment C of tetanus toxin fused to a
carboxyl-
terminal fragment of diphtheria toxin in Salmonella typhi CVD 908 vaccine
strain.
Vaccine 1995;13:1596-1602).
Therapeutics and diagnostics
The peptides, polypeptides, polynucleotides, and antibodies of the present
invention are
considered to have health benefits. In particular aspects, vaccines that
target
methanogens can be used to restore energy to the subject that is normally lost
as
methane. The invention therefore relates to a pharmaceutical composition
(especially a
vaccine composition) in conjunction with a pharmaceutically acceptable
carrier, for use
with any of the methods discussed above. Such pharmaceutical compositions may
comprise a peptide, polypeptide, or antibody in combination with a cell
inhibitor.
Alternatively, the pharmaceutical compositions may comprise a polynucleotide,
expression vector, or host cell as described in detail herein. The
compositions may be
administered alone or in combination with at least one other agent, such as
stabilizing
compound, which may be administered in any sterile, biocompatible
pharmaceutical
carrier, including, but not limited to, saline, buffered saline, dextrose, and
water. The
compositions may be administered to a subject alone, or in combination with
other
agents, drugs (e.g., antimicrobial drugs), or hormones.
In addition to the active ingredients, these pharmaceutical compositions may
contain
suitable pharmaceutically acceptable carriers comprising excipients and
auxiliaries
which facilitate processing of the active compounds into preparations which
can be
=used pharmaceutically. Further details on techniques for formulation and
administration
may be found in the latest edition of Remington's Pharmaceutical Sciences
(Maack
Publishing Co., Easton, PA). The pharmaceutical compositions utilized in this
invention
CA 02772224 2012-02-24
WO 2011/025394 PCT/NZ2010/000169
may be administered by any number of routes including, but not limited to,
oral,
intravenous, intramuscular, intra-arterial, intramedullary, intrathecal,
intraventricular,
_
transdermal, subcutaneous, intraperitoneal, intranasal, enteral, topical,
sublingual, or
rectal means.
5
Pharmaceutical compositions for oral administration can be formulated using
pharmaceutically acceptable carriers well known in the art in dosages suitable
for oral
administration. Such carriers enable the pharmaceutical compositions to be
formulated
as tablets, pills, dragees, capsules, liquids, gels, syrups, slurries,
suspensions, and the
10 like, for ingestion by the subject. Pharmaceutical preparations for
oral use can be
obtained through combination of active compounds with solid excipient,
optionally
grinding a resulting mixture, and processing the mixture of granules, after
adding
suitable auxiliaries, if desired, to obtain tablets or dragee cores. Suitable
excipients are
carbohydrate or protein fillers, such as sugars, including lactose, sucrose,
mannitol, or
15 sorbitol; starch from corn, wheat, rice, potato, or other plants;
cellulose, such as methyl
cellulose, hydroxypropylmethyl-cellulose, or sodium carboxymethylcellulose;
gums
including arabic and tragacanth; and proteins such as gelatin and collagen. If
desired,
disintegrating or solubilising agents may be added, such as the crosslinked
polyvinyl
pyrrolidone, agar, alginic acid, or a salt thereof, such as sodium alginate.
Pharmaceutical preparations which can be used orally include push-fit capsules
made
of gelatin, as well as soft, sealed capsules made of gelatin and a coating,
such as
glycerol or sorbitol. Push-fit capsules can contain active ingredients mixed
with a filler or
binders, such as lactose or starches, lubricants, such as talc or magnesium
stearate,
and, optionally, stabilizers. In soft capsules, the active compounds may be
dissolved or
suspended in suitable liquids, such as fatty oils, liquid, or liquid
polyethylene glycol with
or without stabilizers. Dragee cores may be used in conjunction with suitable
coatings,
such as concentrated sugar solutions, which may also contain gum arabic, talc,
polyvinylpyrrolidone, carbopol gel, polyethylene glycol, and/or titanium
dioxide, lacquer
solutions, and suitable organic solvents or solvent mixtures. Dyestuffs or
pigments may
be added to the tablets or dragee coatings for product identification or to
characterize
the quantity of active compound, i.e., dosage.
Pharmaceutical formulations suitable for parenteral administration may be
formulated in
aqueous solutions, preferably in physiologically compatible buffers such as
Hanks'
CA 02772224 2012-02-24
WO 2011/025394 PCT/NZ2010/000169
61
solution, Ringer's solution, or physiologically buffered saline. Aqueous
injection
suspensions may contain substances which increase the viscosity of the
suspension,
such as sodium carboxymethyl cellulose, sorbitol, or dextran. Additionally,
suspensions
of the active compounds may be prepared as appropriate oily injection
suspensions.
Suitable lipophilic solvents or vehicles include fatty oils such as sesame
oil, or synthetic
fatty acid esters, such as ethyl oleate or triglycerides, or liposomes. Non-
lipid
polycationic amino polymers may also be used for delivery. Optionally, the
suspension
may also contain suitable stabilizers or agents which increase the solubility
of the
compounds to allow for the preparation of highly concentrated solutions. For
topical or
nasal administration, penetrants appropriate to the particular barrier to be
permeated
are used in the formulation. Such penetrants are generally known in the art.
The pharmaceutical compositions of the present invention may be manufactured
in a
manner that is known in the art, e.g., by means of conventional mixing,
dissolving,
granulating, dragee-making, levigating, emulsifying, encapsulating,
entrapping, or
lyophilizing processes. The pharmaceutical composition may be provided as a
salt and
can be formed with many acids, including but not limited to, hydrochloric,
sulfuric,
acetic, lactic, tartaric, malic, succinic, etc. Salts tend to be more soluble
in aqueous or
other protonic solvents than are the corresponding free base forms. In other
cases, the
preferred preparation may be a lyophilized powder which may contain any or all
of the
following: 1-50 mM histidine, 0.1%-2% sucrose, and 2-7% mannitol, at a pH
range of
4.5 to 5.5, that is combined with buffer prior to use. After pharmaceutical
compositions
have been prepared, they can be placed in an appropriate container and labeled
for
treatment of an indicated condition. For administration of a composition of
the invention,
such labeling would include amount, frequency, and method of administration.
Pharmaceutical compositions suitable for use in the invention include
compositions
wherein the active ingredients are contained in an effective amount to achieve
the
intended purpose. For any compound, the therapeutically effective dose can be
estimated initially either in cell assays, e.g., in microbial cells, or in
particular, in
methanogen cells, or in animal models, usually mice, rabbits, dogs, or pigs,
or in
ruminant species such as sheep, cattle, deer, and goats. The animal model may
also be
used to determine the appropriate concentration range and route of
administration.
Such information can then be used to determine useful doses and routes for
administration. Normal dosage amounts may vary from 0.1 to 100,000 micrograms,
up
CA 2772224 2017-03-13
62 =
to a total dose of about 1 g, depending upon the route of administration.
Guidance as to
particular dosages and methods of delivery is provided in the literature and
generally
available to practitioners in the art. Those skilled in the art will employ
different
formulations for polynucleotides than for polypeptides. Similarly, delivery of
peptides, or
polypeptides, polynudeotides, or = antibodies will be specific to particular
cells,
conditions, locations, etc.
The exact dosage will be determined by the practitioner, in light of factors
related to the
subject that requires treatment. Dosage and administration are adjusted to
provide
sufficient levels of the active agent or to maintain the desired effect.
Factors which may
be taken into account include the severity of the disease state, general
health of the
subject, age, weight, and gender, diet, time, and frequency of administration,
drug
= combination(s), reaction sensitivities, and tolerance/response to
therapy. Long-acting
pharmaceutical compositions may be administered every 3 to 4 days, every week,
or
. 15 once every two weeks depending on half-life and clearance rate of the
particular
formulation. The compositions can be co-administered with one or more
additional anti-
microbial agents, .including anti-methanogenesis
compounds (e.g.,
bromoethanesulphonic acid), antibodies and antibody fragments, lytic enzymes,
peptide
nucleic acids, antimicrobial peptides, and other antibiotics as described in
detail herein.
Co-administration can be simultaneous or sequential, or can alternate with
repeated =
= administration.
Particularly useful for the compositions of the invention (e.g.,
pharmaceutical =
compositions) are= slow release formulas or mechanisms. For example, intra-
ruminal
devices include, but are not limited to, Time Capsulew Bolus range by Agri-
Feeds Ltd.,
New Zealand, originally developed within AgResearch Ltd., New Zealand, as
disclosed
in WO 95/19763 and NZ 278977, and CAPTEC by Nufarm Health & Sciences, a
division of Nufarm Ltd., Auckland, New Zealand, as disclosed in AU 35908178,
PCT/AU81/100082, and Laby et al., 1984, Can. J. Anim. ScL 64 (Suppl.), 337-8.
As a particular example, the device can
include a spring and plunger which force the composition against a hole in the
end of a
barrel.
As a further embodiment, the invention relates to a composition for a water
supplement,
e.g., drenching composition, or food supplement, e.g., ruminant feed
component, for
CA 02772224 2012-02-24
WO 2011/025394 PCT/NZ2010/000169
63
use with any of the methods discussed above. In particular aspects, the food
supplement comprises at least one vegetable material that is edible, and a
peptide or
polypeptide of the invention. Alternatively, the food supplement comprises at
least one
vegetable material that is edible, and a polypeptide or peptide, or a
polynucleotide
encoding a peptide or polypeptide disclosed herein, for example, as an
expression
vector or host cell comprising the expression vector. In particular, the
composition
further includes a cell inhibitor, as fused or linked to the resultant
sequence. The
preferred vegetable material include any one of hay, grass, grain, or meal,
for example,
legume hay, grass hay, corn silage, grass silage, legume silage, corn grain,
oats,
barley, distillers grain, brewers grain, soy bean meal, and cotton seed meal.
In
particular, grass silage is useful as a food composition for ruminants. The
plant material
can be genetically modified to contain one or more components of the
invention, e.g.,
one or more polypeptides or peptides, polynucleotides, or vectors.
In another embodiment, antibodies which specifically bind the peptides,
polypeptides, or
polynucleotides of the invention may be used to determine the presence of
microbes,
especially methanogens, or in assays to monitor levels of such microbes. The
antibodies useful for diagnostic purposes may be prepared in the same manner
as
those described above. Diagnostic assays include methods which utilize the
antibody
and a label to detect a peptide or polypeptide in human body fluids or
extracts of cells or
tissues. The antibodies may be used with or without modification, and may be
labeled
by joining them, either covalently or non-covalently, with a reporter
molecule. A wide
variety of reporter molecules which are known in the art may be used, several
of which
are described above.
A variety of protocols for measuring levels of a peptide, polypeptide, or
polynucleotide
are known in the art (e.g., ELISA, RIA, and FACS), and provide a basis for
diagnosing
the presence or levels of a microbe, especially a methanogen. Normal or
standard
levels established by combining body fluids or cell extracts taken from normal
subjects,
e.g., normal humans or ruminants, with the antibody under conditions suitable
for
complex formation. The amount of standard complex formation may be quantified
by
various methods, but preferably by photometric means. Quantities of peptide,
polypeptide, or polynucleotide expressed in subject, control, and treated
samples (e.g.,
samples from vaccinated subjects) are compared with the standard values.
Deviation
=
CA 02772224 2012-02-24
WO 2011/025394 PCT/NZ2010/000169
64
between standard and subject values establishes the parameters for determining
the
presence or levels of the microbe.
In another embodiment of the invention, the polynucleotides may be used for
diagnostic
purposes using particular hybridization and/or amplification techniques. The
polynucleotides which may be used include oligonucleotides, complementary RNA
and
DNA molecules, and PNAs. The polynucleotides may be used to detect and
quantitate
gene expression in samples in which expression may be correlated with the
presence
or levels of a microbe: The diagnostic assay may be used to distinguish
between the
absence, presence, and alteration of microbe levels, and to monitor levels
during
therapeutic intervention.
In one aspect, hybridization with PCR probes may be used to identify nucleic
acid
sequences, especially genomic sequences, which encode the peptides or
polypeptides
of the invention. The specificity of the probe, whether it is made from a
highly specific
region, e.g., 10 unique nucleotides in the 5' regulatory region, or a less
specific region,
e.g., in the 3' coding region, and the stringency of the hybridization or
amplification
(maximal, high, intermediate, or low) will determine whether the probe
identifies only
naturally occurring sequences, alleles, or related sequences. Probes may also
be used
for the detection of related sequences, and should preferably contain at least
50% of
the nucleotides from any of the coding sequences. The hybridization probes of
the
subject invention may be DNA or RNA and derived from the nucleotide sequence
of
SEQ ID NO: 1-1718, or complements, or modified sequences thereof, or from
genomic
sequences including promoter and enhancer elements of the naturally occurring
sequence.
Means for producing specific hybridization probes for DNAs include the cloning
of
polynucleotides into vectors for the production of mRNA probes. Such vectors
are
known in the art, commercially available, and may be used to synthesize RNA
probes in
vitro by means of the addition of the appropriate RNA polymerases and the
appropriate
labeled nucleotides. Hybridization probes may be labeled by a variety of
reporter
groups, for example, radionuclides such as 32P or 33S, or enzymatic labels,
such as
alkaline phosphatase coupled to the probe via avidin/biotin coupling systems,
and the
like. The polynucleotides may be used in Southern or northern analysis, dot
blot, or
other membrane-based technologies; in PCR technologies; or in dipstick, pin,
ELISA
CA 02772224 2012-02-24
WO 2011/025394 PCT/NZ2010/000169
assays, or microarrays utilizing fluids or tissues from subject biopsies to
detect the
presence or levels of a microbe. Such qualitative or quantitative methods are
well
known in the art.
5 In a particular aspect, the polynucleotides may be useful in various
assays labelled by
standard methods, and added to a fluid or tissue sample from a subject under
conditions suitable for hybridization and/or amplification. After a suitable
incubation
period, the sample is washed and the signal is quantitated and compared with a
standard value. If the amount of signal in the test sample is significantly
altered from
10 that of a comparable control sample, the presence of altered levels of
nucleotide
sequences in the sample indicates the presence or levels of the microbe. Such
assays
may also be used to evaluate the efficacy of a particular vaccination regimen
in animal
studies, in clinical trials, or in monitoring the treatment of a subject.
15 In order to provide a basis for the diagnosis of the presence or levels
of a microbe, a
normal or standard profile for expression is established. This may be
accomplished by
combining body fluids or cell extracts taken from normal subjects, with a
polynucleotide
or a fragment thereof, under conditions suitable for hybridization and/or
amplification.
Standard levels may be quantified by comparing the values obtained from normal
20 subjects with those from an experiment where a known amount of a
substantially
purified polynucleotide is used. Standard values obtained from normal samples
may be
compared with values obtained from samples from subjects treated for microbial
growth. Deviation between standard and subject values is used to establish the
presence or levels of the microbe.
Once the microbe is identified and a vaccination protocol is initiated,
hybridization
and/or amplification assays may be repeated on a regular basis to evaluate
whether the
level of expression in the subject begins to decrease relative to that which
is observed
in the normal subject. The results obtained from successive assays may be used
to
show the efficacy of vaccination over a period ranging from several days to
months.
Particular diagnostic uses for oligonucleotides designed from the
polynucleotides may
involve the use of PCR. Such oligomers may be chemically synthesized,
generated
enzymatically, or produced in vitro. Oligomers will preferably consist of two
nucleotide
sequences, one with sense orientation (51.fwdarw.3') and another with
antisense
CA 02772224 2012-02-24
WO 2011/025394 PCT/NZ2010/000169
66
orientation (3'.fwdarw.5'), employed under optimized conditions for
identification of a
specific gene or condition. The same two oligomers, nested sets of oligomers,
or even a
degenerate pool of oligomers may be employed under less stringent conditions
for
detection and/or quantitation of closely related DNA or RNA sequences.
Methods which may also be used to quantitate expression include radiolabeling
or
biotinylating nucleotides, coamplification of a control nucleic acid, and
standard curves
onto which the experimental results are interpolated (Melby, P. C. et al.
(1993) J.
Immunol. Methods, 159:235-244; Duplaa, C. et al. (1993) Anal. Biochem. 229-
236). The
speed of quantitation of multiple samples may be accelerated by running the
assay in
an ELISA format where the oligomer of interest is presented in various
dilutions and a
spectrophotometric or colorimetric response gives rapid quantitation.
In further embodiments, oligonucleotides or longer fragments derived from any
of the
polynucleotides described herein may be used as targets in a microarray. The
microarray can be used to monitor the expression level of large numbers of
genes
simultaneously (to produce a transcript image), and to identify genetic
variants,
mutations and polymorphisms. This information may be used to determine gene
function, to understand the genetic basis of disease, to diagnose disease, and
to
develop and monitor the activities of therapeutic agents. In one embodiment,
the
microarray is prepared and used according to methods known in the art such as
those
described in PCT application WO 95/11995 (Chee et al.), Lockhart, D. J. et al.
(1996;
Nat. Biotech. 14: 1675-1680) and Schena, M. et al. (1996; Proc. Natl. Acad.
Sci. 93:
10614-10619).
In one aspect, the oligonucleotides may be synthesized on the surface of the
microarray
using a chemical coupling procedure and an ink jet application apparatus, such
as that
described in PCT application WO 95/251116 (Baldeschweiler et al.). In another
aspect,
a "gridded" array analogous to a dot or slot blot (HYBRIDOT apparatus, Life
Technologies) may be used to arrange and link cDNA fragments or
oligonucleotides to
the surface of a substrate using a vacuum system, thermal, UV, mechanical or
chemical
bonding procedures. In yet another aspect, an array may be produced by hand or
by
using available devices, materials, and machines (including multichannel
pipettors or .
robotic instruments; Brinkmann, Westbury, N.Y.) and may include, for example,
24, 48,
96, 384, 1024, 1536, or 6144 spots or wells (e.g., as a multiwell plate), or
more, or any
CA 02772224 2012-02-24
WO 2011/025394 PCT/NZ2010/000169
67
other multiple from 2 to 1,000,000 which lends itself to the efficient use of
commercially
available instrumentation.
In order to conduct sample analysis using the microarrays, polynucleotides are
extracted from a biological sample. The biological samples may be obtained
from any
bodily fluid (blood, urine, saliva, phlegm, gastric juices, etc.), cultured
cells, biopsies, or
other tissue preparations. To produce probes, the polynucleotides extracted
from the
sample are used to produce polynucleotides which are complementary to the
nucleic
acids on the microarray. If the microarray consists of cDNAs, antisense RNAs
are
appropriate probes. Therefore, in one aspect, mRNA is used to produce cDNA
which, in
turn and in the presence of fluorescent nucleotides, is used to produce
fragments or
antisense RNA probes. These fluorescently labeled probes are incubated with
the
microarray so that the probe sequences hybridize to the cDNA oligonucleotides
of the
microarray. In another aspect, polynucleotides used as probes can include
polynucleotides, fragments, and complementary or antisense sequences produced
using restriction enzymes, PCR technologies, and oligolabeling kits (Amersham
= Pharmacia Biotech) well known in the area of hybridization technology.
In another embodiment of the invention, the peptides or polypeptides of the
invention or
functional or immunogenic fragments or oligopeptides thereof, can be used for
screening libraries of compounds in any of a variety of drug screening
techniques. The
fragment employed in such screening may be free in solution, affixed to a
solid support,
borne on a cell surface, or located intracellularly. The formation of binding
complexes, -
between the peptide or polypeptide and the agent being tested, may be
measured.
One technique for drug screening which may be used provides for high
throughput
screening of compounds having suitable binding affinity to the peptide or
polypeptide of
interest as described in published PCT application WO 84/03564. In this
method, large
numbers of different small test compounds are synthesized on a solid
substrate, such
as plastic pins or some other surface. The test compounds are reacted with the
peptide
or polypeptide, or fragments thereof, and washed. Bound peptide or polypeptide
is then
detected by methods well known in the art. Purified peptide or polypeptide can
also be
coated directly onto plates for use in the aforementioned drug screening
techniques.
Alternatively, non-neutralizing antibodies can be used to capture the peptide
and
immobilize it on a solid support.
=
CA 02772224 2012-02-24
WO 2011/025394 PCT/NZ2010/000169
68
In another technique, one may use competitive drug screening assays in which
neutralizing antibodies capable of binding the peptide or polypeptide
specifically
compete with a test compound for binding to the peptide or polypeptide. In
this manner,
the antibodies can be used to detect the presence of a test compound which
shares
one or more antigen binding sites with the antibody.
Plant constructs and plant transformants
Of particular interest is the use of the polynucleotides of this invention for
plant
transformation or transfection. Exogenous genetic material may be transferred
into a
plant cell and the plant cell regenerated into a whole, fertile, or sterile
plant. Exogenous
genetic material is any genetic material, whether naturally occurring or
otherwise, from
any source that is capable of being inserted into any organism. Such genetic
material
may be transferred into either monocotyledons or dicotyledons including but
not limited
to the plants used for animal feed, e.g., feed for sheep, cows, etc.
A variety of methods can be used to generate stable transgenic plants. These
include
particle gun bombardment (Fromm et al., Biorrechnology 8:833-839 (1990)),
electroporation of protoplasts (Rhodes et al., Science 240:204-207 (1989);
Shimamoto
et al., Nature 338:274-276 (1989)), treatment of protoplasts with polyethylene
glycol
(Datta et al., Biorrechnology, 8:736-740 (1990)), microinjection (Neuhaus et
al.,
Theoretical and Applied Genetics, 75:30-36 (1987)), immersion of seeds in a
DNA
solution (Ledoux et al., Nature, 249:17-21 (1974)), and transformation with T-
DNA of
Agrobacterium (Valvekens et al., PNAS, 85:5536-5540 (1988); Komari, Plant
Science,
60:223-229 (1989)). In most, perhaps all plant species, Agrobacterium-mediated
transformation is the most efficient and easiest of these methods to use. T-
DNA transfer
generally produces the greatest number of transformed plants with the fewest
multi-
copy insertions, rearrangements, and other undesirable events.
Many different methods for generating transgenic plants using Agrobacterium
have
been described. In general, these methods rely on a "disarmed" Agrobacterium
strain
that is incapable of inducing tumours, and a binary plasmid transfer system.
The
disarmed strain has the oncogenic genes of the T-DNA deleted. A Binary plasmid
transfer system consists of one plasmid with the 23-base pair T-DNA left and
right
CA 02772224 2012-02-24
WO 2011/025394 PCT/NZ2010/000169
69
border sequences, between which a gene for a selectable marker (e.g. an
herbicide
resistance gene) and other desired genetic elements are cloned. Another
plasmid
encodes the Agrobacterium genes necessary for effecting the transfer of the
DNA
between the border sequences in the first plasmid. Plant tissue is exposed to
=Agrobacterium carrying the two plasmids, the DNA between the left and right
border
repeats is transferred into the plant cells, transformed cells are identified
using the
selectable marker, and whole plants are regenerated from the transformed
tissue. Plant
tissue types that have been reported to be transformed using variations of
this method
include: cultured protoplasts (Komari, Plant Science, 60:223-229 (1989)), leaf
disks
(Lloyd et al., Science 234:464-466 (1986)), shoot apices (Gould et al., Plant
Physiology,
95:426-434 (1991)), root segments (Valvekens et at., PNAS, 85:5536-5540
(1988)),
tuber disks (Jin et al., Journal of Bacteriology, 169: 4417-4425 (1987)), and
embryos
(Gordon-Kamm et at., Plant Cell, 2:603-618 (1990)).
In the case of Arabidopsis thaliana it is possible to perform in plant
germline
transformation (Katavic et at., Molecular and General Genetics, 245:363-370
(1994);
Clough et al., Plant Journal, 16:735-743 (1998)). In the simplest of these
methods,
flowering Arabidopsis plants are dipped into a culture of Agrobacterium such
as that .
described in the previous paragraph. Among the seeds produced from these
plants, 1%
or more have integration of T-DNA into the genome.
Monocot plants have generally been more difficult to transform with
Agrobacterium than _
dicot plants. However, "supervirulent" strains of Agrobacterium with increased
expression of the virB and virG genes have been reported to transform monocot
plants
with increased efficiency (Komari et al., Journal of Bacteriology, 166:88-94
(1986); Jin
et al., Journal of Bacteriology, 169:417-425 (1987
Most T-DNA insertion events are due to illegitimate recombination events and
are
targeted to random sites in the genome. However, given sufficient homology
between
= 30 the transferred DNA and genomic sequence, it has been reported that
integration of T-
DNA by homologous recombination may be obtained at a very low frequency. Even
with
long stretches of DNA homology, the frequency of integration by homologous
recombination relative to integration= by illegitimate recombination is
roughly 1:1000
= (Miao et at., Plant Journal, 7:359-365 (1995); Kempin et at., 389:802-803
(1997)).
CA 02772224 2012-02-24
WO 2011/025394 PCT/NZ2010/000169
Exogenous genetic material may be transferred into a plant cell by the use of
a DNA
vector or construct designed for such a purpose. Vectors have been engineered
for
transformation of large DNA inserts into plant genomes. Binary bacterial
artificial
chromosomes have been designed to replicate in both E. coil and Agrobacterium
and
5, have all of the features required for transferring large inserts of DNA
into plant
chromosomes. BAC vectors, e.g. a pBACwich, have been developed to achieve site-
directed integration of DNA into a genome.
A construct or vector may also include a plant promoter to express the gene or
gene
10 fragment of choice. A number of promoters that are active in plant cells
have been
described in the literature. These include the nopaline synthase (NOS)
promoter, the
octopine synthase (OCS) promoter, a caulimovirus promoter such as the CaMV 19S
promoter and the CaMV 35S promoter, the figwort mosaic virus 35S promoter, the
light-
inducible promoter from the small subunit of ribulose-1,5-bis-phosphate
carboxylase
15 (ssRUBISCO), the Adh promoter, the sucrose synthase promoter, the R gene
complex
promoter, and the chlorophyll a/b binding protein gene promoter.
For the purpose of expression in source tissues of the plant, such as the
leaf, seed,
root, or stem, it is preferred that the promoters utilized in the present
invention have
20 _ relatively high expression in these specific tissues. For this purpose,
one may choose
from a number of promoters for genes with tissue- or cell-specific or -
enhanced
expression. Examples of such promoters reported in the literature include the
chloroplast glutamine synthetase GS2 promoter from pea, the chloroplast
fructose-1,6-
biphosphatase (FBPase) promoter from wheat, the nuclear photosynthetic ST-LS1
25 promoter from potato, the phenylalanine ammonia-lyase (PAL) promoter and
the
chalcone synthase (CHS) promoter from Arabidopsis thaliana.
Also reported to be active in photosynthetically active tissues are the
ribulose-1,5-
bisphosphate carboxylase (RbcS) promoter from eastern larch (Larix laricina),
the
30 promoter for the cab gene, cab6, from pine, the promoter for the Cab-1
gene from
wheat, the promoter for the CAB-1 gene from spinach, the promoter for the
cab1R gene
from rice, the pyruvate, orthophosphate dikinase (PPDK) promoter from Zea
mays, the
promoter for the tobacco LhcbI*2 gene, the Arabidopsis thaliana SUC2 sucrose-
H+
symporter promoter, and the promoter for the thylacoid membrane proteins from
35 spinach (psaD, psaF, psaE, PC, FNR, atpC, atpD, cab, rbcS). Other
promoters for the
CA 2772224 2017-03-13
71
¨ chlorophyll a/b-binding proteins may also be utilized in the present
invention, such as
the promoters for LhcB gene and PsbP gene from white mustard (Sinapis alba).
Additional promoters that may be utilized are described, for example, in U.S.
Pat. Nos.
5,378,619; 5,391,725; 5,428,147; 5,447,858; 5,608,144; 5,608,144; 5,614,399;
5,633,441; 5,633,435 and 4,633,436.
Constructs or vectors may also include, with the coding region of interest, a
nucleic acid
sequence that acts, in whole or in part, to terminate transcription of that
region. For
example, such sequences have been isolated including the Tr7 3' sequence and
the
nos 3' sequence or the like. It is understood that one or more sequences of
the present
invention that act to terminate transcription may be used.
A vector or construct may also include otherregulatory elements or selectable
markers.
Selectable markers may also be used to select for plants or plant cells that
contain the
exogenous genetic material. Examples of such include, but are not limited to,
a neo
gene which codes for kanamycin resistance and can be selected for using
kanamycin,
G418, etc.; a bar gene which codes for bialaphos resistance; a mutant EPSP
synthase
gene which encodes glyphosate resistance; a nitrilase gene which confers
resistance to
bromoxynil, a mutant acetolactate synthase gene (ALS) which confers
imidazolinone or
sulphonylurea resistance; and a methotrexate resistant DHFR gene.
A vector or construct may also include a screenable marker to monitor
expression.
Exemplary screenable markers include a .beta.-glucuronidase or uidA gene
(GUS), an
' R-locus gene, which encodes a product that regulates the production of
anthocyanin
pigments= (red colour) in plant tissues; a beta-lactamase gene, a gene which
encodes
an enzyme for which various chromogenic substrates are known (e.g., PADAC, a
chromogenic cephalosporin); a luciferase gene, a xylE gene which encodes a
catechol
dioxygenase that can convert chromogenic catechols; an alpha-amylase gene, a
tyrosinase gene which encodes an enzyme capable of oxidizing tyrosine to DOPA
and
dopaquinone which in turn condenses to melanin; an alpha-galactosidase, which
will
turn a chromogenic alpha-galactose substrate.
Included within the terms "selectable or screenable marker genes" are also
genes which
encode a secretable marker whose secretion can be detected as a means of
identifying
CA 02772224 2012-02-24
WO 2011/025394 PCT/NZ2010/000169
72
or selecting for transformed cells. Examples include markers which encode a
secretable
antigen that can be identified by antibody interaction, or even secretable
enzymes
which can be detected catalytically. Secretable proteins fall into a number of
classes,
including small, diffusible proteins detectable, e.g., by ELISA, small active
enzymes
detectable in extracellular solution (e.g., alpha-amylase, beta-lactamase,
phosphinothricin transferase), or proteins which are inserted or trapped in
the cell wall
(such as proteins which include a leader sequence such as that found in the
expression
unit of extension or tobacco PR-S). Other possible selectable and/or
screenable marker
genes will be apparent to those of skill in the art.
Thus, any of the polynucleotides of the present invention may be introduced
into a plant
cell in a permanent or transient manner in combination with other genetic
elements
such as vectors, promoters enhancers etc. Further any of the polynucleotides
encoding
a protein or fragment thereof or homologs of the present invention may be
introduced
into a plant cell in a manner that allows for expression (e.g.,
overexpression) of the
protein or fragment thereof encoded by the polynucleotide.
Computer related uses
In one embodiment, a nucleotide or amino acid sequence of the present
invention can
be recorded on computer readable media. This takes into account any medium
which
can be read and accessed directly by a computer. Such media include, but are
not
limited to: magnetic storage media, such as floppy discs, hard disc storage
medium,
and magnetic tape; optical storage media such as CD-ROM; electrical storage
media
such as RAM and ROM; and hybrids of these categories such as magnetic/optical
storage media. A skilled artisan can readily appreciate how any of the
presently known
computer readable mediums can be used to create a manufacture comprising
computer
readable medium having recorded thereon a sequence of the present invention.
A skilled artisan can readily adopt any of the presently know methods for
recording
information on computer readable medium to generate manufactures comprising
the
nucleotide sequence information of the present invention. A variety of data
storage
structures are available to a skilled artisan for creating a computer readable
medium
having recorded thereon a nucleotide sequence of the present invention. The
choice of
the data storage structure will generally be based on the means chosen to
access the
CA 02772224 2012-02-24
WO 2011/025394 PCT/NZ2010/000169
73
stored information. In addition, a variety of data processor programs and
formats can be
used to store the nucleotide sequence information of the present invention on
computer
readable medium.
As non limiting examples, the sequence information can be represented in a
word
processing text file, formatted in commercially-available software such as
WordPerfect
and Microsoft Word, or represented in the form of an ASCII file, stored in a
database
application, such as DB2, Sybase, Oracle, or the like. A skilled artisan can
readily adapt
any number of data processor structuring formats (e.g. text file or database)
in order to
obtain computer readable medium having recorded thereon the nucleotide
sequence
information of the present invention.
By providing the sequence of any SEQ ID NO: herein, or a representative
fragment
thereof, or any variant thereof, in computer readable form, a skilled artisan
can routinely
access the sequence information for a variety of purposes. Computer software
is
publicly available which allows a skilled artisan to access sequence
information
provided in a computer readable medium. For example, software can be used to
implement the BLAST (Altschul et al., J. Mol. Biol. 215:403-410 (1990)) and
BLAZE
(Brutlag et al., Comp. Chem. 17:203-207 (1993)) search algorithms, e.g., on a
Sybase
system to identify ,open reading frames (ORFs) which contain homology to ORFs
or
proteins from other organisms. Such ORFs may be protein encoding sequences
which
are useful in producing commercially important proteins such as enzymes.
The present invention further provides systems, particularly computer-based
systems,
which contain the sequence information described herein. Such systems are
designed
to identify commercially important sequences of the M. ruminantium genome.
This
includes the hardware means, software means, and data storage means used to
analyze the nucleotide sequence information of the present invention. The
minimum
hardware means of the computer-based systems of the present invention
comprises a
central processing unit (CPU), input means, output means, and data storage
means. A
skilled artisan can readily appreciate that any one of the currently available
computer-
based system are suitable for use in the present invention.
As stated above, the computer-based systems of the present invention comprise
a data
storage means having stored therein a nucleotide sequence of the present
invention
CA 02772224 2012-02-24
WO 2011/025394 PCT/NZ2010/000169
74
and the necessary hardware means and software- means for supporting and
implementing a search means. This refers to memory which can store nucleotide
sequence information of the present invention, or a memory access means which
can
access manufactures having recorded thereon the nucleotide sequence
information of
the present invention. Searching can include one or more programs which are
implemented on the computer-based system to compare a target sequence or
target
structural motif with the sequence information stored within the data storage
means.
Search means are used to identify fragments or regions of the M. ruminantium
genome
which match a particular target sequence or target motif, e.g., antibody
targets. A
variety of known algorithms are disclosed publicly and a variety of
commercially
available software for conducting search means are and can be used in the
computer-
based systems of the present invention. Examples of such software include, but
are not
limited to, MacPattem (EMBL), BLASTN and BLASTX (NCBIA). A skilled artisan can
readily recognize that any one of the available algorithms or implementing
software
packages for conducting homology searches can be adapted for use in the
present
computer-based systems.
The target sequence can be any DNA or amino acid sequence of six or more
nucleotides or two or more amino acids. A skilled artisan can readily
recognize that the
longer a target sequence is, the less likely a target sequence will be present
as a
random occurrence in the database. The most preferred sequence length of a
target
sequence is from about 10 to 100 amino acids or from about 30 to 300
nucleotide
residues. However, it is well recognized that searches for commercially
important
fragments of the M. ruminantium genome, such as sequence fragments involved in
gene expression and protein processing, may be of shorter length.
A target structural motif, or target motif includes any rationally selected
sequence or
combination of sequences in which the sequence(s) are chosen based on a three-
dimensional configuration which is formed upon the folding of the target
motif. There are
= a variety of target motifs known in the art. Protein target motifs
include, but are not
limited to, enzyme active sites and signal sequences. Nucleic acid target
motifs include,
but are not limited to, promoter sequences, hairpin structures and inducible
expression
elements (protein binding sequences).
CA 02772224 2012-02-24
WO 2011/025394 PCT/NZ2010/000169
A variety of structural formats for the input and output means can be used to
input and
output the information in the computer-based systems of the present invention.
A
preferred format for an output means ranks fragments of the M. ruminantium
genome
possessing varying degrees of homology to the target sequence or target motif.
Such
5 presentation provides a skilled artisan with a ranking of sequences which
contain
various amounts of the target sequence or target motif and identifies the
degree of
homology contained in the identified fragment.
A variety of comparing means can be used to compare a target sequence or
target
10 motif with the data storage means to identify sequence fragments of the
M. ruminantium
genome. In particular aspects, software can be used implement the BLAST and
BLAZE
algorithms (Altschul et al., J. Mol. Biol. 215:403-410. (1990)) and to
identify open
reading frames. A skilled artisan can readily recognize that any one of the
publicly
available homology search programs can be used as the search means for the
15 computer-based systems of the present invention
The computer system may include a processor connected to a bus. Also connected
to
the bus may be a main memory (preferably implemented as random access memory,
RAM) and a variety of secondary storage devices, such as a hard drive and a
20 removable medium storage device. The removable medium storage device may
represent, for example, a floppy disk drive, a CD-ROM drive, a magnetic tape
drive, etc.
A removable storage medium (such as a floppy disk, a compact disk, a magnetic
tape,
etc.) containing control logic and/or data recorded therein may be inserted
into the
removable medium storage device. The computer system may include appropriate
25 software for reading the control logic and/or the data from the removable
medium
storage device once inserted in the removable medium storage device.
A sequence of the present invention may be stored in a well known manner in
the main
memory, any of the secondary storage devices, and/or a removable storage
medium.
30 Software for accessing and processing the genomic sequence (such as
search tools,
comparing tools, etc.) reside in main memory during execution.
CA 02772224 2012-02-24
WO 2011/025394 PCT/NZ2010/000169
76
EXAMPLES
The examples described herein are for purposes of illustrating embodiments of
the
invention. Other embodiments, methods, and types of analyses are within the
scope of
persons of ordinary skill in the molecular diagnostic arts and need not be
described in
detail hereon. Other embodiments within the scope of the art are considered to
be part
of this invention.
EXAMPLE 1: Materials and methods
Strain information and growth conditions
Methanobrevibacter ruminantium MIT (DSM1093) was obtained from the German
Collection of Microorganisms and Cell Cultures (DSMZ), Braunschweig, Germany.
The
original description of Methanobacterium ruminantium was made by Smith and
Hungate
(Smith & Hungate, 1958) and the genus designation later changed to
Methanobrevibacter (Balch et al., 1979). Methanobrevibacter ruminantium MIT
=
(DSM1093) was isolated from bovine rumen contents by Bryant (Bryant, 1965). It
is
designated the neotype strain for this species because the original strain of
Smith and
Hungate was not maintained. M. ruminantium strain M1T was routinely grown in
basal
medium (Joblin et al., 1990) with added trace elements (Balch et al., 1979),
(Br
medium), with H2 plus CO2 (4:1) at 180 kPa overpressure. The culture tubes
were
incubated on their sides, at 39 C in the dark, on a platform shaken at 200
rpm.
Co-culture of M. ruminantium and Butyrivibrio proteoclasticus
M1 was grown in co-culture with Butrivibrio proteoclasticus B316T (DSM14932)
to
examine gene expression under rumen-like conditions. Eighteen pure cultures of
M1
were grown in Br medium with H2 plus CO2 (4:1) at 180 kPa overpressure in 100
ml
volumes in 125 ml serum bottles sealed with blue butyl septum stoppers and
aluminium
seals (Bellco Glass, Vineland, NJ, USA). When the cultures reached mid-
exponential
phase, as measured by optical density at 600 nm (Ultrospec 1100 pro UVNis
spectrophotometer, Amersham Biosciences, Little Chalfont, Buckinghamshire, UK)
they
were flushed with 02-free 100% CO2 gas until H2 was not detectible by gas
chromatography. All 18 cultures were supplemented with oat spelt )(Irian
(Sigma-Aldrich,
St. Louis, MO, USA) to 0.2% (w1v) final concentration, then nine of the
cultures were
inoculated with 0.5 ml of a late-exponential phase culture of B.
proteoclasticus. The
CA 02772224 2012-02-24
WO 2011/025394 PCT/NZ2010/000169
77
other nine were re-pressurized to H2 plus CO2 (4:1) at 180 kPa overpressure.
Three
further serum bottles of BY medium supplemented with 0.2% (w/v) xylan were
also
inoculated with an equivalent inoculum of B. proteoclasticus. Growth in the co-
culture
was monitored periodically by Thoma slide enumeration (Webber Scientific
International
Ltd., Teddington, England). Mid-exponential phase co-cultures and monocultures
were
harvested by centrifugation (10,000 X g, 5 min at 4 C), and the cell pellets
resuspended
in 10 ml of BY medium (4- 0.2% [w/v] xylan) and 20 ml of RNAprotect (Qiagen,
Hilden,
Germany). These were incubated for 5 min at room temperature, and were
immediately
processed for RNA extraction
Microarray analyses
RNA isolation, cDNA synthesis and labelling. Cells of M1 and B.
proteoclasticus from
mono- or co-cultures prepared as described above, were pelleted by.
centrifugation
(5,000 x g, 10 min room temperature), air-dried and frozen .under liquid N2.
Frozen
pellets were ground in a sterile pre-chilled (-20 C) mortar and pestle under
liquid N2,
and the ground samples resuspended in excess TRIzol (Invitrogen, Carlsbad, CA,
USA). The mixtures were incubated at 20 C for 5 min. Chloroform (200 pl) was
then
added, mixed vigorously, and incubated for a further 3 min. The samples were
centrifuged (12,000 x g, 15 min, 4 C) and the aqueous phases transferred to
fresh
tubes, mixed with 0.5 volumes isopropanol and incubated at 20 C for 10 min to
precipitate the RNAs. Precipitated RNAs were pelleted by centrifugation
(12,000 x g,
10 min, 4 C), the supernatants removed and the RNAs washed with 5 ml of 75%
(v/v)
ethanol before being re-pelleted by centrifugation. Ethanol was removed, the
pellets air
dried on ice and finally each resuspended in 1 ml of diethyl pyrocarbonate
(DEPC)
treated Milli-Q water. The RNAs were further purified using an RNeasy Midi kit
(Qiagen, Hilden, Germany) and quantified using an Agilent 2100 Bioanalyzer
(Agilent
Technologies, Santa Clara, CA, USA) following the respective manufacturer's
instructions. cDNA synthesis, labeling and purification were carried out using
the
Invitrogen cDNA labelling purification kit, while the Cy3 and Cy5 dyes were
from GE
Healthcare (Uppsala, Sweden).
Quantification of co-culture mRNA. The relative quantities of RNAs contributed
by each
organism to the co-culture samples were determined by quantitative PCR of the
B. proteoclasticus butyryl-CoA dehydrogenase (bcd) gene (using primers bcdqfp:
tgagaagggaacacctggat; SEQ ID NO: 7586, and bcdqrp: ttgctcttccgaactgctt; SEQ ID
NO:
CA 02772224 2012-02-24
WO 2011/025394 PCT/NZ2010/000169
78
7587), and the M1 gene encoding N51N10-methenyl-H4MPT cyclohydrolase (mch)
(using
primers mchqfp: gtattgcctggtgaagatgt; SEQ ID NO: 7588 and mchqrp:
gtcgatttggtagaagtca; SEQ ID NO: 7589). Homologs of both genes have previously
been
shown to be. constitutively expressed in closely related species (Reeve et
al., 1997;
Asanuma et al., 2005). The mono-culture RNAs were then combined in equal
proportions to normalise mRNA abundance with their co-culture replicates.
Probe synthesis and slide printing. Oligonucleotide 70mer probes were designed
against the draft genomes of M1 and Butyrivibrio proteoclasticus B316T using
ROSO
software (Reymond et al., 2004) and synthesised by IIlumina (San Diego, CA,
USA).
Oligonucleotides were spotted onto epoxy-coated slides (Corning, Lowell, MA,
USA)
using an ESI robot (Engineer Service Inc., Toronto, Ontario, Canada).
Microarray hybridization and scanning. Microarrays were replicated 6 times (3
biological replicates" per treatment, each with a dye swap) and each gene was
represented on the array 3 to 7 times. Microarray slides were pre-warmed in
microarray
prehybridization buffer (50 C for 30 min), and transferred into hybridization
chambers
(Corning, Lowell, MA, USA) and lifter cover slips (Erie Scientific,
Portsmouth, NH, USA)
were laid over the probe areas. Samples of RNA to be compared (e.g., Cy3 co-
culture
versus combined Cy5 individual mono-cultures) were combined, denatured at 95 C
for
10 min, and mixed with 60 pl of pre-warmed (68 C) Slide Hyb buffer #1 (Ambion,
Austin, TX, USA). The mixture was loaded onto the slide, the hybridization
chamber
sealed, and incubated in a water bath at 50 C for 24 h. Following
hybridization, the
slides were washed by vigorous shaking by hand in pre-warmed (50 C) wash
solutions
1 to 3 (wash solution 1: 10%SDS, 2 x SSC; wash solution 2: 1 x SSC; wash
solution 3:
0.1 x SSC), 7 min per wash in aluminium foil-covered Falcon tubes (Becton,
Dickinson
and Co. Sparks, MD, USA). Following the third wash, the slides were dried by
low
speed centrifugation (1,500 x g, 4 min) followed by incubation for 20 min in a
37 C
vacuum oven (Contherm, Wellington, NZ) in the dark. Microarray slides were
scanned
using a GenePixe Professional 4200 scanner and GenePix Pro 6.0 software
(Molecular
Devices, Sunnyvale, CA, USA) and analysed using the Limma package in
Bioconductor
(Smyth, 2005). Genes with an up- or down-regulation of 2 fold or greater and
an FOR
value < 0.05 were deemed statistically significant.
Growth experiments to test effects of PeiR and alcohols
CA 02772224 2012-02-24
WO 2011/025394 PCT/NZ2010/000169
79
M1 was grown in medium RM02 in anaerobic culture tubes (16 mm internal
diameter,
18 mm outer diamater, 150 mm long; BeIlco Glass, Vineland, NJ, USA),
essentially as
described by Balch and Wolfe, 1976. The mineral salts base of RM02 contained
(per
litre of medium): 1.4 g of KH2PO4, 0.6 g of (NH4)2SO4, 1.5 g of KCI, 1 ml
trace element
solution SL10 (Widdel et al., 1983), 1 ml of selenite/tungstate solution
(Tschech and
Pfennig, 1984) and 4 drops of 0.1% (w/v) resazurin solution. This solution was
mixed
and then boiled under 02-free 100% CO2, before being cooled in an ice bath
while it
was bubbled with 100% 002. Once the medium was cool, 4.2 g of NaHCO3 and 0.5 g
of
L-cysteine-HCI-H20 was added per litre. The medium was dispensed into the
culture
tubes while being gassed with 100% CO2, at 9.5 ml of medium per tube, and the
tubes
sealed with blue butyl septum stoppers and aluminium seals (BelIco), with a
headspace
of 100% CO2. These tubes were sterilised by autoclaving for 20 min at 121 C.
Before
use, the tubes were stored in the dark for at least 24 h. Sodium acetate (20
mM final
conc.), sodium formate (60 mM final conc.), coenzyme M (10 pM final conc.),
and
vitamin-supplemented clarified rumen fluid were added to sterile media, before
inoculation with 0.5 ml of an actively growing culture of M. ruminantium, then
gassed
with H2 plus CO2 (4:1) to 180 kPa overpressure. In some experiments, the
formate was
omitted, and alcohols were added, as noted in, the experimental descriptions
accompanying the results. The culture tubes were incubated on their sides, at
39 C in
the dark, on a platform shaken at 200 rpm.
To prepare the clarified rumen fluid, rumen contents were collected from a
ruminally-
fistulated cow that had been fed hay for 48 h after being on a rye-grass
clover pasture.
Feed was withheld from the animal overnight and rumen contents collected the
next
morning. The material was filtered through a single layer of cheesecloth and
then fine
particulate material removed by centrifugation at 10,000 x g for 20 min. The
supernatant
was stored at -20 C. Before further use, it was thawed, and any precipitates
removed
by centrifugation at 12,000 x g for 15 min. The supernatant was bubbled for 10
min with
100% N2 gas, before being autoclaved under 100% nitrogen for 15 min to remove
viruses. The following was then added per 100 ml of rumen fluid while stirring
under air:
1.63 g of MgC12=6H20 and 1.18 g of CaCl2-2H20. The resulting heavy precipitate
was
removed by centrifuging at 30,000 X g and 4 C for 60 min. The supernatant was
designated the clarified rumen fluid. Two grams of yeast extract powder was
added,
and the mixture then bubbled with N2 gas for 15 min, before being transferred
to a N2-
CA 02772224 2012-02-24
WO 2011/025394 PCT/NZ2010/000169
flushed sterile serum vial through a 0.2-pm pore size sterile filter using a
syringe and
needle.
Two ml of Vitamin 10 concentrate was then added per 100 ml of preparation by
syringe
5 and needle. Vitamin 10 concentrate contained 1000 ml of distilled water,
40 mg of
4-aminobenzoate, 10 mg of D-(+)-biotin, 100 mg of nicotinic acid, 50 mg of
hemicalcium
D-(+)-pantothenate, 150 mg of pyridoxamine hydrochloride, 100 mg of thiamine
chloride
hydrochloride, 50 mg of cyanocobalamin, 30 mg of D,L-6,8-thioctic acid, 30 mg
of
riboflavin and 10 mg of folic acid. After preparation, the solution was well
mixed and
10 then bubbled with N2 gas for 15 min, before being transferred to a N2-
flushed sterile
= serum vial through a 0.2 pm pore size sterile filter using a syringe and
needle.
Growth of M1 was followed by measuring the culture density at 600 nm by
inserting the
tubes 'directly into an Ultrospec 1100 pro UVNis spectrophotometer (Amersham
15 Biosciences, Little Chalfont, Buckinghamshire, UK). Tubes containing 10
ml of medium
RM02 were inoculated with 0.5 ml of an actively growing culture of Ml, then
gassed
with H2 plus CO2 (4:1) to 180 kPa overpressure. Additions of PeiR in 0.1 ml of
buffer (20
mM 3[N-morpholino]propane sulfonic acid: 1 mM dithiothreitol: 0.3 M NaCl, 20%
glycerol [v/v], pH 7.0 with NaOH), 0.1 ml of buffer only, or 0.1 ml of
chloroform were
20 made when the cultures had grown to mid-exponential phase (optical
density at 600 nm
[OD600] ¨0.1, 16 mm path length). In the experiments testing the effects of
PeiR
addition, the culture densities were mathematically normalised to an 0D600 of
0.1 at the
time the additions were made, and all other readings corrected by the same
ratio. This
was done to remove the effect of lack of absolute synchronicity between
cultures, a
25 common phenomenon when culturing methanogens. This normalisation was not
done
for experiments testing the utilisation of alcohols. Methane was measured by
gas
chromatography, taking a 0.3 ml sample from the culture headspace, at the
pressure in
the culture tube, and injecting it into an Aerograph 660 (Varian Associates,
Palo Alto,
CA, USA) fitted with a Porapak Q 80/100 mesh column (Waters Corporation,
Milford,
30 MA, USA) and a thermal conductivity detector operated at 100 C. The column
was
operated at room temperature with N2 as the carrier gas at 12 cm3/min.
DNA extraction
Genomic DNA was extracted from M1 grown on BY medium with H2 plus CO2 (4:1),
35 using the liquid N2 freezing and grinding method (Jarrell et al., 1992).
Briefly, M1
CA 02772224 2012-02-24
WO 2011/025394 PCT/NZ2010/000169
81
cultures were harvested by centrifugation at 27,000 x g for 20 min at 4 C and
cell
pellets combined and placed into 40 ml Oakridge centrifuge tubes (Thermo
Fisher
Scientific, Inc.). The cells were frozen at -20 C and kept frozen for at least
4 days. The
frozen cell pellets were placed in a sterile, pre-cooled (-85 C) mortar and
liquid N2
poured over the pellet. After the N2 had evaporated, the pellet was ground to
a powder
with a sterile glass rod. Immediately, 0.5 ml of TES buffer (10 mM Tris-HCI :
1 mM
EDTA:0.25 M sucrose, pH 7.5) was added to the powdered cell pellet and mixed
gently
into a slurry. Sodium dodecyl sulfate was added to a final concentration of 1%
(w/v)
and Proteinase = K (Roche Diagnostics, Mannheim, Germany) added to a final
concentration of 50 ig/ml. The mixture was incubated at 60 C for 30 min. NaCI
was
added to a final concentration of 0.5 M and the lysate was placed on ice for 1
h. The
lysate was centrifuged at 25,000 x g for 15 min at 4 C and the supematant
recovered
carefully. An equal volume of cold (4 C) isopropanol was added to the
supernatant, and
the precipitated DNA was collected by centrifugation at 12,000 X g for 10 min
at room
temperature and re-dissolved in TE buffer (10 mM Tris-HCI: 1 mM EDTA, pH 7.5).
The
DNA was treated with RNase (10 pg/m1), (Sigma-Aldrich) for 30 min at 37 C, and
extracted twice with an equal volume of phenol/chloroform/isoamyl alcohol
(25:24:1)
and twice with an equal volume of chloroform alone. NaCI was added to a final
concentration of 0.5 M and the DNA precipitated by adding 2.5 volumes of cold
(4 C)
ethanol. The precipitated DNA was collected by centrifugation at 14,000 X g
for 10 min
at 4 C and re-dissolved in TE buffer.
Pulsed-field gel electrophoresis (PFGE)
Standard PFGE protocol involves embedding cells in agarose and lysis with
lysozyme
and/or proteases, but this was not possible with M1 because its pseudomurein-
containing cell wall was resistant to lysis by commercially available enzymes.
In order to
overcome this, the cell pellet from a centrifuged 50 ml culture was frozen
with liquid N2
and very gently ground in a pestle and mortar to damage the cell wall. The
ground
material was allowed to thaw, 2 ml of 1 M NaCI plus 10 mM Tris (pH 7.6) was
added
and 300 1F aliquots were mixed with an equal volume of 2% (w/v) low melt
agarose
(Bio-Rad Laboratories, Hercules, CA, USA). Embedded cells were treated with
0.1 mg
m1-1 Proteinase K in lysis buffer (50 mM Tris-HCI : 50 nriM EDTA: 1% [w/v]
sarkosyl, pH
=
8.0) at 50 C for up to 24 h. The agarose plugs were washed twice with sterile
water and
three times with TE buffer (10 mM Tris-HCI : 1 mM EDTA, pH 8.0) before storage
in 10
mM Tris-HCI: 100 mM EDTA (pH 8.0) at 4 C. DNA embedded in agarose was digested
82
for 16 h with 1.0 U of Apal, Bssi-111 or M/u1 (New England Biolabs, Beverly,
MA, USA) in
100 pl of restriction enzyme buffer, loaded into wells of 1% (w/v) agarose
gels (SeaKem
= Gold agarose, Cambrex Bio Science, Rockland, ME, USA), and run at 200 V
for 20 h at
14 C in 0.5X Tris-borate buffer using a CHEF DR Ill PFGE apparatus and model
1000
mini chiller (Bio-Rad). Double-digest combinations of these enzymes were
digested and
run in the same way. DNA was visualized by staining with ethidium bromide and
the
image captured using a Gel Doc 1000 system (Kodak Gel Logic 200 Imaging
System,
Eastman Kodak, Rochester, NY, USA).
Genome sequencing, assembly and validation
The genome sequence of M1 was determined using a whole genome shotgun strategy
(Agencourt Biosciences, USA) and a pyrosequencing approach (Macrogen, USA). A
hybrid assembly (Goldberg et al., 2006) was performed utilising the Staden
package
(Sladen & Bonfield, 2000), Phred (Ewing et al., 1998), Phrap, Paracel (Paracel
Inc.)
and Repeatnnasker resulting in a 27
cortig assembly. Gaps were closed using additional sequencing by PCR-based
techniques. Quality _improvement of the genome sequence was performed using
standard PCR to ensure correct assembly and the resolution of any remaining-
base- .
conflicts. Assembly validation was confirmed by pulsed-field gel
electrophoresis (see
above).
=
Genome analysis and annotation
A GAMOLA (Alterrnann and Klaenhammer, 2003) Artemis (Rutherford et al., mop)
software suite was used to manage genome annotation. Protein-encoding open
reading
frames (ORES) were identified using the ORF-prediction program Glimmer
(Delcher et
al., 1999) and BLASTX (Gish & States, 1993). A manual inspection was performed
to
verify or, if necessary, redefine the start and stop of each ORF. Assignment
of protein
function to ORFs was performed manually using results from the following
sources;
BLASTP (Altschul et al., 1990) to both a non-redundant protein database
provided by
the National Centre for Biotechnology Information (NCB!) (Sayers et al., 2009)
and
clusters of orthologous groups (COG) (Tatusov et al., 2000) database. HMMER
(Eddy,
1998) was used to identify protein motifs to both the PFAM (Finn et al., 2008)
and
TIGRFAM (Haft et al., 2003) libraries. TMHMM (Krogh et al., 2001)
was used to predict transmembrane
CA 2772224 2018-05-04
CA 02772224 2012-02-24
WO 2011/025394 PCT/NZ2010/000169
83
sequences, and SignalP (Bendtsen et al., 2004) was used for the prediction of
signal =
peptides.
Ribosomal RNA genes were detected on the basis of BLASTN searches to a custom
GAMOLA ribosomal database. Transfer RNA genes were identified using tRNAscan-
SE (Lowe & Eddy, 1997). Miscellaneous-coding RNAs were identified using the
Rfam
database utilizing the INFERNAL software package (Eddy, 2002). Insertion
sequence
elements were identified using Repeaffinder (Volfovsky et al., 2001) and BLAST
and
annotated manually. Genome atlas visualisations were constructed using GENEWIZ
(Jensen et al., 1999). Horizontal gene transfer studies were performed using
Darkhorse
(Podell & Gaasterland, 2007), GC% content (Rice et al., 2000) and the Codon
Adaptation Index (Sharp & Li, 1987). A BLAST analysis was performed against
the
arCOG (Makarova et al., 2007) database. Analysis of non-ribosomal peptide
synthetases (NRPSs) was performed using NRPSpredictor (Rausch et al., 2005).
An
LPxTG-HMM (Boekhorst et al., 2005) was used for the identification of LPxTG
motifs.
Metabolic pathway reconstructions were performed using Pathway Voyager
(Altermann
& Klaenhammer, 2005) and the KEGG (Kyoto Encyclopedia of Genes and Genomes)
database (Kanehisa & Goto, 2000) combined with an extensive review of the
literature.
Identification of open reading frames (ORFs) as vaccine and drug targets was
performed as described. Genome sequences used in comparative studies were
downloaded from the National Centre for Biotechnology Information (NCB!) FTP
website and are listed in Table 10, below.
Genome sequence was prepared for NCBI submission using Sequin (Benson et al.,
2009). The adenine residue of the start codon of the Cdc6-1 (mru0001) gene was
chosen as the first base for the M1 genome. For GC skew and synteny analysis,
the
sequences of genonnes of other members of the order Methanobacteriales were
rotated
to begin at the same location. GC skew analysis was performed by circular
diagram.pl
(Rutherford, K, Sanger Centre software) and synteny plots were generated using
MUMmer3.0 (Delcher et al., 2003).
Vaccine target ORF identification
Vaccine targets are likely to be surface exposed or membrane associated and
conserved among methanogens or archaeal species. Methanogens are the only
known
resident archaea in the rumen and therefore archaeal specific candidates were
not
84
omitted from target lists, likewise when present, sequence homology to
proteins
associated with known vaccine or drug design remained a strong element for
target
selection regardless of other criteria. To identify the surface-exposed or
membrane-
associated ORFs of M1 a combination of methods was utilized. To date, there is
no
. 5 signal peptide model for archaea. There are simply too few
experimentally verified
secretory proteins available for Archaea to train a specific model. For this
reason ORF
sequences were analysed for the presence of signal peptides using SignalP
Version 3.0
(Bendtsen et al., 2004) trained against the Gram-positive, Grain-negative 'and
Eukaryotic models and the results combined. SignaIP-HMM (hidden markov models)
was used to discriminate between signal peptide and non-signal peptide ORFs
whereas
SignaIP-NN (neural networks) was utilized for the prediction of cleavage sites
as
described by Emanuelsson et al., 2007 (Emanuelsson et al., 2007). TMHMM (Krogh
et
al., 2001) was used for the prediction of
transmembrane domains and (Nakai & Horton, 1999) PSORT trained on a Gram-
positive model was used to predict a protein's subcellular localization.
BLASTP results .
were analyzed to identify methanogen specific ORFs.
-Chemogenomics target ORF identification
Three different approaches were utilized to identify candidate chemogenomics
targets.
Metabolic profiling analyses. Several factors were taken into consideration
when
performing this analysis. Utilizing the metabolic reconstruction of M1 and an
extensive
review of the literature, archaeal- or methanogen-specific enzymes, or enzymes
with -
sufficient structural or biochemical differences compared to their bacterial
or eukaryl
counterparts were identified. Some methanogen enzymes or pathways that have
been
previously targeted by researchers for inhibition demonstrating the
essentiality of certain
enzymes/pathways were also taken into consideration. In addition, a few
enzymes
which represent key enzymes to several pathways or are well known validated
targets
in pathogenic bacteria or parasites, whilst still retaining sufficient
sequence divergence
to potentially be able to be targeted effectively were also included. Most of
the cell wall
enzymes are listed as the majority of successful antibiotics that have been
developed
=against pathogenic bacteria target cell wall biosynthesis. Methanobacterial
cell wall
synthesis, despite apparently sharing some common enzymes (e.g., mur ligases)
is
widely divergent in biochemical terms -from bacterial cell wall synthesis and
the
homologous enzymes share only limited sequence homology. The degree to which
CA 2772224 2018-05-04
CA 02772224 2012-02-24
WO 2011/025394 PCT/NZ2010/000169
strain Ml, or other rumen methanogens, are able to utilize amino acids,
vitamins, or
purine or pyrimidine compounds in rumen fluid is under investigation.
Functional genome distribution (FGD). A FGD analysis (Altermann 2009,
manuscript in
5 preparation) was performed using 26 publicly available draft and complete
methanogen
genome sequences (dbMethano, Table 10). In contrast to an evolutionary
phylogeny,
FGD analyzes the functional relationship between microbes based on their
predicted
ORFeomes. FGD is a comparative genomics approach to genome-genome
comparisons, emphasizing functional relationships rather than ancestral
lineages.
10 .. Briefly, pooled ORFeomes are subjected to all-vs-all analyses,
evaluating the level and
quality of amino-acid similarities between individual ORFs pairings.
Individual results for
each genome-genome combination are then combined into a symmetrical distance
matrix and can be visualized using the Unweighted Pair Group Method with
Arithmetic
mean (UPGMA) method (Sneath and Sokal, 1973) Numerical Taxonomy. Freeman, San
15 Francisco. Strain and cluster conserved and specific gene sets were
mined based on
respective BLAST e-values, using custom developed software.
Differential Blast Analysis (DBA). The reference genome of M1 was subjected to
analysis against two BLASTP databases using GAMOLA (Altermann and
20 Klaenhammer, 2003). The first amino-acid database= employed all methanogen
ORFeomes used in the FGD analysis (dbMethano), while the second database was
comprised of the non-redundant database (nr) as provided by NCBI, excluding
hits to
genera used in dbMethano. E-values of best BLASTP hits for both database sets
were
consolidated into an empirically determined e-value trust level range (fT
e-valueD and their
25 respective
differential calculated as follows: A = (Tnr TdbMethano)= Results were
visualized on a genome atlas using Genewiz (Jensen et al., 1999) and software
developed in-house.
Peptide vaccine methods
30 The use of synthetic peptides to raise antibodies against predicted M1
surface proteins
was investigated. The M1 proteins encoding the membrane-spanning subunits of
tetrahydromethanopterin S-methyltransferase (MtrCDE, mru1921, 1922 and 1923),
adhesin-like proteins (mru2049, 0842, 0143 and 2048) and a magnesium chelatase
subunit H (BchH, mru2047) containing N-terminal and C-terminal TMHs, were
analysed
35 to identify extracellular peptide sequences which might serve as
potential antibody
CA 02772224 2012-02-24
WO 2011/025394 PCT/NZ2010/000169
86
binding sites. Nine suitable peptide sequences from extracellular regions of
these eight
proteins were identified and used to guide the manufacture of the
corresponding
synthetic peptides. Each peptide was coupled to the Keyhole Limpet hemocyanin
(KLH) protein via an additional N- or C-terminal cysteine residue and a
maleimidocaproyl-N-hydroxysuccinimide linker and used to raise antibodies in
sheep
(lnvitrogen, USA). The conjugated peptides (200 pg) were injected
intradermally (ID)
into sheep (1-3 yr age) in Complete Freund's Adjuvant (CFA) at 10-15 sites on
day 0,
and secondary boosters in CFA were given on day 14. Six ID injections of 200
pg KLH-
peptide in Incomplete Freund's Adjuvant at 10-15 sites were given at days 28,
56, 70,
84, 98 and 112. Test bleeds (2-5 ml) were taken on days 42, 56, 84, and 112
for ELISA
analyses.
Antibody titer was determined with an ELISA with Peptide-GGG (goat gamma
globulin)
bound in solid phase (0.1 pg/100 p1/well) on high binding 96 well plates. The
serum was
first diluted 50-fold and then further diluted in 2-fold serial dilutions. The
ELISA titer is
the estimated dilution factor that resulted in an OD405nm of 0.2 and is
derived from
nonlinear regression analysis of the serial dilution curve. Detection was
obtained using
an HRP (horseradish peroxidase)-conjugated secondary antibody and ABTS (2,2'-
azino-bis(3-ethylbenzthiazoline-6-sulphonic acid). In the antibody-binding
experiment
M1 cells (40 pl of cells in 2 ml of sodium carbonate buffer) were immobilised
on
Maxisorp ELISA plates and antibody binding was determined by ELISA. Serum
samples were diluted 1/20 in diluent (1% w/v casein in PBS Tween 20 (g/I NaCI,
8.0;
= KCI, 0.2; Na2HPO4, 1.15; KH2PO4, 0.2; pH 7.2-7.4; Tween 20, 0.5 ml) and
incubated
at room temperature for 1 hr. Plates were washed 6 times with PBS Tween 20 and
conjugate (donkey anti-sheep/goat IgG HRP, 50 p1/well of a 1/5000 diluted
solution) and
substrate (3, 3',5, 5' tetramethyl benzidine, 50 pl/well) were added. After
incubation at
room temperature in the dark for 15 min, stop solution (50 p1/well of 0.05 M
H2SO4)
was added and 0D450 nm readings were taken.
Accession numbers and sequence listing
The nucleotide sequence of the M. ruminantium M1 chromosome has been deposited
in
GenBank under Accession Number CP001719. Microarray data has been submitted to
the Gene Expression Omnibus (GEO) in accordance with MIAME standards under GEO
Accession Number GSE18716. The specific M. ruminantium sequences are disclosed
CA 2772224 2017-03-13
87
herewith as a teit file (.txt) for the sequence listing.
EXAMPLE 2: Results
General genome characteristics
The genome sequence of M1 consists of a single 2.93 megabase (Mb) circular
chromosome, the assembly of which has been verified by pulsed-field gel
electrophoresis (Figure 6). The general features of the Ml genome compared to
other
genomes of species within the order Methanobacteriales are summarized in Table
1,
below, and Figure 10. M1 has the largest genome of the Methanobacteriales
sequenced to date. This increased genome size is due in part to a lower
overall coding
density, but also to a large number of genes encoding surface adhesin-like
proteins, thern
presence of a prophage, and a variety of genes unique to the M1 genome. M1
encodes _
2217 open reading frames (ORFs) and a functional classification of each ORE is
. presented in Table 9, below, and Figure 15. Genomes of the
Methanobacteriales
display a GC skew similar to bacterial chromosomes (Lobry, 1996) (Figure 2)
and an X-
shaped synteny pattern that is characteristic of moderately diverged genomes
(Figure
3). Analysis of potential horizontal gene transfer (HGT) events in M1
identified a number
of genes which show high sequence similarity to non-methanogens, typically
from
members of the bacterial phylum Firmicutes (Table 4, below, and Figure 16).
These
potential HOT events can be visualized .in a BLAST heat map analysis (Figure
16).
Several approaches were used to define potential gene targets for. CH4
mitigation from
Ml. An integral part of this process was the analysis of genes Which are
conserved
across methanogen genomes (Figure 13 and Table 8, below). Coupled with this,
chemogenomic targets (Figure la) were selected on the basis of detailed
metabolic
analysis (Table 2, below), while potential vaccine targets (Figure 1 b) were
chosen from
proteins predicted to be associated with the M1 cell surface. Examination of
the M1
genome has revealed a number of features and targets that could lead to an
effective
enteric methane mitigation technology, and these are discussed below.
= 30 Growth and methanogenesis
Many of the enzymes involved in the methanogenesis pathway are strongly
conserved
and found only among' methanogens, and therefore present good targets for CH4
mitigation technologies. Although this pathway has been well studied in
methanogens =
= =
CA 02772224 2012-02-24
WO 2011/025394 PCT/NZ2010/000169
88
from a range of other environments (Thauer et al., 2008), the M1 genome shows
for the
first time details of this pathway in a rumen methanogen. M1 can grow with
H2+CO2 and
formate (Smith & Hungate, 1958) and encodes the enzymes, and most of the
cofactors,
required for conversion of these substrates through to methane according to
the
metabolic scheme presented in Figure 11. Consistent with this hydrogenotrophic
lifestyle, M1 lacks the nnethanophenazine-reducing [Ni-Fe] hydrogenase
(VhoACG) and
methanophenazine-dependant heterodisulphide reductase (HdrDE) found in
methanophenazine-containing species within the order Methanosarcinales (Abken
et
al., 1998).
Surprisingly, M1 has two NADPH-dependent F420 dehydrogenase (npdG1, 2) genes
and
three NADP-dependent alcohol dehydrogenase (adh1, 2 and 3) genes. In some
methanogens, these enzymes allow growth on ethanol or isopropanol via NADP+-
dependent oxidation of the alcohol coupled to F420 reduction of methenyl-H4MPT
to
methyl-H4MPT (Berk & Thauer, 1997) (Figure 11). M1 is reported as not being
able to
grow on ethanol or methanol (Smith & Hungate, 1958), although a ciliate-
associated M.
ruminantium-like isolate was able to use isopropanol to a limited degree but
data were
not presented (Tokura et al., 1999). Our attempts to grow M1 on alcohols
indicate that
ethanol and methanol stimulate growth in the presence of limiting amounts of
H2+CO2,
but they do not support growth when H2 is absent (Figure 14). M1 does not
contain
homologues of the mta genes known to be required for methanol utilization in
other
methanogens (Fricke et al., 2006). The adh genes may play a role in alcohol
metabolism but the mechanism is under assessment.
Hydrogenotrophic methanogens usually encode a methyl coenzyme reductase II
(mall
or mrt), an isoenzyme of the methyl CoM reductase I (mcrl) enzyme which is
differentially regulated during growth (Reeve et al., 1997) to mediate methane
formation
at high partial pressures of H2. Interestingly, M1 does not encode a mail
system. In the
rumen, methanogens depend on fermentative microbes to supply H2, usually at
very
low concentrations, and M1 appears to have adapted its lifestyle for growth at
low levels
of H2 using the mcrl system only.
To examine the expression of genes involved in methanogenesis, in the presence
of a
H2-forming rumen bacterium, M1 was grown in co-culture with Butyrivibrio
proteoclasticus B316 (Moon et al., 2008) in a medium containing xylan as the
sole
CA 02772224 2012-02-24
WO 2011/025394 PCT/NZ2010/000169
89
carbon source, and gene expression was analysed by microarrays.
Formylmethanofuran dehydrogenase (fwdA), methyl CoM reductase (mcrBCDG),
methyl viologen-reducing hydrogenase (mvhG), and H4MPT methyltransferase
(mtrABCH) were significantly up-regulated (>2 fold) in the co-culture compared
to the
monoculture of M1 grown with H2+CO2 (Table 5, below). .Interestingly, formate
utilisation (fdhAB) genes were also up-regulated, suggesting that formate
formed by B.
proteoclasticus was an important methanogenic substrate transferred during
this
syntrophic interaction.
Genes encoding enzymes in the methanogenesis pathway that are potential
targets are
highlighted in Figure 11. Several methanogenesis marker proteins found in
methanogen
genomes, with hypothetical function, were also included in the target list.
Many of the
enzyme subunit targets are predicted to be within the cell cytoplasm, and
therefore best
pursued via a chemogenomics approach (Figure 11). However, several, subunits
including those of the Aha, Eha, Ehb and Mtr enzyme complexes, are membrane-
located and may be suitable as vaccine targets. Mtr catalyses the transfer of
the methyl
, group from methyl-H4MPT to CoM and couples this to the efflux of Na +
ions (Reeve et
al., 1997). Three of the Mtr subunits (MtrEDC) are predicted to be membrane-
spanning
in M1 and in each of the membrane-spanning regions the transmembrane helices
have
peptide loops located outside the cell membrane. These loops are potential
antibody
binding sites. Synthetic peptides corresponding to the loop regions of lintif,
Mtti) and
MttC have been coupled to a carrier protein and used as antigens to vaccinate
sheep.
The resulting immune sera were shown to bind specifically to immobilized M1
_cells
(Figure 17), demonstrating the feasibility of such a peptide-directed reverse
vaccinology
approach.
Analysis of the M1 genome has helped explain the growth requirements of M1 for
acetate, 2-methylbutyrate and co-enzyme M (CoM) (Bryant et al., 1971). Acetate
is
required for cell carbon biosynthesis after activation to acetyl CoA (acs,
acsA), followed
by reductive carboxylation to pyruvate (porABCDEF, Table 9, below). Reductive
carboxylation of 2-methylbutyrate is probably the route for isoleucine
biosynthesis
(Robinson & Allison, 1969), as M1 lacks a gene encoding a homoserine kinase
needed ,
for the usual pathway from threonine (Table 9, below). Exogenously supplied
CoM is
essential for M1 growth as two genes needed in the CoM biosynthetic pathway,
=
CA 02772224 2012-02-24
WO 2011/025394 PCT/NZ2010/000169
phosphosulfolactate synthase (comA) and sulfopyruvate decarboxylase (comDE)
(Graham et al., 2002), are missing in M1 (Figure 11).
=
5 Cell envelope
The methanogen cell envelope serves as the interface between the organism and
its
rumen environment, and as such represents a key area for the identification of
vaccine
and drug targets. The main structural component of the cell envelope of M1
(Figure 12),
as with other Gram-positive methanogens, is pseudomurein. This is structurally
10 analogous, but chemically different, to peptidoglycan, which performs
the comparable
function in bacteria (K6nig et al., 1994). Bacterial peptidoglycan
biosynthesis has long
been a major target of antimicrobials but these compounds are largely
ineffective
against pseudomurein-containing cells (Kandler & Konig, 1998). The pathway for
pseudomurein biosynthesis and its primary structure have been proposed
(Kandler &
15 KOnig, 1998), but the enzymes involved have not been characterized. Our
genomic
analysis has identified several genes encoding enzymes likely to be involved
both in the
intracellular biosynthesis of the pseudomurein precursors and the processes
involved in
exporting and assembling these into the cell wall (Figure 5).
20 The original description of M. ruminantium reported the existence of a
capsule
surrounding the cells, and chemical analysis of the cell walls showed that
galactose and _
rhamnose together with lower amounts of glucose and rpannose were present in
addition to pseudomurein (Kandler & Konig, 1978; Kandler & Konig, 1985). The
cell
walls are also reported to contain high levels of phosphate, comparable to
that found in
25 bacterial cell walls containing teichoic acid (Kandler & KOnig, 1978).
M1 contains
homologs of genes involved in teichoic acid production in Gram positive
bacteria
(Bhaysar & Brown, 2006; Weidenmaier & Peschel, 2008) (Table 9, below),
suggesting
the presence of as-yet unidentified cell wall glycopolymers. Additionally,
several genes
are predicted to be involved in exopolysaccharide production, sialic acid
biosynthesis
30 and protein glycosylation (Table 9, below). The genome contains a
homolog of the
eukaryal oligosaccharyl transferase (mru0391), a membrane protein believed to
be
involved in glycosylating proteins translocated via the Sec pathway (Yurist-
Doutsch,
2008) (Figure 12). Glycoproteins derived from the cell wall of M1 have been
shown to be
highly immunogenic in sheep. The resulting antisera agglutinated M1 cells and
35 significantly reduced their ability to grow and produce methane in vitro
(Wedlock et al.,
CA 02772224 2012-02-24
WO 2011/025394 PCT/NZ2010/000169
91
in preparation). Overall, polysaccharides and glycosylated molecules are a
major
component of the M1 cell envelope, and their accessibility at the cell surface
make
these polymers viable methane mitigation targets.
Genomes of human gut methanogens encode large surface proteins that have
features
similar to bacterial (Fricke et al., 2006; Samuel et al., 2007) adhesins.
Similarly, M1 has
an array of large adhesin-like proteins, much greater in number than those
reported
from other gut methanogens (Table 1, below). In the co-culturing experiments
described
above, six M1 adhesin-like proteins were upregulated (Table 5, below), and
microscopic
examination showed co-aggregation of M1 and B. proteoclasticus cells (Figure
9). In
addition, immune sera produced by small peptides synthesized to correspond to
four
M1 adhesin-like proteins were shown to bind specifically to immobilized M1
cells (Figure
17). Identifying highly conserved methanogen-specific features of these
adhesin-like
proteins may present a pathway to vaccine development Sixty two adhesin-like
proteins are predicted to be extracellular and contain a cell-anchoring domain
(Figure
12). These proteins represent a significant component of the M1 cell envelope
(Table
3). The largest group of these (44) contain a conserved C-terminal domain (M1-
C,
Figure 4) with weak homology to a Big_1 domain (Pfam accession number PF02369)
which may be involved in attachment to the cell wall, possibly by interaction
with
pseudomurein or cell wall glycopolymers. Several of these proteins also
contain a
papain family cysteine protease domain (PF00112), and their role may be in the
turnover of pseudomurein cell walls.
A second group of 14 proteins is predicted to contain a C-terminal
transmembrane
domain, suggesting they are anchored in the cell membrane. Curiously, the
genome
contains one adhesin-like protein (mru2147) with a cell wall LPxTG-like
sorting motif
and three copies of a cell wall binding repeat (PF01473), both of which are
commonly
found in Gram-positive bacteria. There has only been one other report of a
LPxTG-
containing protein in a methanogen, the pseudomurein containing Methanopyrus
kandleri (Boekhorst et al., 2005). Our analysis of the M. smithii PS genome
revealed
the presence of two LPxTG containing proteins (msm0173 and msm0411). Such
proteins are covalently attached to the cell wall by membrane associated
transpeptidases, known as sortases. Sortase activity has been recognised as a
target
for anti-infective therapy in bacteria (Maresso & Schneewind, 2008) and a
sortase
(rnru1832) has been identified in the M1 genome.
CA 2772224 2017-03-13
92
=
Prophage
Phage exert a significant ecological impact on microbial populations in the
rumen, and
have been suggested as biocontrol agents for rumen methanogens (Klieve &
Hegarty,
1999). Ml has 70 ORFs (m6.10256-0325) over a 62 Kb GC-rich (39% G-FC content)
region of the genome that encode a prophage genome, designated cpmru. Based on
a
functional annotation, cpmn., is partitioned into, distinct modules encoding
integration,
DNA replication, DNA packaging, phage capsid, lysis and lysogenic functions
,(Attwood
et al., 2008). Within the lysis module, a gene encoding a putative lytic
enzyme,
endoisopeptidase PeiR (mru0320), was identified. Recombinant phage lytic
enzymes
have been used for controlling antibiotic-resistant bacterial pathogens
(Hermoso et al.,
2007), and a methanogen phage lytic enzyme may be a viable biocontrol option.
We
have confirmed the ability of recombinant PeiR to lyse MI cells in pure
culture (WO
09041831A1) (Figure 8). PeiR represents a novel enzyme, as it does not show
significant homology to any sequence currently in public databases. The
variety of
methanogen cell wall types means a combination of different lytic enzymes will
be
required for effective methanogen lysis in the rumen. However, the expression
of Pelf?
and demonstration of its' effectiveness against a major rumen methanogen is an
important step towards this goal. PeiR will also be useful in increasing the
permeability
of pseudomurein-containing cell walls of methanogens to aid development of
genetic
_ systems for performing gene knockouts to validate targets, while the
cpmru phage itself
might be useful as a genetic tool for M. ruminantium. The prophage has also
been
disclosed in detail in US 60/989,840 filed 22 November 2007, and in
PCT/2008/000248
filed 25 September 2008.
=
Non-ribosomal peptide synthetases
= An unforeseen and novel feature of M1 is the presence of two large
proteins (mru0068
and mru0351) showing the distinctive domain architecture of non-ribosomal
peptide
synthetases (NRPS) (Figure 7). To our knowledge, this is the first report of
NRPS genes
identified in an archaeal genome. HGT studies indicate that these genes may be
bacterial in origin (Table 4, below). NRPSs produce a wide variety of small
molecule
CA 02772224 2012-02-24
WO 2011/025394 PCT/NZ2010/000169
93
natural products that have biotechnological applications as peptide
antibiotics,
siderophores, immunosupressants or antitumor drugs (Amoutzias et al., 2008).
The
NRPS encoded by mru0068 is predicted to encode two modules, each containing
condensation, adenylation and thiolation domains. The presence of a
condensation
domain in the first module is often associated with NRPSs that make N-acylated
peptides (Fischbach and Walsh, 2006). The second module is followed by a
terminal
thioesterase domain which is thought to release the peptide from the final
thiolation
domain. Mru0068 is surrounded by genes that encode two serine phosphatases
(mru0066, mru0071), an anti-sigma factor antagonist (mru0067), and a MatE
efflux
family protein (mru0069), which are likely to be involved in environment
sensing,
= regulating NRPS expression and export of the NRP, respectively. Mru0068
displays full
= length protein alignment with a putative NRPS from Syntrophomonas wolfei
subsp.
wolfei strain Gottingen (Figure 18), a Gram-positive bacterium known to
participate in
syntrophic interactions with methanogens (McInerney et al., 1979).
The second NRPS gene (mru0351) contains 4 modules and a thioesterase domain.
Downstream of mru0351 is another MatE efflux family protein (mru0352),
presumably
involved in NRP export. A third, smaller cluster of genes located elsewhere in
the
genome (mru0513-0516) appear to encode NRPS-associated functions. This cluster
.includes a 4'-phosphopantetheinyl transferase (mru0514) which primes NRPSs by
adding a phosphopantetheinyl group to a conserved serine within the thiolation
domain,
an acyltransferase (rnru0512) possibly involved in NRP acylation, a serine
phosphatase
(mru0515), an anti-sigma factor antagonist (mru0513), and an anti-sigma
regulatory
factor serine/threonine protein kinase (mru0516) that may function in sensing
the
environment and NRPS regulation. Although the products of each NRPS are
unknown,
s an analysis of adenylation domain amino acid sequences predicts 10 residues
(boxed,
Figure 7) which are important for substrate binding and catalysis. HGT studies
indicate
that these genes may be bacterial in origin (Table 4).
Thus, several genes in M1 are possibly involved in sensing the environment,
and in the
regulation and transport of the NRP (Figure 7) and such genes are also present
in the
S. wolfei genome. Although the actual roles of these genes have not been
defined
conclusively, NRPs are known to contribute to microbial growth and ecological
interactions, thus may provide a means to manage methane emissions from
livestock.
CA 02772224 2012-02-24
WO 2011/025394 PCT/NZ2010/000169
94
Comparative genomics analysis
To functionally compare M. ruminantium to other methanogens, 25 publicly
accessible
complete and draft phase genome sequences were subjected to a Functional
Genome
Distribution (FGD) analysis (Altermann, submitted) (Figure 13). Together with
Methanobrevibater smitbii, Methonsphaera stadtmanae, and Methanothennobacter
thermoautotrophicus, M. ruminantium formed a functional cluster. Within this
cluster, a
large number of predicted genes were found to be conserved (low e-value cutoff
le-
100, Table 8, below). The majority of these conserved genes were classified
into core
biological categories, such as amino acid biosynthesis, cell cycle, central
carbon
metabolism, nucleic acid metabolism, protein fate and synthesis and purine and
pyrimidine biosynthesis. Similarly, genes involved in energy metabolism,
especially in =
the methanogenesis pathway were commonly shared within this functional
cluster.
Most notably, only one fourth of this gene set was found to be conserved when
compared with other functional clusters. A similar observation was found for
conserved
genes involved in the biosynthesis of vitamins and co-factors. 23 genes were
found to
be highly conserved within the M. ruminantium functional cluster, whereas only
two
genes involved in cobalamin and thimaine biosynthesis were shared with
methanogens
outside this cluster, respectively. It is also interesting to note that, apart
from
Methanopyrus kandleri, pseudomurein containing methanogens have been
functionally
grouped together in the M. ruminantium cluster. This is clearly reflected in
the 14 highly
conserved genes involved in pseudomurein and exopolysaccharide biosynthesis.
Outside this functional cluster Only one gene, a glucosamine-fructose-p-
phosphate
aminotransferase, GlmS2, was found to be conserved at this level.
While conserved gene sets cause functional clustering, strain or cluster-
specific genes
are responsible for differentiation. When compared to all other members of its
own
cluster, M. ruminantium was found to harbour 468 strain specific genes (high e-
value
cutoff le-10). While the vast majority (341 genes) was assigned to genes with
hypothetical or unknown functional categories, a significant number of genes
were
identified assigned to relevant biological functions. Mobile elements such a
transposases and prophage elements were identified as strain specific to M.
ruminantium. Strain-specific genes involved in the production of
exopolysaccharides
and cell surface proteins are likely to endow M. ruminantium with surface and
adhesion
properties distinctly different from other members of the M. ruminantium
functional
CA 02772224 2012-02-24
WO 2011/025394 PCT/NZ2010/000169
cluster. Similarly, ten unique transport systems were identified ranging from
generic
multidrug transporter to predicted pH homeostasis systems. Also 22 regulatory
proteins
(16 involved in protein interaction and five transcriptional regulators) were
detected.
When extending the search for strain specific genes outside its own functional
cluster,
5 the number of identified ORFs dropped down significantly. Only three
genes coding for
adhesion-like proteins and two genes involved in the production of
exopolysaccharides
(a sialyltransferase and a glycosyl transferase) were found to be M.
ruminantium
specific.
10 Comparing pseudomurein producers (PMP) to all other methanogens using a
relaxed
parameter set (low e-value cutoff le-60; high e-value cutoff le-10; mismatch
tolerance
2) revealed an interesting set of genes functionally specific to this subset.
A number of
genes involved in pseudomurein biosynthesis were identified to be functionally
specific
to PMPs. In addition, two genes coding for adhesion-like proteins, four genes
predicted
15 to be involved in exopolysaccharide production and one gene coding for a
poly-gamma-
glutamate biosynthesis protein, likely to be involved in capsule synthesis,
were found to
be cluster specific. These genes coding for cell surface structures represent
prime
targets for vaccination based methane mitigation strategies, as their gene
products are
likely to reflect unique structures with the potential to induce specific
antibodies.
20 Interestingly, a gene coding for the energy-converting hydrogenase A
subunit R was
found to be PMP specific (including M. thermoautotrophicus and M. kandlen).
This gene
product is involved in electron transfer by reducing a ferredoxin. The Eha/Ehb
complex
is already being targeted for the development of a methanogen vaccine (Figure
11).
25 Currently a bias exists for pseudomurein producing methanogen genomes. With
the
exception of Methanothermobacter thennoautotrophicus (isolated from sewage
sludge)
and Methanopyrus kandleti (isolated from a submarine hydrothermal vent) all
PMP
methanogens were isolated either from rumen, the gastrointestinal tract or
from faeces.
It stands to reason that these closely related ecological niches facilitate
common
30 .. lifestyle adaption events. Such an event was detected in the presence of
a highly
conserved bile salt hydrolase which is predicted to hydrolyse the amide
linkage
between the bile acid carboxyl group and the glycine or taurine amino group.
The
presence of a bile salt hydrolase explicitly implies that M. ruminantium (and
other rumen
methanogens) is well adapted for a passage from the rumen environment through
to the
35 gastrointestinal tract. Functionally conserved oxidative stress response
genes such as
CA 02772224 2012-02-24
WO 2011/025394 PCT/NZ2010/000169
96
rubredoxin rub1 further suggest a certain tolerance of methanogens to aerobic
environments and a life-cycle from oral intake, biological activity in rumen
and
gastrointestinal tract ecologies, excretion into the aerobic environment and a
subsequent ¨ potentially time limited ¨ waiting period for the next oral
intake event may
be proposed. How strictly anaerobic organisms such as methanogens survive
exposure
to high stress levels remains subject to further evaluation.
A recently published phylogenetic tree based on seven core enzymes (Miller,
2001)
deviates in part significantly from the functional clustering shown in Figure
13. Anderson
et al. propose three classes of methanogens, based on the resulting
phylogenetic
approximation. In contrast, the FGD analysis revealed the presence of only two
major
functional clusters which can each be split into further sub-clusters (Figure
13). Most
notably, Methanocorpusculum labreanum, Methanococcoides budonii and
Methanosaeta thennophila form a distinct functional cluster under the FGD
analysis
(Figure 13, sub-cluster 1.3) but are separated into Class II and Class III,
respectively,
based on the phylogenetic analysis. These differences clearly highlight the
importance
of whole genome analyses to address lifestyle adaptation processes (as
described
above) and genomic plasticity within a biotechnological context.
To investigate the broader phylogenetic placement of M. ruminantium and assess
the
potential level of horizontal gene transfer between archaea and archaea and
eubacteria, a Blast Heat Map analysis was performed based on the non-redundant
amino-acid database (Figure 16). Significant heat flares were detected within
archaea
for the genera of Methanococcus, and, to lesser degrees for Methanosarcina and
Thermococcus (Figure 16A). Surprisingly, considerable levels of high sequence
similarities were detected for the eubacterial genera of Clostridium and
Bacillus,
commonly found in ecological niches inhabited by M. ruminantium. These heat
flares
infer a profound level of shared and functionally similar gene sets and
fortify the notion
that genetic exchange between microbial domains is a common event, likely
driven by
lifestyle adaption forces. Analysing the relative quality of sequence
similarity levels
revealed a core set of genes commonly shared among phylogenetically diverse
methanogenic archaea (Figure 16B). Interestingly, individual heat flares of
other
archaea such as Halobacteria sharing highly conserved gene sets to M.
ruminantium
appear similar in shape than flares observed for Clostridium and Bacillus,
further
strengthening the model of genetic exchange between domains. Interestingly, a
more
CA 02772224 2012-02-24
WO 2011/025394 PCT/NZ2010/000169
97
detailed analysis of the M. ruminantium M1 ORFeome to bacterial sequences
highlighted a series of gene sets not commonly found in other
Methanobacteriales such
as genes involved in biotin and cobalamin biosynthesis (Table 8, below).
Based on these observations we conducted a differential Blast Analysis (DBA)
using the
non-redundant database and the 26 methanogen genomes as references. While a
DBA
analysis is more liberal than FGD, it is able to highlight gene products
present in at least
one methanogen genome comprising dbMethano but not present in any other
organism
(nr database) and vice versa. Therefore, gene sets found to be present, in
methanogens
but not in other microbes might represent prime targets for methane mitigation
strategies. Using a minimum DBA value of four, 117 gene targets were
identified to be
conserved between M. ruminantium and methanogens but not present in other
organisms. Notably, 21 genes predicted to be associated to the cell envelope
were
identified, including cell surface proteins and genes involved in
exopolysaccharide and
pseudomurein biosynthesis. One gene coding for a polysaccharide biosynthesis
protein
and two adhesion-like proteins were among the most prominent targets (DBA
value of -
6). All three genes are involved in cell-cell interaction and could represent
targets for
methanogen modulation. Interestingly, a single adhesion like protein was
identified near
the origin of DNA replication, conserved between M. ruminantium and other
organisms
but absent in methanogens (Figure 10).
Closest hits were found to Coprococcus eutactus (Acc: ZP_02205388.1, isolated
from
human faeces), Streptococcus sanguinis (Acc: YP_001036038.1, isolated from
human
dental plaque), Peptostreptococcus micros. (renamed as: Patvimonas micra, Acc:
ZP_02093886.1, isolated from human gut), Arcanobacterium pyogenes (Acc:
AA043108.1, commonly found on mucosal surfaces of livestock) and Bacillus
weihenstephanensis (Acc: YP_001647931.1, human pathogen). All of those
organisms
are able to interact with either animal or human hosts and it is tempting to
speculate
that this adhesion reflects a specific lifestyle adaptation of M. ruminantium
to the rumen
and, possibly, to the gastrointestinal tract environment. This clearly
strengthens the
previous hypothesis modelled on the Blast Heat Map proposing a significant
level of
genetic exchange between M. ruminantium and certain genera of Eubacteria.
Similarly,
two adjacent glycosylhydrolases of the GT2 family were found to be shared in a
similar
way. For both proteins, top Blast hits link to Clostridia and Bacilli while
their function
was associated to the transfer of sugar units to teichoic acids which in turn
are
CA 02772224 2012-02-24
WO 2011/025394 PCT/NZ2010/000169
98
covalently linked to the cell wall thus providing M. ruminantium with a
methanogen-
unique outer cell-surface structure.
Genes predicted to be Involved in hydrogen metabolism and methanogenesis such
as
the energy-converting hydrogenase B (Ehb) and the tungsten formylmethanofuran
dehydrogenase (Fwd) were found to be methanogen specific. Also, another gene
involved core functions, a Fibrillarin-like archaeal protein, commonly thought
to
participate in processing pre-ribosomal rRNA, was found to be methanogen
specific. =
This is interesting, as only methanogen genera were excluded from the non-
redundant
database, but not other archaea. Therefore, this gene might present an
opportunity to
target methanogens at an essential function. An in-depth analysis of the
presence/absence of this gene in the methanogen genomes revealed an
interesting
distribution pattern. With the exception of Methanospirillum hungatei members
of
functional cluster 4 (Figure 13) do not harbours this gene, while cluster 3
predominantly
features this archaeal gene, with the notable exception of Methanococcoides
burtonii,
Methanosaeta thermophila and Methanosphaera stadtmanaa It might be noteworthy,
that within methanogens this gene is either absent or present, creating a
highly
conserved target with low genetic drift. Although not universally conserved,
this gene
might offer the opportunity to specifically target the majority of cluster 3
methanogens,
thus including those of rumen origin, for methane mitigation strategies.
Identification of targets for methane mitigation
Several approaches were used to define potential gene targets from M1 for CH4
mitigation via chemogenomic and vaccine approaches (Figure 1). Genes suitable
as
chemogenomics targets were identified using a combination of metabolic
profiling,
review of the literature pertaining to the biochemistry and physiology of
methanogens,
and comparative genomics. The 33 candidate genes commonly identified by these
approaches are shown in Figure 1A. The full list of ORFs identified as
chemogenomic
targets by metabolic profiling of M1 and literature can be found in Table 5.
Comparative
studies were based on M1 and 26 complete and draft phase methanogen genome
sequences, using a functional genome distribution (FGD) analysis (Table 3,
Figure 13).
This analysis of whole genome gene conservation among methanogens showed that
M1 and other members of the Methanobacteriales formed a functional cluster
that
shared a large number of conserved genes predicted to be involved in core
biological
functions (low e-value cut-off 1e-100, Table 3). In addition, a differential
blast analysis
CA 02772224 2012-02-24
WO 2011/025394 PCT/NZ2010/000169
99
(DBA) was conducted using the non-redundant (nr) database and a methanogen
genome sequence database (dbMethano). The DBA analysis highlighted genes
present
in at least one methanogen genome within dbMethano but not present in any
other
organism within the nr database and vice versa (Figure 10), thus identifying
methanogen-specific genes. The majority of the 33 selected conserved and
methanogen-specific genes encode enzymes that fall within the energy
metabolism
category, mainly within the methanogenesis pathway (Table 9). This also
included
several methanogenesis marker proteins found in methanogen genomes, but
currently
without defined function. Most of these methanogenesis enzymes are located
within the
cell cytoplasm, and therefore have been tagged as key targets for inhibitor
discovery via
a chemogenomics approach (Figure 11).
The alternative approach of inducing the ruminant immune system to produce
salivary
antibodies against conserved features of rumen methanogens is an attractive
methane
mitigation strategy. The rumen epithelium is not immunologically active, and
rumen
contents do not contain complement proteins, therefore specific immune
responses in
the rumen do not occur. The effectiveness of a vaccination approach relies on
the
binding of salivary antibodies to methanogen surface features, which results
in their
=
inactivation or clearance from the rumen. Vaccines are typically composed of
proteins
=or polysaccharides derived from killed or attenuated whole cells or
components
presented on the outside of the cell such as flagella, capsules, cell walls,
fimbrae, or
secreted toxins. In the case of rumen methanogens, the primary vaccine targets
are
likely to be surface-exposed or membrane-associated proteins that are
conserved
among methanogens or archaeal species and which encode functions vital to
methanogen growth and survival in the rumen. In silk analysis of the M1
ORFeome
(all ORFs) identified an initial pool of 572.0RFs containing one or more
transmembrane
helices (TMH) or signal peptide (SP) indicating a cell membrane or cell
surface location
and therefore potential vaccine targets. Those ORFs with a top BLAST hit to a
non-
methanogen or with no homology to the nr database were removed from the
analysis,
as were transposase sequences (which are unlikely to represent good vaccine
targets),
while adhesin-like ORFs are dealt with separately above. This gave a new total
of 337
ORFs. Examination of the remaining 337 ORFs, assessing their predicted
function,
degree of conservation among methanogens and the nature of their transmembrane
structures, refined the list to 71 ORFs (Figure 1B). Heterologous expression
of
membrane proteins with more than 4 TMHs has been difficult in RV studies of
other
CA 02772224 2012-02-24
WO 2011/025394 PCT/NZ2010/000169
100
microbes (Vivona et al., 2008). Therefore, a cut-off of 4 THMs was applied to
define
two final groups: Group A with 47 ORFs with 4 or fewer TMHs suitable for
cloning and
heterologous expression studies; and Group B composed of 24 ORFs with more
than 4
TMHs more suitable for a synthetic peptide-directed vaccine approach.
The majority of vaccine targets identified above correspond to hypothetical
proteins of
unknown function. While these ORFs are presumed to be of value to M1, their
importance to M1 growth and survival in the rumen is not evident, and
therefore they
are of lower priority as vaccine candidates. Of the remaining ORFs, those
involved in.
energy metabolism are again prime vaccine candidates (Figure 11). Of
particular
interest is the Mtr enzyme complex, which catalyses the essential methanogen
function
= of transferring the methyl group from methyl-H4MPT to CoM, coupled to the
efflux of
Na + ions (Lienard et al., 1996). Three of the Mtr subunits (MtrEDC) are each
predicted
to have >4 membrane-spanning regions and, in each of the membrane-spanning
regions, the transmembrane helices have peptide loops located outside the cell
membrane. These loops are potential antibody binding sites. We synthesised
peptides
corresponding to the loop regions of MtrE, MtrD and Mtr C which were coupled
to a
carrier protein and then used as antigens to vaccinate sheep. The resulting
immune .
sera bound specifically to immobilized M1 cells (Figure 17), demonstrating the
feasibility
of such a peptide-directed RV approach.
Vaccine target identification results
TMHMM predicted 542 ORFs to contain one or more transmembrane (TM) domains,
243 of which are also predicted to contain a signal peptide (SP). A further 30
ORFs
were predicted to contain a signal peptide but no TM domain. This gave a total
pool of
572 ORFs as potential vaccine candidates for the M. ruminantium Ml genome.
Blast
analyses revealed ORFs that had a top blast hit to a non-methanogen or had no
homology to the NR database. These ORFs were removed from the analysis at this
point as were transposase sequences (which are unlikely to represent a good
vaccine
target) and adhesin-like ORFs which were dealt with separately. This gave a
new total
of 339 ORFs. Those genes which are presently only found in M. ruminantium M1
and
provide important information about rumen methanogens.
Methanogens are strict anaerobes and are thus present challenges for culturing
in vitro
and genetic techniques for validating both drug and vaccine targets
particularly the
=
CA 2772224 2017-03-13
101
=
Methanobacteria with their thick cell walls are still being established. As
such, a further
manual analysis of the 339 ORFs functions was undertaken re-examining their
functional annotation, their conservation among methanogens and the prediction
of their
transmembrane structures and subcellular location which refined the target
list to 71
ORFs which are presently specific to methanogens or archaea. Because-
expression
can be challenging for membrane proteins, a cut-off of four TM domains was
applied to
= expression studies of M. ruminantium M1 (multiple TMs can be indicative
of a surface
exposed protein, but conversely, they are known to impair heterologous
expression in
Escherichia coh) This resulted in a final two groups; Group A ¨47 vaccine
targets (less
than or equal to 4 TM) suitable for further cloning and expression studies and
Group B ¨
24 -vaccine targets (greater than 4 TM) more suitable for a synthetic peptide
directed
vaccine approach. The adhesinTlike ORFs were treated as a separate vaccine
target
list Vaccine targets have also be disclosed in detail in US 60/989,841 filed
22
November 2007, and in PCT/2008/000249 filed 25 September 2008.
EXAMPLE 3: Overview
Genome sequencing
Methanobrevibacter ruminantium was chosen for genome sequencing because of its
prevalence in the rumen under a variety of dietary conditions (based on
cultivation and
molecular detection data), the availability of cultures, its amenity to
routine growth in the
laboratory, and the relatively large amount of previous studies and background
literature
available for this organism. A significant number of the genes within the M.
ruminantium
have been assigned a function, and have thereby allowed a detailed picture of
this
organism's lifestyle within the rumen. M. ruminantium's dependence on simple
substrates (H2 + CO2, formate) and its interaction with the rumen environment
via
surface proteins and exopolysaccharides are important targets for inhibition.
Similarly,
the SPIDRs hold promise for both specific targeting of M. ruminantium and for
future
genetic manipulations to assist in determining gene function. The sequence
data
elucidates the metabolism of this organism and how it interacts with other
microbes,
and points to conserved systems and components among methanogens that can be
inactivated to prevent or reduce methane formation in the rumen.
CA 02772224 2012-02-24
WO 2011/025394 PCT/NZ2010/000169
102
Drug discovery for reducing rumen methane emissions
The completion of the M. ruminantium genome now enables the full power of `-
omic'
approaches to be used for developing novel drugs for reducing methane
emissions. A
rational drug discovery approach to the development of effective methane
mitigation
agents, analogous to the established approaches utilised for developing novel
antibiotics against human pathogens, should proceed through several stages
including:
target identification; target/pathway validation; lead identification;
verification of
effectiveness of lead in rumen-like conditions; and ultimately, testing in
animal trials
(Gerdes et al. 2006; Pucci 2006; Galperin 2007). A priori, it is important to
know the
breadth of methanogen diversity in the rumen, and this has recently been
summarised
in a meta-analysis incorporating data from 14 phylogenetic studies (Janssen
and Kirs,
2008). The study has shown that the vast majority of rumen archaea (92.3%)
fall into
three clades, the genera Methanobrevibacter (61.6%) and Methanomicrobium
(14.9%)
and an uncultured group termed rumen cluster C (15.8%) of as yet unknown
physiology
(Janssen and Kirs, 2008). Methanogens typically only account for approximately
1-4%
of the total microbial community (Janssen and Kirs 2008).
Inter-genome comparisons of representative organisms, including relevant gut
methanogens (e.g., Methanobrevibacter smith!!, Methanospirillim hungatei and
Methanosphaera stadtmanae), other rumen microorganisms (bacterial, fungal and
prozoal) and mammals can aid in the identification of suitable targets (Samuel
et al.
2007; Fricke et al. 2006). Overall, rumen methanogens should be somewhat
easier to
inhibit than bacteria, fungi or protozoa given that they are the only resident
archaea in
the rumen and are well known to possess several unique traits including
distinctive
cofactors, cell wall chemistries and lipids. Furthermore, they tend to have
smaller
genomes with less metabolic capability, tend to be less adaptable with fewer
regulatory
systems and are probably less able to develop resistance to drugs.
Given the diversity of methanogens in the rumen, the aim of developing a full
spectrum ,
anti-methanogen 'magic bullet' for complete mitigation of emissions would
necessitate
the targeting of enzymes that are essential to all rumen methanogens under
normal
rumen growth conditions. However, partial inhibition may-also be desirable for
extended
periods of time, due to the potential decrease in the efficiency of feed
conversion
resulting from feedback inhibition of ruminal fermentation (Hegarty 1999;
Russell and
Rychlik, 2005). Significant partial reductions in methane emissions which
might be more
CA 02772224 2012-02-24
WO 2011/025394 PCT/NZ2010/000169
103
sustainable could also possibly be achievable by limiting the targeting to
specific
phylogenetic groups, such as the dominant Methanobrevibacteria. Methanogens
grow
slowly compared to rumen bacteria and are prone to being flushed out of the
rumen
(Janssen and Kirs 2008).
Analysis of the M. ruminantium genome, combined with the genome comparison and
a
consideration of the literature that has identified archaeal/methanogen-
specific enzymes
or has demonstrated the essentiality of enzymes/pathways has allowed us to
generate
a list of targets of interest (Tables 2 and 3, below). General targets areas
include the
methanogenesis pathway, energy metabolism, transcription, protein synthesis,
cell wall
synthesis, lipid synthesis, cofactor synthesis, and some key central carbon
metabolic
enzymes that are important links between essential pathways. Target
prioritisation for
introducing enzymes into the work stream is based on the ultimate aim of
obtaining
enzymes for high-throughput screening and crystal structure determinations for
in silico
lead identification. Targets are spread over multiple susceptible pathways to
minimise
risk. Prioritisation takes into consideration the presumed essentiality of
target, the
expected ease of expression of the proteins (e.g., size, number of subunits
and
presence of transmembrane domains), availability of assays, expected
`druggability'
and availability crystal structures of homologous enzymes (Pucci, 2006;
Hopkins and
Groom, 2002).
There are several factors relevant to the development of future small molecule
inhibitors
of methanogens. These include that they should have minimal toxicity to the
host
animals, minimal accumulation or toxicity in any downstream products for human
consumption, minimal deleterious effects on beneficial microorganisms
responsible for
normal fermentation in the rumen, and minimal downstream environmental impact.
Ideally, they should also be inexpensive, given the current cost of carbon, be
impervious to the large hydrolytic capacity of the rumen and should have
reduced
potential for allowing resistance to develop amongst the rumen nnethanogens.
In the
best circumstances, the concentration of inhibitor required should be low
enough to
prevent any rumen microbes from utilising the inhibitor as a substrate for
growth and
minimise overall costs (Weimer, 1998). It would be very beneficial if future
anti-
methanogen compounds that satisfy the above criteria can also help to reduce
methane
emissions emanating from other sources such as rice paddies. The goal of
providing
long term reductions in rumen methane emissions over decades will likely
require a
104
suite of non over-lapping anti-methanogen compounds. Compounds that are shown
to
. inhibit an enzyme in an in vitro high-throughput assay should sequentially
be verified for
their effectiveness in pure culture experiments, followed by in vitro rumen
simulations,
and finally in short term and long term animal trials.
In the last decade there has been growing concern, as evidenced by recent EU
legislation banning their use (McAllister and Newbold, 2008), about the use of
antibiotics and other feed additives as growth promoters largely due to the
development
of antibiotic resistance. Consequently, even though methanogens are usually
not
thought to be directly implicated as a causative agent in pathogenic disease
(Macario
and Macario, 2008), it may be wise to avoid over-reliance on anti-methanogen
agents
that seek to inhibit enzymes that are common between them such as those in
pseudomurein and peptidoglycan synthesis. Significantly, approximately two-
thirds of
antibiotics in use today against pathogenic microorganisms target cell wall
synthesis,
many of these the transpeptidation reaction catalysing the closure of
bacterial
peptidoglycan, but enzymes that catalyse earlier steps are being increasingly
investigated as alternatives (Hammes et al. 1979). One of the main reasons for
this is
that the final steps of cell wall synthesis occurs extracellularly and are
therefore the
drugs are less prone to inactivation.
The recent development of an anti-Mycobacterium tuberculosis drug that targets
the E
= subunit of the FoFi-ATPase that is also present in humans highlights the
fact that even
relatively small differences in structure could possibly be exploited to
discover novel
anti-methanogen compounds (Andries et at. 2005). The implication of this
finding is that
many essential methanogen enzymes that are not methanogen-specific. Targets
that
share <30-40% identity with their bacterial or eukaryal counterparts could
ultimately be
exploited for designing effective inhibitors.
=
, The number of enzymes or pathways in methanogens demonstrated to be
essential to
help guide prioritisation of targets is relatively limited. Historically, the
methanogenesis
pathway itself has been targeted in numerous experiments using halogenated
compounds such as chloroform that inhibit the terminal step of methanogenesis
catalysed by methyl coenzyme M reductase, but these are not sustainable due to
=
toxicity and environmental concerns.
=
CA 2772224 2018-05-04
CA 02772224 2012-02-24
WO 2011/025394 PCT/NZ2010/000169
105
In only a couple of other cases has a methanogen enzyme inhibitor been used to
check
whether rumen methanogen isolates hare also inhibited in pure culture, for
example the
targeting of the RFA-P synthase (Dumitru et al. 2003) and HMG CoA reductase
with
statins (Miller and Wolin, 2001). In addition, some additional understanding
of growth
characteristics of methanogens in the rumen may be needed. For example,
researchers
are still determining the extent to which rumen methanogens utilise amino
acids in the
rumen, which vitamins are used or enhance growth or the degree, if any, to
utilise
purines and pyrimidines. Thus at the moment, it may less desirable to target
enzymes
dedicated to the synthesis of amino acids, purines or pyrimidines.
The analysis of the M1 genome has provided new perspectives on the lifestyle
and
cellular processes of this prominent rumen methanogen. The genome sequence
confirms the hydrogenotrophic lifestyle of M1 and gene expression data
indicate that
= formate may be an important substrate for methanogenesis during
syntrophic
interaction with B. proteoclasticus. The ability of short chain alcohols to
stimulate growth
on H2 but not support growth themselves is intriguing. We speculate that
methanol or
ethanol are oxidised by the NADP-dependent alcohol dehydrogenases and the
reducing
= potential used to form F4201-12 using NADPH-dependent F420 dehydrogenase,
thus
augmenting the cellular pool of F420H2. This metabolism of alcohols could
spare some of
the H2 or formate normally used to produce F420112 and would explain the
stimulation of
- growth by alcohols in the presence of H2. The lack of a means of reducing
ferredoxins
with electrons from alcohols would explain why growth is not possible on
alcohols
alone. Further work will be required to test this hypothesis.
The abundance of genes encoding adhesin-like proteins in M1 indicates a
significant
ability to modulate cell surface topology. While the exact role of these
proteins is
currently unknown, initial observations from co-culture experiments indicate
that at least
some are involved in mediating close associations with hydrogen-producing
bacteria in
the rumen and others may be concerned with similar interactions with rumen
protozoa
and fungi.
The cpmru prophage sequence within the M1 genome yielded the PeiR enzyme which
is
able to lyse methanogen cells. The variety of methanogen cell wall types means
a
combination of different lytic enzymes would be required for effective
methanogen lysis
in the rumen. However, the expression of PeiR and demonstration of its
effectiveness
CA 02772224 2012-02-24
WO 2011/025394 PCT/NZ2010/000169
106
against a major rumen methanogen is an important step towards this goal. The
PeiR
enzyme and the pmru phage may also be useful in increasing the permeability of
M1
and other pseudomurein-containing methanogens to facilitate DNA entry and for
developing tools for genetic manipulation of Ml.
Methanogens are not known as producers of secondary metabolites, so the
discovery
of two NRPS genes was surprising, and to our knowledge, they are the first
reported in
an archaeal genome. Non-ribosomal peptides (NRPs) are known to contribute to
microbial growth and ecological interactions and therefore their function is
of interest as
they could lead to a means of modulating methanogen growth.
The metabolic profiling and comparative genomics carried out in this study
identified
several sets of conserved, methanogen-specific genes that are currently being
investigated further in our laboratory. Chemogenomic targets are being
investigated via
heterologous expression of genes in Escherichia colt coupled with the
development of
bioassays for screening these enzymes against libraries of chemical compounds
to find
specific inhibitors with efficacy at low concentrations. Vaccine candidate
proteins with
<4 TMHs are being investigated via heterologous expression in E. colt and
vaccination
of sheep. We have also shown the use of synthetic peptides in a reverse
vaccinology
approach to elicit specific antibody responses against M1 proteins with >4
TMHs. This
demonstrates that membrane-embedded M1 proteins, that are unlikely to be
amenable
to expression in a heterologous host, are viable targets as vaccine antigens.
A wider representation of rumen methanogen genomes will be essential to verify
that
the selected vaccine and chemogenomics targets are conserved among other rumen
methanogens, and ensure a successful, long-term CH4 mitigation technology for
rumen. The wealth of biological information 'provided by the M1 genome
represents a
significant advancement for ruminant methane mitigation efforts, aimed at
identifying
anti-methanogen technologies with broad efficacy.
EXAMPLE 4: Aik ATP synthase cloning and characterisation
We designed and developed an over-expression system for the Alk-ATPase from
Methanobrevibacter ruminantium and Methanobrevibacter smithii to allow the
subsequent purification and characterisation of the AiAo-ATPase.
=
CA 02772224 2012-02-24
WO 2011/025394 PCT/NZ2010/000169
107
Methanobrevibacter ruminantium and Methanobrevibacter smithii PCR cloning
and introduction of Hexa-His tag by PCR overlap extension
To study the biochemical properties of the M. ruminantium Al-ATPase, we
prepared
inverted membrane vesicles and tested for ATP hydrolysis activity. We were
unable to
detect any significant levels of ATP hydrolysis activity from inverted
membrane vesicles
of M. ruminantium. Due to the limited amount of cells that can be prepared at
any given
time for M. ruminant/urn, We undertook a heterologous over-expression approach
to
produce the ArATPase. For this, we cloned the M. ruminantium ArATPase genes as
a
6.3 kb BamH1-Xbal PCR product into the expression vector pTrc99a, generating
plasmid pTrMbrA1. To insert the HIS-tag at the N-terminal of Subunit-A PCR
overlap
extension was conducted. Using this approach, we generated a clone named
pTrMbrA1HIS which contains the genes encoding for the M. ruminantium ArATPase
in
the E. coil expression vector pTrc99a, and introduced a Hexa-Histidine tag
onto the N-
terminal of Subunit A. We also cloned the A. genes with the Al genes to
construct a full-
length AlA. ATP synthase expression plasmid. This was named pTrMbbrA1AoHis9.
In
addition, we cloned the ArATPase of Methanobrevibacter smithfi as a 6.3 kb PCR
product into the E. coil expression vector pTrc99a generating the plasmid
pTrMbsA1.
To facilitate purification, we introduced a Hexa-Histidine tag onto the N-
terminal of the
M. smithfi Subunit-A by PCR overlap extension. This was named pTrMbsA1HIS. The
lists of primers and plasmids from this study are shown below.
Primers
Primer Name Primer
Modifications
MbrA1FWD AAATTTGGATCCGGAATCTTAGGTTAGGAGGTCAAT
(BamHI)
SEQ ID NO: 7590
MbrA1REV AAATITTCTAGATAACAAGCAAAATATGAATTGC
(Xbal)
SEQ ID NO: 7591
MbrA1HisFWD ATGCATCATCATCATCATCATAGAGGAACTCAAATGTATGAA HEXA-HIS TAG
MbrA1HisREV ATGATGATGATGATGATGCATCCCATCTGCGACGATAACAGG HEXA-H IS TAG
MbrA1His_MID TTAGACAAGTTCTTAGTCGACTCTG (Sal)
MbrAO REV AGAGACAATTTTATCTGCCCCAGAGCTCAT (Sac)
= MbrA1FWD
ATTTAATTACCATGGTGATTTATTATGGCA (Nco)
MbrA1ASeq 1 ATTGCAGGTCCTGTTATCGTC
MbrA1ASeq2 GGACATTCCACTTATTACCGC
MbrA1ASeq3 ACTTATCCGAACCGGTTACTC
CA 02772224 2012-02-24
WO 2011/025394 .PCT/NZ2010/000169
108
SEQ ID NO: 7592-7599
MbsA1FWD AAATTTTAAGGATCCAATCTGTATGAGCTCAG BamHI
MbsA1REV AAATTTGTCGACCAATTACACAAAAAGATGAGCCGTTAC Sall
MbsA1HisFWD ATGATTCATCATCATCATCATCATATCGAAGGAAAAATTATTA HEXA-HIS TAG
MbsA1HisREV AA HEXA-HIS
TAG
MbsA1_AatIIRE ATGATGATGATGATGATGAATCATTTAACCATCTCTACCCCAA (Aat11)
V TA
MbsA1Seq1 ATTTATCCACATATGGACGTCCTTTCCTTA
MbsA1Seq2 CCTCTGAAGGATCATCTGAT
MbsA1Seq3 AGCATIGCTICTGAAGGTGAA
GAGTAAACACTATTGGTACTA
SEQ ID NO: 7600-7607
Plasmids
Plasmid Name Details Features
pIrMbrA1 Methanobrevibacter
ruminantium ArATPase cloned into
expression vector pTrc99a as a 6.3 kb Bam/Xba fragment
pTrMbrA1HIS
Methanobrevibacter ruminantium ArATPase, cloned into Hexa-His Tag
expression vector pTrc99a as a 6.3 kb Bann/Xba
= fragment. N-terminal Hexa-Histidine tag on Subunit A
pTrMbrA1A0
Methanobrevibacter ruminantium AiAo-ATPase cloned Hexa-His Tag
into expression vector pTrc99a as a Nco/Xba fragment.
N-terminal Hexa-Histidine tag on Subunit A
pTrMbsA1 Methanobrevibacter
smithii Al-ATPase cloned into
expression vector pTrc99a as a 6.3 kb Bam/Sal fragment
pTrMbsA1HIS _ Methanobrevibacter ArATPase cloned into
expression vector pTrc99a as a 6.3 kb Barn/Sal fragment. Hexa-His Tag
N-terminal Hexa-Histidine tag on Subunit A
Over-expression and purification of pTrMbrAl HIS, and pTrMbsAl HIS as analysed
by Western Blotting
As noted above, we generated a clone named pTriVlbrAl HIS which contains the
genes
encoding for the M. ruminantium Al-ATPase in the E. coil expression vector
pTrc99a,
and introduced a Hexa-Histidine tag onto the N-terminal of Subunit A. We
expressed
the plasmid pTrMbrA1HIS in the E. coil strain DK8, and purified the expressed
protein
complex by Ni-affinity chromatography. We were able to detect Subunit-A via
both
Western and MALDI-TOF/TOF analysis. Subunit-A was found to be running at an
incorrect molecular mass of approximately 24 kDa compared to the 65 kDa
predicted
molecular mass. We found that this discrepancy was caused by a mutation within
CA 02772224 2012-02-24
WO 2011/025394 PCT/NZ2010/000169
109 =
SubunitLA. This mutation has now been corrected, and we have repeated the over-
expression and purification with the new construct. Furthermore, we have also
created
a clone to express the full length A1A0-ATPase in E. coli. The purification
and
characterisation of the Alk-ATPase from M. ruminantium has proceeded
accordingly.
- From Western analysis of the M. smithii over-expression, two dominant bands
can be
observed. One protein band runs above the 72 kDa marker, and the other runs at
approximately 33 kDa. The expected size of the M. smithii Subunit-A is 64.8
kDa,
therefore the protein band running at approximately 72 kDa is likely the Mbb.
smithii
Subunit-A, and the lower band running at 33 kDa is likely a breakdown product,
or
unassembled Subunit-A which still retains the HIS-tag. We are proceeding to
further
purify the Ni-affinity eluted fractions, through either PEG-fractionation or
gel filtration
= with the aim to remove the lower contaminating band, while still
retaining Al-ATPase
activity. We have assayed the ATPase activity of the eluted M. smithii Al-
ATPase and
found the sample to hydrolyse with a specific activity of approximately 0.4
Units/mg.
Mbb. ruminantium AIA, ATP synthase expression in a foreign host
The Mbb. ruminantium Aik-ATPase was expressed in E. coil strains DK8 (Aatp),
BL21
and C41 (Figure 24A-D). All strains showed a decrease in growth after
induction of
expression with 1 nnM IPTG. BL21 showed the best growth (Figure 24A) and
expression
of the Alk-ATP synthase and therefore was chosen as a suitable expression host
for
subsequent purification. Growth of the induced cultures (BL21, C41 or DK8/
pIrMbbrA1A0His9) is at a reduced rate compared to the non-induced control
culture in
all 3 strains (see Figure 24A-C), a phenomenon that is a good indication of
foreign
protein over-expression in E co/i. To examine the localization of the
recombinant AlAo-
ATP synthase in E coil the cell debris, cytoplasm and membranes were examined
by
SDS-PAGE and immunoblotting. Purified FiFo-ATPase (his-tag on p-subunit) from
TA2.A1 enzyme was used as a positive control (Figure 24D). The enzyme was
localized
in the membranes and not in the cytoplasm indicating the presence of a
properly
assembled enzyme. pTrMbbrA1AõHis9 was able to be expressed in E. coli strains
BL21,
C41 or DK8, with the best growth and overexpression in BL21 (data not shown).
These
results also show the AlA0-ATP synthase is specifically localizing to the
membrane
preparation in both BL21 and DK8 cells.
CA 02772224 2012-02-24
WO 2011/025394 PCT/NZ2010/000169
110
Extraction of the AlAo-ATP synthase from cell membranes
The Mbb. ruminantium A1A0-ATPase was expressed in the E. coil DK8 (Aatp) and
BL21
(Figure 24A-D). Previously, we were able to solubilize the Mbb. ruminantium
AlA, using
2% DDM,_ and we are able to semi-purify the Mbb. ruminantium Alk ATP synthase
by
exploiting the introduced hexa-histidine tag on the A subunit (elution at 120
mM
imidazole, see Figure 25). However, solubilisation was considered limited with
only
about 40% of the tagged protein being extracted from membranes. Therefore, to
determine the detergent with the optimal solubilisation effect, E. colt BL21
inverted
membrane vesicles were diluted in solubilsation buffer supplemented with
different
detergents to concentrations of 0.5, 1,2 or 4%, and a concentration of 5 mg
protein/ml.
Solubilisation was performed under gentle stirring overnight at 4 C or at room
temperature with DM, DDM, Triton X-100, Cymal-6, CHAPS, cholate,
octylglucoside
and fos-choline. SDS was used as a positive control as it solubilized 100% of
the
membranes. The soluble and insoluble fractions were analyzed by SDS-PAGE and
immunoblotting. Comparison of the immunoblots revealed that fos-choline had
the best
solubilisation effect (90-100%), followed by DDM (40%), DM (35%) and cymal-6
(39%).
Triton X-100 and octylglucoside were weak (>20%). CHAPS and cholate led to a
significant degradation of the enzyme (a smear) and were therefore not
feasible.
However, after activity was measured of the solubilized membrane protein, the
best
detergent was DDM or cymal-6 both liberating 40% of the ATPase and maintaining
activity the highest activity. The Fos-choline samples did not contain enzyme
activity
regardless of concentration.
Purification of the Mbb. ruminantium A1A0-ATPase
BL21 containing pTrMbbrA1kHis9 induced with 1 mM IPTG and expression conducted
for 4 hours at 37 C. Membranes were prepared by French-press, and solubilized
with
2% DDM at 4 C overnight in the presence of 0.1% TCEP (a reductant). The
purification
was then routinely performed by IMAC and the bound protein subjected to 10,
20, 40
then 60 mM imidazole wash steps before elution at 100 mM imidazole. It should
be
noted that elution of a significant amount of Alk protein is observed at 60 mM
imidazole, however to ensure a very clean preparation this step was essential
to
remove contaminating proteins. To remove additional contaminants, the eluant
was
PEG-precipitated for 1 h at room temperature with 10 % PEG6000 followed by 15
%
CA 02772224 2012-02-24
WO 2011/025394 PCT/NZ2010/000169
111
=
PEGe000= The first step precipitation removed contaminating proteins (fraction
split), the
second precipitates the AlA, ATPase.
The purified Mbb. ruminantium AlAo-ATPase contained all 9 subunits, which was
confirmed by MS/MS (Figure 25A and B). The K-subunit appears both as a monomer
and as an oligomer on a 14% SDS-PAGE gel. When this enzyme preparation was TCA-
treated, the oligomers were no longer seen, and a strong band of the K monomer
was
observed. This observation is indicative of an SDS-stable K ring. This was
further
confirmed by isolation of monomers of the K-subunit by methanol/chloroform
extraction
(Figure 27). This K ring preparation is being used for antibody trials.
Purified Mbb. ruminantium AA, ATP synthase characterization
Two enzyme preparations were studied. Purified ATP synthase from BL21 and
recombinant enzyme expressed in E colt DK8 membranes. Purified Alk, ATP
synthase
was examined for ATPase activity using the inorganic phosphate assay. After
examining current literature on the Methanosarcina mazei and Methanococcus
jannaschii AiA, ATP synthases, it was decided to examine activity at 39 C and
a pH
value of 6.5 in a buffering system that mimics the Na + concentration in the
rumen (70-
137 mM Na).
To determine the kinetics of ATP hydrolysis, the reaction was started at 39 C
by
addition of Na2-ATP to a final concentration of 2.5 mM. 16 pg of protein was
used in
each end-point assay. Background ATPase activity generated by thermal
hydrolysis of -
ATP or contaminant ATP in buffer or enzyme was subtracted (these totalled <5 %
of the
final value shown; Figure 29A). The influence of Mg2+ on the kinetics of ATP
hydrolysis
was also evaluated. The reaction was started by addition of Tris-ATP to a
final
concentration of 2.5 mM. 16 pg of protein was used in each end-point assay.
Background ATPase activity generated by thermal hydrolysis of ATP or
contaminant
ATP in buffer or enzyme was subtracted (these totalled <5 % of the final value
shown;
Figure 29B).
ATPase activity was tested over a pH value range from 5.5 to 8.5 and in
presence and
absence of Na' using the purified recombinant AlAo-ATPase. The reaction was
started
at 39 C by addition of Na2-ATP or Tris-ATP to a final concentration of 2.5 mM.
16 pg of
protein was used in each end-point assay. Background ATPase activity generated
by
CA 02772224 2012-02-24
WO 2011/025394 PCT/NZ2010/000169
112
thermal hydrolysis of ATP or contaminant ATP in buffer or enzyme was
subtracted=
(these totalled <5 % of the final value shown; Figure 29C). The stability of
the purified
and membrane-bound (in DK8 membranes) Mbb. ruminantium A1A0 ATP synthase was
also tested. ATPase activity was examined each day after the preparation of
either
purified recombinant or DK8 membrane bound Mbb. ruminantium Alk ATP synthase.
The reaction was started at 39 C by addition of Na2-ATP to a final
concentration of 2.5
mM. 16 pg of purified protein or 0.5 mg inverted membrane vesicles was used in
each
end-point assay. Background ATPase activity generated by thermal hydrolysis of
ATP
or contaminant ATP in buffer or enzyme was subtracted (these totalled <5 % of
the final
value shown; Figure 29D).
To gain insight to whether the purified or membrane-bound Mbb. ruminantiumAr,
ATP
synthase is functionally coupled to an ionic driving force (F1+ or Nat),
tributylin (TBT)
was tested as an inhibitor. TBT is a well characterized F, and Ao channel
inhibitor of
ATP synthases. To examine the coupling ion used by the Mbb_ ruminantium Aik
ATP
. synthase, the effect of amiloride, a known Na-coupled V-type ATPase and
Na l" channel
inhibitor was also examined.
The effects of TBT and DCCD on ATPase activity were investigated as follows.
E. coil
DK8 (Aatp) inverted membranes containing the recombinant Aik-ATPase were used
to determine the effects of the inhibitors TBT (200 pM) and DCCD (250 pM) at
different
pH values. ATPase activity was measured in presence of 130 mM Na + (Figure
30A) and
in absence of Na + (Figure 30B). After preincubation with the inhibitor for 20
min at room
temperature (TBT or DCCD), the reaction was started at 39 C by addition of Na2-
ATP
or Tris-ATP to a final concentration of 2.5 mM. 0.5 mg inverted membrane
vesicles was
used in each end-point assay. Background ATPase activity generated by thermal
hydrolysis of ATP or contaminant ATP in buffer or enzyme has been subtracted
(these
totalled <5% of the final value shown; Figure 30A and 30B).
Tributylin inhibition of ATP hydrolysis by purified recombinant Mbb.
ruminantium
ATP synthase was then evaluated. After preincubation with the inhibitor
tributylin (TBT)
for 20 min at room temperature, the reaction was started at 39 C by addition
of Na2-
ATP to a final concentration of 2.5 mM. 16 pg of protein was used in each end-
point
assay. Background ATPase activity generated by thermal hydrolysis of ATP or
contaminant ATP in buffer or enzyme was subtracted (these totalled <5 % of the
final
CA 02772224 2012-02-24
WO 2011/025394 PCT/NZ2010/000169
113
value shown; Figure 30C). Amiloride inhibition of ATP hydrolysis of the Mbb.
ruminantium AlA, ATP synthase was evaluated next. After preincubation with the
inhibitor tributylin (TBT) for 20 min at room temperature, the reaction was
started at
39 C by addition of Na2-ATP to a final concentration of 2.5 mM. 16 pg of
protein was
used in each end-point assay. Background ATPase activity generated by thermal
hydrolysis of ATP or contaminant ATP in buffer or enzyme was subtracted (these
totalled <5 % of the final value shown; Figure 30D).
Tributylin inhibition of ATP hydrolysis by the Mbb. ruminantiumAo ATP synthase
was
also tested in 0K8 and native membranes. After preincubation with the
inhibitor
tributylin (TBT) for 20 min at room temperature, the reaction was started at
39 C by
addition of Na2-ATP to a final concentration of 2.5 mM. 0.5 mg inverted
membrane
vesicles was used in each end-point assay. Background ATPase activity
generated by
thermal hydrolysis of ATP or contaminant ATP in buffer or enzyme was
subtracted
(these totalled <5 % of the final value shown; Figure 31A). Amiloride
inhibition of ATP
hydrolysis of purified recombinant Mbb. ruminantium AlA, ATP synthase was
further
tested in DK8 and native membranes. After preincubation with the inhibitor
tributylin
_ (TBT) for 20 min at room temperature, the reaction was started at 39 C by
addition of
Na2-ATP to a final concentration of 2.5 mM. 0.5 mg inverted membrane vesicles
was
used, in each end-point assay. Background ATPase activity generated by thermal
hydrolysis of ATP or contaminant ATP in buffer or enzyme was subtracted (these
totalled <5 % of the final value shown; Figure 31B).
ATP synthesis in E. coil DK8 inverted membrane vesicles was further evaluated.
Time-
course of ATP synthesis was assessed at pH 6.5, 125 mM Na + and 39*C with 0.5
mg of
inverted membrane vesicles using the ATP synthesis inverted membrane vesicle
assay
using NADH as a driving force. Membranes were preincubated for 2 min with 2.5
mM
NADH with stirring before the reaction was initiated using 0.75 mM ADP and 2.5
mM P.
Closed squares with no DCCD; closed triangles, a 20 min preincubation with 250
pM
TBT (Figure 32).
Overview
We have successfully cloned, expressed and characterized the AlAo-ATP synthase
from Mbb. ruminantium. SDS-PAGE revealed 9 subunits and an SOS-stable K ring,
which was purified. The A1A0 synthase is active in both synthesis and
hydrolysis, but
CA 02772224 2012-02-24
WO 2011/025394 PCT/NZ2010/000169
114
the enzyme activities can be improved. This is consistent with other published
A1A0-
ATP synthases from methanogens. The coupling ion for the enzyme is being
identified
with studies suggesting both Fl+ and Na + ions being important. ATP hydrolysis
activity is
sensitive to TBT, DCCD, and amiloride in high concentrations, suggesting these
inhibitors will be ineffective against the growth of rnethanogens, hence the
need to find
a better inhibitor.
=
We will proceed to test antibodies generated towards the Alk-ATP synthase in
sheep
against the purified enzyme in a Western blot. The antibodies have been
directed
against the soluble Al part of the enzyme and therefore are probably
inaccessible
during growth inhibition studies. This suggests that targeting the A1 portion
will be
ineffective. We will also use the membrane-embedded sector (K ring) of the
enzyme for
new sheep antibody trials. This component would be accessible in the whole
cell, so
could be very useful as a target. We will also use the recombinant enzyme to
identify
inhibitors of activity using LOPAC1280Tm which is a versatile compound library
for
assay validation and high throughput screening.
EXAMPLE 5: Cloning and expression of non-ribosomal-peptide-synthase (NRPS)
genes
Experiments are being performed to obtain expression of full-length NRPS
genes,
.. isolate the expression product and submit for structural determination and
activity
testing. Two non-ribosomal peptide synthetase gene sequences have been
identified in
M. ruminantium M1 (Leahy et al., 2010). We have been able to clone and amplify
most
of the functional modules of the NRPS1 gene of M. ruminantium. This allows us
to
investigate the substrate specificity and mode-of-action of individual NRPS
domains.
We have also completed the design and in vitro assessment of predicted
synthetic
peptides from both the NRPS1 and NRPS2 gene products from M. ruminantium Ml.
The unexpected outcomes of these experiments (an opposed reaction to known
siderophores) has prompted significant interest in the nature of these NRPs
and their
native molecular structure.
We have shown the presence of the native NRPs in M. ruminantium M1 growth
supernatant, and are obtaining further information on secondary and tertiary
native
structures and on any further modification to the peptide backbone, such as
cyclisation
CA 02772224 2012-02-24
WO 2011/025394 PCT/NZ2010/000169
115
or acylation. Therefore, expression of full length NRPS genes in a
heterologous
expression system will give us an opportunity to purify the active peptide
compounds
synthesized and make the genes available for large scale production.
Suitable primer sets were designed to ensure full length amplification and
subsequent
cloning into suitable expression systems. We have obtained full length
amplification of
the M1 NRPS2 gene and are carrying out further experiments to clone this
amplicon
into a vector system. Small amounts of NRPS1 from M1 have been amplified and
inserted into entry vectors. These vectors are now being sequenced to confirm
that the
inserted amplicons reflect the NRPS gene and are free of non-silent nucleotide
mutations.
We are also synthesizing NRPS genes using GeneArt's gene optimization service.
This
optimization not only adapts codon usage to the heterologous expression host
E_ coil
but also accounts for factors that may compromise the stability of mRNA, such
as
extreme GC content, ribosomal binding sites, repeats and secondary structures_
Substrate feeding studies with E. coil crude extract and E. coil /I M.
ruminantium crude
extract mix will be carried out to evaluate the amount of functional NRPS
units.
Biological active NRP molecules will subsequently be detected using the CAS
colorimetric assay.
We are also working to induce gene expression by providing a range of stimuli
(i.e.,
increasing amounts of ion chelators) in the growth medium and monitor gene
expression levels. An alternative approach to isolate the non-ribosomal
peptides is to
identify induction conditions of the NRPS genes via Northern Blot analyses.
Induction of
those genes will lead to -increased mRNA levels, more active NRPS units and,
subsequently, more NRP molecules in the growth supernatant. Based on our
preliminary results reported earlier, we have identified two initial stress
conditions to
test. The interaction of NRP1 and 2 with Chrome Azure S (CAS) and Fe-ions,
make iron
scavengers and chelators such as Desferal and EDTA prime candidates for
initial
induction testing. The addition of both compounds to the growth medium may
trigger the
NRPS sensor system and cause elevated gene expression.
For these experiments, M. ruminantium M1 was grown to an 0D600 of 0.1 and then
aliquoted in 5 mL amounts into anaerobic tubes. Different concentrations of
EDTA and
CA 02772224 2012-02-24
_ WO 2011/025394 PCT/NZ2010/000169
116
Desferal were added to the cultures. Each culture was sampled at 1, 2, and 4
hours and
overnight after the addition of EDTA or Desferal. Samples were centrifuged to
pellet the
cells, supernatant was removed and the samples stored at -80 C. Subsequently,
total
RNA has been isolated and tested for integrity. To test semi-quantitatively
the level of
gene expression, RNA dilution series for each sampled time point and each
inducer
concentration will be established in a Northern Dot Blot system.
Three different probes have been designed targeting one house-keeping gene as
positive control, and both NRPS genes. We are currently in the process of
labelling
these probes and will commence testing. When appropriate induction conditions
have
been established, we will purify the native non-ribosomal peptides from M.
ruminantium
culture supernatant using , HPLC and assess for functionality using CAS
assays.
Furthermore, purified NRPs will be subjected to structural analyses were
possible. A
comparison between the non-ribosomal peptides from M. ruminantium and those
purified from the heterologous gene expression system will allow upscaling the
production of NRPs and derivatives thereof.
EXAMPLE 6: Vaccination of sheep using candidate proteins identified from the
M. ruminantium genome and other rumen methanogens
Experiments are being performed to use up to ten selected gene targets from
M. ruminantium for heterologous expression, vaccinate sheep and test resulting
serum
antibodies against methanogen cultures. The first step in identifying
candidate proteins
for vaccine development is to determine which M. ruminantium proteins are cell
surface-
located and potentially accessible to antibody binding. In silico analysis of
the M.
ruminantium M1 open reading frames (ORFs) identified an initial pool of 572
ORFs
containing one or more transmembrane helices (TMH) or signal peptide (SP)
indicating
a cell membrane or cell surface location. Those ORFs with a top BLAST hit to a
non-
methanogen or with no homology to the non-redundant database were removed and
adhesin-like ORFs were dealt with separately. This gave a new total of 337
ORFs.
Examination of the remaining 337 ORFs, assessing their predicted function,
degree of
conservation among methanogens and the nature of their transmembrane
structures,
refined the list to 71 ORFs (Leahy et al., 2010). Heterologous expression of
membrane
proteins with more than 4 TMHs has been difficult in reverse vaccinology
studies of
other microbes, so a cut-off of 4 THMs was applied to define two final groups:
Group A
CA 02772224 2012-02-24
WO 2011/025394 PCT/NZ2010/000169
117. =
with 47 ORFs with 4 or fewer TMHs suitable for cloning and heterologous
expression
studies; and Group B composed of 24 ORFs with more than 4 TMHs more suitable
for a
synthetic peptide-directed vaccine approach (see below).
M. ruminantium surface and membrane proteins selected as vaccine targets
Functional Category and Locus tag Annotation
Energy Metabolism
mru0697 AhaK
mru1405*, 1406*, 1407, 1408, 1411, 1412 EhaHGFEAB
mru2006, 2007, 2008, 2010, 2012, 2013 EhbIHGECB
mru1917, 1918, 1921*, 1922*, 1923* MtrGFCDE
Protein Fate
mru0239 SecG
mru0482 SecE
mru1234* type IV leader peptidase family protein
Vitamins & Cofactors
mru0540 CbiN1
Hypothetical
mru0542*, 0840*, 1693,2156*, 0233, 0234*,
0330, 1021, 1144, 1231, 1480*, 1585, 1635*,
1955, 2015*, 2046*, 2056, 2146* 0529 0081,
0196, 0225*, 0328*, 0412, 0428*, 0499, 0596,
0597, 0693, 0832, 1098, 1385, 0147, 0377*, Hypothetical proteins
1375*, 1641, 0543, 0833, 1991, 0545*, 0716*,
0718*, 0838, 0968, 1232, 1550, 1884*, 2202,
1694
* ORFs 5 TMHs
Many of these candidate genes correspond to proteins whose function is under
investigation. However, some are involved in energy production and are
therefore prime
candidates for vaccine development (Figure 11). Of particular interest are the
membrane-embedded ATP synthase enzyme complex which generates ATP from
either a sodium or proton gradient (Aha), the H4MPT methyl transferase enzyme
complex that catalyses the -second to last step in the methanogenesis pathway
(Mtr)
and two membrane-bound energy converting [Ni-Fe] hydrogenases (Eha and Ehb).
The HIMPT methyl transferase enzyme complex is the penultimate step in the
nnethanogenesis pathway. The methanogenic archaea use this pathway to generate
energy (Figure 11). The first five steps result in the sequential reduction of
CO2 by
electrons sourced from H2 to form N5-methyl-H4MPT, then the methyl group is
transferred to coenzyme M via the action of the methyl-H4MPT:CoM-
methyltransferase.
CA 02772224 2012-02-24
WO 2011/025394 PCT/NZ2010/000169
118
Mtr is made up of multiple enzyme subunits (Mtr E, D, C, B, A, F, G, H; Figure
33) and
couples the methyl transfer reaction to the efflux of Na + ions out of the
methanogen cell.
This creates a Na + gradient that is used, either directly, or via a Na+/H+
antiporter, to
drive ATP synthesis. Our analyses indicate that Mtr subunits are sufficiently
conserved
and specific to methanogens to be prime candidates for vaccine development.
Therefore we are sub-cloning and expressing these subunits in order to obtain
protein
to vaccinate into sheep.
In order to clone the genes into the pTrc99A vector, 2 pairs of primers were
designed
for each of the 9 target ORFs (8 individual subunits and complete operon-
mtrEDCBEFGH). One set of primers introduces a His (6 x Histidine) tag to the N-
terminal of the translated proteins, while the other set of primers do not
include this tag.
The primers were designed for insertion of the fragments between the Ncol site
and
Xbal site of the pTrc99A vector. Each open reading frame (ORF) from the mtr
operon
was amplified from M. ruminantium M1 genomic DNA, and ligated into the
appropriately
digested vector. The ligation, products were transformed into DH5a competent
cells
and insert-containing colonies were selected and analysed to verify insertion
of the
correct gene. In order to improve the chances of mtr gene expression, a codon-
optimised mtr construct was designed for synthesis by GENEART (Germany) and
cloned into pTrc99A. Each mtr gene contains an RBS site, an N-terminal 6 x His
tag, a
TEV protease cleavage site, and is flanked by unique restriction site
compatible with
sub-cloning individual synthetic ORFs into the pTrc99A.
The table below summarises the mtr gene cloning and expression results. All 9
ORFs
were successfully amplified with the two pairs of primers, giving 18
constructs, of which
17 were successfully cloned into the pTrc99A vector. All constructs were
sequenced to
confirm correct gene inserts.
Mtr cloning and expression
=
Constructs Cloned Sequenced Expression
mtrA -His Yes Yes In progress
mtr13-His Yes Yes In progress
mtrC-His Yes Y
mtnD-His Yes Yes progress
mtrE-His Yes Yes In progress
mtrF-His Yes Yes In progress
mtrG -His Yes Yes In progress
mtri-l-His Yes Yes Yes
mtrEDCBAFGH-His Yes Yes In progress
CA 02772224 2012-02-24
WO 2011/025394 PCT/NZ2010/000169
119
mtrA YesYes M Yes In progress
mtrBM Yes In progress
mfiC - Yes PI Yes In progress
mtrD - Yes Yes In progress
mtrE Yes M Yes In progress
mtrF Yes 1- Yes In progress
mtrG Yes Yes In progress
mt11-1 Yes - Yes -1-A Yes
Only the complete mtr operon without the 6 x His tag failed to clone from the
PCR
product. Expression of all 17 mtr constructs has been obtained using the
expression
host OverExpress C41 (DE3) (Lucigen) cells grown in YT medium and induced by
IPTG
at either 37.0 or 30.C. MtrH with or without the 6 x His tag was successfully
over-
expressed, and there also appeared to .be some mtrH expression from the
mtrEDCBAFGH-His construct (Figure 34).
. The MtrC construct with 6 x His tag had low level expression in the
OverExpress C43
(DE3) (Lucigen) expression host (Figure 35). After lysis of the cells
expressing the mtrH
and mtre proteins, the two proteins were solubilised and put through the
nickel columns
for purification, but the proteins did not bind well to the columns,
preventing column
purification. The codon-optimised mtr synthetic construct has also been tested
for
expression in OverExpress C43 (DE3) (Lucigen) cells in YT medium, and Rosetta
II
= (DE3), OverExpress C43 (DE3), OverExpress C41 (DE3), OverExpress C43
(DE3)
pLysS, OverExpress C41 (DE3) pLysS cells in auto-induction medium (ZYP5052).
No
expression from any of the mtr genes was detected under any of these
conditions.
Further expression of codon-optimised mtr synthetic construct in Rosetta!!
(DE3) and
induction temperature at 20 to 37 C for a range of induction times (4 to
16 hrs), with
IPTG concentration ranging from 0.1 mM to 1 mM is being assessed. We also plan
to
sub-clone each of the individual mtr subunits from the synthetic construct and
obtain
expression individually.
EXAMPLE 7: Vaccination of sheep using cell surface protein fractions isolated
from M. ruminantium cells
Traditional vaccine development against bacteria relies on using cell
fractions to elicit
antigenic responses in the host animal. This approach, while generally
accepted,
carries the inherent risk of contamination from other cell fractions. As an
alternative to
traditional cell fractionation, cell surface proteins isolated via non-
destructive treatment
CA 02772224 2012-02-24
WO 2011/025394 PCT/NZ2010/000169
120
are investigated for their ability to create effective and specific antigens.
Chaotropic
salts, such as guanidine hydrochloride, have been widely used to remove non-
covalently bound proteins such as S-layers from the cell surface, while
maintaining
viable microbial cells.
In our experiments, M. ruminantium M1 cells have been subjected to chaotropic
salt
treatment removing non-covalently bound proteins from the cell surface. In a
second
step the treated methanogen cells have been subjected to a trypsin digest.
This cleaved
off exposed protein epitopes from the cell surface, without disrupting the
cell wall or cell
membrane. For this protocol, M. ruminantium M1 cells were grown in RM02 media
for 7
days. The culture was then centrifuged to spin down the cells and the
supernatant
discarded. The cell pellet was washed three times in sterile distilled water,
resuspended
in 2 mL of 4M guanidine-HCI pH 7.0 and incubated at 37 C for 1 hour. The
suspension
was centrifuged in a microfuge at 13 000 rpm. The supernatant (containing non-
covalently bound surface proteins) was carefully removed and dialysed twice
against
100 mM ammonium bicarbarbonate buffer (pH 8). A sample was run on an SDS-PAGE
to estimate number and quantity of isolated proteins.
These samples have been used in a sheep vaccination trial. A Western blot
analysis of
M1 cell fractions against sheep antisera from the trial has been carried out
and a
significant level of cross-reaction has been observed (Figure 36).
A trypsin digest was then carried out for both the non-covalently bound
isolated proteins
and the exposed cell-surface epitopes. This digest also served to prepare the
samples
for subsequent MALDI-TOF analyses. Trypsin was added at an approximate ratio
of
1:50 relative to estimated amount of protein (based on SDS-PAGE). 10% (v/v)
HPLC
grade acetonitrile was added and the samples were incubated at 370 for 12
hours. The
digests were centrifuged, supernatants were collected and subsequently
dialysed
against 100 mM ammonium bicarbonate buffer (pH 8). Where appropriate, DTI"
(dithiothreitol) was added to a final concentration of 50 mM and the samples
were
incubated at 60 C for 30 minutes to reduce disulphides. IAM (iodoacetamide)
was then
added to a final concentration of 150 mM and samples were incubated in the
dark at
room temperature for 30 minutes. DTT and IAM were removed from the sample by
dialysis or washing with 100 mM ammonium bicarbonate buffer (pH 8). Both
preparations were sent to Dr Stefan Clerens (AgResearch, Lincoln) for MALDI-
TOF.
CA 02772224 2012-02-24
WO 2011/025394 PCT/NZ2010/000169
121
=
Protein fragments were subsequently compared to the non-redundant amino-acid
database hosted by NCB! and a custom ORFeome based M. ruminantium database
(Leahy et al., 2010), associating epitopes with their respective ORFs.
Six independent culture trials were conducted, including cells from normal
growth
conditions, cells from medium supplemented with methanol and cell subjected to
oxygen stress. A total of 57 fragments could be assigned to their respective
M.
= ruminantium M1 genes (see table, below). It is noteworthy that a
significant level of
variety in identified ORFs was encountered in between the individual protein
isolations.
This may be caused by a sensitive and rapid adaptation of M. ruminantium to
even
= small changes in culture conditions. Notable targets such as adhesion-
like proteins or
an ABC transporter substrate-binding protein were detected which will be
merged into
the priority target list. Interestingly, a number of cytosolic proteins (i.e.,
ribosomal
proteins and other predicted highly expressed genes) were also detected,
indicating a
certain amount of ongoing cell lysis during the sample preparation. In this
context, we
also identified the M1 bacteriophage (pmru integrase protein, pointing to a
continued
phage lysis process. This is supported by the detection of (pmru particle in
culture
supematant during normal growth conditions. The active prophage not only
points to a
highly stressed M. ruminantium cell condition but also may cause an altered
biochemical and phenotypical profile.
It therefore is important to improve culture conditions and also develop a
prophage-free
strain that lacks the potential of accelerated and uncontrolled cell lysis.
Consequently,
we have initiated a phage curing experiment which aims to create a phage-free.
M.
ruminantium derivative. Because M. ruminantium cannot be cultivated on solid
media
traditional phage curing methods cannot be employed. We have therefore opted
for an
evolutionary approach that uses ProteinaseK to remove free phage particles and
= continuous minimal subculturing. We will enrich for a prophage-free M.
ruminantium
population that eventually evolves in a fully cured derivative. This
derivative would then
also be used in further experiments, such as the assessment of the prophage
(pmru.
Most interestingly, we were also able to identify the M1 NRPS1 gene product,
indicating
that the NRPS system can be active and expressed under laboratory growth
conditions.
CA 02772224 2012-02-24
WO 2011/025394 PCT/NZ2010/000169
122
List of open reading frames identified by MALDI-TOF analyses.
ORF number , Annotation Count
82 Adhesin-like protein+ = 1
113 (1228) Exopolysaccharide
biosynthesis polyprenyl 1
glycosylphosphotransferase"
115 RadA DNA repair and recombination protein+ 1
117 HdrA CoB¨CoM heterodisulfide reductase subunit A'"** 7
201 ModA molybdate ABC transporter substrate-binding protein+ 1
205 Copper ion binding protein" 1
247 ThiC1 thiamine biosynthesis protein*** 7
256 Phage integrase*** 5
326 Adhesin-like protein*** 2 =
333 FdhA1*** 3
334 FdhB1"** 3
351 Nrps1+ 1
455 Acetyltransferase+ 1
481 FtsZ cell division protein-I- 1
498 Fbp fructose 1,6-bisphosphatase*** 3
520 Translation elongation factor aEF-1 beta+ 1
526 Hmd coenzyme F420-dependent N(5),N(10)- 5
methenyltetrahydromethanopterin reductase***
550 PorA pyruvate ferredoxin oxidoreductase alpha subunit*** 4
551 PorB pyruvate ferredoxin oxidoreductase beta subunit*** 2
569 Mer 5,10-methylenetetrahydromethanopterin reductase*** 7
582 PhoU phosphate uptake regulator PhoU*** 2
595 PurP+ = 1
629 Hypothetical protein+ 1
735 Rbr1+ 1
727 (2191) Adhesin-like protein
with cysteine protease domain" 1
774 Archaeal histone+ 1 -
816 HdrC CoB¨CoM heterodisulfide reductase subunit C*** 6
817 Hdr13*** 3
851 Rpl3p ribosomal protein L3P+ 1
872 Rps5p ribosomal protein S5P+ 1
CA 02772224 2012-02-24
WO 2011/025394
PCT/NZ2010/000169
123
949 BtcC bicarbonate ABC transporter substrate-binding protein***
8
1052 Transcriptional regulator+ 1
1091 SucD^ 2
1490 Ribosomal protein L7Ae+ 1
1228 (113) Hypothetical protein' 1
1371 Hypothetical protein+ 1
1491 Archaeal histone"" 8
1499 Adhesin-like protein with transglutaminase domain+ 1
1570 Acs, ADP-dependent acetyl-CoA synthetase"** 5
1645 Thermosome subunit*" 2
1683 Hypothetical protein"' 3
1686 Archaeal histone*** 2
1691 MoaA molybdenum cofactor biosynthesis protein+ = 1
1730 Heat shock protein Hsp20+ 1
1805 Translation elongation factor aEF-1 alpha*" 7
1836 Cell shape determining protein MreB/Mr1 family*** 3
1888 PycB+ 1
1897 PpsA1 phosphoenolpyruvate synthase+ 1
1900 Hypothetical protein+ 1
1901 Peptidyl-prolyl cis-trans isomerase+ 1
1906 MvhA methyl viologen-reducing hydrogenase alpha*** 7
1907 MvhG methyl viologen-reducing hydrogenase gamma*** 4
1916 _MtrH tetrahydromethanopterin S-methyltransferase*" 9
1919 MtrA1 tetrahydromethanopterin S-methyltransferase*** 4
1920 MtrB tetrahydromethanopterin S-methyltransferase+ 1
1921 MtrC tetrahydromethanopterin S-methyltransferase*** 2
1924 McrA methyl-coenzyme M reductase alpha subunit*" 5
1925 McrG methyl-coenzyme M reductase gamma subunit ** 12
1928 McrB methyl-coenzyme M reductase beta subunit*** 9
1993 CBS domain-containing protein+ 1
1994 CBS domain-containing protein"' 8
2022 Ftr2 formylmethanofuran-tetrahydromethanopterin
formyltransferase*" 7
2074 FdhA2 formate dehydrogenase alpha chain+ 1
2110 IlvC ketol-acid reductoisomerase+ 1
CA 02772224 2012-02-24
WO 2011/025394 PCT/NZ2010/000169
124
2121 Hcp hydroxylamine reductase*** 5
2131 Fae/Hps bifunctional formaldehyde-activating enzyme/3- hexulose-6-
1
phosphate synthase+
2142 Mtd F420-dependent methylenetetrahydromethanopterin 11
dehydrogenase*''
.2159 TrpB2 tryptophan synthase beta subunit ** 6
2191 (727) CD P-g lycerol:poly(g
lycerophosphate) glycerophosphotransferaseA 1
ORFnumber indicates the respective locus tag within the M. ruminantium M1
genome; Annotation shows
the corresponding functional annotation; Count highlights the number of
occurrences found for each ORF
throughout the six individual runs. ***: ORFs found multiple times, +: unique
hits, A: Hits with sequence
mismatch to M1 deduced aa sequences. ORF numbers in brackets indicate
ambiguous hits.
EXAMPLE 8: Expression and purification of enzymes from the methanogen
M. ruminantium for the discovery of inhibitors
Novel inhibitors have great potential to provide mitigation of the greenhouse
gas
methane from ruminants. This area of research has significance in the
stabilisation of
greenhouse gas concentrations in the atmosphere to prevent climate change. The
principal methane forming organisms in the rumen are archaea belonging to the
genus
Methanobrevibacter. We are targeting genes of rumen methanogens for cloning
and
expression in E. coil with the aim of obtaining purified enzymes that can be
used for:
.. high-throughput screening (HTS) assays of chemical compound libraries; and
in silico
screening of inhibitors. The majority of the targets are from five main
functional classes;,
methanogenesis/energy metabolism, central carbon metabolism, cofactor
synthesis,
cell wall synthesis and lipid synthesis. Work will continue to advance the
targets through
the pipeline. This project brings together diverse disciplines including
chemogenomics,
in silica modelling, structural biology, and the in vitro biological screening
of targeted
. compounds to formulate the development of potent anti-methanogen compounds
that
are non-toxic to host ruminant animals and have negligible environmental
impact.
Our aim is to discover small molecule inhibitors of methanogens, based on
genomics,
biochemistry, and structural biology. This is a powerful means to search for
inhibitors of
methanogens, which are difficult to culture and are not amenable to high-
throughput
screening with cells. In this study, a number of enzymes from
Methanobrevibacter
ruminantium were found to be solubly expressed in E. coli e.g. 3-hydroxy-3-
methylglutaryl-CoA reductase (HMGR),
methenyltetra hyd rometha no pterin
cyclohydrolase (MCH), 3-hexulose-6-phosphate isomerise (PHI) and bifunctional
CA 02772224 2012-02-24
WO 2011/025394 PCT/NZ2010/000169
125
formaldehyde-activating enzyme/3-hexulose-6-phosphate synthase (FAE/HPS). HMGR
catalyzes the rate-limiting step in the synthesis of isoprenoid units, which
are
components of archaeal membrane lipids. MCH is an enzyme involved in the
methanogenesis pathway. PHI and FAE/HPS are key enzymes of the ribulose
monophosphate pathway, used by methanogens to generate ribose for nucleotide
synthesis.
As a first step, we made a selection of targets. Information from literature
and genome
of Methanobrevibacter ruminant/urn (Leahy et. al., 2010) was studied and 34
targets
(see table in Example 9, below) were chosen. Next, we carried out cloning in
E. coll.
Genes were amplified and cloned into expression vector pET151D (Invitrogen).
Plasmids were used to transform E. coil BL21-Rosetta 2 cells (Novagen). From
this, 27
positive clones (see table, below) were obtained. We then looked for
expression of
recombinant proteins. Cells were grown in auto induction medium ZYP-5052
(Studier,
2005), with shaking at 25''6or 30 C for approx 16 hr. Cells were lysed using
lysozyme
at 4 C. The hexa-histidine-tagged enzymes were then purified from cell free
extracts by
nickel-affinity chromatography. Imidazole was removed and buffer was
exchanged. We
found 15 proteins were expressed while 10 (12) were soluble (see table,
below).
Clones that failed to express were grown in LB media induced with IPTG. Clones
with
insoluble expression were subjected to varying lysis conditions. To overcome
the lack of
expression in some clones, the genes have been synthesised for optimum
expression
in E. coil (GeneArt).
Biochemical characterisation of HMGR has also been carried out. This included
assays
to measure the oxidation of NADHP (366 nm, E 3,300 M-1 cm-1), which were
performed
at 37 C. Activity was expressed in U mg-1 of enzyme. One unit was defined as
the
turnover of one pmol of NADPH per minute (2 NADPH molecules are required to
reduce 1 HMG-CoA). Standard assays contained 50 mM BTP (Bis-Tris propane) pH
6.5, 400 mM NaCI, 0.05 mM DTT, 2.5% glycerol, 338 pM NADPH and 250 pM (R,S)-
HMG-CoA. Prior to use, the enzyme stock (0.6 mg/mL) was incubated in 400 mM
NaCl
and 10 mM DTT for 2 hours at 4 C and for 20 minutes at 37 C then kept on ice.
Assays were carried out in duplicate. HMGR was susceptible to oxidation and
could be
reactivated by incubation with 10 mM dithiothreitol for two hours. Highest
activity was
found at pH 6.5 and at 0.4-1.5 M NaCl. HMGR was able to oxidize NADPH but not
NADH. The enzyme had Km values of 165 35 and 12.4 1.83 pM for NADPH and
CA 02772224 2012-02-24
WO 2011/025394 PCT/NZ2010/000169
126
HMG-CoA, respectively. The statins simvastatin and lovastatin inhibit HMGR
with high
Kic values of 3.7 0.65 and 8.5 1.1 pM, respectively. Statins have
previously been
shown to inhibit the growth of strains of rumen Methanobrevibacter (Wolin and
Miller,
2006). Notably, structure-based alignment showed that the enzyme is a Class I
HMGR.
EXAMPLE 9: In silico modelling of enzymes from the methanogen
Methanobrevibacter ruminantium for the discovery of novel inhibitors
Analysis of the Methanobrevibacter ruminantium genome (Leahy et. al., 2010)
has
revealed archaeal and methanogen-specific enzyme pathways involved in methane
production, energy metabolism, protein, lipid, cofactor and cell wall
synthesis. Making
comprehensive use of this genomic data, essential protein targets have been
analysed
based on already available structural data of related enzymes for homology
modelling,
or when possible, purified protein has been submitted to protein
crystallisation trials for
x-ray structure determination.
The presence of published crystal structures that are similar in sequence is a
factor for -
selecting targets, due to the fact that they can be used to guide the
determination of our
own crystal structures, thus saving time. In addition, high-resolution
structures can also
be used as models to develop inhibitors, as is now being performed. For
example,
0Dcase (orotidine-5'-phosphate decarboxylase; PyrF) is a key enzyme in the
biosynthesis of the pyrimidine uridine-5'-monophosphate (UMP) (Nyce and White
1996). 0Dcase from other sources have provided crystal structures and high-
quality
work on developing inhibitors as part of its kinetic characterisation (Wu and
Pal, 2002;
Poduch et al., 2006; Poduch et al., 2008; Fujihashi et al., 2009; Bello et
al., 2007).
Other researchers are using 0Dcase for the development of novel anti-
pathogenic
compounds =based on small differences between the pathogen and mammalian
structures (Bello et al., 2007). There are over 40 crystal structures for
methanogen
0Dcases. Similarly, HMG CoA reductase from other sources has been identified
as the
presumed target of statins (Miller and Wolin, 2001). The work of Miller and
Wolin
validated HMG CoA reductase as a target in methanogens, and indeed, the entire
mevalonate pathway for lipid synthesis. There are >20 HMG CoA reductase
crystal
structures available.
CA 02772224 2012-02-24
WO 2011/025394 PCT/NZ2010/000169
127
The target sn-glycerol-1-phosphate dehydrogenase (NAD(P)-dependent glycerol-1-
phosphate dehydrogenase, EgsA) is a well known archaeal enzyme that forms the
stereo-specific glycerol-1-phosphate backbone for archaeal lipids (Koga and
Maui,
2007). GGPS (geranylgeranylphosphate= synthase) is another lipid synthesis
enzyme
that is being targeted as it also has an archaeal stereospecific catalytic
site (Koga and
Morii, 2007). A crystal structure is available for a methanogen GGPS (Payandeh
et at.
2006). As another example, DAPDC (diaminopimelate decarboxylase; LysA) helps
catalyse the formation of lysine which is a key cross-linking component of
M. ruminantium cell walls. Lysine is an essential amino acid in mammals and
therefore
the lysine biosynthesis pathway has been targeted for the development of novel
antibiotics (Hutton et al., 2007). A crystal structure is available for DAPDC
(Ray et al.,
2002).The methanopterin biosynthesis pathway enzyme RFA-P synthase has been
validated as a target in other systems, and a vast array of potential
inhibitors have been
identified (Dumitru et al., 2003; Dumitru and Ragsdale, 2004; Scott and
Rasche, 2002;
Miner et al., 2003). The methanopterin pathway enzyme CitG is presumed to be .
essential and is nearly-methanogen specific (Chistoserdova et al., 2003,
Chistoserdova
et at., 2004, Schneider et al., 2000). HisAF is also indicated to be part of
the
methanopterin pathway (Chistoserdova et al., 2003, 2004).
The F420 biosynthesis pathway includes the targets of CofA, PLT (CofD), CofC
(PLGT),
and creatinine amidohydrolase (CA) and FucA are nearly methanogen-specific and
presumed to be essential for methanogen survival (Graham and White 2002). Most
of
the remainder of the F420 pathway is also being targeted. A crystal structure
is
available for PLT (Forouhar et at. 2008). Several methanogenesis pathway
enzymes
are included which are encoded by single genes. These are essential for
survival and
there are crystal structures for all four (Hiromoto et al., 2009; Shinna et
al., 2008;
Grabarse et at., 1999; Aufhammer et at., 2005; Hegemeier et al., 2003). The
Coenzyme
M (CoM) pathway is also thought to be essential for survival and crystal
structures are
available for ComA and ComC (Irimia et al. 2004; Wise et al. 2003). The
Coenzyme B
biosynthesis pathway is the only methanogen cofactor pathway that is fully
methanogen-specific and should be absolutely essential (Graham and White,
2002).
Despite this, all the known enzymes (AksA, AksD, AksE and AksF) share
significant
homology with bacterial proteins, and three have some homology with mammalian
enzymes (AksD, AksE and AksF have some homology with mammalian aconitase
subunits and/or isocitrate dehydrogenase).
= =
CA 02772224 2012-02-24
WO 2011/025394 PCT/NZ2010/000169
128
The RuMP, or ribulose monophosphate pathway, is used by methanogens to
generate
ribose for nucleotide synthesis. Although it is not methanogen-specific it is
only found in
a very limited number of organisms and these are not typically found in the
rumen
(Werken et al., 2008; Growchowski and White, 2005). Several of the key enzymes
of
the pathway are of interest, and in M. ruminantium two of these are linked to
form a
bifunctional enzyme (HPS-FAE, formaldehyde-activating enzyme/hexulose-6-
phosphate
synthase). Phil (hexulose-6-phosphate isomerase) is also being targeted and is
also
part of the RuMP pathway. Interestingly, M. ruminantium also has another Phi
(Phi2),
although sequence analyses suggests that Phil is the more likely to be
involved in the
RuMP pathway. A methanogen crystal structure is available for Phi (Martinez-
Cruz et
al., 2002).
The targets Mur 53, 78, 520, 873 and 874 are murein ligases involved in
synthesising
the amino acid peptide linkages of the cell walls of members of the
Methanobacteriales.
Murein ligases are also found in bacteria. Disruption of cross-linking peptide
biosynthesis would be akin to discovering an 'anti-methanogen antibiotic' with
similar
effects to penicillin-like drugs (Zoeiby et al., 2003; Hartmann and Ktinig,
1990). Most of
these methanogen murein ligases (53, 520, 873 and 874) are quite distinct from
their
.. bacterial homologues, whereas Mur 78 retains quite high levels of homology
with its
bacterial counterparts. Interestingly, Mur 520 is also found in some
Methanococci and
Methanosarcina. Bacterial murein ligase structures (>40) are available that
could be
used as scaffolds for performing molecular replacement, thus aiding future
structure
determination of the methanogen enzymes.
Several enzymes are targeted as they are considered 'key' enzymes that are
central to
metabolism (gluconeogenesis that is required for amino acid, cell wall sugar
synthesis,
DNA and RNA synthesis and the TCA cycle), and therefore essential. Most of
these
have relatively easy spectrophotometric assays (Acs, PEP syn, AcsA,
SdhA/SdhB). Acs
and SdhA/B have archaeal-specific features (Bobik and Wolfe, 1989; Musfeldt
and
Schonheit, 2002). GatD and GatE are involved in translation and are archaeal-
specific
and provide glutaminyl-tRNA for protein synthesis (Possot et al. 1988). tRNA
synthetases have been successfully targeted for the development of novel
antibiotics
(Ahel et al. 2005). There is a methanogen crystal structure for GatD/E
(Oshikane et al.
2006).
CA 02772224 2012-02-24
WO 2011/025394 = PCT/NZ2010/000169
129
The current targets (see below) represent 12 different cellular processes or
metabolic
pathways with a large share (16 targets) being derived from methanogen
cofactor
synthesis pathways. Taking everything into consideration, methanogen cofactors
represent -strong targets overall as they are typically restricted to
methanogens, are
likely to be' essential and the cofactors themselves are fairly small
molecules.
Advantages of small molecules are that it can be easier to synthesise
substrates for
enzyme assays and easier to synthesise potential inhibitors that share similar
structural
features. Due to potential off-target effects, we have decided to avoid most
vitamin
synthesis pathways as almost all of the methanogen enzymes have counterparts
in =
bacteria and therefore, inhibitors would have a high chance of also inhibiting
beneficial
rumen bacteria. -
, Current chemogenomics targets
Target pathway PC R clone expressed/ Pure X-ta I comments
sokble
Nese (PyrF) pyrimidines no early target
HMG CoA red lipids yes yes yes/ yes yes yes early
target
sn-Gl P deh2 lipids yes yes no GA; early target
DAPDC (LysA) lysine/CW yes yes yes/ yes yes early
target
RFA-P (MptG) methanopterin yes yes yes/ no GA
PLT (CofD) F420 yes yes no GA
CofA(1093) F420 yes yes GA
. FucA F420 yes yes yes/ yes
Phil RuMP yes yes yes/ yes yes yes
Hmd methanogenesis yes yes no GA
Mch methanogenesis yes yes yes/ yes yes yes
=
-Mer methanogenesis yes yes yes/ no GA
Mtd methanogenesis yes yes GA
AksA Coenzyme B yes
AksD Coenzyme B yes yes, yes/ no
AksE Coenzyme B yes
AksF Coenzyme B yes yes no
CitG methanopterin yes yes yes/ yes
ComA (Homolog) coenzyme M yes yes yes/ yes
ComB (Homolog) coenzyme M yes yes no GA
ComC (Homolog) coenzyme M yes yes yes/ yes yes
ComD (Homolog) coenzyme M yes yes no = GA
ComE (Homolog) coenzyme M yes yes no GA
Hps-Fae = RuMP yes yes yes/ yes yes yes
Acs key-enzyme yes
AcsA key enzyme yes
PEP syn key enzyme yes yes yes/ yes
GGPS lipids - yes yes yes/
some
SdhA TCA yes
SdhB TCA yes
Mur 53 cell wall yes yes GA
Mur 78 cell wall yes yes . yes/ yes
some
130
Mur 520 cell wall yes yes
Mur 873 cell wall no . GA
Mur 874 cell wall no GA
GatD translation yes yes
GatE translation yes
CofC F420 no
CA F420/riboflavin no
HisAF methanopterin no
GA, gene synthesised by GeneArt and will be redoned; CA, creatinine
amidohydrolase
=
For specific experiments, in silico screening of large commercial compound
libraries
using the docking programme GOLD (Verdonk et. al., 2003; Cole, J.C. et. al.,
2005) has
formed the basis of our selection and design of high affinity ligands for our
modelled
proteins. Work is currently underway for evaluating the essential methanogen
enzyme
methyl-coenzyme M reductase (MCR), an enzyme that catalyzes the terminal step
in
methane production. A number of enzymes are being crystallised and include 3-
hydroxy-3-methylglutaryl-CoA reductase (HMGR), the rate-limiting step in the
synthesis
of isoprenoid units, which are components of archaeal membrane lipids. Others
include
3-hexulose-6-phosphate isomerase (PHI), a key enzyme of the ribulose
monophosphate pathway and methenyltetrahydromethanopterin cyclohydrolase
(Mch),
an enzyme involved in the methanogenesis pathway.
The programme GOLD (Verdonk et. al., 2003; Cole, J.C. et. al., 2005) has been
tasked
in carrying out the in silica docking process in order to obtain novel
inhibitors of
archaeal and methanogen-specific enzyrnes, in particular MCR (Grabarse et.
al.,- 2001;
Figure 37A). MCR has an a262y2 subunit structure which contains two nickel
porphinoid F430 rings which forms the centre of enzyme activity and two
molecules
each of methylcoenzyme M (CoM) and coenzyme B (CoB). These cofactors operate
under strictly anaerobic conditions to carry out the final reaction of the
energy
conserving pathway of methanogenic archaea in which CoM and CoB are converted
to
methane and the heterodisulfide product CoM-S-S-CoB (Figure 37B). This product
is
subsequently reduced with H2 back to the thiol forms of the separate cofactors
by
heterodisulfide reductase.
A number of structural databases from commercial suppliers have been
downloaded
from the ZINC (Irwin and Shoichet, 2005)
and screened with GOLD (Verdonk et. al., 2003; Cole, J.C. et. al., 2005).
Commercial
databases available from the zinc website such as Asinex, Chembridge building
blocks
and the LOPAC library of proven pharmacologically-active compounds were
screened
CA 2772224 2019-03-25
CA 02772224 2012-02-24
WO 2011/025394 PCT/NZ2010/000169
131
with MCR. Utilizing a ruminant model of MCR based on a crystal structure from
Methanothem7obacter marburgensis (pdb code 1 HBM4) active site regions were
subjected to specific and targeted docking attempts to find inhibitors that
could mimic
the natural substrates and product of the enzyme. The effectiveness of
candidate
inhibitors were then monitored in pure culture experiments. Cell density was
measured
over time to assess the effectiveness of potential inhibitors (Figure 38).
=
=
o
N)
,i
,1 =
IV
N) Table 1. Comparison of the M1 genome features with methanogens
from the order Methanobacteriales
iv
0. !77/1114tiirri iii0TitiiiiiiR M.
smithii PS M. smithii ALla M. smithii Fla
Methanothermobacter Methanosphaera
[34]
thennoautotrophicus AH stacitmanae MCB-3
0
,
ki)
1 Source ailElairikIi-;uviTeri1A Sewage digester Human
Human faeces Sewage sludge Human faeces
c)
w Project status MWeeiriii5let-eiWN
complete draft draft complete complete
1 Genome size (bp) .41.7.:21537:44203MNIVI
1,853,160 1,704,865 1,707,624 1,751,377 1,767,403
1.)
in G+C content (%) 6=4,0334_,Ata 31
31 31 50 28
Number of ORFs -;:tt; 7 ' )421,10. ' 61.':c; 1795 1709
1710 _ 1873 1534
Coding area (%) .M*,V8'4111Waiff 90 90 90
90 84
rRNA operons 2=PMEN 2 rid nd
2 4
_
tRNAs (with intron) i.-!_i::=31-'5ey(gp-,-,,,iz 34 (1) 34 34
39 (3) 40 (1)
Non-coding RNA "ft.-1,24a0.3Wg14".iiil 3 nd
nd 2 2
Insertion 1,-.7-7-4,-N,witqw-;-,tr ,-; 8 rid
nd 0 4
sequences _
Prophage ' MAZYide044041 Yes nd nd
No No
CRISPR regions Attt;Z.4-2V.AetViti. 1 rid nd
2 2
_
Adhesin-like wow, La - 48 nd nd
0 37 ¨
,
,
proteins
N)
_
; LPxTG motif iffAVASS1M.'K'= 2 nd
nd 0 0
_
Sortases wit-mm=4 1 nd nd
2 0
a Draft genome data obtained from National Centre for Biotechnology
Information ,
nd Data not determined from draft genome
34 Boekhorst et al., 2005
- 46 Smith et at. 1997
20 Fricke et al., 2006
'
-
' ,
-
C
w
o
Table 2. Potential chemogenomic gene targets of the M1 genome based on in-
mru1499 domain I--
1--,
depth literature.and metabolic analyses. mru1604
adhesin-like protein with transglutaminase --
o
'
Locus Annotation I Reference domain
(..3
(A
(.4
adhesin-like.protein with transglutaminase
AMINO ACID METABOLISM domain
i
adhesin-like protein with transglutaminase
domain
mru0997 phospho-2-dehydro-3-deoxyheptonate aldolase/ [S8,9]
fructose- bisphosphate aldolase mru1836 cell
shape determining protein MreB/Mrlfamily [S45-47]
mru0998 AroB [S9-11] mru1047 poly-
gamma-glutamate biosynthesis protein [S48,49]
= mru1577 AroA [S9,10,12]
mru2175 cell wall biosynthesis glycosyl transferase
mru1676 AroK [S13] mru0707 cell
wall biosynthesis protein Mur ligase family [S50-541
rnru0350 G1nA1 [S14] mrii1042 cell
wall biosynthesis protein Mur ligase family a
mru2078 G1nA2 mru1118 cell
wall biosynthesis protein Mur ligase family
o
mru1745 cell
wall biosynthesis protein Mur ligase family
mru0122 GlyA [S15-17]
IV
mru2091 cell
wall biosynthesis protein Mur ligase family -.3
mru2139 HisB [S18]
.-1
mru2092 cell
wall biosynthesis protein Mur ligase family NJ
mru0152 LysA [S19-21]
IV
mru0964 cell
wall biosynthesis protein phospho-N- [S55-57] IV
mru0153 DapF [S21,22]
acetylmuramoyl-pentapeptide-transferase family
mru1743 PdaD [S23, 241 mru1041 cell
wall biosynthesis protein phospho:N- i-4µ.3 .. "
o
'
mru0208 TrpE [S25]
acetylmuramoyl-pentapeptide-transferase family o.) H
IV
I
mru0410 11vB1 [S26, 27] mru2126 . cell
wall biosynthesis protein UDP- o
mru2112 11vB2
glycosyltransferase family = "
,
.
I
mru2111 IlvN mru1293 GlmS1
[S52, 58] "
mru1414 CimA [S28, 29] mru1536 GlmS2
mru1388 NAD
dependent epimerase/dehydratase [S59]
CELL CYCLE ' mru1413 NAD
dependent epimerase/dehydratase
mru0458 .GImM1
. [S52, 58]
mru0481 FtsZ [S30, 31] mru0449 GImM2
mru0240 PolD2 ' [S32] mru1733
phosphosugar-binding protein
mru2212 PolD1 mru2136
polysaccharide biosynthesis protein
n
.
mru1864 DNA topoisomerase VI subunit A [S33, 34] mru1470 GalE
[S59] 1-3
mru1865 DNA topoisomerase VI subunit B mru0456 G1mU
[S52, 58]
N
mru1005 UppS
[S60, 61]
= CELL ENVELOPE
mru2108 UppP [S52,62-64] o
1--,
o
mru1524
polysaccharide biosynthesis protein [S65]
mru0824 adhesin-like protein with transglutaminase
[S35- 44]
o
mru0828 domain CENTRAL CARBON
METABOLISM 1-,
o= .
mru1497 adhesin-like protein with transglutaminase
=
.
.
.
.
.
.
,
'
mru1434 AcsA [S66] , mru1408
EhaE
0
mru1570 Acs = .' TS66-68] mru1407 EhaF
= ,
(,..
mru0550mru PorA [S69-74]= mru1406 EhaG
c'
1¨,
0551 ' PorB mru1405
= EhaH = = 1¨
,
=. o
mru0549 PorD =
mru1404 Ehar
(A
mru0548 PorD mru1403 EhaJ
(.4
mru0552 PorE . mru1402
.EhaK
mru0553 PorF mru1401
= EhaL
, mru0957 RpiA [S75-77] mru1400 EhaM
mru1634 Prs [S78] mru1399 EhaN
mru0250 Phil [S75,77, 79- mru1398 = Eha0
.
mru1310 Phi2 81] mru1397 EhaP
.
mru2131 Fae/Hps . [S75-77,81, ,. mru1396
EhaQ .
mru1394 EhaR . = '
82]
mru1255 Mdh [S83-86] mru2014 EhbA
. [S108 110] a
mru2013 EhbB .
mru0847. PycA = [S87-901 =
0
mru2012 . EhbC = IV
mru1888 PycB
.-.1
mru201-1 EhbD
= mru0088 SdhA
[S91, 92] NJ
= ru
. m2010 EhbE = . IV
mru0655 = SdhB
' IV
' ' mru2009 ,
EhbF
mru2008 =EhbG NJ
ENERGY METABOLISM ' mru2007 EhbH
F. so 0
I-.
.
IV
' . mru2006 Ehbl
1
mru0701 AhaA [S93-105] ,mru2005
EhbJ o
n)
mru0702 AhaB mru2004 EhbK
. N31
mru0699 AhaC = mru2003
. EhbL .. .1,
.mru0703 AhaD mru2002 EhbM
=
mru0698 AhaE mru2001 EhbN
mru0700 AhaF = = mru2000
Ehb0
. .
mru0695 ' AhaH . = = mru1999 =
EhbP
mru0696 Ahal " nnru1998
EhbQ
mru0697 AhaK
.
= . mru1906
MyhA [S111-113]
, mru2064 FrhA .. [S106, 107] mru1905 MvhB
00
. mru2061 . FrhB1 ' mru1908
MvhD1 . n
1-
nnru2081 FrhB2 mru2076
MvhD2 -
mru2063 FrhD = . mru1907
MyhG - N
mru2062 FrhG mru0569 =
Mer = . . [S106, 114-
1--,
mru1412 EhaA [S108, 1091
= 116] c'
= .j(E3 mru1411
EhaB mru0117 HdrA . = ' [S106, 115,
o
mru1410 EhaC mru0817
' HdrB 117-122] 1..,
= c7,
mru1409 EhaD mru1212
HdrB2
.
.
_
mru0816 HdrC
1691 0
mru0526 Hmd [S106, 123- mru1092 HmgA
[S170-175] w
o
127] mru1640
hydroxymethylglutaryl-CoA synthase [S169, 172, 1--
1--,
mru2142 Mtd [S126, 128-
173, 176, 177] --.
o
131] mru0922 Fni
[S77, 176, k..i
uri
c..i
mru1393 Ftr1 [S106,115-
178-180]
mru2022 Ftr2 116, 132-133] .
mru0921 isopentenyl diphosphate kinase [S77, 181]
mru1619 Mch [S106, 114, mru0920 Mvk
[S170-175]
115, 129, mru0919
phosphomevalonate decarboxylase [S77]
134-136] mru1102
digeranylgeranylglyc,eryl phosphate synthase [S182]
mru1924 McrA . [S106, 114, mru0924 IdsA
[S177]
mru1928 McrB 115,137-151]
mru1926 McrC MOBILE ELEMENTS
. mru1927 McrD
a
mru1925 McrG
mru0317 phage-
related protein [S35]
mru1262 AtwA1
0
i 0320 mru
endosopeptidase PeiR [S171] n)
mru1850 AtwA2
...3
mru1919 MtrA1 . [S106, 115,
.-.1
NJ
mru0441 MtrA2 121, 152-155] PROTEIN FATE
IV
1\)
mru1920 MtrB
.1,.
mru1921 MtrC mru2021
transglutaminase domain-containing protein [S35]
c.,.)
0
mru1922 MtrD mru0391
oligosaccharyl transferase cn H
IV
mru1923 MtrE mru1832
sortase family protein [S183, 184] 1
o
mru1918 MtrF
"
1
mru1917 MtrG PROTEIN SYNTHESIS
.1,
mru1916 MtrH mru2169 GatA
[S185-189]
mru0344 FwdA S121, 156- mru2029 GatB
mru0343 FwdB 162] mru1142 GatC
mru0345 FwdC mru1571 CysS
[S185, 186,
mru0342 FwdD
190]
mru0254 FwdE mru1427 GatD
[S185-187,
mru0340 FwdF . mru1426 GatE
" 191-193]
mru0341 FwdG = mru0126 IleS
[S186, 194- n
1-
mru0339 FwdH
196] ,
mru0242 LysS
[S185-187] N
LIPID METABOLISM mru0954 ProS
[S197, 198]
1--,
mru1947 SerS
= [S186, 199, =
C3
mru1031 FabG1 [S163-167]
200] o
mru1630 = FabG2
o
1-,
mru0955 EgsA [S77, 168, PURINES AND
PYRIMIDINES c7,
,
,
,
VITAMINS AND COFACTORS
mru1839 Pur0 - [S77, 201, mru1560 HemB
[S153, 210- 0
202]
212]t=-4
o
1-.
mru0595 PurP , [S77, 203] mru1541 CobA
[S153, 210- 0.,
.
--.
mru1055 PyrF [S204, 2051
212] o
k..4
mru1853 HemA
[S153, 210- (A
(.4
TRANSCRIPTION .
215]
mru1544 HemD
[S153, 210-
mru1482 RpoE1 [S206, 2071
212, 216]
mru1481 RpoE2 mru0999 HemL
[S210-214]
mru1815 RpoA1 mru1746 HemC
[S153, 211-
mru1814 RpoA2
212]
mru1816 RpoB1 mru0384 AksD
[S217-219]
mru1817 RpoB2 mru1689 AksE
[S217-219]
mru0908 RpoD mru0385 AksA
. [S217, 219, 0
mru0161 RpoF ,
220] 0
. mru1818 RpoH mru1033 AksF
[S217, 219, ' IV
.-.3
mru0913 RpoK
221] .-1
NJ
mru0169 RpoL mru1283 ArfB
[S222] IV
IV
mru0912 RpoN
mru0953 CofC
[S217, 223]
mru1350 RpoP
1.)
mru1253 FtsA1 ,
[S217, 224- c0) .
,_.
mru1787 FtsA2
226] IV
TRANSPORTERS
1
mru0479 F420-
0:gamma-glutamyl ligase [S2171 o
IV
= = mru1842
CofE [S217, 227- 1
mru0405 transporter Na+/H+ antiporter family
r [S208] 229] "
.1,
mru1974 CofG
[5230-232]
UNKNOWN FUNCTION mru1266 CofH
mru2213 FucA
[S233-236]
mru0668 methanogenesis marker protein 1 [S209] mru0672 CofA
[S235]
mru1929 methanogenesis marker protein 10 . mru1844
CofD [S237, 238]
mru0097 methanogenesis marker protein 11 mru1949 ComB
[S239, 240]
mru0181 methanogenesis marker protein 13
1-:
mru1980 ComC
[S82, 217, n
mru1915 . methanogenesis marker protein 14 ,
241, 242] 1-3
mru1771rnru methanogenesis marker protein 15 mru1896 MfnA
[S217, 243]
1778 methanogenesis marker protein 2
mru1690 MptG
[S244-249] N
mru1774 methanogenesis marker protein 3
o
mru1962 MptA
[S250, 251]
mru1931 methanogenesis marker protein 7 ,
1--,
o
inru0436 methanogenesis marker protein 8 mru1559 CitG
[S252-254] C3
o
mru1695 H4MPT-linked Cl transfer pathway protein mru1845 ArfA
[S255, 256] =
1--,
mru1215 RibC
[S257-262] o
CA 02772224 2012-02-24
WO 2011/025394
PCT/NZ2010/000169
137
_
r= o-
Lo
45 t
N N CV
U) (1) õ ,
ic
= :0
cc cc cc
co r=-=
co N-
o
N
2 2 2
EEE
-
_
S8 White & Xu, 2006' S49 Scorpio et at., 2007 S90 Shieh &
Whitman, 1987 S131 Jacobson et al., 1982 0
.
S9 Porat et al., 2006 S50 Smith, 2006 S91 Bobik & Wolfe,
1989 . S132 Acharya et al., 2006 t.4
o
S10 White, 2004 S51 Silver, 2006 S92 Heim et at.,
1998 S133 Mamat et al., 2002 0-
S11 Porat et at., 2004 S52 Kotnik et al., 2007 S93 Lemker et al.,
2001 S134 DiMarco et al., 1986 --.
o
S12 Morar et al., 2007 S53 Katz & Caufield, 2003 594 Lemker et at.,
2003 5135 Donnelly et at., 1985 (..3
(A
S13 Daugherty at at., 1999 S54 Zoeiby et al., 2003 S95 Lewalter &
Mailer, 2006 S136 Vaupel et al., 1996 (.4
S14 Possot et al., 1998 S55 de Kruijff et al., 2008 S96 Schafer et
at., 2006 S137 Whitman & Wolfe, 1985
515 Lin & Sperling, 1998 S56 Kimura & Bugg, 2003 S97 Schafer et al.,
2006 S138 Whitman & Wolfe, 1987
' S16 Hoyt et al., 1986 S57 Hi'pert et al., 1981 S98 Coskun et at.,
2002 S139 Harmer et al., 2008 ..
S17 Angelaccio et al., 2003 S58 Namboori & Graham, 2008 S99 Coskun et
at., 2004 S140 Ermler, 2005
S18 Sment & Konisky, 1989 S59 Hartmann & KOnig, 1990 S100 Lingl et
al., 2003 S141 Grabarse et al., 2001
S19 Hutton et al., 2007 S60 Guo et al., 2007 S101 Sprott &
Jarrell, 1982 S142 Selmer et al., 2000
S20 Born & Blanchard, 1999 S61 Scholte et al., 2004 .
S102 Gruber & Marshansky, 2008 S143 Ermler et al., 1997
S21 Girodeau et at., 1986 S62 Hammes et al., 1979 S103 Pisa et at.,
2007 S144 Prins et at., 1972
S22 Pillai et al., 2006 S63 Kandler & Kainig, 1998 8104 Willer et
at., 1999 = S145 Attwood & McSweeney, 2008
a
S23 Tolbert et al., 2003 S64 Bouhss et al., 2008 S105 Muller et al.,
2004 S146 Rospert et at., 1992 0
S24 Graham et al., 2002 S65 Ruiz, 2008 S106 Ferry, 1999
. S147 Goenrich et al., 2004 "
-.3
- S25 Kalyazhnaya et at., 2005 S66 Lindahl & Chang,
2001 S107 Alex et al., 1990 S148 Bucket &Golding, 2006 .-.1
NJ
S26 Xing & Whitman, 1987 S67 Musfeldt & Schonheit, 2002
S108 Tersteegen & Hedderich, 1999 S149 Ellermann et al., 1988 IV
I\ )
S27 Tan et al., 2006 S68 Eggen et al., 1991 S109 Anderson et
at., 2009 S150 Sauer, 1991 .1,.
=
S28 Hernandez-Montes et at., 2008 S69 Ragsdale, 2003
S110 Porat et at., 2006 S151 Wackett et at., 1987 1.)
S29 Howell et at., 1999 S70 Dermouni & Ansorg, 2001 S111 Woo et at.,
1993 S152 Gottschalk & Thauer, 2001
c....) 1-=
,
co IND
i
S30 Huang et al., 2007 871 Ansorg et al., 2003 S112 Shah & Clark,
1990 S153 Kenealy & Zeikus, 1981 o
S31 LOwe & Amos, 1998 572 Bock et at., 19.96 , t S113 Stojanowic
et at., 2003 S154 Stupperich, 1993 , n)
i
S32 lshino & Cann, 1998 573 Lin et at., 2003 ' S114 Shima et at.,
2002 S155 Becher at at., 1992 i\D
S33 Graille et at., 2008 S74 Lin & Whitman, 2004 S115 Thauer et at.,
1993 S156 Andreesen & Makdessi, 2008
S34 Gadelle et at., 2005 S75 Kato et al., 2006 S116 Aufhammer et
al., 2005 S157 Hochheimer et at., 1995
S35 Makarova et al., 1999 S76 Grochowski et at., 2005 S117 Hedderich
at at., 2005 S158 Hochheimer et at., 1996
836 Esposito et al., 2007 S77 Grochowski & White, 2008
S118 Mauer et at., 2002 S159 Hochheimer et at., 1998
'
S37 Griffin et at., 2002 S78 Kadziola et al., 2005 S119 Deppenmeier,
2002 5160 Deppenmeier, 2002
S38 Yokoyama et at., 2004 , S79 Martinez-Cruz et at., 2002
S120 Shokes et al., 2005 S161 Vorholt, 1997
S39 lranzo et at., 2002 S80 Goenrich et at., 2005 S121 de Poorter et
at., 2003 S162 Wasserfallen, 1994 oci
S40 Kato et al., 2008 S81 Werken van de et at., 2008
S122 Schafer et at., 1999 S163 Heath & Rock, 2004 n
S41 Hartmann & Konig, 1990 S82 Soderberg, 2005 S123 Pilak et at.,
2006 S164 Heath et at., 2001 1-3
S42 Lee et al., 1985 S83 Lee et at., 2001 S124 Shima et at.,
2008 S165 Campbell & Cronan, 2001
N
S43 Luo et at., 2002 S84 Sprott et at., 1979 S125 Vignais et at.,
2001 5166 Payne et at., 2001 o
S44 Steenbakkers et at., 2006 S85 Storer et at., 1981 S126 Hendrickson &
Leigh, 2008 S167 Payne, 2008 1--,
o
S45 Divakaruni et at., 2007 - S86 Thompson et at., 1998 S127 Klein et
at., 1995 S168 Daiyasu et at., 2002
S46 Osbom & Rothfield, 2007 S87 Mukhopadhyay et at., 1998
S128 Hagemeier et at., 2003 S169 Koga & Morii, 2007
o
= S47 Daniel & Errington, 2003
S88 Mukhopadhyay et at., 2000 S129 Mukhopadhyay & Daniels, 1989
S170 Miller & Wolin, 2001 1..,
c7,
S48 Candela & Fouet, 2006 S89 Mukhopadhyay et at., 2001
S130 Mukhopadhyay et at., 1995 S171 Samuel et al., 2007
S172 de Ruyck & Wouters; 2008 S213 Pfaltz et at., 1987
S254 Bauer et at., 2004
S173 Bonanno et at., 2001 S214 Schulz et at., 2006 S255 Morrison et at.,
2008 0
(,..
S174 Friesen & Rodwell, 2004 S215 Moser et al., 2002
S256 Graham et at., 2002 = ==
1--
S175 Istvan, 2001 S216 Gilles & Thauer, 1983 S257 Ungerfeld et
al., 2004
=-.
S176 Smit & Mushegian, 2000 S217 Graham & White, 2002
3258 Ungerfeld et at., 2007 o
(..4
S177 Boucher et al., 2004 S218 Drevland et al., 2008 =
S259 Nagar-Anthal et at., 1996 (A
(.4
S178 Barkley et al., 2004 " = S219 White, 2001
S260 Fischer et at., 2004
. 8179 Hoshino et at., 2006 S220 Howell et at.,. 1998
S261 Fischer et at., 2005 '
S180 Wouters et at., 2004 S221 Howell et al., 2000 S262 Osterman et at.,
2003
S181 Grochowski et at., 2004 S222 Grochowski et at.,
2009 5263 Romisch-Margl et at., 2008 .
S182 Payandeh et at., 2006 S223 Grochowski et at., 2008 8264 Mashhhadi
et at., 2008
S183 Mareso & Schneewind; 2008 S224 Kengen et at., 1991 ,
S265 Ammelburg et at., 2009
S184 Mareso et at., 2007 S225 Vermeij et at., 1994
S185 Prffitorius-Ibba & lbba, 2003 S226 Vermeij et at.,
1995 S266 Konig et at., 1994
S186 Kim et at., 2003 S227 Li et at., 2003 S267 Kandler & Konig,
1978 a
5187 Tumbula et at., 1999 S228 Kwang-Pit et at., 2001 - S268 Perez-
Bercoff et at., 2006
S188 Tumbula et at., 2000 S229 Nocek et at., 2007 S269 Larkin et at.,
2007 . = 0
IV
S189 Sheppard et at., 2008 S230 Kwang-Pil et at, 2002
S270 Waterhouse et at., 200' = ...3
.-.1
S190 Klipcan et at., 2008 S231 Guerra-Lopez et at., 2007
n)
IV
S191 Sheppard et at., 2008 S232 Graham et at., 2003 =
m
S192 Schmitt et at., 2005 S233 Joerger et at., 2000 -
.
S193 Oshikane et at., 2006 S234 Schumperli et at., 2007
F;
1-'
S194 Ataide & lbba, 2006 S235 Grochowski et at., 2006 .
CD IV
I
S195 Jenal et al., 1991 5236 Nam Shin et at., 2007 .
o
S196 Pohlmann & Brbtz-Oesterhelt, 2004 S237 Forouhar et at., 2008
IVi
i\)
S197 Ambrogelly et at., 2005
S238 Graupner et at., 2002 .1,.
3198 Ahel et at., 2002 S239 Wise et at., 2003
S199 Ahel et at., 2005 S240 Graham et at., 2002
S200 Kim et al., 1998 '
S241 Graupner & White, 2001 ,
3201 Yang et at., 2007 S242 Graupner et at., 2000 '
S202 Graupner et at., 2002 S243 Kezmarsky et at., 2005 ,
S203 Zhang et al., 2008 . S244 Dumitru et at., 2003
S204 Bello et at., 2007 S245 Dumitru & Ragsdale, 2004
oci
n
S205 Nyce & White, 1996 S246 Scott & Rasche, 2002
1-3
S206 Sarkar et at., 1977 S247 Chistoserdova et at., 1998 .
S207 Hilpert et at., 1981 S248 Rasche & White, 1998
N
S208 Surin et at., 2007 S249 Chistoserdova et at., 2004
, o
S209 Hunter et at., 2009 S250 Howell & White, 1997
1--,
o
S210 DiMarco et at., 1990 S251 Grochowski et at., 2007
=-=:E3
.
o
S211 Thauer, & Bonacher, 1994
S252 Schneider et al., 2000 =
1..,
S212 Vermeij et at., 1997 S253 Chistoserdova et at., 2003 .
o
,
' .
= .
,
C
Pfaml & TigrFam domains2
w
o
To 1--
c 1¨.
o
k..)
= Table 3.
Predicted cell surface associated adhesin-like proteins in M1. There are 105
9 to'
a)
w un
ORFs annotated as an adhesin-like protein in the genome of M1. Seventy-five of
these E .% iti o_
2
.d
a
(2)
,0
0 E ¨ a)
2
ORFs show a signal peptide or are predicted to be cell'wall located and are
displayed in c E 0_ g 0 0 c l-
a) =
-co 2
0
a
this table. These ORFs can be largely assembled into four main groups based on
their = cT, o -o ¨= 0
'5
E ,3) a) CL C (I) il-) "Q Z W a)
cell-anchoring domains. The remaining 30 ORFs are likely to be remnants of
former o
.,.., co 0) 2 15 0 o o 0 co
.... co co
Z. a) -17:0 'a 0 C6
adhesin-like proteins, intracellular proteins or proteins which currently
cannot be identified -a To 0 _ .c C c 2 g 2
CL co c
as cell wall associated. Group three, contains 13 ORFs which currently have a
signal .g. .c E 2 , 0 '' .ET2 cn 111 2' Ci .. CD 4-
7 .0 .,... .5
z n'
peptide but do not have any visible cell-anchoring domain. These may be
extraCellular E E P. "L a' c` -5 in E a") E
a) a) = 0 " 2 co) co 0 N-6 a) o = (1 0 'a E .2
proteins or remnants of former adhesin-like proteins. Shaded boxes indicate
domains .-= .- "5). .S >= 1- i co in o.L.c9 co ....Ø,... >,
a a CO 0 E .-= =E E v- ¨ N -9 Of Fi2
identified in these ORFs _c v; (0 Li-
al a) 1-1-. I i 0 .c .c E < a.) m o.
E D I co D a) _i co a
1-- 2 j¨ 0 0 0 co I 0 iii 'ci3 a. CO >- (..) ...1 0< 2 ce E.' 0 u)
. Locus tag Annotation Size (bp) Signal's TM4
o
n)
-.3
Group 1: Transmembrane" C-terminal Group
mru0019 adhesin-like protein 1220 SP 2
IV
mru0327 adhesin-like protein 2090 SP 2
m
.1,.
mru0687 adhesin-like protein 2963 SP 2
mru1210 adhesin-like protein 7250 SP 2 , ____ ' __
r., 1
.o. o
F-'mru1222 ' adhesin-like protein 4055
SP 2
,
o
mru1506 adhesin-like protein 857 SP 2
1
mru2053 adhesin-like protein 3494 SP 2
,\3
.1,.
mru2134 adhesin-like protein 17957 SP 2
mru2147 adhesin-like protein 16955 SP 4 -..,.... ,
__________
r I-- ¨
_______________________________________________________________________________
______________________
mru2178 adhesin-like protein 9239 SP 2 . . .
,
,
mru0031 adhesin-like protein 4415 CW 1 r...,õ-
-, ....
mru0704 adhesin-like protein 2858 SP 1
mru0963 adhesin-like protein 8159 CW 0
mru0976/0977 adhesin-like protein 4775 SP 2
Kli ro
,
n
Group2: M1-Big_1 like C-terminal Group
1-3
_______________________________________________________________________________
__________________ ___________
adhesin-like protein f,
,
with cysteine protease
s
'LsJ
nnru0020 domain 6014 SP 1 lr
4
mru0064 adhesin-like protein 3536 SP 1
=
mru0072 adhesin-like protein 2918 SP 1
o
, .
_______________________________________________________________________________
______________________________ o
mru0076 adhesin-like protein 6605 SP 1
..,
o,
mru0077 adhesin-like protein 9161 SP 1
.
.
. .
mru0079 adhesin-like protein 3560 SP 1 ____ ,
_______________________________________________ .0
mru0083 adhesin-like protein 839 SP 1
mru0084 adhesin-like protein 14477 SP 1
= =
,--
mru0085 adhesin-like protein 8030 SP 1
.--.
_ _
o
mru0086 adhesin-like protein 10175 SP 1
un
adhesin-like protein
with cysteine protease
. ,
mru0143 domain 3284 SP 0 ¨
mru0160 adhesin-like protein 3176 SP 1
adhesin-like protein 7'
with cysteine protease 1 = ' ¨
mru0222 domain = 3302 SP 1 i
I -
mru0327 adhesin-like protein 2060 SP 1 I
, mru0338 adhesin-like protein 6929 SP 1
. _____________________
mru0417/0418 adhesin-like protein 1391 SP 1
mru0419 adhesin-like protein 4175 SP 1 j
adhesin-like protein
...i
with cysteine protease ,
N)
'
IV
mru0727 domain 3788 SP 0'
N)
adhesin-like protein
with cysteine protease
4t.
mru0772 domain 3281 SP 1
_. 1-=
IV
. adhesin-like protein
1
o
-
N)
with cysteine protease
1
= mru0839 domain 8639 SP
1 N3
adhesin-like protein
with cysteine protease
, ,-, 'I
,. mru0842 domain = 3977 SP 1 , H.
mru0978 adhesin-like protein 6606 SP 0
/ mru0979 adhesin-like protein 8753 SP 1
24
mru1076 adhesin-like protein 2681 SP 1 j
.. ________________________
mru1077 adhesin-like protein 2273 SP 1 mru1246 adhesin-
like protein ___________________ 4619 SP 1 I,
_________________________________________________________________ , 5
n
1-
mru1247 adhesin-like protein . 5060 SP 1
mru1465 adhesin-like protein 2882 _ SP 1 .,
__________________________________________________ N
mru1513 adhesin-like protein 1853 SP 1
1--,
mru1650 adhesin-like protein 9161 SP 1 -.-
_____________ n= 4::
mru1726 adhesin-like protein 6767 SP 1 -
o
.
_______________________________________________________________________________
__________________________________ o
mru1971 adhesin-like protein 1937 . SP, 1
1--,
mru1996 adhesin-like protein 4496 SP 1 1 ii =
_______________________________________
,
,
,
mru2043 adhesin-like protein 9530 SP 1 ___ : ;
_______________________________________ : 0,
0
mru2048 adhesin-like protein 5417 SP 1
t=-)
,
_______________________________________________________________________________
____________________________
mru2049 adhesin-like protein 10355 SP 1
=
I--
-----=
mru2052 adhesin-like protein 4112 SP 1
--.
o
mru2054 adhesin-like protein 5054 SP 1
Ik..i
ur;
,
mru2055 adhesin-like protein 8906 SP 1
o
,
mru2059 adhesin-like protein 4415 SP _ 0 ;
mru2090 adhesin-like protein 15200 SP 1
1 = ., ,
,,
mru0004 adhesin-like protein 2237 SP 1
mru0331 adhesin-like protein 1622 SP 1
;ill
adhesin-like protein
with cysteine protease
mru0843 domain 6197 SP 1 4 R I
I .
Group 3 : Other
.
adhesin-like protein
with cysteine protease ,
n)
mru0015 domain 3845 SP 1
...3
.-.1
mru0090 adhesin-like protein 2063 SP 1 _,,j _
= IV
-
_ ________________________ IV
mru0255 adhesin-like protein 4187 SP 1
n)
mru0450 adhesin-like protein 803 SP 1
mru0723 adhesin4ike protein 7727 ' SP 1
VA au 4, o
mru0962 adhesin-like protein 14789 SP 1
__________________________________________________
1
mru0970 adhesin-like protein 2483 SP 1
o
n)
mru1263 adhesin-like protein 2585 SP _ 1 ,_.
-
1
mru1358 adhesin-like protein 2243 SP 1 _
.1,
_
_______________________________________________________________________________
_____________________________
mru1386 adhesin-like protein 1841 ' SP 1
adhesin-like protein ,
________________________________________
with cysteine protease
mru1387 domain ' 2957 SP 1
mru1424 adhesin-like protein 1445 SP 1
mru1500 adhesin-like protein 3896 SP 1 g..-7
=
oo
Group 4: Pseudomurein-binding Group
n
mru0493 adhesin-like protein 2447 . SP 1
1-3
adhesin-like protein
=
N
' with transglutaminase
mru0824 domain 2027 SP 1
1--,
,
o
adhesin-like protein
o
with transglutaminase
I =
1--,
mru1499 domain 3032 SP 0
I
o
,
=
=
adhesin-like protein Ir.
____________ It-121
with transglutaminase
mru1604 domain 2996 SP 0
1Pfam: Transglutaminase (PF01841), Papain family cysteine protease (PF00112),
Domain of unknown function DUF11 (PF01345), Haemagluttinin repeat (PF05594),
Protein of unknown function DUF1565 (PF07602), Group 1 Bacterial Ig-like
domain Big_1 (PF02369), Group 2 Bacterial Ig-like domain Big_2 (PF02368),
Pseudomurein-
binding repeat (PF09373), CNA protein B-type domain (PF05738), Cell wall
binding repeat (PF01473), Alpha-2-macroglobulin (PF01835), Morn repeat variant
(PF07661),
RNA polymerase Rpb1 C-terminal repeat (PF05001), GLUG motif (PF07581),
Staphylocoagulase repeat (PF04022).
2TigrFam: Chlamydial POMP repeat (TIGR01376), conserved repeat domain
B_ant_repeat (TI0R01451), YD repeat (TIGR01643).
35igna1: SP, signal peptide as determined by SignalP3.0; CW, cell wall as
determined by PSORT
4TM: TM, transmembrane domain predictions completed using
www.cbs.dtu.dkiservices/TMHMM/
0
=
74
19
(A)
n)
=
IV
=
=
=
. ,
Table 4. M1 genes postulated to have originated from horizontal gene transfer
events based on analysis by the Darkhorse algorithm (Podell and 0
,
t..)
Gaasterland, 2007). Hits with a Unease probability score LPI) of less than 0.6
are shown. =
,--
Functional Sub
0-
¨. = -
o
Locus tag Sequin Annotation Classification Classification LPI
E-value Species Lineage ks.i
vi
f...)
o
cysteine synthase Amino acid
Clostridium leptum DSM
mru2096 CysKM2 biosynthesis Cysteine 0.134 7E-65 753
Bacteria; Firmicutes
glutamate synthase
Anaerofustis
domain-containing Amino acid glutamate/gluta
stercorihotninis DSM
mru0810 protein biosynthesis mine 0.13 2E-177 = 17244
Bacteria; Firmicutes
tryptophan synthase Amino acid
Clostridium the rmocellum
mru0213 beta subunit TrpB1 biosynthesis Tryptophan 0.134
1E-179 ATCC 27405 Bacteria; Firmicutes
HIRAN domain- Chromosome 0.000000
Anaerococcus prevotii 0
mru1667 containing protein Cell cycle replication 0.138
01 DSM 20548 Bacteria; Firmicutes 0
RecF/RecN/SMC N
-.1
-A
terminal domain- Genome
Desulfatibacillum Bacteria;
I.)
mru1158 containing protein Cell cycle segregation 0.088
4E-13 alkenivorans AK-01 Proteobacteria N)
.1,.
Cell surface
Trypanosome brucei Eukaryota;
mru0004 adhesin-like protein , Cell envelope proteins 0.003
1E-22 TREU927 Kinetoplastida
4s.
1-=
I.)
Cell surface
1 . .0
mru1299 adhesin-like protein Cell envelope proteins =
0.003 2E-36 Trichoplax adhaerens Eukaryota; Metazoa
1
Cell surface
Planctomyces mans DSM Bacteria; N.) , .1,
mru0687 adhesin-like protein Cell envelope proteins 0.082
1E-25 8797 Planctomycetes
Cell surface
Planctomyces marls DSM Bacteria;
mru1263 adhesin-like protein Cell envelope proteins 0.082
2E-22 8797 Planctomycetes
Cell surface
Planctomyces malls DSM Bacteria;
mru1417 adhesin-like protein Cell envelope proteins 0.082
1E-10 8797 . Planctomycetes
=
Cell surface Bacteria;
mru2178 adhesin-like protein Cell envelope proteins 0.085
5E-96 Polaribacter sp. MED152 Bacteroidetes n
1-
. Cell surface
Chloroflexus aurantiacus J-
mru1312 adhesin-like protein Cell envelope proteins 0.088
2E-17 10-fl Bacteria; Chloroflexi
N
. Cell surface
Lactobacillus johnsonii
,--,
mru0036 adhesin-like protein Cell envelope proteins 0.112
8E-33 NCC 533 Bacteria; Firmicutes
¨
o
Cell surface Listeria
monocytpgenes str. o
o
mru1315 adhesin-like protein Cell envelope proteins 0.112
2E-13 4b H7858 Bacteria; Firmicutes
c7,
mru2134 adhesin-like protein Cell envelope Cell surface 0.13
6E-165 Coprococcus eutactus Bacteria; Firmicutes
= , ,
,
,
-
proteins ' ,
, ATCC
27759 -
2
UDP-glucose/GDP- ,
1
o
mannose Exopolysacchari 1
Bifidobacterium 'Bacteria; ,--
,-,
mru1051 dehydrogenase Cell envelope des
0.084 , 3E-146 adolescentis L2-32 Actinobacteria .--.
o
glycosyl transferase Exopolysacchari
Dinoroseobacter shibae Bacteria; vi
f...)
mru0099 GT4 family Cell envelope des 0.087 8E-62
DFL 12 .. Proteobacteria
glycosyl transferase ,, Exopolysacchari 0.000000
Hydrogenobaculum sp.
mru1074 GT2 family Cell envelope des 0.087 001
YO4AAS1 Bacteria; Aquificae
Exopolysacchari
Bacteroides fragilis NCTC Bacteria;
mru1527 glycosyl transferase Cell envelope des
0.087 2E-61 9343 Bacteroidetes
Exopolysacchari Hahella
chejuensis KCTC Bacteria;
mru1879 sialyltransferase Cell envelope des
0.087 3E-54 2396 Proteobacteria
Exopolysacchari
Clostridium botulinum A2 a
mru1528 glycosyl transferase Cell envelope des
0.134 2E-55 str. Kyoto Bacteria; Firmicutes
CMP-N-
0
i.)
acetylneuraminic acid Exopolysacchari
Clostridium botulinum A3
-A
mru1876 synthetase NeuA Cell envelope des
0.134 .. 0 .. str. Loch Maree .. Bacteria; Firmicutes
I.)
m
N-acetyl neuramic Exopolysacchari
Clostridium botulinum B1 . .1,.
mru1878 acid synthetase NeuB Cell envelope
des 0.134 1E-124 str. Okra Bacteria; Firmicutes
. glycosyl transferase Exopolysacchari
0
IV
mru2183 012 family Cell envelope des 0.134 5E-40
Clostridium sp. SS2/1 Bacteria; Firmicutes ,
.0
.
i.)
UDP-glucose/GDP-
I
IV
mannose Exopolysacchari
Anaerococcus prevotii
mru1075 dehydrogenase Cell envelope des
0.138 = 3E-139 DSM 20548 Bacteria; Firmicutes
glycosyl transferase
GT2 family/CDP- .
,
glycerol:poly(glycerop
hosphate) -
glycerophosphotransf
mru2181 erase Cell envelope Other 0.134
2E-129 Clostridium sp. SS2/1 Bacteria; Firmicutes cn
1-3
glycosyl transferase
GT2 family/CDP- '
N
glycerol:poly(glycerop
,--,
- o
hosphate)
glycerophosphotransf
. o
mru2182 erase Cell envelope Other 0.134
2E-127 Clostridium sp. SS2/1 Bacteria; Firmicutes
. .
-
..
= CDP-
, 0
glycerol:poly(glycerop
r..)
o
hosphate)
,--
0-,
glycerophosphotransf
. ¨.
o
mru2191 erase Cell envelope Other 0.134 5E-123
Clostridium sp. SS2/1 Bacteria; Firmicutes vi
f...)
NADPH-dependent Cellular Electron Treponema
denticola Bacteria; o
mru1260 FMN reductase processes transport 0.088 3E-43 ATCC
35405 Spirochaetes
NADPH-dependent ' Cellular Electron
Heliobacterium
mru1369 FMN reductase processes transport 0.129 2E-24
modesticaldum Ice1 Bacteria; Firmicutes
NADPH-dependent Cellular Electron
Clostridium leptum DSM
mru0580 FMN reductase processes transport 0.134 4E-72 753
Bacteria; Firmicutes
NADPH-dependent Cellular Electron
Clostridium beijerinckii
mru1732 FMN reductase processes transport 0.134 2E-55 NCIMB
8052 Bacteria; Firmicutes r)
NADPH-dependent Cellular Electron
Clostridiales bacterium
mru1609 FMN reductase processes transport 0.141 2E-76 1 7 47
FAA Bacteria; Firmicutes 0
_ _ _
.
i.)
Cellular Oxidative stress
Syntrophus aciditrophicus Bacteria;
-A
mru1258 rubredoxin Rub2 processes . response 0.088
1E-13 SB Proteobacteria
I.)
.
m
Cellular Oxidative stress Slackia
heliotrinireducens Bacteria; .1,.
mru1367 rubrerythrin Rbr2 processes ' response 0.093
4E-65 DSM 20476 Actinobacteri6
..t=.
0
Cellular Oxidative stress
a) 1-=
I.)
mru1259 rubredoxin Rub3 processes response 0.13 3E-11
Oribacterium sinus F0268 Bacteria; Firmicutes '
.0
Cellular Oxidative stress
Anaerocellum
1
NJ
mru1564 desulfoferrodoxin Dfx processes = response 0.138
2E-31 thermophilum DSM 6725 Bacteria; Firmicutes
pyruvate-formate Central carbon
Desulfitobacterium
mru1519 lyase Pfl metabolism Formate 0.129 2E-54
hafniense Y51 Bacteria; Firmicutes
Central carbon Gluconeogenesi
Anaerocellum
mru0635 pyruvate kinase PykA metabolism s 0.138 6E-103
thermophilum DSM 6725 Bacteria; Firmicutes
,
3-hexulose-6-
phosphate isomerase Central carbon
Staphylothermus marinus Archaea; 1-Lt
mru1310 = Phi2 metabolism RUMP pathway 0.424 _ 4E-23 Fl =-
Crenarchaeota cn
1-3
=
iron-sulfur cluster Energy Electron = Dorea
longicatena DSM =¨
mru0155 binding protein metabolism transport 0.13
4E-35 13814 Bacteria; Firmicutes N
4Fe-4S binding
o
,--,
=
domain-containing Energy Electron' Dorea
longicatena DSM =
mru1711 protein metabolism transport 0.13 2E-40 13814
Bacteria; Firmicutes . o
,-,
mru0009 flavodoxin domain Energy Electron 0.134
5E-54 Clostridium kluyveri NBRC Bacteria; Firmicutes
o
,
,
containing protein metabolism transport 12016
0
4Fe-4S binding
t..)
o
domain-containing Energy Electron
Clostridium perfringens ,--
,-,
mru2036 protein , metabolism transport 0.134 2E-87 CPE
str. F4969 _ Bacteria; Firmicutes ,
o
NADP-dependent
un
f..)
alcohol Energy
Clostridium sporo genes
mru0065 dehydrogenase Adh1 metabolism Ethanol 0.134
6E-112 ATCC 15579 Bacteria; Firmicutes
mru1297 hypothetical protein Hypothetical ,Conserved 0.003 ,
2E-110 Ralstonia phage RSL1 .. Viruses; Caudovirales
.
Mycoplasma
mru0643 hypothetical protein Hypothetical Conserved 0.081
2E-26 hyopneumoniae J Bacteria; Tenericutes
Chlorobium chlorochromatii
mru0899 hypothetical protein Hypothetical Conserved 0.081
1E-13 CaD3 ' Bacteria; Chlorobi
Corynebacterium
Bacteria; a
mru0751 hypothetical protein Hypothetical Conserved 0.085
8E-11 urealyticum DSM 7109 Actinobacteria
Psychrotlexus torquis
Bacteria; 0
i.)
mru0945 hypothetical protein Hypothetical Conserved 0.085
1E-20 ATCC 700755 Bacteroidetes
-A
IV
Geodermatophilus
Bacteria; I.)
.
m
mru1579 hypothetical protein Hypothetical Conserved 0.085
1E-33 obscurus DSM 43160 Actinobacteria
.1,.
Actinomyces odontolyticus Bacteria;
mru1964 hyikthetical protein Hypothetical Conserved 0.085
8E-57 ATCC 17982 Actinobacteria -4 H
I.)
1
Havobacterium
Bacteria; .0
i.)
mru2185 hypothetical protein Hypothetical Conserved
0.085 0.000003 psychrophilum JIP02/86
Bacteroidetes I
NJ
Bacteroides dorei DSM
Bacteria; .1,.
mru0021 hypothetical protein Hypothetical Conserved 0.087
8E-47 17855 Bacteroidetes
-
Tolumonas auensis DSM Bacteria;
, mru0134 hypothetical protein Hypothetical Conserved 0.087
2E-35 9187 Proteobacteria
Bacteria;
mru0223 hypothetical protein Hypothetical Conserved 0.087
2E-29 Dichelobacter nodosus Proteobacteria
=
Bacteria; ro
mru0644 hypothetical protein Hypothetical Conserved 0.087
4E-13 Roseobacter sp. AzwK-3b Proteobacteria
n
1-
Bacteria;
z
mru0836 , hypothetical protein Hypothetical Conserved 0.087
9E-14 Roseobacter sp. AzwK-3b Proteobacteria
N
= Bacteria;
mru1073 hypothetical protein Hypothetical Conserved 0.087
5E-15 Opitutus terrae PB90-1 Verruconlicrobia
o
Hahella chejuensis KCTC
Bacteria; =
,-,
mru1290 hypothetical protein Hypothetical Conserved . 0.087
1E-13 2396 Proteobacteria c,
,
.
,
'
Janthinobacterium sp.
Bacteria; 0
mru1932 hypothetical protein Hypothetical Conserved
0.087 1E-132 Marseille . Proteobacteria t..)
,--
Bacteroides cellulosilyticus Bacteria;
0-
mru1966 hypothetical protein Hypothetical Conserved
0.087 5E-10 DSM 14838 Bacteroidetes o
Desulfovibrio vulgaris str.
Bacteria; f...)
mru0028 hypothetical protein Hypothetical , Conserved
0.088 5E-11 Miyazaki F = Proteobacteria
'
Leptotrichia buccalis DSM Bacteria;
mru0118 hypothetical protein Hypothetical Conserved
0.088 2E-28 1135 Fusobacteria
Syntrophobacter
Bacteria;
mru0573 hypothetical protein Hypothetical = Conserved
0.088 1E-12 fumaroxidans MPOB Proteobacteria
0.000000
Bacteria;
mru0780 hypothetical protein Hypothetical Conserved
0.088 2 Rhizobiurn etli CIAT 652 Proteobacteria
Bacteria;
a
mru0803 hypothetical protein Hypothetical Conserved
0.088 5E-34 Nitrobacter sp. Nb-311A
Proteobacteria 0
Rhodopseudomonas
Bacteria;
-.1
mru1063 hypothetical protein Hypothetical Conserved
0.088 1E-28 palustris HaA2 Proteobacteria
-A
IV
0.000000 Rhizobium leguminosarum Bacteria;
I.)
m
mru1172 hypothetical protein Hypothetical Conserved
0.088 2 by. trifoliiWSM1325 Proteobacteria
1.)
Desulfovibrio salexigens
Bacteria; _. 0
mru1749 hypothetical protein Hypothetical Conserved
0.088 1E-45 DSM 2638 Proteobacteria
1
Geobacter bemidjiensis
Bacteria; o
i.)
1
mru2194 hypothetical protein Hypothetical Conserved
0.088 1E-55 , Bern = Proteobacteria NJ
.I,
0.000000 Microcoleus
Bacteria;
mru1389 hypothetical protein Hypothetical Conserved
0.096 2 chthonoplastes PCC 7420 Cyanobacteria
mru0642 hypothetical protein Hypothetical , Conserved
0.108 1E-31 Mollicutes bacterium D. Bacteria; Tenericutes
Staphylococcus camosus
mru0745 hypothetical protein Hypothetical Conserved
0.112 8E-11 subsp. camosus TM300 Bacteria; Firmicutes
Brevibacillus brevis NBRC
mru1886 hypothetical protein Hypothetical Conserved
0.112 ' 2E-30 100599 Bacteria; Firmicutes , 'A
Lactobacillus vagina/is
1-
mru1937 hypothetical protein Hypothetical Conserved
0.112 7E-43 , ATCC 49540 Bacteria; Firmicutes
N
Bubacterium biforme DSM
mru1242 hypothetical protein Hypothetical , Conserved
0.117 2E-24 3989 Bacteria; Firmicutes o
Eubacterium biforme DSM
-C7
mru2130 hypothetical protein Hypothetical Conserved
0.117 5E-112 3989 Bacteria; Firmicutes o
,-,
c7,
mru0791 hypothetical protein Hypothetical Conserved
0.12 -- 8E-10 - Coprothermobacter -- Bacteria; Firmicutes
' ,
proteolyticus DSM 5265
.
0
Veil/one/la parvula DSM
t..)
mru0078 hypothetical protein Hypothetical Conserved 0.129
2E-100 2008 Bacteria; Firmicutes =
=--
,--,
Dethiobacter alkaliphilus
,
o
mru0776 hypothetical protein Hypothetical Conserved 0.129
3E-80 AHT 1 Bacteria; Firmicutes ks.i
un
f..)
Dethiobacter alkaliphilus
o
mru0785 hypothetical protein Hypothetical Conserved 0.129
9E-81 AHT 1 Bacteria; Firmicutes
,
Desulfotomaculum
mru1229 hypothetical protein Hypothetical. Conserved 0.129
1E-24 acetoxidans DSM 771 Bacteria; Firmicutes
Desulfitobacterium
mru1323 hypothetical protein - Hypothetical Conserved 0.129
4E-23 hafniense Y51 Bacteria; Firmicutes
Desulfotomaculum
mru1724 hypothetical protein Hypothetical Conserved 0.129
=2E-23 reducens MI-1 Bacteria; Firmicutes a
Ruminococcus gnavus
mru1967 hypothetical protein Hypothetical Conserved 0.129
2E-33. ATCC 29149 Bacteria; Firmicutes 0
i.)
Ruminococcus obeum
-A
mru2143 hypothetical protein _ Hypothetical Conserved 0.129
1E-39 ATCC 29174 Bacteria; Firmicutes
I.)
m
Roseburia intestinalis L1-
mru0100 hypothetical protein r Hypothetical ' Conserved 0.13
1E-43 -- 82 -- Bacteria; Firmicutes
Anaerofustis
CD H
IV
I
stercorihominis DSM
o
mru0156 hypothetical protein = Hypothetical Conserved 0.13
3E-47 17244 Bacteria; Firmicutes "
I
IV
Anaerofustis
.1,
stercorihominis DSM
mru0185 hypothetical protein Hypothetical Conserved 0.13
4E-112 17244 Bacteria; Firmicutes
Ahaerofustis
0.000000 stercorihominis DSM
mru0558 hypothetical protein Hypothetical Conserved 0.13
4 17244 Bacteria; Firmicutes
Dorea longicatena DSM
ro
mru0753 hypothetical protein Hypothetical Conserved 0.13
8E-112 13814 Bacteria; Firmicutes n
1-
=
Dorea formicigenerans
z
mru0806 hypothetical protein Hypothetical Conserved 0.13
1E-56 ATCC 27755 Bacteria; Firmicutes N
Roseburia intestinalis L1-
' =
=--,
=
mru1065 hypothetical protein Hypothetical Conserved 0.13
1E-32 82 Bacteria; Firmicutes o
-C7
:Coprococcus eutactus
=
mru1227 hypothetical protein Hypothetical Conserved
0.13 0.000008 ATCC 27759 Bacteria; Firmicutes
=-,
o
o
=
-
. ,
Eubacterium siraeum DSM
0
mru1287 hypothetical protein Hypothetical Conserved 0.13
1E-24 15702 Bacteria; Firmicutes t..)
o
Coprococcus eutactus
,--
0-
mru1938 hypothetical protein Hypothetical Conserved 0.13
, 4E-44 ATCC 27759 Bacteria; Firmicutes ¨.
o
Clostridium hiranonis DSM
vi
f...)
mru0047 hypothetical protein Hypothetical Conserved 0.134 ,
3E-44 13275 Bacteria; Firmicutes o
Clostridium botulinum E3
mru0150 hypothetical protein Hypothetical Conserved 0.134
6E-39 str. Alaska E43 Bacteria; Firmicutes
Clostridium scindens ATCC
mru0167 hypothetical protein Hypothetical Conserved 0.134
3E-29 35704 Bacteria; Firmicutes
Clostridium acetobutylicum
mru0186 hypothetical protein Hypothetical Conserved 0.134
9E-158 ATCC 824 Bacteria; Firmicutes
mru0572 hypothetical protein Hypothetical Conserved 0.134
3E-32 Clostridium sp. M62/1 Bacteria;
Firmicutes a
Clostridium hiranonis DSM
mru0947 hypothetical protein Hypothetical Conserved 0.134
4E-136 13275 Bacteria; Firmicutes 0
i.)
-.1
Clostridium thermocellum
-A
IV
mru1173 hypothetical protein Hypothetical Conserved 0.134
5E-30 ATCC 27405 Bacteria; Firmicutes N)
m
Clostridium thermocellum
mru1177 hypothetical protein Hypothetical , Conserved 0.134
3E-42 ATCC 27405 Bacteria; Firmicutes
c.ri
o
Clostridium the rmocellum
. o H'
I.)
1
mru1311 hypothetical protein Hypothetical Conserved 0.134
2E-122 ATCC 27405 Bacteria; Firmicutes .0
i.)
1 Clostridium asparagiforme .
NJ
mru1330 hypothetical protein Hypothetical Conserved 0.134
1E-14 DSM 15981 Bacteria; Firmicutes .1,.
Clostridium hylemonae
mru2085 hypothetical protein Hypothetical Conserved 0.134
5E-53 DSM 15053 Bacteria; Firmicutes
0.000000 Clostridium beijerinckii
mru2154 hypothetical protein Hypothetical Conserved 0.134
02 NCIMB 8052 Bacteria; Firmicutes
Blautia hydrogenotrophica
mru0835 hypothetical protein Hypothetical Conserved 0.138
1E-33 DSM 10507 Bacteria; Firmicutes
cn
Clostridiales bacterium
1-
mru0939 hypothetical protein Hypothetical Conserved 0.141
5E-38 1_7_47_FAA Bacteria; Firmicutes
0.000000
Archaea; = N
mru2205 hypothetical protein Hypothetical Conserved 0.424
04 Sulfolobus tokodaii str. 7 Crenarchaeota
,--,
o
=
0.000000 Parabacteroides johnsonii Bacteria; -C7
o
mru1521 hypothetical protein Hypothetical Hypothetical 0.086
02 =DSM 18315 Bacteroidetes =
,-,
mru0017 hypothetical protein Hypothetical Hypothetical 0.087
5E-21 Bacteroides sp. D1 Bacteria; o
o
=
,
,
..
Bacteroidetes
0
.
0.000000
. Bacteria; t..)
o
mru0018 hypothetical protein Hypothetical Hypothetical
0.087 003 Bacteroides sp. D1 Bacteroidetes
,-,
,
Bacteroides ovatus ATCC , Bacteria;
mru0053 hypothetical protein Hypothetical Hypothetical
0.087 2E-15 8483 Bacteroidetes un
f..)
Slackia heliotrinireducens
Bacteria;
\ mru0432 hypothetical protein Hypothetical Hypothetical
0.093 2E-100 DSM 20476 Actinobacteria
0.000000 Slackia heliotrinireducens
Bacteria;
mru1582 hypothetical protein Hypothetical Hypothetical
0.093 4 DSM 20476 Actinobacteria
0.000000 Lactobacillus saliva rius
mru2173 hypothetical protein Hypothetical Hypothetical
0.112 2 ATCC 11741 Bacteria; Firmicutes
mru1161 hypothetical protein Hypothetical Hypothetical
0i14 2E-10 Bacillus cereus BGSC 6E1 Bacteria;
Firmicutes ..
Anaerococcus lactolyticus
a
mru0024 hypothetical protein Hypothetical Hypothetical
0.138 6E-26 ATCC 51172 Bacteria; Firmicutes
0
0.000000 Anaerococcus lactolyticus
-.1
mru0025 hypothetical protein Hypothetical Hypothetical
0.138 04 ATCC.51172 Bacteria; Firmicutes
-A
IV
3-oxoacyl-(acyl-
I.)
m
_ carrier-protein)
mru1031 reductase FabG1 Lipid metabolism Bacterial 0.112 3E-48
Paenibacillus sp. JDR-2 -- Bacteria; Firmicutes
cyl ,
diacylglycerol kinase
Clostridium scindens ATCC " I.)
1
nnru1289 DagK Lipid metabolism Bacterial 0.134 7E-19
35704 Bacteria; Firmicutes o
n)
' 3-oxoacyl-(acyl-
NJ
carrier-protein)
Clostridium cellulolyticum
mru1630 reductase FabG2 Lipid metabolism Bacterial 0.134 2E-74
H10 Bacteria; Firmicutes .
CRISPR-associated CRISPR- ,
protein TIGR02710 associated Syntrophus aciditrophicus Bacteria;
mru1178 family Mobile elements genes 0.088 1E-28 SB
Proteobacteria
Methanobacterium phage
mru0256 phage integrase Mobile elements Prophage 0.003 1E-11
psiM2 Viruses; Caudovirales ro
n
Methanothermobacter
1-
mru0307 phage-related protein Mobile elements Prophage 0.003 3E-16
phage psiM100 Viruses; Caudovirales -- z
type II restriction
s
w
o
enzyme, methylase
Robiginitalea biformata Bacteria; ,--,
o
mru0324 subunit Mobile elements Prophage 0.085 5E-110
HTCC2501 Bacteroidetes O'
o
Alistipes putredinis DSM
Bacteria;
mru0282 phage-related protein Mobile elements Prophage 0.086 2E-16
17216 Bacteroidetes c,
= ,
f
.
= =
,
,
.
. .
,
_______________________________________________________________________________
__________________________
terminase large
Burkholderia pseudomallei Bacteria;
0
mru0285 subunit Mobile elements Prophage 0.087 4E-32
Pasteur 52237 Proteobacteria t..)
o
Sulfurospirillum deleyianum Bacteria;
,--
0-,
mru0298 hypothetical protein Mobile elements
Prophage 0.087 1E-13 DSM 6946 Proteobacteria
¨.
o
Eukaryota;
un
.
f..)
mru0267 hypothetical protein Mobile elements
Prophage 0.111 9E-13 . Ricinus communis
Viridiplantae o
ParB-like nuclease
domain-containing
Lactobacillus reuteri CF48-
mru0280 protein Mobile elements Prophage 0.112 3E-19 3A
- Bacteria; Firmicutes
mru0287 phage portal protein Mobile elements Prophage
. 0.113 1E-67 Geobacillus sp. G1.1MC16 Bacteria;
Firmicutes
mru0308 phage-related protein Mobile elements Prophage 0.113 1E-40
Geobacillus sp. G11MC16 Bacteria; Firmicutes
Geobacillus
mru0310 phage-related protein Mobile elements Prophage 0.113
0.000002 thermodenitrificans NG80-2
Bacteria; Firmicutes a
mru0057 phage-related protein Mobile elements Prophage 0.114 7E-39
Bacillus coahuilensis m4-4 , Bacteria; Firmicutes
mru0058 phage-related protein Mobile elements Prophage 0.114
0.000001 Bacillus coahuilensis m4-4
Bacteria; Firmicutes 0
i.)
-.1
mru0288 phage-related protein Mobile elements Prophage 0.114 2E-51
Bacillus coahuilensis m4-4 Bacteria;
Firmicutes -A
NJ
mru0311 phage-related protein Mobile elements Prophage 0.114 2E-17
Bacillus coahuilensis m4-4 ' Bacteria;
Firmicutes I.)
m
0.000000
.1,.
mru0313 phage-related protein Mobile elements Prophage 0.114 3
Bacillus coahuilensis m4-4 Bacteria;
Firmicutes Z Pi ,K2
H'
. = dnd system-
rvI.)
1
mru0321 associated protein 3 Mobile elements
Prophage 0.114 5E-107 Bacillus cereus MM3
Bacteria; Firmicutes .0
i.)
1 dnd system-
Bacillus cereus BDRD- NJ
mru0322 associated protein 1 Mobile elements
Prophage 0.114 3E-73 ST196 Bacteria;
Firmicutes .1,.
dnd system-
mru0323 associated protein 2 Mobile elements,
Prophage 0.114 0 . Bacillus cereus m1293 Bacteria;
Firmicutes
Anaerostipes caccae DSM
,
mru0299 hypothetical protein Mobile elements
Prophage 0.13 2E-13 14662 Bacteria; Firmicutes
Blautia hansenii DSM
mru0296 hypothetical protein Mobile elements
Prophage 0.138 2E-17 20583 -Bacteria; Firmicutes
n
nitroreductase family Nitrogen
Clostridium cellulolyticum . 1-
mru0094 protein metabolism General 0.134 3E-38 H10
Bacteria; Firmicutes
N nitroreductase family
Nitrogen . o
mru0995 protein metabolism General 0.134 3E-40
Clostridium sp. M62/1 Bacteria; Firmicutes ,--,
o
ADP-
o
' ribosylglycohydrolase Nitrogen
Bacteroides caccae ATCC Bacteria; =
,-,
mru1580 family protein metabolism Other 0.087 2E-64
43185 Bacteroidetes c=
o
,
=
ADP-
.
0
ribosylglycohydrolase Nitrogen
Synechococcus sp. PCC Bacteria; t..)
mru0749 family protein metabolism Other 0.097 1E-71 7002
Cyanobacteria o
,
ADP-
0-
,
o
ribosylglycohydrolase Nitrogen
Synechococcus sp. PCC Bacteria; ks.i
un
mru1455 family protein metabolism Other 0.097 8E-58 7002
Cyanobacteria f..)
helicase RecD/TraA Nucleic acid
Bacteroides dorei DSM Bacteria;
mru1157 family metabolism Helicase 0.087 3E-46
17855 Bacteroidetes
Wolbachia endosymbiont
. .
Nucleic acid of
Culex quinqyefasciatus Bacteria;
mru0620 helicase SNF2 family metabolism Helicase 0.088
1E-152 JHB Proteobacteria --,
DEAD/DEAH box
" helicase domain- Nucleic
acid
. mru0778 containing protein metabolism Helicase 0.134
0 . Clostridium butyricum 5521
Bacteria; Firmicutes a
Bifidobacterium longum
0
i.)
DNA mismatch Nucleic acid Recombination subsp.
infantis ATCC Bacteria;
-A
mru1429 endonuclease Vsr metabolism and repair 0.084 6E-49
15697 Actinobacteria
I.)
m
6-0-methylguanine
.1,.
'
DNA Nucleic acid Recombination
Syntrophus aciditrophicus Bacteria;
cri
0
mru1575 methyltransferase Ogt metabolism and repair 0.088
3E-26 SB Proteobacteria c.,,a 1-=
I.)
1
excinuclease ABC A Nucleic acid Recombination
Catenibacterium mitsuokai
mru1256 subunit UvrA2 metabolism and'repair 0.109 0 DSM
15897 Bacteria; Firmicutes " ,
NJ
,
exodeoxyribonucleas
.1,.
e VII small subunit Nucleic acid Recombination
mru0813 XseB metabolism and repair 0.111 3E-12
Enterococcus faecium DO Bacteria; Firmicutes
uracil-DNA Nucleic acid Recombination
Geobacillus kaustophilus
mru0218 glycosylase Ung metabolism and repair 0.113 , 2E-71
HTA426 Bacteria; Firmicutes
exodeoxyribonucleas -
e VII large subunit Nucleic acid Recombination
ro
mru0812 XseA ' metabolism and repair 0.114 3E-87
Bacillus sp. SG-1 Bacteria; Firmicutes n
1-
Anaerofustis
DNA-3-methyladenine Nucleic acid Recombination
stercorihominis DSM N
w
mru2068 glycosylase I Tag metabolism and repair 0.13 4E-66
17244 - =Bacteria; Firmicutes ,
,¨
Nucleic acid Recombination
Blautia hansenii DSM o
mru0770 exonuclease metabolism and repair 0.138 2E-41
20583 Bacteria; Firmicutes
=
,¨
mru1165 restriction enzyme Nucleic acid , Restriction and
0.097 0 Synechococcus sp. PCC Bacteria; .
c,
,
= .
,
methylase subunit metabolism modification 7002
Cyanobacteria
0
type II restriction Nucleic acid Restriction and
Streptococcus equi subsp. r..)
mru1166 endanuclease metabolism modification 0.111 3E-
119 equi 4047 ' Bacteria; Firmicutes , o
.--
0-
= DNA-cytosine Nucleic acid
Restriction and ,
o
mru0027 methyltransferase metabolism modification
0.114 2E-70 Bacillus cereus 03BB102 Bacteria;
Firmicutes vi
f...)
DNA modification Nucleic acid Restriction and
Eubacterium hallii DSM
mru1167 methylase metabolism modification 0.13
2E-128 3353 Bacteria; Firmicutes
DNA-cytosine Nucleic acid Restriction and
Anaerococcus lactolyticus
mru0026 methyltransferase metabolism modification
0.138 5E-91 ATCC 51172 Bacteria; Firmicutes
5-methylcytosine
restriction system Nucleic acid Restriction and
Br/ante/la formatexigens
mru0029 component protein metabolism modification
0.138 9E-72 DSM 14469 . Bacteria; Firmicutes
transglutaminase
a
= domain-containing
Protein Bifidobacterium breve DSM Bacteria;
mru2021 protein. Protein fate degradation 0.084
4E-42 20213 Actinobacteria 0
i.)
Chthoniobacter tlavus
Bacteria;
-A ,
. mru1305 DnaK-related protein Protein fate Protein
folding 0.096 3E-78 Ellin428 Verrucomicrobia
I.)
m
.
Coprothermobacter
mru1812 DnaK-related protein Protein fate Protein
folding 0.12 1E-100 proteolyticus DSM 5265 Bacteria; Firmicutes
Protein
Thermosipho africanus Bacteria; ,
IV
mru1510 rRNA methylase synthesis Other
0.087 1E-42 TCF52B Thermotogae 1
o
n)
queuosine
I
IV
biosynthesis protein Protein
Bacteria; .1,.
mru0437 QueC synthesis RNA processing 0.087
6E-45 ldiomarina loihiensis L2TR Proteobacteria
7-cyano-7-
deazaguanosine
biosynthesis protein Protein
Bacteria;
mru0439 QueE synthesis RNA processing 0.087
4E-21 Erythrobacter sp. SD-21 Proteobacteria
queuosine
biosynthesis protein Protein
Beijerinckia indica subsp. Bacteria; cn
1-3
mru0438 QueD synthesis RNA processing 0.088
4E-11 indica ATCC 9039- = Proteobacteria
RNA.ligase DRB0094 Protein
Chitinophaga pinensis Bacteria; N
mru1025 family synthesis RNA processing 0.09
8E-30 DSM 2588 Bacteroidetes
.--,
tRNA-dihydrouridine Protein
Clostridium ramosum DSM o
mru0096 synthase DusA1 synthesis RNA
processing 0.117 3E-94 1402 Bacteria; Firmicutes
o
mru1286 CMP/dCMP Purines & Pyrimidine 0.111 - 2E-29
Abiotrophia defective Bacteria; Firmicutes ...
c7,
,
,
,
deaminase Pyrinlidines '
interconversion ATCC 49176
0
sugar fermentation
r..)
o
stimulation protein
Dehalococcoides ,--
0-
mru0576 SfsA1 Regulation Other 0.096 4E-55
ethenogenes 195 Bacteria; Chloroflexi ,
o ,
carbon starvation . 1
Clostridium bartlettii DSM vi
f...)
mru0187 protein CstA Regulation Other 0.134 ; 9E-126
16795 Bacteria; Firmicutes
= TPR repeat- Protein 1
Paramecium tetraurelia Eukaryota;
mru1316 containing protein Regulation interactions
0.003 ' 6E-17 strain d4-2 Intramacronucleata
Protein
Eggetthella lenta DSM Bacteria;
' mru0066 serine phosphatase Regulation interactions
0.085 1E-47 2243 Actinobacteria
serine/threonine Protein Bacteria;
_
mru1306 protein kinase Regulation interactions 0.085 3E-38
Frankia sp. EAN1pec Actinobacteria
= anti-sigma factor
Protein Sphingomonas wittichii Bacteria; a
mru0513 antagonist Regulation interactions 0.087 8E-10
RW1 Proteobacteria
,
anti-sigma regulatory
0
i.)
factor
-A
.
IV
serine/threonine Protein
Bacteria; I.)
mru0516 protein kinase Regulation interactions 0.087 2E-13
.Bacteroides fragilis YCH46
Bacteroidetes m
serine/threonine
cil
'
protein kinase with. Protein
I.)
'
mru1168 TPR repeats Regulation . interactions 0.088 1E-
33 Roseiflexus sp. RS-1 Bacteria; Chloroflexi o
n)
serine/threonine Protein
Stigmatella aurantiaca Bacteria; 1
mru1288 protein phosphatase Regulation interactions
0.088 3E-20 DW4/3-1 Proteobacteria NJ
.I,
=
TPR repeat- Protein
Bacteria;
mru2116 containing protein Regulation interactions
0.096 8E-23 Cyanothece sp. PCC 7424 Cyanobacteria
= TPR repeat-
Protein Microcoleus Bacteria;
mru2166 containing protein Regulation interactions
0.096 0.000002 chthonoplastes PCC 7420 Cyanobacteria
4'-
=
=
phosphopantetheinyl
1-Lt
, '
transferase family Protein
cn
.
1-3
mru0514 protein Regulation interactions 0.114 5E-25
Bacillus mycoides Rock1-4 Bacteria; Firmicutes
phosphotyrosine Protein )
Halothermothrix orenii H N
mru1295 protein phosphatase Regulation interactions 0.12
4E-25 168 Bacteria; Firmicutes
,--,
,
o
Syntrophomonas wolfei
C-3,
Protein
subsp. wolfei str. o
o
,-,
mru0044 serine phosphatase Regulation interactions
0.129 2E-15 Goettingen Bacteria; Firmicutes
c7,
,
Syntrophomonas wolfei
0
anti-sigma factor , Protein subsp.
wolfei str. t..)
o
mru0067 =antagonist Regulation ' interactions 0.129 3E-13
Goettingen Bacteria; Firmicutes ,--
0-
biotin-binding and
o
phosphotyrosine '
vi
f...)
protein phosphatase
o
domain-containing Protein
Clostridium thermocellum
mru0039 protein Regulation interactions 0.134 1E-13
ATCC 27405 Bacteria; Firmicutes
Protein ,
Clostridium bolteae ATCC
mru0515 serine phosphatase Regulation interactions 0.134
1E-55 'BAA-613 Bacteria; Firmicutes
,
Fusobacterium nucleatum
transcriptional Transcriptional subsp.
polymorp hum - Bacteria; =
mru0577 regulator TetR family Regulation regulators 0.088
_ 8E-22 ATCC 10953 Fusobacteria a
iron dependent Transcriptional
Pelobacter propionicus Bacteria;
mru1338 repressor Regulation regulators 0.088 6E-35
DSM 2379 Proteobacteria 0
i.)
transcriptional Transcriptional
Sebaldella termitidis ATCC Bacteria; -A
IV
mru1629 regulator MarR family Regulation _ regulators 0.088
5E-18 33386 Fusobacteria I.)
m
transcriptional Transcriptional -
Abiotrophia defectiva
mru1160 regulator Regulation regulators = 0.111 6E-12
= ATCC 49176 Bacteria; Firmicutes " ¨ 0
(71
H
transcriptional Transcriptional
Shuttleworthia satelles , CO N)
I
mru0662 regulator LytR family Regulation regulators 0.13
6E-21 DSM 14600 Bacteria; Firmicutes .0
_ i.)
transcriptional Transcriptional
_ i)
mru1739 regulator TetR family Regulation regulators 0.13
6E-57 Oribacterium sinus F0268 Bacteria;
Firmicutes .1,.
_
non-ribosomal peptide Secondary
mru0351 synthetase metabolites NRPS 0.112 0
Brevibacillus parabrevis Bacteria; Firmicutes .
. .
Syntrophomonas wolfei
non-ribosomal peptide Secondary subsp.
wolfei str.
mru0068 synthetase metabolites NRPS 0.129 0
Goettingen Bacteria; Firmicutes =
amino, acid carrier =
1-:
n
, mru1759 protein AGCS family Transporters Amino acids
0.129 2E-164 Acidaminococcus sp. D21 Bacteria; Firmicutes
amino acid acid ABC .
transporter substrate- Dorea
longicatena DSM N
mru1945 binding protein Transporters Amino acids 0.13 3E-70
13814 Bacteria; Firmicutes o
,--,
ferrous iron transport
Geobacter uraniireducens Bacteria; ¨
o
mru1340 protein B FeoB1 Transporters Cations 0.088 5E-158
Rf4 Proteobacteria o
=
,-,
mru0821 transporter CDF Transporters Cations 0.093 4E-90
Slackia heliotrinireducens Bacteria; c7,
o
= .
,
, . . ,
'
= ,
family DSM
20476 Actinobacteria _ 0
transporter Na+/H+
t..)
. o
mru0405 antiporter family Transporters Cations.
0.134 3E-174 Clostridium sp 72 Bacteria; Firmicutes
,--
0-,
.
,
transporter small
o
multidrug resistance
=Yersinia bercovieri ATCC Bacteria; - vi
f...)
mru0358 (SMR) family. Transporters Other
0.087 . 2E-16 43970 . Proteobacteria
transporter small
multidrug resistance
Yersinia bercovieri ATCC Bacteria;
mru0369 (SMR) family Transporters , Other
0.087 2E-16 , 43970 Proteobacteria
Lactococcus lactis subsp.
mru1201 MFS transporter Transporters Other
0.111 9E-49 lactis Bacteria; Firmicutes
Staphylococcus
,
saprophyticus subsp.
a
mru0559 MFS transporter Transporters Other 0.112
2E-29 saprophyticus ATCC 15305 Bacteria; Firmicutes
ABC transporter ATP-
0
i.)
binding/permease ,
Eubacterium biforme DSM
. mru0366 protein Transporters Other
0.117 1E-21 3989 Bacteria; Firmicutes
I.)
m
transporter permease
Eubacterium biforme DSM
mru2176 family protein Transporters Other
0.117 . 6E-22 .3989 Bacteria; Firmicutes VµI IV
0
transporter permease
Clostridium spiroforme -4 H
IV
I
' mru2177 family protein Transporters Other
0.117 5E-83 DSM 1552 Bacteria; Firmicutes .0
i.)
The rmosinus
. 1
NJ
mru1002 MFS transporter Transporters Other
0.129 4E-46 carboxydivorans Non 1 Bacteria; Firmicutes
Na+ dependent
= transporter SBF
mru1332 family = - Transporters Other
0.129 4E-47 ACidaminococcus sp. 021 Bacteria; Firmicutes
transporter SDF
Eubacterium hallii DSM
mru0986 family Transporters Other
0.13 2E-121 3353 Bacteria; Firmicutes
ABC transporter ATP: Anaerofustis
.
binding/permease
stercorihominis DSM cn
1-3
mru1628 protein Transporters = Other
0.13 6E-175 17244 Bacteria; Firmicutes
= MatE
efflux family Clostridium hiranonis DSM N
mru0069 protein Transporters Other
0.134 7E-28 13275 Bacteria; Firmicutes
o
=
MatE efflux family --
o
mru0352 protein Transporters Other
0.134 2E-22 Clostridium sp. L2-50 Bacteria;
Firmicutes o
o
,-,
mru0993 transporter TOT Transporters Other
0.134 2E-26 Clostridium sp. 7_2_43FAA
Bacteria; Firmicutes c7,
. ,
'
, =
= ,
.
.
'
,
. family
_
0
ABC. transporter ATP-
binding/permease
o
.--
mru1627 protein Transporters Other 0.134 1E-112
Clostridium difficile 630 Bacteria; Firmicutes .¨
o
,
Blautia hansenii DSM ks.i
vi
mru0141 transporter Transporters Other . 0.138 4E-58
20583 Bacteria; Firmicutes f...)
transporter SDF '
Epulopiscium sp. N.t.
mru1789 family Transporters Other 0.138 3E-107
morphotype B Bacteria; Firmicutes
Unknown
Sphingobacterium Bacteria;'
mru0248 acyltransferase function Enzyme 0.084 2E-15
spiritivorum ATCC 33300 Bacteroidetes
NADP-dependent
alcohol Unknown
Saccharopolyspora Bacteria;
mru1847 dehydrogenase Adh2 function Enzyme 0.085 5E-65
etythraea NRRL 2338 Actinobacteria
a
glycyl-radical enzyme Unknown
Bacteroides Bacteria;
mru0052 activating protein function Enzyme 0.087
6E-35 thetaiotaomicron VPI-5482
Bacteroidetes 0
i.)
acetyltransferase Unknown
Oxalobacter formigenes Bacteria; -A
mru0574 GNAT family function Enzyme 0.087 5E-50
HOxBLS Proteobacteria
I.)
m
oxidoreductase
.1,.
aldo/keto reductase Unknown Bacteroides capillosus
Bacteria; mru0579 family function Enzyme 0.087 1E-131
ATCC 29799 Bacteroidetes cx) 1-=
I.)
' NADH-dependent
Unknown Campylobacter curvus Bacteria; .0
-
mru1757 flavin oxidoreductase function Enzyme 0.087 1E-54
525.92 Proteobacteria
1
NJ
Unknown
Bacteroides capillosus Bacteria; .1,.
mru1758 acetyltransferase function Enzyme 0.087 3E-49
ATCC 29799 Bacteroidetes
Unknown
Bacteria;
mru1881 acetyltransferase function Enzyme 0.087 1E-51
ldiomarina loihiensis L2TR Proteobacteria
acetyltransferase Unknown
Bacteria;
mru0612 GNAT family function Enzyme 0.088 3E-40
Geobacter sp. FRC-32 Proteobacteria
' hydrolase alpha/beta Unknown
Nostoc punctiforme PCC Bacteria;
mru1036 fold family function Enzyme 0.088 3E-24
73102 Cyanobacteria cn
1-3
. Unknown
Catenibacterium mitsuokai
mru1502 methyltransferase function Enzyme 0.109 4E-57
DSM 15897 Bacteria; Firmicutes N
w
Unknown
Pediococcus pentosaceus
,--,
mru2170 acetyltransferase function Enzyme 0.111 1E-52
ATCC 25745 Bacteria; Firmicutes o
manganese- Unknown
o
mru0929 dependent inorganic function Enzyme 0.114
3E-85 Bacillus sp. SG-1 Bacteria; Firmicutes ...
c7,
,
'
= pyrophosphatase
=
0
PpaC
r..)
o
' hydrolase alpha/beta Unknown
0-,
mru1508 fold family function Enzyme 0114 2E-14
Bacillus clausii KSM-K16 Bacteria; Firmicutes --.
Bacillus thuringiensis
vi
f...)
o
Unknown
serovar konkukian str. 97-
mru1534 acyltransferase function Enzyme 0.114 2E-16 27
Bacteria; Firmicutes
SAM-dependent Unknown .
Eubacterium biforme DSM
mru0637 methyltransferase function Enzyme 0.117 2E-34
3989 Bacteria; Firmicutes
.
Unknown
Ruminococcus obeum .
mru0512 acyltransferase function Enzyme 0.129 9E-25
ATCC 29174 Bacteria; Firmicutes
_
Unknown
Eubacterium siraeum DSM .
mru0610 amidohydrolase function Enzyme 0.13 , 1E-67
15702 Bacteria; Firmicutes a
SAM dependent Unknown
Anaerostipes caccae DSM
mru0779 methyltransferase function Enzyme 0.13 2E-42
14662 Bacteria; Firmicutes 0
i.)
short-chain
-A
IV
dehydrogenase family Unknown
Alkaliphilus oremlandii I.)
m
mru0443 protein function = , Enzyme 0.132 7E-72
OhILAs = Bacteria; Firmicutes .1,.
= SAM dependent
Unknown Clostridium sporogenes = 1.)
¨ 0
mru0195 methyltransferase function Enzyme 0.134 3E-39
ATCC 15579 = Bacteria; Firmicutes . (0 1\)
1
hydrolase alpha/beta Unknown
Clostridium kluyveri DSM .0
i.)
mru0511 fold family function Enzyme 0.134 5E-39
555 Bacteria; Firmicutes I
NJ
acetyltransferase Unknown -
Clostridium acetobutylicum
mru0633 GNAT family function Enzyme 0.134 8E-14
ATCC 824 Bacteria; Firmicutes
radical SAM domain- Unknown
Clostridium botulinum A3
mru0646 containing protein function Enzyme 0.134
4E-102 str. Loch Maree Bacteria; Firmicutes
oxidoreductase
,
aldo/keto reductase . Unknown
=Clostridium kluyveri DSM
mru1120 family function Enzyme 0.134 3E-93
555 Bacteria; Firmicutes
acetyltransferase Unknown
Clostridium scindens ATCC =n
1-
mru1374 GNAT family function Enzyme 0.134 7E-50
35704 Bacteria; Firmicutes .
short-chain '
N
dehydrogenase family Unknown
Clostridium cellulolyticum ,--,
o
mru1958 protein function Enzyme 0.134 2E-45
H10 Bacteria; Firmicutes -O
o
= NADH:flavin
Unknown Clostridium cellulolyticum =
,-,
mru2164 oxidoreductase/NADH function Enzyme 0.134 2E-77
H10 Bacteria; Firmicutes o
o
-
,
oxidase family protein
0
Clostridium
t..)
o
acetyltransferase Unknown '
methylpentosum DSM ,--
0-,
mru2198 GNAT family function Enzyme 0.134 1E-19 5476
Bacteria; Firmicutes ¨.
=
)..)
hydrolase alpha/beta Unknown
Bo/anteIla formatexigens un
f..)
mru0491 fold family function Enzyme 0.138 2E-55 DSM
14469 Bacteria; Firmicutes o
hydrolase alpha/beta= Unknown , Blautia
hydrogenotrophica
mru0771 fold family ' function Enzyme 0.138 2E-93 DSM
10507 Bacteria; Firmicutes
SAM-dependent , Unknown
Bryantella formatexigens
mru1001 methyltransferase function Enzyme 0.138
, 8E-60 DSM 14469 Bacteria; Firmicutes
Unknown
Parvimonas micra ATCC
mru0226 hydrolase TatD family function Enzyme 0.141 2E-38 33270
. Bacteria; Firmicutes
WD40 repeat- .Unknown
a
mru1304 containing protein function General 0.003
4E-26 Branchiostoma floridae Eukaryota;
Metazoa ),
0
CAAX amino terminal .
.
protease family Unknown
Bacteria; -A
IV
mru0547 protein function General 0.085 1E-17
Kocuria rhizophila DC2201 Actinobacteria
"
IV_
von Willebrand factor
d..
" 1.)
type A domain- , Unknown
Campylobacter curvus Bacteria; _. 0
a)
H
mru1593 containing protein function General 0.087
2E-43 525.92 Proteobacteria o I.)
isoprenylcysteine
.
.
i.)
carboxyl
Streptococcus infantarius i)
methyltransferase Unknown subsp.
infantarius ATCC .1..
mru0095 family protein function General 0.111 9E-56
BAA-102 Bacteria; Firmicutes
pyridoxannine 5'-
phosphate oxidase Unknown
Mitsuokella multacida DSM
mru0228 family protein function General 0.129 2E-32
20544 Bacteria; Firmicutes
CAAX amino terminal
.
protease family Unknown
1-:
n
mru1738 protein function General 0.13 5E-83
Oribacterium sinus F0268 Bacteria; Firmicutes 1-
thioesterase family Unknown
Eubacterium siraeum DSM
mru1756 protein function General 0.13 4E-22 15702
Bacteria; Firmicutes N
Unknown
Coprococcus eutactus ,--,
o
mru1860 ATPase function General 0.13 3E-92 ATCC
27759 Bacteria; Firmicutes -,C3
o
TfoX N-terminal Unknown
Clostridium hiranonis DSM =
,-,
mru0517 domain-containing function General 0.134
7E-29 13275 Bacteria; Firmicutes c,
o
,
=
'
,
'
protein
0
, TfoX C-terminal
)..)
domain-containing Unknown
Clostridium kluyveri DSM ,--
0-
mru1848 protein function General 0.134 3E-23
555 , Bacteria; Firmicutes ¨.
)..)
pyridoxamine 5'-
vi
f...)
phosphate oxidase Unknown
Br/ante/la formatexigens =
mru0191 family protein function General 0.138 6E-23
DSM 14469 Bacteria; Firmicutes
adenosylmethionine-
.
8-amino-7- .
oxononanoate
.
aminotransferase Vitamins and Bra
chyspira . Bacteria;
mru2084 BioA = , cofactors Biotin 0.088 3E-179
hyodysenteriae WA1 Spirochaetes
Vitamins and =
Brachyspira Bacteria; a
mru2087 biotin synthase BioB1 cofactors Biotin 0.088 .3E-95
hyodysenteriae WA1 Spirochaetes
6-carboxyhexanoate- Vitamins and
Thermosinus 0
i.)
mru2041 CoA ligase BioW cofactors Biotin 0.129 2E-47
carboxydivorans Non 1 Bacteria; Firmicutes = -.1
-A
IV
8-amino-7-
I.)
t.)
oxononanoate Vitamins and
.1,.
mru2042 synthase BioF , cofactors Biotin 0.129
3E-73 Acidaminococcus sp. D21 Bacteria; Firmicutes N)
¨ 0
. a) ,
dethiobiotin Vitamins and
._., ,)
mru2086 synthetase BioD cofactors Biotin 0.134 2E-59
Clostridium butyricum 5521 Bacteria; Firmicutes _ 1
o
iv
cobyrinic acid a,c-
I
NJ
diamide synthase Vitamins and
Bacteroides capillosus Bacteria; .1,.
mru0893 CbiA3 cofactors Cobalamin 0.087 2E-121
ATCC 29799 Bacteroidetes
cobalamin
.
biosynthesis protein Vitamins and
Faecalibacterium '
mru0889 CbiG cofactors Cobalamin 0.129 , 4E-67
prausnitzii M21/2 Bacteria; Firmicutes
precorrin-3B C17- ,
methyltransferase . Vitamins and
Ruminococcus gnavus ro
mru0890 CbiH1 cofactors Cobalamin 0.129 1E-86
ATCC 29149 Bacteria; Firmicutes n
1-
magnesium- ,
,
protoporphyrin IX .
N
=
monomethyl ester . = o
,--= .
o
. anaerobic oxidative
Vitamins and Anaerotruncus colihominis C-3,
mru1643 cyclase BchE cofactors Cobalamin 0.129 0 DSM
17241 Bacteria; Firmicutes
o
mru0886 precorrin-2 C20- Vitamins and Cobalamin
0.134 1E-51 Clostridium Bacteria; Firmicutes
c7,
,
,
'
,
,
=
methyltransferase cofactors
phytofermentans ISDg , 0
CbiL
t..) cobalamin
biosynthesis protein Vitamins and
=Clostridium ,
o
,
mru0887 CbiD cofactors Cobalamin 0.134 1E-87
phytofermentans ISDg Bacteria; Firmicutes ks.i
un
f..)
precorrin-4 C11-
o
methyltransferase Vitamins and Blautia
hydrogenotrophica
mru0888 CbiF cofactors Cobalamin 0.138 3E-102 DSM
10507 Bacteria; Firmicutes
precorrin-6x Vitamins and Blautia
hansenii DSM
mru0891 reductase CbiJ cofactors Cobalamin 0.138 2E-55 20583
Bacteria; Firmicutes
precorrin-6Y C5,15-
methyltransferase
(decarboxylating) Vitamins and Blautia
hansenii DSM a
mru0892 CbiET cofactors Cobalamin 0.138 3E-87 20583
Bacteria; Firmicutes
precorrin-8X Vitamins and
Blyantella formatexigens 0
i.)
mru0894 methylmutase CbiC cofactors Cobalamin 0.138
7E-82 DSM 14469 Bacteria; Firmicutes
-A
Vitamins and
Bryantella formatexigens
I.)
mru0895 cobalt chelatase CbiK cofactors Cobalamin 0.138
2E-82 DSM 14469 Bacteria; Firmicutes m
.1,.
glutathione-disulfide Vitamins and Glutathione
Staphylococcus
0
mru1377 redUctase Gor2 cofactors metabolism 0.112 3E-94
haemolyticus JCSC1435 Bacteria; Firmicutes N)
I.)
1
glutathione Vitamins and Glutathione
Clostridium o . n)
mru1935 peroxidase GpxA cofactors metabolism 0.134 5E-54
phytofermentans ISDg Bacteria; Firmicutes 1
. NJ
NAD+ synthetase Vitamins and
Veil/one/la parvula DSM .1,.
,
mru1430 NadE cofactors' Nicotinate 0.129 1E-66 2008
Bacteria; Firmicutes
Vitamins and
Coprococcus comes ATCC
mru0189 ATP-NAD kinase cofactors Nicotinate 0.13 1E-37 27758
Bacteria; Firmicutes
3,4-dihydroxy-2- .
-
butanone 4-
phosphate synthase Vitamins and
Desulfovibrio salexigens Bacteria;
mru0089 RibB cofactors Riboflavin 0.088 4E-84 DSM
2638 = Proteobacteria n
1-
hydroxymethylpyrimidi Vitamins and Dorea
formicigenerans . z
mru0198 ne transporter CytX cofactors Thiamine 0.13
3E-61 ATCC 27755 = Bacteria; Firmicutes N
phosphomethylpyrimi Vitamins and
Eubacterium ventriosum ,--,
o
mru0199 dine kinase,ThiD1 cofactors Thiamine 0.13
4E-59 ATCC 27560 Bacteria; Firmicutes C-3,
o
Vitamins and
Clostridium thermocellum . =
,-,
mru0227 ThiF family protein cofactors Thiamine 0.134
5E-68 ATCC 27405 Bacteria; Firmicutes = o
Table 5. Selection of upregulated genes of the M1 genome when grown in co-
culture with . 0
t..)
Butyrivibrio proteoclasticus B316.
=
,--
Locus
0-,
,
o
tag Annotation . Fold difference
ks.i
vi
f...)
Energy metabolism - formate metabolism
o
mru0333 formate dehydrogenase alpha subunit FdhA 3.46
.
mru0334 formate dehydrogenase beta subunit.FdhB 2.34
Energy metabolism - methanogenesis
mru1928 methyl-coenzyme M reductase beta subunit McrB , 2.26
mru1926 methyl-coenzyme M reductase C subunit McrC 2.39 '
mru1927 methyl-coenzyme M reductase D subunit McrD 2.23
= a
. .
mru1925 methyl-coenzyme M reductase gamma subunit McrG 3.41
0
i.)
.
-.1
mru1919 tetrahydromethanopterin S-methyltransferase subunit A MtrA
2.02 -A
.
IV
mru1920 tetrahydromethanopterin S-methyltransferase subunit B MtrB
2.23 I.)
m
mru1921 tetrahydromethanopterin S-methyltransferase subunit C MtrC
3.23 1.)
¨ 0
d)
mru1916 tetrahydromethanopterin S-methyltransferase subunit H MtrH
2.14
1
mru1907 methyl viologen-reducing hydrogenase gamma subunit MvhG
2.25 , .0
mru0344 tungsten formylmethanofuran dehydrogenase subunit A FwdA
2.12 ' I
NJ
.1,
Cell envelope - Cell surface .
.
= mru2090 adhesin-like
protein 2.31
'
mru2134 adhesin-like protein 2.28
mru1222 adhesin-like protein 4.08
.
mru0076 adhesin-like protein 2.01
.
cn
mru1499 adhesin-like protein with transglutaminase domain 2.14
, 1-
mru0828 adhesin-like protein with transglutaminase domain 2.50
.
N
.
-
.
--..3
. .
-
c.
,
.
.
'
=
=
Table 6: Biosynthetic pathway for pseudomurein in M. ruminantium M
C44
Step Locus tan. Annotation
Step 1 mru2126 cell wall biosynthesis protein UDP-
glycosyltransferase family
Steps 2 & 3 mru0707, cell wall biosynthesis protein Mur ligase family
71ru1042 cell wall biosynthesis protein Mur ligase family
mru1118 cell wall biosynthesis protein Mur ligase family
0
-nru1745 cell wall,biosynthesis protein Mur ligase family
mru2091 cell wall biosynthesis protein Mur ligase family
mru2092 cell wall biosynthesis protein Mur ligase family
1-=
Step 4 hiru0964 cell wall biosynthesis protein phospho-N-
acetylmuramoyl-pentapeptide-transferase family
cell wall biosynthesis protein phospho N-
= mru1041 acetylmuramoyl-
pentapeptide-transferase family
Step 5 mru2136 polysaccharide biosynthesis protein
Step 6 mru0470 conserved hypotheticaltransmembrane protein
mru2175 glycosyl transferase GT2 family
Step 7 mru0824 adhesin-like protein with transglutaminase domain
mm1499 adhesin-hlceprotein with transglutaminase domain
-nru1604 adhesin-like protein with transglutaminase domain
Table 7: Gene clusters involved in secondary metabolite metabolism in M.
ruminantium M1
Locus tag Product
Cluster 1JI
=
C44
mru_0066 serine phosphatase
mru_0067 anti-sigma factor antagonist
mru_0068 non-ribosomal peptide synthetase
mru_0069 MatE efflux family protein
mru_0070 hypothetical protein
mru_0071 serine phosphatase
0
Cluster 2
=
mru_0351 non-ribosomal peptide synthetase
mru_0352 MatE efflux family protein(71
H.
Cluster 3
mru_0513 ' anti-sigma factor antagonist
mru_0514 4'-phosphopantetheinyl transferase family protein
mru_0515 serine phosphatase
mru_0516 anti-sigma regulatory factor serine/threonine protein kinase
=
=
.
.
. , .
.
= .
= =
=
,
C
=
Table 8. Summary of the Functional Genome.
Conserved genes, Cluster specific genes,
o
-
Distribution Analysis. Identification of conserved M1
M. ruminantium-centric M. ruminantium-centrie) -
,
specific (left table panel) and sub-cluster 1.1 specific
NJ
' gene sets (right table panel). Conserved and cluster M.
Cl M. _ M. M. vi
f...)
Cluster
o
specific gene sets are referenced to Ml. Cluster ruminan ruminan PMP(c)
ruminan Cluster PMP(c)
1
designations refer to those shown in Figure 16. Cut-offs tium tium -
- tium tium .1 1 .1
---
--- indicate
---
the e-value thresholds for conserved (low cut --- --- Cluster
--- ¨ Cluster ' Cluster Cluster
off) and unique (high cut-off) gene selection criteria. Cluster
Cluster 1 + 2 Cluster Cluster 1 + 2
Mismatch tolerance indicates the number of organisms 1.1(B) '
1 + 2 1 + 2 1.1 1 + 2 1 + 2
specific ORFs allowed outside the respective thresholds.
.
Low cutoff le-100 1e-100 le-100 le-100 le-100 le-60
le-60 le-60 .
'
Thresholds: High cutoff
le-10 le-10 le-10 le-10 a
Mismatch tolerance 0 0 0 0 =
0 2 2 2
Classification Sub classification
0
NJ
= 41 10 10
10 -A
Amino acid biosynthesis
NJ
NJ
- N)
2 1 1 2
.1,.
.
' Amino acid degradation
=
_. ,N2
Cell cycle 18 3 3 3
2 1 2 1 03 H
CS) N)
I
' Pseudomurein 10 . 1 1
1 . 5 5 o . . NJ
I
biosynthesis 4
.. 4 5 5 4 NJ
,
Cell envelope Exopolysaccharides
7 9, 10 2 .1,.
Cell surface proteins .
3 1
.
= other
Oxidative stress response 2
1 1
Cellular processes Stress response
I 1 .
24 2 2 2
1 4 1 3 ro
Central carbon metabolism
cn
1-3
. Methanogenesis 22 = 6 6 _____ 5
2 2
-
Hydrogen metabolism 7 1 1 1
' 2 1 N
' Electron transport 4 2 2 2 .
3' 3 . o
,--,
.
o
,
Formate
Energy
C-3,
o
=
metabolism
,--
.
c7, . .
= .
,
=
'
C
t,..)
o
-_.
o
Hypothetical Conserved 15 1 1 1
306 22 330 18 ks.i
vi
Lipid metabolism 6 2 2 2
1 1 1 1 f...)
Transposase
3 3
=Prophage
52 55
Mobile elements
CRISPR-associated
1 1 7 1
genes
Nitrogen metabolism 3
1
Nucleic acid metabolism 16 4 4 4
9 2 8
Protein fate 15 3 3 4
1 = 4 2 a
Protein synthesis 46 23 23 24
3 1 6 1
Purines & pyrimidines 17 4 4 4
0
i.)
,
Regulation 1 22
= 23
-A
IV
' Transcription 5 3 3
3 I.)
. m
Transporters 6 10
3 4 1
= Enzyme 9 1
20 3 13 2 8 ',I ))
lJnknown function
-4 H
General 8 1 1 1 15
2 11 2 I.)
1
Cobalamin 6 1 1 1
1 2 .0
i.)
1
Coenzyme F420 3
IV Methanopterin 1
Ubiquinone 2
Thiamine 3 1 1 1
2 2
Other 3
1
.
Vitamins and cofactors Metal-binding pterin
1 1
Methanofuran 1
.
Coenzyme B 2
1-:
, .
Coenzyme M 1
n
1-
- Biotin 3
Glutathione metabolism
1 4 N
Nicotinate
1 1 o
,--,
o
-C7
o
o
,-,
c7,
,
,
= .
= .
, ,
' .
.
.
,
Table 9. Manual functional annotation of the Methanobrevibacter General
0
ruminantium Ml open reading frames. Table excludes hypothetical
mru0506aspartate aminotransferase
o
Proteins .
.
.
¨.
Glutamate/glutamine
,
.
W
Uri
mru1761glutamate dehydrogenase GdhA
(.4
. ,
o
= mru2082g1utamate synthase alpha subunit GItA
AMINO ACID METABOLISM mru2080glutamate
synthase beta subunit GltB .
mru0810glutamate synthase domain-containing protein
Arginine , ' mru2079g1utam1ne
amidotransferase
mru1029acetylglutamate kinase ArgB mru050glutamine
synthetase GInA1 =
mru0149acetylomithine aminotransferase ArgD mru2078g1utam1ne
synthetase GInA2
mru1476arg1ninosuccinate lyase ArgH
mru0349transcr1pt1ona1 repressor of nif and glnA operons NrpR
a
mru2017argininosuccinate synthase ArgG -
mru1023b1functi0na1 ornithine acetyltransferase/N-acetylglutamate synthase
protein Glycine . , 0
NJ
ArgJ
,J
mru0122serine hydroxymethyltransferase GlyA
,J
mru1719N-acetyl-gamma-glutamyl-phosphate reductase ArgC
NJ
NJ
.
NJ
mru2115ornithine carbamoyltransferase ArgF
Histidine
NJ
mru0010ATP phosphoribosyltransferase HisG1
(3)
03 0
I-.
Aspartate/asparagine
NJ
mru1050ATP phosphoribosyltransferase HisG2
= ' I
mru1143asparagine synthase (glutamine-hydrolyzing) AsnB
o
mru2139bifunctional imidazoleglycerol-phosphate dehydratase HisB
NJ
I
_
mru1337hisA/hisF family protein HisAF
NJ
.I,
Chorismate = mru1015histidinol
dehydrogenase HisD .
mru10893-dehydroquinate dehydratase type I AroD
.
mru0454h15t1din01-phosphate aminotransferase HisC '
=
mru09983-dehydroquinate synthase AroB mru0182imidazo1e
glycerol phosphate synthase glutamine amidotransferase subunit
mru15773-phosphoshikimate 1-carboxyvinyltransferase AroA =HisH
mru1561chorismate synthase AroC
mru0135imidazoleglycerol-phosphate synthase cyclase subunit HisF ,
mru1244shik1mate 5-dehydrogenase AroE ',
mru1249phosphor1b0sy1-AMP cyclohydrolase Hisl
n
mru1676shikimate kinase AroK mru2031phosphor1bosy1-
ATP pyrophosphohydrolase HisE 1-3
mru1717phosphoribosylformimino-5- aminoimidazole carboxamide ribotide
Cysteine . isomerase HisA
N
mru1574cysteine synthase CysKM1
1--,
o ,
mru2096cysteine synthase CysKM2 Homoserine
C3
o
o
mru1573serine 0-acetyltransferase CysE mru1141allosteric
regulator of homoserine dehydrogenase 1..,
o
mru1140homoserine dehydrogenase Horn
o
- =
. .
_
" . . =
.
mru0414am1notransferase class V family " 0
w
Lysine .
mru0678ph0sphog1ycerate dehydrogenase SerA , o
.
I--
mru1672aspartate kinaSe Ask mru0388phosphoserine
phosphatase SerB 1--,
--.
o
mru1669aspartate-semialdehyde dehydrogenase Asd
k,.3
= uri
mru0941diaminopimelate aminotransferase DapL . Threonine
mru0152d1am1n0p1me1ate decarboxylase LysA mru1492threonine
synthase ThrC
mru0153diaminopime1ate epimerase DapF
.
mru1670d1hydr0d1pic01inate reductase DapB = Tryptophan
mru1671dihydrodipicolinate synthase DapA mru0210anthran11ate
phosphoribosyltransferase TrpD
=
mru0208anthrani1ate synthase component I TrpE .
Methionine ' =
mru0209anthran11ate synthase component II TrpG .
mru0611homoserine 0-acetyltransferase MetX1 mru0211indole-3-
glycerol phosphate synthase TrpC a
mru1205homoserine o-acetyltransferase MetX2
mru0212ph0sphor1bosy1anthran11ate isomerase TrpF 0
= n)
mru1620methion1ne synthase MetE , mru0214trypt0phan
synthase alpha subunit TrpA
, mru11480-acetylhomoserine/0-acetylserine sulfhydrylase MetZ/CysK1
mru2159tryptophan synthase beta
subunit TrpB K)
IV
mru15690-acetylhomoserine/0-acetylserine sulfhydrylase MetZ/CysK2
mru0213trypt0phan synthase beta
subunit TrpB1 n)
mru0477trypt0phan-binding repressor TrpY
n)
Phenylalanine/tyrosine
co IV
I
. mru1674chorismate mutase AroH
Valine/leucine/isoleucine o
IV
mru1992prephenate dehydratase PheA , mru21552-is-
opropylmalate synthase LeuA 1
i\)
.1,
. mru0468prephenate dehydrogenase TyrA1 = mru01053-
isopropylmalate dehydratase large subunit LeuC
mru1975prephenate dehydrogenase TyrA2 . mru01043-
isopropylmalate dehydratase small subunit LeuD . =
= mru01033-isopropylmalate dehydrogenase LeuB
Polyamines mru0410aceto1actate synthase large subunit 11vB1
_
mru1741arginase/agmatinase family protein rnru2112aceto1actate
synthase large subunit 11vB2
mru0603N-carbamoyl-D-amino acid amidohydrolase AguB mru2111acetolactate
synthase Small subunit IlvN
mru1743pyruvoy1-dependent arginine decarboxylase PdaD mru2107branched-
chain-amino-acid aminotransferase IlvE " n
. mru1414c1trama1ate
synthase CimA
Proline ' ' mru2119dihydroxy-
acid dehydratase IlvD N
(,)
o
mru0518de1ta 1-pyrroline-5-carboxylate synthetase mru2110ketol-acid
reductoisomerase IlvC.
.
, o
mru1509pyrr01ine-5-carboxylate reductase ProC
' 'a
.
o
o
Salvage- general
. c7,
Serine = mru06561nd01epyruvate ferredoxin oxidoreductase alpha
subunit lorA
,
'
.
,
' = '
' . .
,
mru06571ndo1epyruvate ferredoxin oxidoreductase beta subunit lorB
mru0969DNA primase large subunit PriB r
0
mru1432ketoisova1erate ferredoxin oxidoreductase alpha subunit VorA
mru0974DNA primase small subunit PriA
w
o
mru1431ketoisovalerate ferredoxin oxidoreductase beta subunit VorB
mru0685DNA-binding protein 1--
1.-
mru1433ketoisova1erate ferredoxin oxidoreductase gamma subunit VorC
mru0382flap endoniklease Fen
k..i
uri
mru1667HIRAN domain-containing protein
f...)
\
Salvage- methionine mru0446013 fold
nucleic acid binding domain-containing protein ,
mru0427methy1thi0aden0s1ne phosphorylase MtnP mru0114replication
factor A
mru0380S-adenosyl-L-homocysteine hydrolas,e AhcY mru1129replication
factor C large subunit RfcL
mru01255-adenosylmethionine synthetase MetK mru1130rep1icati0n
factor C small subunit RfcS
mru0591replicative DNA helicase Mom
Salvage- tyrosine mru1838ribonuc1ease
HII RnhB =
mru00074-hydroxyphenylacetate degradation bifunctional isomerase/decarboxylase
mru0710tyrosine recombinase XerC a
HpaG
' Genome segregation
0
n)
mru1603chromosome partitioning ATPase ParA
NJ
mru0390DNA topoisomerase I TopA
1\)
CELL CYCLE mru1864DNA
topoisomerase VI subunit A .1,.
.
.
mru1865DNA topoisomerase VI subunit B
--.1
0 '
0
I-.
Cell division mru1158RecF/RecN/SMC
N terminal domain-containing protein IV
I
0
mru2160ce11 division ATPase MinD
n)
1
mru0435ce11 division contra protein Cdc48
Fl,
,
mru0481cell division protein FtsZ .
CELL ENVELOPE
Chromosome replication
mru0445ATP-dependent DNA ligase DnI1 Cell surface
proteins
mru0001cdc6 family replication initiation protein Cdc6-1 mru0004adhes'in-
like protein
mru0423cdc6family replication initiation protein Cdc6-2 mru0019adhesin-like
protein = 1-:
n
mru1983DNA polymerase family B PolB1 mru0031adhesin-like
protein 1-3
mru1553DNA polymerase family B PolB2 mru0032adhesin-like
protein
N
mru0240DNA polymerase large subunit DP2 PolD2 mru0033adhesin-like
protein =
1--,
mru0173DNA polymerase sliding clamp subunit PCNA family Pcn
mru0036adhesin-like protein , o
-c-6,
mru2212DNA polymerase small subunit DPI PolD1 mru0038adhesin-like
protein o
mru0711DNA primase DnaG = mru0048adhesin-like
protein
c7,
..,
=
.
.
mru0064adhesin-like protein .
mru0970adhesin-like protein = C
mru0072adhes1n-like protein mru0977adhesin-like protein
w
o
I--
mru0076adhesin-like protein mru0978adhesin-like protein
o
mru0077adhesin-like protein mru0979adhesin-like protein
k-.3
uri
mru0079adhesin-like protein mru1076adhesin-like protein
\ f....3
mru0082adhesin-like protein mru1077adhesin-like protein
mru0083adhesin-like protein mru1124adhesin-like protein
,
mru0084adhesin-like protein mru1210adhesin-like protein
rnru0085adhesin-like protein mru1222adhesin-like protein
mru0086adhes1n-like protein = mru1246adhesin-like protein
v , mru0090adhes1n-like protein . mru1247adhes1n-
like protein
mru0160adhes1n-like protein ,
mru1263adhesin-like protein a
mru0245adhesin-like protein mru1299adhesin-like protein
0
n)
mru0255adhesin-like protein mru1312adhesin-like protein
.-.1
mru0326adhesin-like protein mru1313adhesin-like protein
N)
IV
.
I \ )
mru0327adhesin-like protein mru1314adhesin-like protein
' .
mru0331adhes1n-like protein mru1315adhesin-like protein
n)
.
0
mru0338adhes1n-like protein ,mru1342adhesin-like protein
IV
I
. mru0417adhesin-like protein mru1358adhes1n-like protein
o
mru0418adhesin-like protein mru1376adhesin-like protein
i)
.1,
mru0419adhesin-like protein mru1386adhesin-like protein
mru0450adhesin-like protein mru1417adhesin-like protein
mru0451adhesin-like protein , mru1424adhes1n-like protein
mru0493adhesin-like protein mru1465adhesin-like protein
mru0687adhesin-like protein . mru1500adhes1n-like protein
mru0704adhesin-like protein mru1506adhesin-like protein
mru0723adhesin-like protein mru1513adhesin-like protein
n
mru0775adhesin-like protein mru1650adhesin-like protein
mru0811adhesin-like protein mru1651adhesin-like protein
. N
mru0881adhesin-like protein =
mru1659adhesin-like protein 1--,
o
mru0896adhes1n-like protein mru1661adhesin-like protein
= 'a
o
mru0962adhesin-like protein mru1726adhesin-like protein
o
1--,
c7,
mru0963adhesin-like protein mru1798adhesin-like protein
,
.
.
_ . .
3-
.
,
,
=
. ,
,
mru1971adhesin-like protein . mru0433d01ich01
kinase 0
mru1996adhes1n-like protein mru0109dTDP-4-
dehydrorhamnose 3,5- epimerase RfbC1 (.4
o
mru2043adhes1n-like protein mru1061dTDP-4-
dehydrorhamnose 3,5-epimerase RfbC 1--
1--
mru2048adhesin-like protein (.4 mru0107dTDP-4-dehydrorhamnose reductase
RfbD = o
=
= (A
mru2049adhesin-like protein mru0110dTDP-glucose 4,6-dehydratase RfbB1
(.4
'
mru2052adhesin-like protein mru1060dTDP-
glucose.4,6-dehydratase RfbB2
mru2053adhes1n-like protein
mru0113exopolysaccharide biosynthesis polyprenyl glycosylphosphotransferase
mru2054adhes1n-like protein mru1062glucose-1-
phosphate thymidylyltransferase RfbA
mru2055adhes1n-like protein mru0108glucose-1-
phosphate thymidylyltransferase RfbA1
mru2059adhes1n-like protein mru1527g1ycosy1
transferase
mru2090adhes1n-like protein mru1528g1yc0sy1
transferase
mru2134adhesin-like protein mru0101glycosyl
transferase GT2 family a
'
" mru2147adhesin-like protein mru0111glycosyl
transferase GT2 family ,
mru2178adhesin-like protein mru0112glycosyl
transferase GT2 family
...3
mru2189adhesin-like protein mru1049g1yc0sy1
transferase GT2 family .-.1
NJ
.
IV
mru0015adhesin-like protein with cysteine protease domain mru1064glycosyl
transferase GT2 family n)
mru0020adhesin-like protein with cysteine protease domain mru1069g1yc0sy1
transferase GT2 family .
.
_.. n)
mru0143adhesin-like protein with cysteine protease domain mru1074glycosyl
transferase GT2 family -A
IQ
0
I-.
IV
I
mru0222adhes1n-like protein with cysteine protease domain mru1214glycosyl
transferase GT2 family o
i.)
mru0727adhesin-like protein with cysteine protease domain rnru1264g1yc0sy1
transferase GT2 family 1
i\)
mru0772adhesin-like protein with cysteine protease domain mru1458g1yc0sy1
transferase GT2 family
mru0839adhesin-like protein with cysteine protease domain mru1525g1yc0sy1
transferase GT2 family
. mru0842adhesin-like protein with cysteine protease domain mru1545g1yc0sy1
transferase GT2 family
mru0843adhesin-like protein with cysteine protease domain mru0099glycosyl
transferase GT4 family
mru1387adhes1n-like protein with cysteine protease domain mru1066glycosyl
transferase GT4 family
mru0824adhesin-like protein with transglutaminase domain '
mru1068glycosyl transferase GT4 family
1-Lt
mru0828adhesin-like protein with transglutaminase domain mru1378g1yc0sy1
transferase GT4 family n
mru1497adhesin-like protein with transglutaminase domain mru1679g1ycosy1
transferase GT4 family. .
mru1499adhes1n-like protein with transglutaminase domain mru1883g1ycosy1
transferase GT4 family N
mru1604adhesin-like protein with transglutaminase domain mru1072glycosyl
transferase GT2 family
1--,
o
mru1459NAD dependent epimerase/dehydratase
o
Expolysaccharide synthesis mru1526nucleotidyl
transferase ' o
1..,
mru1067acety1transferase mru1522po1ysaccharide
biosynthesis protein c7,
'
=
mru1523po1ysaccharide biosynthesis protein
mru0458ph0sph0g1uc0sam1ne mutase GImM1 0
mru1524po1ysaccharide biosynthesis protein =
mru0449ph0sph0g1uc0samine mutase GImM2
(,..
o
mru1071polysaccharide/polyol phosphate ABC transporter ATP-binding protein
mru1733ph0sph05ugar-binding protein I--
1--,
,
o
mru1457polysaccharide/polyol phosphate ABC transporter ATP-binding protein
mru2136p01ysaccharide biosynthesis protein
(..4
(A
mru1070polysaccharide/polyol phosphate ABC transporter permease protein
mru147OUDP-glucose 4-epimerae GalE (.4
mru1456po1ysaccharide/polyol phosphate ABC transporter permease protein
mru0456UDP-N-acetylglucosamine diphosphorylase/glucosamine-1-phosphate N-
'
mru1529UDP-galactopyranose mutase Glf acetyltransferase
GImU
mru1461UDP-glucose pyrophosphorylase Galli mru1005undecaprenyl
pyrophosphate synthetase UppS
mru1051UDP-glucose/GDP-mannose dehydrogenase mm2108undecapreny1-
4iphosphatase UppP
mru1075UDP-glucose/GDP-mannose dehydrogenase
.
mru0106UDP-N-acetyl-D-mannosaminuronate dehydrogenase Sialic acid
biosynthesis
' mru1697UDP-N-acetylglucosamine 2-epimerase mru1876CMP-N-
acetylneuraminic acid synthetase NeuA a
mru1878N-acetyl neuramic acid synthetase NeuB
0
Other
mru1879s1a1y1transferase IV
.-.3
mru1836ce11 shape determining protein MreB/Mr1 family
mru1880p01ysacchar1de biosynthesis protein .-.1
NJ
IV
mru1047po1y-gamma-glutamate biosynthesis protein
IV
.1,
Teichoic acid biosynthesis
_
-
_.,.
NJ
Pseudomurein biosynthesis mru0715g1yc0sy1
transferase GT2 family -.1
c.)
0
H
IV
I
rnru2175ce11 wall biosynthesis glycosyl transferase mru10562-C-methyl-D-
elythritol 4-phosphate cytidylyltransferase o
mru0707ce11 wall biosynthesis protein Mur ligase family mru1057a1c0h01
dehydrogenase "
1
i\)
'
mru1042ce11 wall biosynthesis protein Mur ligase family mru1058glycosyl
transferase GT2 family
mru1118cell wall biosynthesis protein Mur ligase family mru1078glycosyl
transferase GT2 family/CDP-glycerol:poly(glycerophosphate)
mru1745ce11 wall biosynthesis protein Mur ligase family
glycerophosphotransferase
mru2091cell wall biosynthesis protein Mur ligase family mru1079CDP-
glycerol:poly(glycerophosphate) glycerophosphotransferase
nnru2092ce11 wall biosynthesis protein Mur ligase family mru1718glycerol-3-
phosphate cytidylyltransferase¨ ¨
mru0964ce11 wall biosynthesis protein phospho-N-acetylnnuramoyl-pentapeptide-
mru2181glycosyl transferase GT2 family/CDP-glycerol:poly(glycerophosphate)
transferase family
glycerophosphotransferase
oc
n
1-3
mru1041cell wall biosynthesis protein phospho-N-acetylmuramoyl-pentapeptide-
mru2182g1yc0sy1 transferase GT2 family/CDP-glycerol:poly(glycerophosphate
transferase family
glycerophosphotransferase
N
mru2126ce11 wall biosynthesis protein UDP-glycosyltransferase family
mru2183g1yc0sy1 transferase GT2 family
mru2190nuc1eotidy1 transferase
mru1293g1ucosamine-fructose-6-phosphate aminotransferase GlmS1
mru1536g1ucosamine-fructose-6-phosphate aminotransferase GlmS2
mru2191CDP-glycerol:poly(glycerophosphate) glycerophosphotransferase
-..,
=
o
mru1388NAD dependent epimerase/dehydratase
1..,c7,
mru1413NAD dependent epimerase/dehydratase
=
mru0553pyruvate ferredoxin oxidoreductase-
associated PorF 0
CELLULAR PROCESSES mru1786transporter
SSS family w
o
I--
1--,
,
--
.
o
Oxidative stress response Aromatic compounds
k..)
,
uri
mru1564desu1f0ferr0doxin Dbc mru10454-
oxalocrotonate tautomerase family enzyme Dmpl c..3
mru1507F420H2 oxidase FprA1
mru1381carboxymuconolactone decarboxylase family protein PcaC1
mru0131F420H2 oxidase FprA2
mru1382carboxymucono1actone decarboxylase family protein PcaC2
= mru1257ferritin-like domain-containing protein
mru0958NADH oxidase Nox Bicarbonate
,
mru0457rubredox1n Rub1 mru0951bicarbonate
ABC transporter ATP-binding protein BtcA
. .
mru1258rubred0xin Rub2 mru0950b1carbonate
ABC transporter permease protein BtcB
mru1259rubred0xin Rub3 mru0949bicarb0nate
ABC transporter substrate-binding protein BtcC a
mru0735rubrerythrin Rbr1 mru1602oarbonic
anhydrase Cab
.
0
mru1367rubrerythrin Rbr2
"
-.3
.-.1
Butanol
IV
' Stress response mru09903-
hydroxybutyryl-CoA dehydrogenase Hbd m
,
mru1368bi1e salt hydrolase
1.)
mru0183pr0tein disulfide-isomerase thioredoxin-related Formate
-V
H
IV
I
mru1261universal stress protein UspA1 mru0453pyruvate
formate-Iyase-activating enzyme PflA1 o
n)
mru0440universa1 stress protein UspA2 mru1965pyruvate
formate-Iyase-activating enzyme P11A2 1
i\)
mru1519pyruvate-formate lyase Pfl
Gluconeogenesis
CENTRAL CARBON METABOLISM mru04592,3-
bisphosphoglycerate-independent phosphoglycerate mutase ApgM1
mru11392,3-bisphosphoglycerate-independent phosphoglycerate mutase ApgM2
.
Acetate mru06282-
phosphoglycerate kinase Pgk2A
oci
mru1434acety1-CoA synthetase AcsA . mru08222-
phosphoglycerate kinase Pgk2B n
.
mru1570ADP-dependent acetyl-CoA synthetase Acs mru1017cyclic 2,3-
diphosphoglycerate-synthetase
mru0550pyruvate ferredoxin oxidoreductase alpha subunit PorA
mru0498fructose 1,6-bisphosphatase Fbp
N
mru0551pyruvate ferredoxin oxidoreductase beta subunit PorB
mru1856glyceraldehyde-3-phosphate
dehydrogenase Gap o
1--,
o
mru0549pyruvate ferredoxin oxidoreductase delta subunit PorD
mru1897phosphoeno1pyruvate synthase PpsA1
C3
o
mru0548pyruvate ferredoxin oxidoreductase gamma subunit PorC
= mru2083phosphoeno1pyruvate
synthase/pyruvate phosphate dikinase o
1--,
mru0552pyruvate ferredoxin oxidoreductase-associated PorE
mru1821phosphoglycerate kinase Pgk c7,
,
,
-
meu0914ph0sph0pyruvate hydratase Eno mru0556fumarate
hydratase FumAl 0
' mru0997phosph0-2-dehydro-3-deoxyheptonate aldolase/fructose-bisphosphate
mru0690fumarate hydratase FumA2 (,..
,
o
aldolase mru08411umarate
hydratase FumA3 1--
0-,
.--.
mru0635pyruvate kinase PykA mru1895fumarate
hydratase FumA4 (..4
mru1822tr1oseph0sphate isomerase TpiA mru1255ma1ate
dehydrogenase Mdh (A
(.4
. , mru0847pyruvate
carboxYlase subunit A PycA
Glycolate salvage pathway mru1888pyruvate
carboxylase subunit B PycB '
mru1874ph0sph0g1yc01ate phosphatase Gph mru0088succinate
dehydrogenase/fumarate reductase flavoprotein subunit SdhA
, mru0655succinate
dehydrogenase/fumarate reductase iron-sulfur protein SdhB
Inositol biosynthesis ' mru1091succinate-CoA
ligase alpha subunit SucD '
mru1744bifunctional inositol-1 monophosphatase/fructose-1,6-bisphosphatase/ATP-
mru1824succinyl-CoA synthetase beta subunit SucC
NAD kinase
.
a
mru1887my0-inosito1-1-phosphate synthase
Other
0
mni1685de0xyr1bose-phosphate aldolase DeoC
IV
Propanoate '
.-.1
.-.1
NJ
mru06882-methylcitrate dehydratase PrpD
IV
I \ )
mru06912-methylcitrate synthase/citrate synthase II PrpC/CitZ
ENERGY METABOLISM
" 0
4
cri
H
PRPP synthesis '
"
I
mru0957r1b0se 5-phosphate isomerase A RpiA
o
IV
Electron transfer
,
mru1634rib0se-phosphate diphospho' kinase Prs
"
mru09154Fe-4S binding domain-containing protein
mru13454Fe-4S binding domain-containing protein
.
Ribulose monophosphate pathway . ,
mru17114Fe-4S binding domain-containing protein
mru02503-hexulose-6-phosphate isomerase Phil mru17154Fe-4S binding
domain-containing protein '
mru13103-hexulose-6-phosphate isomerase Phi2
mru17164Fe-45 binding domain-containing protein
mru2131bifunctional formaldehyde-activating enzyrne/3- hexulose-6-phosphate
mru20364Fe-4S binding domain-containing protein
synthase Fae/Hps
1-:
mru12114Fe-4S binding domain-containing protein
. n
mru01884Fe-4S ferredoxin binding domain-containing protein
Tricarboxylic cycle
mru03624Fe-4S ferredoxin binding domain-containing protein
N mru18282-oxoglutarate ferredoxin oxidoreductase subunit alpha KorA
mru03734Fe-48 ferredoxin binding domain-containing protein
mru18272-oxoglutarate ferredoxin oxidoreductase subunit beta KorB
1--,
mru0123archae0flavopr0te1n AfpA
mru18292-oxoglutarate ferredoxin oxidoreductase subunit delta KorD
o
mru0184cytochr0me C-type biogenesis protein DsbD
o
mru18262-oxoglutarate ferredoxin oxidoreductase subunit gamma KorC
1--,
mru05441erredoxin
mru2095ac0nitase
"
mru0653ferredoxin mru0991NADPH-dependent
F420 reductase NpdG1 0
mru0830ferredox1n = mru1444NADPH-dependent
F420 reductase NpdG2
o
mru1472ferredoxin
o
mru2138fefredox1n Formate metabolism
(,.3
(A
mru0363flavod0xin mru0681formate
dehydrogenase accessory protein FdhD1 (.4
o
mru1003flavodoxin mru1939formate
dehydrogenase accessory protein FdhD2 .
mru1720flavod0xin mru2074formate
dehydrogenase alpha chain FdhA2 .
mru0009f1avodoxin domain containing protein . mru0333formate
dehydrogenase alpha subunit FdhA1
mru0006flavodox1n domain-containing protein mru2075formate
dehydrogenase beta chain FdhB2
mru01551ron-sulfur cluster binding protein mru0334formate
dehydrogenase beta subunit FdhB1
mru0219NADPH-dependent FMN reductase mru0332formate/nitrite
transporter FdhC .
mru0364NADPH-dependent FMN reductase
a
mru0580NADPH-dependent FMN reductase H2 metabolism
0
= mru1260NADPH-dependent
FMN reductase mru2064c0enzyme F420 hydrogenase alpha
subunit FrhA n)
-.3
.-.1
mru1369NADPH-dependent FMN reductase mru2061c0enzyme F420
hydrogenase beta subunit FrhB1 n)
IV
mru1609NADPH-dependent.FMN reductase ' mru2081coenzyme F420
hydrogenase beta subunit FrhB2 n)
nn ru1732NADPH-dependent FMN reductase mru2063coenzyme F420
hydrogenase delta subunit FrhD n)
_..
0
mru2135NADPH-dependent FMN reductase mru2062c0enzyme F420
hydrogenase gamma subunit FrhG ====1 H.
0)
IV
I
mru1391polyferredoxin , mru1412energy-
converting hydrogenase A subunit A EhaA . o
n)
mru0701A1A0 archaeal ATP synthase subunit A AhaA mru1411energy-
converting hydrogenase A subunit B EhaB 1
.
i\)
mru0702A1A0 archaeal ATP synthase subunit B AhaB mru1410energy-
converting hydrogenase A subunit C EhaC
mru0699A1A0 archaeal ATP synthase subunit C AhaC mru1409energy-
converting hydrogenase A subunit D EhaD
mru0703A1A0 archaeal ATP synthase subunit 0 AhaD mru1408ener9y-
converting hydrogenase A' subunit E EhaE
mru0698A1A0 archaeal ATP synthase subunit E AhaE mru1407energy-
converting hydrogenase A subunit F EhaF =
mru0700A1A0 archaeal ATP synthase subunit F AhaF mru1406energy-
converting hydrogenase A subunit G EhaG .
mru0695A1A0 archaeal ATP synthase subunit H AhaH mru1405energy-
converting hydrogenase A subunit H EhaH
1-:
mru0696A1A0 archaeal ATP synthase subunit I Ahal mru1404energy-
converting hydrogenase A subunit I Ehal n
mru0697A1A0 archaeal ATP synthase subunit K AhaK mru1403energy-
converting hydrogenase A subunit J EhaJ
mru1402energy-converting hydrogenase A subunit K EhaK
N
Alcohol metabolism mru1401energy-
converting hydrogenase A subunit L EhaL 1--,
o
mru0065NADP-dependent alcohol dehydrogenase Adh1 mru1400energy-
converting hydrogenase A subunit M EhaM
o
mru1847NADP-dependent alcohol dehydrogenase Adh2 mru1399energy-
converting hydrogenase A subunit N EhaN =
1--,
o=
mru1445NADP-dependent alcohol dehydrogenase Adh3 mru1398energy-
converting hydrogenase A subunit 0 Eha0 o
,
mru1397energy-converting hydrogenase A subunit P EhaP
0
mru1396energy-converting hydrogenase A subunit Q EhaQ Methanogenesis
pathway o
1--
mru1394energy-converting hydrogenase A subunit R EhaR mru05695,10-
methylenetetrahydromethanopterin reductase Mer 1¨
o
mru2014energy-converting hydrogenase B subunit A EhbA mru0117CoB¨CoM
heterodisulfide reductase subunit A HdrA (..4
(A
(.4
mru2013energy-converting hydrogenase B subunit B EhbB mru0817CoB¨COM
heterodisulfide reductase subunit B HdrB o
mru2012energy-converting hydrogenase .B subunit C EhbC mru1212CoB¨CoM
heterodisulfide reductase subunit B HdrB2
mru2011energy-converting hydrogenase B subunit D EhbD mru0816CoB¨CoM
heterodisulfide reductase subunit C HdrC
mru2010energy-converting hydrogenase B subunit E EhbE mru0526c0enzyme F420-
dependent N(5),N(10)-methenyltetrahydromethanopterin
mru2009energy-converting hydrogenase B subunit F EhbF reductase Hmd
mru2008energy-converting hydrogenase B subunit G EhbG mru2142F420-
dependent methylenetetrahydromethanopterin dehydrogenase Mtd
mru2007energy-converting hydrogenase B subunit H EhbH
mru1393formy1methanofuran-tetrahydromethanopterin formyltransferase Ftr1
mru2006energy-converting hydrogenase B subunit I Ehbl
mru2022formy1methanofuran-tetrahydromethanopterin formyltransferase Ftr2 a
-
mru2005energy-converting hydrogenase B subunit J EhbJ
mru1619methenyltetrahydromethanopterin cyclohydrolase Mch
0
mru2004energy-converting hydrogenase B subunit K EhbK mru1924methy1-
coenzyme M reductase alpha subunit McrA
.-.3
.-.1
mru2003energy-converting hydrogenase B subunit L EhbL mru1928methy1-
coenzyme M reductase beta subunit McrB NJ
IV
mru2002energy-converting hydrogenase B subunit M EhbM mru1926methy1-
coenzyme M reductase C subunit McrC IV
.1,
mru2001energy-converting hydrogenase B subunit N EhbN mru1262methy1-
coenzyrne M reductase component A2 AtwA1 NJ
.-n
0
mru2000energy-converting hydrogenase B subunit 0 Ehb0 mru1850methyl-
coenzyme M reductase component A2 AtwA2 -.I
--4
I-.
IV
mru1999energy-converting hydrogenase B subunit P EhbP mru1927methy1-
coenzyme M reductase D subunit McrD I
o
n)
' mru1998ener9y-converting hydrogenase B subunit Q EhbQ mru1925methyl-
coenzyme M reductase gamma subunit McrG 1
i\)
=mru1632hydrogenase accessory protein HypB mru1919tetrahydromethanopterin S-
methyltransferase subunit A MtrA1
mru0473hydrogenase assembly chaperone HypC
mru0441tetrahydr0methan0pterin S-methyltransferase subunit A MtrA2
mru1875hydrogenase expression/formation protein HypD
mru1920tetrahydromethan0pterin S-methyltransferase subunit B MtrB
mru0190hydrogenase expression/formation protein HypEl
mru1921tetrahydromethanopterin S-methyltransferase subunit C MtrC
mru1551hydrogenase maturation factor HypE2
mru1922tetrahydromethanopterin S-methyltransferase subunit D MtrD
mru2034hydr0genase maturation factor HypF
mru1923tetrahydr0methan0pterin S-methyltransferase subunit E MtrE
oc
mru1039hydrogenase maturation protease Hycl
mru1918tetrahydromethanopterin S-methyltransferase subunit F MtrF n
mru1633hydrogenase nickel insertion protein HypA
mru1917tetrahydromethanopterin S-methyltransferase subunit G MtrG
mru1906methy1 viologen-reducing hydrogenase alpha subunit MvhA
mru1916tetrahydromethanopterin S-
methyltransferase subunit H MtrH = N
mru1905methy1 viologen-reducing hydrogenase beta subunit MvhB
mru0344tung5ten formylmethanofuran
dehydrogenase subunit A FwdA o
1¨
o
mru1908methy1 viologen-reducing hydrogenase delta subunit MvhD1
mru0343tungsten formylmethanofuran
dehydrogenase subunit B FwdB =-=:E3
o
mru2076methy1 viologen-reducing hydrogenase delta subunit MvhD2
mru0345tungsten formylmethanofuran
dehydrogenase subunit C FwdC 1..,
mru1907methy1 viologen-reducing hydrogenase gamma subunit MvhG
mru0342tungsten formylmethanofuran
dehydrogenase subunit D FwdD = c7,
o
=
mru0254tungsten formylmethanofuran dehydrogenase subunit E FwdE
mru0340tungsten formylmethanofuran dehydrogenase subunifF FwdF Mevalonate
pathway
mru0341tungsten formylmethanofuran dehydrogenase subunit G FwdG
mru1639acety1-CoA acetyltransferase
mru0339tungsten formylmethanofuran dehydrogenase subunit H FwdH
mru1092hydroxymethylglutaryl-CoA reductase (NADPH) HmgA
mru1640hydroxymethylglutaryl-CoA synthase
Other mru09221s0penteny1
diphosphate delta-isomerase Fni
mru16951-14MPT-linked Cl transfer pathway protein mru0921isopentenyl
diphosphate kinase
mru0920meva1onate kinase Mvk
mru0919phosphomeva1onate decarboxylase
Elongation of isoprenoid side chains
LIPID METABOLISM mru0924bifunctiona1
short chain isoprenyl diphosphate synthase IdsA
Biosynthesis bacterial ***********. *****
.***** ********* ***** 0
mru10313-oxoacyl-(acyl-carrier-protein) reductase FabG1
mru16303-oxoacyl-(acyl-carrier-protein) reductase FabG2 MOBILE ELEMENTS
mru1289diacylglycerol kinase DagK =
1.)
mru2188g1ycero1-3-phosphate dehydrogenase (NAD) CRISPR-associated genes
co
0
mru0796CRISPR-associated helicase Cas3
Biosynthesis general mru0798CRISPR-associated
protein Cas1-1
mru1341geranylgeranyl reductase family protein mru1174CRISPR-associated
protein Cas1-2 i\)
mru1441geranylgeranyl reductase family protein mru1647CRISPR-associated
protein Cas1-3
mru1471geranylgeranyl reductase family protein mru1648CRISPR-associated
protein Cas1-4
mru0799CRISPR-associated protein Cas2-1
Lipid backbone mru1176CRISPR-associated
protein Cas2-2
mru0955NAD(P)-dependent glycerol-1-phosphate dehydrogenase EgsA
mru1649CRISPR-associated protein Cas2-3
mru0797CRISPR-associated protein Cas4-1
Phospholipid biosynthesis mru1549CRISPR-associated
protein Cas4-2
mru1885digerany1gerany1g1ycery1 phosphate synthase mru0795CRISPR-associated
protein Cas5 Hmari subtype
mru1102geranylgeranylglyceryl phosphate synthase mru0792CRISPR-associated
protein Cas6
=
mru0503phosphat1dy1g1ycerophosphate synthase PgsA mru1183CRISPR-associated
protein Csm1 family
mru1833phosphatidy1serine decarboxylase Psd mru1182CRISPR-associated
protein Csm2 family
mru1834phosphatidylserine synthase PssA mru0794CRISPR-associated
protein CT1132 family c7,
=
,
.
,
e
,
,
. ,
mru1178CRISPR-associated protein TIGRO2710 family mru0281hypothetical
protein 0
mru1188CRISPR-associated protein TIGRO2710 family mru0282phage-related
protein (,..
o
I--
mru1181CRISPR-associated RAMP protein Csm3 family mru0283hypothetica1
protein 1--,
--.
o
mru1180CRISPR-associated RAMP protein Csm4 family mru0284phage-related
protein (..)
(A
mru1179CRISPR-associated RAMP protein Csm5 family mru0285terminase large
subunit = (.4
o
mru0286hypothetica1 protein
Prophage miu0287phage portal
protein
mru0057phage-related protein mru0288phage-related
protein
mru0058phage-related protein mru0289hyp0thetica1
protein
mru0256phage integrase mru0290hypothetical
protein
mru0257hypothetica1 protein mru0291hypothetical
protein
mru0258hyp0thetica1 protein mru0292hyp0thetica1
protein a
mru0259cdc6 family replication initiation protein Cdc6-3
mru0293hypothetica1 protein
(2.
mru0260hyp0thet1ca1 protein mru0294hyp0thetica1
protein "
.-.1
mru0262hypothet1ca1 protein mru0295hyp0thetica1
protein
.
IV
mru0263hyp0thet1ca1 protein mru0296hyp0thet1ca1
protein m
mru0264hyp0thetica1 protein mru0297hyp0thet1ca1
protein n)
mru0265hyp0thet1ca1 protein mru0298hyp0thetica1
protein
IV
mru0266hyp0thet1ca1 protein mru0299hypothetica1
protein 1
o
i.)
" mru0267hyp0thet1ca1 protein , mru0300hyp0thetica1
protein 1
i.)
mru0268hyp0thetica1 protein mru0301hypothetical
protein
mru0269ATPase involved in DNA replication control MCM family
mru0302hyp0thetica1 protein
mru0270phage-related protein mru0303hyp0thetica1
protein .
mru0271hypothetical protein - mru0305hyp0thetica1
protein
mru0272hypothetica1 protein mru0306hyp0thetica1
protein
mru0273hypothetica1 protein mru0307phage-related
protein
od
mru0274hyp0thet1ca1 protein mru0308phage-related
protein n
.
1-
mru0275hypothetical protein , .
mru0309hypothetica1 protein
mru0276hypothet1ca1 protein mru0310phage-related
protein N
mru0277hypothetica1 protein mru0311phage-related
protein o
1--,
.
o
mru0278hypothetica1 protein mru0312hyp0thetica1
protein
o
mru0279hypothetica1 protein mru0313phage-related
protein =
1..,
mru0280ParB-like nuclease domain-containing protein mru0314hyp0thetica1
protein c7,
o
L.
,
-
,-
,
mru0315phage tail tape measure protein
0
mru0316phage-related protein Fixation
(,..
o
I--
mru0317phage-related protein mru14664Fe-4S iron
sulfur cluster binding protein NifH/frxC family
=--.
o
mru0318hypothetical protein mru1567NifU-like
FeS cluster assembly scaffold protein (..)
(A
,
mru0319hyp0thetica1 protein
(.4
o
mru0320end0is0pept1dase PeiR General
mru0321dnd system-associated protein 3
mru0094nitroreductase family protein
= mru0322dnd system-
associated protein 1 mru0471nitroreductase family protein
mru0323dnd system-associated protein 2
mru0692nitroreductase family protein
mru0324type ll restriction enzyme, methylase subunit
mru0994n1tr0reductase family protein
mru0325 hypothetical protein
mru0995n1troreductase family protein .
mru1941nitroreductase family protein
0
J ,
Transposase
0
mru0119transposase Other
"
-.3
.-.1
r mru0355transp0sase mru0749ADP-
ribosylglycohydrolase family protein
IV
mru0434transp0sase , mru1455ADP-
ribosylglycohydrolase family protein
mru0578transp0sase mru1580ADP-
ribosylglycohydrolase family protein
c)
mru0623transp0sase remnant
mru2121hydroxylamine reductase Hcp co
IV
I
mru0624transp0saSe remnant ,
o
=
i.)
1
mru0625transposase remnant Regulation
mru0777transposase mru1324nitrogen
regulatory protein P-IIGInK
mru0948transposase
mru0992transp0sase Transport
mru1162transposase mru1325ammon1um
transporter Amt '
mru1583transposase .
mru1608transposase
'
mru1662tran5p0sase
n
,
,
mru1766transposase NUCLEIC ACID
METABOLISM
mru2148transposase ,
N
.
=
DNA-binding proteins
1--,
o
**.=*-Int****A-**********************?r*** mru0011archaeal
histone
.
,
o
mru0774archaea1 histone
==
1--,
=
o=
NITROGEN METABOLISM mru1491archaea1
histone = o
,
= ,
'
mru1686archaea1 histone mru2068DNA-3-
methyladenine glycosylase I Tag
0
mru1731archaeal histone mru1576end0nuc1ease III
Nth (,..
o
mru1760archaea1 histone mru1855endonuclease IV
I--
1--
mru0397DNA-binding protein mru1127excinuclease ABC
A subunit UvrA1
(..4
(A
mru1684histone acetyltransferase ELP3 family mru1256excinuc1ease ABC
A subunit UvrA2 = (.4
mru0606NAD-dependent protein deacetylase , mru1087excinuc1ease ABC
B subunit UvrB
mru1223excinuc1ease ABC C subunit Uvre
Helicase
mru1115exodeoxyribonuclease III Xth1
mru0133ATP-dependent DNA helicase
mru1557exodeoxyr1bonuc1ease III Xth2
mru1111ATP-dependent DNA helicase UvrD/REP family
mru0812ex0de0xyr1b0nuc1ease VII large subunit XseA
mru1138ATP-dependent DNA helicase UvrD/REP family
mru0813exodeoxyribonuc1ease VII small subunit XseB
mru1184ATP-dependent DNA helicaie UvrD/REP family, nnru0770exonuc1ease
= a
mru0600DEAD/DEAH box helicase domain-containing protein
mru2089Hef nuclease '
mru0778DEAD/DEAH box helicase domain-containing protein
mru2099RdgB/HAM1 family non-canonical
purine NTP pyrophosphatase n)
...3
mru1110DEAD/DEAH box helicase domain-containing protein
mru1547ssDNA exonuclease RecJ2 .-.1
NJ
.
IV
mru1319DEAD/DEAH box helicase domain-containing protein
mru2097ssDNA exonuclease Rec,11 m
nnru1681DEAD/DEAH box helicase domain-containing protein
mru1383staphy10c0cca1 nuclease domain-
containing protein 1.)
mru1121DNA helicase mru0218uraci1-DNA
glycosylase Ung ol
=_, 0
I-.
mru1157helicase RecDfTraA family
I.)
1
o
mru0620he11caseSNF2 family Restriction and
modification 1\)
1
iv
mru0981Rad3-related DNA helicase mru00295-methylcytosine
restriction system component protein
,
mru1167DNA modification methylase
Recombination and repair = mru0026DNA-cytosine
methyltransferase
mru15756-0-methylguanine DNA methyltransferase Ogt . . mru0027DNA-
cytosine methyltransferase
mru01368-oxoguanine DNA glycosylase Ogg mru1135restriction
endonuclease
,
niru2027archaea1 Holliday junction resolvase Hjc mru1165restriction
enzyme methylase subunit
mru1566archaea-specific RecJ-like exonuclease mru0927type I
restriction-modification enzyme S subunit HsdS oe
n
mru1105DNA double-strand break repair protein Mre11 mru0928type I
restriction-modification system M subunit HsdM 1-3
mru1106DNA double-strand break repair protein Rad50 mru1166type II
restriction endonuclease
tj
mru1429DNA mismatch endonuclease Vsr mru2106type II
restriction endonuclease 1--,
mru0583DNA mismatch repair ATPase MutS family
o
=-=:E3
mru0115DNA repair and recombination protein RadA
o
1..,
mru0505DNA repair and recombination protein RadB
c7,
.
.
-
_______________________________________________________________________________
_____________________________ ,
PROTEIN FATE mru0403prefo1d1n
alpha subunit PfdA = 0
. mru1347prefo1d1n
beta subunit PfdB (.4
o
Protein degradation degradation mru1357proteas0me
alpha subunit 1--,
.
o
mru1637ATP-dependent protease S16 family mru1977pr0teas0me
beta subunit (..3
(A
mru2100glycoprotease M22 family mru1448proteasome-
activating nucleotidase (.4
o
mru2058methionine aminopeptidase Map mru1501thermosome
subunit
mru1028peptidase C39 family mru1645thermosome
subunit
mru2168peptidase C39 family
.
mru1128peptidase M48 family . Protein secretion
mni1504peptidase M48 family
mru0391oligosaccharyl transferase '
= mru1755pept1dase
M48 family mru0482preprotein translocase subunit SecE
mru0130peptidase M50 family mru0239prepr0tein
translocase subunit SecG a
mru0238peptidase M50 family - mru0875prepr0tein
translocase subunit SecY .
0
mru1698pept1dase M50 family mru1722pr0te1n
export membrane protein SecD "
-.3
mru1677peptidase S49 family mru1721protein
export membrane protein SecF
IV
mru0584peptidase U32 family nnru1581signal
peptidase I "
.1,
mru0585peptidase U32 family mru2118signal
peptidase I , n)
.
mru1467peptidase U32 family '
= mru0404s1gna1 recognition particle
receptor FtsY co
m
H'
IV
mru1727peptidase U62 family = mru1546signa1
recognition particle SRP19 protein 1
o
n)
mru1873pept1dase U62 family mru0164signal
recognition particle SRP54 protein 1
i\)
mru2021transg1utaminase domain-containing protein mru1832sortase
family protein .1,.
mru0669Xaa-Pro aminopeptidase mru0522type II
secretion system protein E GspE
,
mru0524type II secretion system protein F GspF1
Protein folding mru0670type II
secretion system protein F GspF2
mru1305DnaK-related protein mru1234type IV
leader peptidase family protein
mru1812DnaK-related protein
oe
mru1730heat shock protein Hsp20/alpha crystallin family
************** *********************** n
mru2040m01ecu1ar chaperone DnaJ =
mru2039m01ecu1ar chaperone DnaK . PROTEIN SYNTHESIS
=
=
N
mru2038molecular chaperone GrpE =
o
=
mru1511nascent polypeptide-associated complex protein Other
C3
o
mru0145peptidy1-proly1 cis-trans isomerase mru0507ATPase RIL
=
1..,
mru1901peptidyl-proly1 cis-trans isomerase mru0508pept1dy1-
tRNA hydrolase o=
o
,
,
. .
,
.
.
'
mru0163pseudouridylate synthase mru0485r1bosoma1
protein LIP Rpl1p 0
mru2127ribonuc1ease . mru0162ribosoma1
protein L21e RpI21e (.4
o
I--
mru2128ribonuclease inhibitor mru0856r1bosoma1
protein L22P Rp122p 1--,
=--.
o
mru1349ribos0ma1 biogenesis protein mru0853ribosoma1
protein L23P Rp123p (..4
(A mru1038RNA methylase
mru0483ribosoma1 protein L24 family (.4
mru1440RNA methylase mru1488ribosomal
protein L24e Rp124e
, mru0159RNA-binding protein mru0863rib0soma1
protein L24P Rp124p ,
,
mru0395RNA-binding protein mru0858ribosoma1
protein L29P Rp129p
mru0475RNA-binding protein mru0854ribos0ma1
protein L2P Rpl2p ,
mru0519RNA-binding protein mru1810ribosomal
protein L30e Rp130e
mru0729RNA-binding protein mru0873rib0soma1' protein L3OP RpI30p
,
mru1284RNA-binding protein = mru0400r1bosoma1
protein L31e RpI31e a
mru1866RNA-binding protein mru0869ribosoma1
protein L32e Rp132e
mru0923RNA-metabolising metallo-beta-lactamase mru0878r1bosoma1
protein L34e Rp134e n)
-.3
.-.1
mru1978RNA-metabolising metallo-beta-lactamase mru1351ribosomal
protein L37Ae Rp137ae = , n)
IV
mru1610rRNA methylase mru1281ribosomal
protein L37e Rp137e "
mru0504Sua5NciO/YrdCNwIC family translation factor mru0399ribosoma1
protein L39e Rp139e
(z)
mru0180trans1ation-associated GTPase mru0851r1bosoma1
protein L3P Rpl3p co
(a
H
IV
.
I
mru0398tRNA methyltransferase subunit mru1100ribosomal
protein L40e RpI40e o
n)
mru0175r1bosoma1 protein L44e Rp144e
1
' i\) .
. .
Ribosomal proteins mru0852r1bosoma1
protein L4p Rpl4p
mru0486acidic ribosomal protein PO Rp1P0 mru0865r1bosoma1
protein L5P Rpl5p
mru1898r1b0s0ma1 protein L10e Rp110e mru0868r1b0soma1
protein L6P Role!)
' mru0484r1b0soma1 protein L11P Rp111p mru1490rib0s0ma1
protein L7Ae Rpl7ae
,
mru0487ribosomal protein L12P Rp112p mru0402r1bosoma1
protein LX RpIX ,
mru0910ribosoma1 protein L13P Rp113p mru1804rib0soma1
protein S1OP Rps1Op
oe
mru0880ribosoma1 protein L14e Rp114e mru0907ribosoma1
protein S11P Rps11p n
1-
mru0862ribosomal protein L14P Rp114p mru1808ribosomal
protein S12P Rps12p
mru1362ribosoma1 protein L15e Rp115e mru0905ribosorna1
protein S13P Rps13p N
mru0874ribosoma1 protein L15P Rp115p ,
mru0866ribosoma1 protein S14P Rps14p 1--
,
o
mru0909ribosoma1 protein L18e Rp118e mru2098ribosoma1
protein S15P Rps15p
o
mru0871rib0s0ma1 protein L18P Rp118p mru1673rib0soma1
protein S17e Rps17e o
1--,
mru0870r1bosoma1 protein L19e Rp119e mru0861rib0s0ma1
protein S17P Rps17p c7,
_
= .
. .
=
mru0396ribosoma1 protein S19e Rps19e mru0713H/ACA RNA-protein
complex component Gart
mru0855ribosoma1 protein S19P Rps19p mru0178H/ACA RNA-protein
complex component Nop1Op
mru1478ribosoma1 protein S24e Rps24e mru1687MiaB-like tRNA
modifying enzyme
mru1477r1bosoma1 protein S27ae Rps27ae mru1960N2,N2-
dimethylguanosine tRNA methyltransferase Trm1 (.4
mru0176ribosoma1 protein S27e Rps27e mru0589NMD3 family
protein
mru1489ribosoma1 protein S28e Rps28e mru1987pre-mRNA splicing
ribonucleoprotein PRP31
mru0916ribos0ma1 protein S2P Rps2p mru0437queuos1ne
biosynthesis protein QueC
= mru0461ribosomal protein
S3Ae Rps3ae mru0438queuosine biosynthesis protein QueD
mru0857r1b0s0ma1 protein S3P Rps3p mru1359r1bonuc1ease P
subunit P14
mru0864ribosoma1 protein S4e Rps4e mru0860r1bonuc1ease P
subunit P29
mru0906ribosoma1 protein S4P Rps4p mru1360r1bonuc1ease P
subunit P30
mru0872ribosoma1 protein S5P Rps5p mru0394r1bonuc1ease P
subunit RPR2
mru1485ribosoma1 protein S6e Rps6e mru0617r1bonuc1ease Z
Rnz
mru1807ribosoma1 protein S7P Rps7p mru0849ribosoma1 protein
L11 methyltransferase PrmA
mru1552ribosoma1 protein S8e Rps8e mru0593r1bosoma1 RNA
large subunit methyltransferase J RrmJ
mru0867rib0soma1 protein S8P Rps8p mru1800ribo5oma1-protein-
alanine acetyltransferase Riml
mru0911ribosomal protein S9P Rps9p mru1025RNA ligase
DRB0094 family
mru0654RNA methyltransferase TrmH family
co
RNA processing mru0157SAM-dependent
methyltransferase HemK-related
=
mru09962'-5 RNA ligase LigT mru1495tRNA intron
endonuclease EndA i\)
mru04397-cyano-7-deazaguanosine biosynthesis protein QueE mru0987tRNA
nucleotidyltransferase Cca
mru0525archaea1 fibrillarin-like protein mru1703tRNA
pseudouridine synthase A TruA
mru0814archaeosine tRNA-ribosyltransferase TgtA1 mru0197tRNA
pseudouridine synthase D TruD
mru1512archaeosine tRNA-ribosyltransferase TgtA2 mru2088tRNA(1-
methyladenosine) methyltransferase
mru0158dimethy1adenos1ne transferase KsgA mru0676tRNA(His)
guanylyltransferase ThgL
mru1353exosome complex exonuclease Rrp41 = mru0096tRNA-
dihydrouridine synthase DusAl 1-Lt
=
mru0168exosome complex RNA-binding protein CsI4 mru1846tRNA-
dihydrouridine synthase DusA2
mru1354exosome complex RNA-binding protein Rrp4 mru1823tRNA-modifying
enzyme
mru1352exosome complex RNA-binding protein Rrp42
mru1355ex0s0me subunit . Translation factors
mru1361exosome subunit mru0469ce11 division
protein pelota PelA
mru1988fibri11arin mru1053deoxyhypusine
synthase Dys
mru0898H/ACA RNA-protein complex component Cbf5p mru0166diphthamide
biosynthesis protein
, .
.
. mru1764diphthine synthase DphB , mru0959methi0ny1-
tRNA synthetase MetG
0
mru0728pept1de chain release factor aRF1 mru1558phenylalanyl-
tRNA synthetase alpha subunit PheS r-4
o
mru1805trans1ati0n elongation factor aEF-1 alpha mru1586pheny1a1any1-
tRNA synthetase subunit beta PheT I--
1--
mru0520trans1ation elongation factor aEF-1 beta mru0954pro1y1-tRNA
synthetase ProS
(..4
(A
mru1300translation elongation factor aEF-2 ,
mru1947sery1-tRNA synthetase SerS (.4
o
mru1806trans1ati0n elongation factor aEF-2 mru2129thre0ny1-
tRNA synthetase ThrS
mru1869trans1at10n initiation factor alF-1A mru0673tRNA binding
domain-containing protein
mru0177trans1ation initiation factor alF-2 alpha subunit
mru1494trypt0phany1-tRNA synthetase TrpS
mru0590trans1ation initiation factor alF-2 beta subunit , mru0588tyr0sy1-
tRNA synthetase TyrS
mru1484trans1ati0n initiation factor alF-2 gamma subunit mru1584va1y1-tRNA
synthetase ValS
mru17681rans1ation initiation factor alF-2B alpha subunit
=
mru1742trans1ation initiation factor alF-5A
(-)
mru0401translation initiation factor alF-6
o
mru0859trans1ati0n initiation factor aSUI1 PURINES AND
PYRIMIDINES ' iv
.-.3
mru1486trans1ation initiation factor IF-2
N)
N)
Purine biosynthesis
N)
.1,
tRNA aminoacylation - mru05955-
formaminoimidazole-4-carboxamide-1-(beta)-D- ribofuranosyl 5'- _. NJ
. mru0492a1any1-tRNA synthetase AlaS monophosphate-
formate ligase PurP 03
(31
0
H
NJ
mru2117arginyl-tRNA synthetase ArgS mru0165adenine
phosphoribosyltransferase Apt 1
0
mru1014aspartyl-tRNA synthetase AspS
niru0229adeny10succinate lyase PurB 1\)
1
NJ
mru2169Asp-tRNA(Asn)/Glu-tRNA(Gln) amidotransferase subunit A GatA
mru1468amid0ph0sph0r1b05y1transferase PurF
mru2029Asp-tRNA(Asn)/G1u4RNA(Gln) amidotransferase subunit B GatB
mru1839IMP cyclohydrolase Pur0
,
mru1142Asp-tRNA(Asn)/Glu-tRNA(Gln) amidotransferase subunit C GatC
mru2113phosphoribosylamine-glycine ligase PurD
mru1571cysteinyl-tRNA synthetase CysS -
mru2033phosphoribosy1amin0imidaz01e carboxylase catalytic subunit PurE1
mru0137D-tyrosyl-tRNA(Tyr) deacylase '
mru2186phosphoribosy1aminoimidazo1e carboxylase purE2
mru0938g1utamy1-tRNA synthetase GItX
mru1538phosphoribosy1amino1midazo1e-succinocarboxamide synthase PurC
mru1427g1utamy1-tRNA(Gln) amidotransferase subunit D GatD
mru0127phosphor1bosy1formy1g1ycinamidine (FGAM) synthase ll PurL oo
n
mru1426g1utamy1-tRNA(Gln) amidotransferase subunit E GatE
mru1540phosphoribosy1formy1g1ycinam1dine (FGAM) synthase PurQ 1-3
mru0651g1ycy1-tRNA synthetase GlyS
mru1539phosphor1bosy1formy1g1ycinarnidine (FGAM) synthase PurS
N
mru1248histidy1-tRNA synthetase HisS
mru1979ph0sph0rib0sy1formy1g1ycinamidine cyclo-ligase PurM 1--,
o
_mru0126is01eucy1-tRNA synthetase IleS
= =-=:E3
mru2077Ieucy1-tRNA synthetase LeuS ' Purine
interconversion
o
1-,
mru02421ysy1-tRNA synthetase LysS mru1736adenine
deaminase Ade , o
o
.
'
,
.
.
,
=
,
_
mru0386adeny1ate cyclase CyaA mru21045'-nucleotidase
SurE1 ,
0
mru0876adenylate kinase Adk mru04205'-nucleotidase
SurE2 (.4
o
mru0946adeny1osuccinate synthetase PurA '
mru0241anaerobic ribonucleoside-triphosphate
reductase NrdD I--
1--
=--.
mru1487nuc1eos1de diphosphate kinase Ndk
(..4
Pyrimidine biosynthesis
o
mru0005aspartate carbamoyltransferase PyrB Salvage
mru1725aspartate carbamoyltransferase regulatory subunit Pyrl
mru1372urac11phosphoribosyltransferase Upp
mru1791carbamoyl-phosphate synthase large subunit CarB .
mru1792carbamoy1-phosphate synthase small subunit CarA = Transport
,
mru1904dihydroor0tase PyrC mru1373xanth1nefuracil
pemnease
mru1985dihydr00r0tate dehydrogenase electron transfer subunit PyrK
=
mru1986dihydr00r0tate dehydrogenase PyrD
a
mru1748orotate phosphoribosyltransferase PyrE1
mru1783orotate phosphoribosyltransferase PyrE2 REGULATION =
n)
...3
mru10550r0tid1ne 6-phosphate decarboxylase PyrF
.-.1
NJ
Protein interaction .
= IV
1\)
.1,
Pyrimidine interconversion mru0039biotin-binding
and phosphotyrosine protein phosphatase domain-containing
mru1286CMP/dCMP deaminase protein
Co
cr) 0
I-.
mru2199CMP/dCMP deaminase = mru0582phosphate uptake
regulator PhoU "
I
0
nnru1237CTP synthase PyrG mru1295phosphotyr0sine
protein phosphatase n)
1
mru0879cytidy1ate kinase Cmk nn ru0044serine
phosphatase N)
.1,.
mru0652deoxycyt1d1ne triphosphate deaminase Dcd = ,=
mru1306ser1ne/threonine protein kinase
mru1048dUTP diphosphatase Dut = mru1868serine/threonine
protein kinase R101 family
mru1425th1oredoxin-disulfide reductase Trx6 = :0(,,;
mru1168serine/threonirie protein kinase with TPR repeats
mru0586thym1dy1ate kinase Tmk1 = :4`,.,
mru1288serine/threonine protein phosphatase
mru1226thymidy1ate kinase Tmk2 mru0452TPR repeat-
containing protein
, ..
e ,
mru2105thymidy1ate synthase ThyA mru0748TPR repeat-
containing protein
n
mru0722ur1dy1ate kinase PyrH mru0800TPR repeat-
containing protein = 1-3
-..
mru1018TPR repeat-containing protein
. N
Interconversion . mru1086TPR repeat-
containing protein o
1--,
mru1422GMP synthase subunit A GuaA _ mru1186TPR repeat-
containing protein
.:(E3
mru1420GMP synthase subunit B GuaAb ,
mru1216TPR repeat-containing protein =
o
1..,
mru1208inosine-6-monophosphate dehydrogenase GuaB mru1316TPR repeat-
containing protein o
o
,
.
=
mru1813TPR repeat-containing protein mru1961transcriptional
regulator AsnC family 0
mru1825TPR repeat-containing protein mru0359transcr1pt10na1
regulator HxIR family
mru2116TPR repeat-containing protein mru0370transcriptiona1
regulator HxIR family
mru2166TPR repeat-containing protein miu0249transcriptiona1
regulator LysR family
=
mru2167TPR repeat-containing protein mru0662transcriptiona1
regulator LytR family
mru0138transcr1ptiona1 regulator MarR family
Transcriptional regulator mru0217transcript1ona1
regulator MarR family
mru13381r0n dependent repressor mru0442transcripti0na1
regulator MarR family
mru14961r0n dependent repressor mru0815transcript10na1
regulator MarR family
mru1037nicke1 responsive transcriptional regulator NikR
mru0988transcript1ona1 regulafor MarR family
mru0073transcripti0na1 regulator mru1548transcr1ptiona1
regulator MarR family
mru0232transcript10na1 regulator mru1556transcriptiona1
regulator MarR family
mru0430transcript10na1 regulator mru1629transcriptiona1
regulator MarR family (2.
=
mru0649transcripti0na1 regulator mru0142transcriptiona1
regulator TetR family
mru0650transcripti0na1 regulator mru0577transcripti0na1
regulator TetR family
mru0684transcripti0na1 regulator mru0645transcriptiona1
regulator TetR family
mru1052transcripti0na1 regulator mru1739transcr1ptiona1
regulator TetR family 1.)
_n
mru1134transcriptional regulator mru0220transcriptiona1
regulator TetR family with acetyltransferase GNAT family co
mru1160transcriptional regulator domain
mru1327transcripti0na1 regulator
i\)
mru1435transcript1ona1 regulator Other
mru1447transcr1pt10na1 regulator mru0187carbon starvation
protein CstA
mru1449transcripti0na1 regulator mru0576sugar
fermentation stimul6tion protein SfsA1
mru1871transcriptional regulator = mru2018sugar
fermentation stimulation protein SfsA2
mru1944transcriptional regulator
mru2122transcripti0na1 regulator ************
=
mru2165transcripti0na1 regulator AbrB family
mru0132transcript10na1 regulator ArsR family SECONDARY METABOLITES
mru0365transcr1pt10na1 regulator ArsR family
mru1334transcriptiona1 regulator ArsR family NRPS
mru1446transcripti0na1 regulator ArsR family . mru0068non-
ribosomal peptide synthetase
mru1762transcripti0na1 regulator ArsR family mru0351non-ribosomal
peptide synthetase
c7,
mru2037transcript10na1 regulator ArsR family
=
.
.
Other mru1809transcript10n
elongation factor NusA-like protein 0
mru05144'-phosphopantetheinyl transferase family protein
mru0546transcr1pt1on factor S Tfs1 r-4
o
mru0512acy1transferase mru0171transcription
factor S Tfs2 1--,
.
--
mru0067anti-sigma factor antagonist mru0714transcripti0n
initiation factor TFIIB Tfb1 o
(..4
(A
mru0513ant1-sigma factor antagonist mru1600transcript10n
initiation factor TFIIB Tfb2 (.4
mru0516anti-sigma regulatory factor serine/threonine protein kinase
mru0478transcripti0n initiation factor TFIIE alpha subunit Tfe
mru0069MatE efflux family protein .
,
mru0352MatE efflux family protein Other
mru0066ser1ne phosphatase mru1282LSM domain-containing protein ,
=
mru0071ser1ne phosphatase mru2102LSM domain-containing protein
,
mru0515ser1ne phosphatase mru1278ribonuc1ease
III Rnc
mru0837RNA-binding protein
,
a
=
mru2179RNA-binding S1 domain-containing protein
=
mru1119glutamyl aminopeptidase PepA
n)
...3
'
TRANSCRIPTION
.-.1
NJ
*************************************
IV
.
1\)
.1,
RNA polymerase
mru1482DNA-dependent RNA polymerase subunit E' RpoE1 TRANSPORTERS.
op
co 0
H
IV
I
mru1481DNA-dependent RNA polymerase subunit E" RpoE2
o
mru1815DNA-directed RNA polymerase subunit A' RpoA1 Amino acids
"
1
i\)
mru1814DNA-directed RNA polymerase subunit A" RpoA2 mru1775amin0 acid
ABC transporter ATP-binding protein
mru1816DNA-directed RNA polymerase subunit B' RpoB1 mru1776amino acid
ABC transporter permease protein
mru1817DNA-directed RNA polymerase subunit B" RpoB2 mru1777amino acid
ABC transporter substrate-binding protein
mru0908DNA-directed RNA polymerase subunit D RpoD , mru1945amino
acid ABC transporter substrate-binding protein
mru0161DNA-directed RNA polymerase subunit F RpoF mru1759amino acid
carrier protein AGCS family
,
'
mru1818DNA-directed RNA polymerase subunit'll RpoH
oe
mru0913DNA-directed RNA polymerase subunit K RpoK Anions
n
mru0169DNA-directed RNA polymerase subunit L RpoL mru0467v0ltage-gated
chloride channel protein 1-3
mru0912DNA-directed RNA polymerase subunit N RpoN
N
mru1350DNA-directed RNA polymerase subunit P RpoP Cations
o
mru1802cati0n-transporting P-type ATPase
o
Translation factors mru0205c0pper ion
binding protein o
mru0387TATA-box binding protein Tbp mru2208diva1ent
cation transporter mgtE family c=
,
-
,
=
mru0536ferrous iron transport protein A FeoA mru1438ABC
transporter ATP-binding protein 0
mru1340ferrous iron transport protein B FeoB1 mru1701ABC
transporter ATP-binding protein (,..
o
I--
mru0537ferrous iron transport protein B FeoB2 mru1714ABC
transporter ATP-binding protein 1--,
=--.
o
mru0206heavy metal translocating P-type ATPase mru2207ABC
transporter ATP-binding protein k..a
.
un
mru1612heavy metal translocating P-type ATPase mru0366ABC
transporter ATP-binding/permease protein (.4
mru1861heavy metal translocating P-type ATPase mru1627ABC
transporter ATP-binding/permease protein
mru1333heavy metal-translocating P-type ATPase mru1628ABC
transporter ATP-binding/permease protein
, mru078910n transport protein mru0003ABC
transporter permease protein
mru0808ion transport protein mru0252ABC
transporter permease protein
mru1083ion transport protein mru1437ABC
transporter permease protein
mru1953K+-dependent Na+/Ca+ exchanger mu1702ABC
transporter permease protein .
.
.
mru1615nickel ABC transporter ATP-binding protein NikD1 mru1713ABC
transporter permease protein a
mru1706nicke1 ABC transporter ATP-binding protein NikD2 mru2206ABC
transporter permease protein
n)
mru1614nickel ABC transporter ATP-binding protein NikE1 mru0251ABC
transporter substrate-binding protein ...3
.-.1
mru1705nicke1 ABC transporter ATP-binding protein NikE2 mru0216MatE efflux
family protein N)
IV
I \ )
= .
mru1617nickel ABC transporter permease protein NikB1 mru0609MatE efflux
family protein
mru1709nicke1 ABC transporter permease protein NikB2 mru1439MatE efflux
family protein
(z)
mru1616nickel ABC transporter permease protein NikC1 mru1658MatE efflux
family protein 03
IV
I
mru1708n1cke1 ABC transporter permease protein NikC2 ,
mru1735MatE efflux family protein
o
i.)
1
mru1618nickel ABC transporter substrate-binding protein NikA1
mru1751MatE efflux family protein
i\)
.1,
mru1710nickel ABC transporter substrate-binding protein NikA2 mru1765MatE
efflux family protein
mru2020potass1um channel protein
mru0616mechanosensitive ion channel protein
mru2025p0tassium uptake protein TrkA family mru0046MFS
transporter ,
mru0207potassium uptake protein TrkH family mru0139MFS
transporter
mru2024potass1um uptake protein TrkH family mru0140MFS
transporter
mru0821tran5p0rter CDF family mru0215MFS
transporter 1-Lt
mru0827transporter CDF family , _
mru0379MFS transporter n
mru0405transp0rter Na+/H+ antiporter family mru0559MFS
transporter
N
= mru2209TrkA domain-
containing protein mru1002MFS transporter
mru1191MFS transporter
1--,
o
Other mru1201MFS
transporter .j(E3
o
o
' '
mru0002ABC transporter ATP-binding protein mru1968MFS
transporter 1..,
c7,
mru0253ABC transporter ATP-binding protein
mru0329MotA/ToIQ/ExbB proton channel family protein
=
,
'
.
'
.
,
S
mru0705MotAirolQ/ExbB proton channel family protein
mru1758acety1transferase 0
mru1082MotArfolQ/ExbB proton channel family protein
mru1881acety1transferase (,..
o
- 1--
' mru2045MotAffolQ/ExbB proton channel family protein
mru2170acety1transferase 1¨
o
mru2051MotA/ToIQ/ExbB proton channel family protein
mru0500acety1transferase GNAT family (..4
(A
mru1332Na+ dependent transporter SBF family
mru0574acety1transferase GNAT family (.4
o
mru0008Na+-dependent transporter SNF family
mru0612acety1transferase GNAT family
mru0406Na+-dependent transporter SNF family
mru0633acety1transferase GNAT family
mru0407Na+-dependent transporter SNF family
mru1374acetyltransferase GNAT family
mru0636Na+-dependent transporter SNF family
rnru1707acety1transferase GNAT family
mru1202Na+-dependent transporter SNF family
mru1712acetyltransferase GNAT family
mru1285Na+-dependent transporter SNF family
mru2032acety1transferase GNAT family
.
mru2197Na+-dependent transporter SNF family
mru2198acety1transferase GNAT family a
mru0116transporter mru1328acy1-CoA synthetase
,
mru0141transporter mru0248acy1tr3nsferase
n)
...i
.-.1
mru1840transp0rter ExbDfrol family mru1534acy1transferase
"
n) ,
IV
mru1370transp0rter MIP family mru0087amidohydr01ase
"
mru1841transporter MotAirolQ/ExbB proton channel family
mru0203amid0hydro1ase 1.)
mru2176transp0rter permease family protein mru0610amidohydrolase
co
0 H
IV
I
mru2177transp0rter permease family protein , ,
mru0664amid0hydr01ase o
iv
1
mru0986tran5porter SDF family mru0826am1notransferase
i\)
,
.
mru1789transp0rter SDF family mru1940amin0transferase
mru0358transporter small multidrug resistance (SMR) family
mru1959aminotransferase DegT/DnrJ/EryC1/StrS family
mru0369transp0rter small multidrug resistance (SMR) family _ mru1032AMP-
binding enzyme
mru0993transporter TDT family mru1298archaea1 ATPase
mru2140archaea1 ATPase
mru0014ATPase
1-: rnru0040ATPase
n
1-3
,
UNKNOWN FUNCTION mru0075ATPase
mru0534ATPase
N
Enzyme mru0560ATPase
1--,
= o
_mru0422acety1transferase mru1104ATPase
o
mru0455acety1transferase mru1170ATPase
=
1..,
o=
mru1473acety1transferase mru1854ATPase
o
= , .
,
,
,
mru1860ATPase ' , mru0151NUDIX domain-
containing protein 0
mru2070ATPase mru0170NUDIX domain-
containing protein w
o
,
I--
mru0415calcineurin-like phosphoesterase ' mru0737NUDIX domain-
containing protein
¨.
mru0820ca1cineur1n-like phosphoesterase mru0148oxidoreductase
aldo/keto reductase family o
k..i
uri
mru2019carb0hydrate kinase mru0354oxidoreductase
aldo/keto reductase family c.4
o
mru0246carbohydrate kinase PfkB family mru0416oxidoreductase
aldo/keto reductase family
mru1329carbohydrate kinase PflcB family mru0579ox1doreductase
aldo/keto reductase family ,
mru1392carbohydrate kinase PfkB family mru0773oxidoreductase
aldo/keto reductase family
mru0708CobB/CobQ-like glutamine amidotransferase domain-containing protein
mru0932oxid0reductase aldo/keto reductase family
mru1040D-alanine-D-alanine ligase mru1120oxidoreductase
aldo/keto reductase family
mru0712demethy1menaquinone methyltransferase - mru1747oxidoreductase
GFO/IDH/MOCA family
mru0825Fe-S oxidoreductase
mru0594phosph0diesterase MJ0936 family a
mru0052g1ycy1-radical enzyme activating protein
mru1623phosphodiesterase MJ0936 family
mru2192g1ycy1-radical enzyme activating protein mru0845pyridoxa1
phosphate enzyme
.-.1
mru1723GMC oxidoreductase family protein mru0568rad1ca1 SAM
domain-containing protein n)
IV
mru0491hydrolase alpha/beta fold family mru0646radica1 SAM
domain-containing protein . m
.1,.
mru0511hydrolase alpha/beta fold family mru1453radica1 SAM
domain-containing protein , n)
mru0771hydro1ase alpha/beta fold family mru1675radica1 SAM
domain-containing protein co
H
IV
mru1036hydrolase alpha/beta fold family mru2028rad1ca1 SAM
domain-containing protein 1
o
n)
- mru1508hydr01ase alpha/beta fold family mru2152rad1ca1 SAM
domain-containing protein 1
i\)
mru1554hydr01ase HAD superfamily nnru0195SAM dependent
methyltransferase .1,.
mru2163hydro1ase HAD superfamily mru0779SAM dependent
methyltransferase
' mru0226hydro1ase TatD family mru0925SAM dependent
methyltransferase
mru0648hydr01ase TatD family mru0637SAM-dependent methyltransferase
,
mru0721hydrolase TatD family mru0933SAM-dependent
methyltransferase =
mru1006hydr01ase TatD family mru0934SAM-dependent
methyltransferase
oe
mru0929manganese-dependent inorganic pyrophosphatase PpaC mru0935SAM-
dependent methyltransferase n
1-3
mru0834meta110-beta-lactamase superfamily protein mru1001SAM-dependent
methyltransferase
, mru1562meta110-beta-lactamase superfamily protein mru1011SAM-dependent
methyltransferase N
k=-)
o
mru2026meta110-beta-lactamase superfamily protein . mru1026SAM-dependent
methyltransferase 1--,
o
mru1502methyltransferase niru1520SAM-dependent
methyltransferase C3
o
mru2164NADH:flavin oxidoreductase/NADH oxidase family protein
mru1613SAM-dependent methyltransferase
=
1--,
c7,
mrul 757NADH-dependent flavin oxidoreductase mru1643SAM-dependent
methyltransferase o
,
.
.
.
_
mru0443short-chain dehydrogenase family protein mru0194HEAT repeat-
containing protein 0
mru1958short-chain dehydrogenase family protein .
mru0601HEAT repeat-containing protein
(,..
o
I--
mru1034HEAT repeat-containing protein
=--.
o
Other
mru0095isoprenylcysteine carboxyl methyltransferase family protein (..4
(A
mru1443ABM family protein nriru1194LemA family
protein = (.4
mru1254ACT domain-containing protein mru1767Met-10+ like-
protein
mru1788ACT domain-containing protein mru1976Met-10+ like-
protein '
. mru1728AMMECR1 domain-containing protein
mru0668methanogenes1s marker protein 1
mru1231ATP-binding protein . ,
mru1929methan0genes1s marker protein 10
mru0709ATP-grasp domain-containing protein
mru0097methanogenesis marker protein 11
,
mru0598band 7 family protein
mru2109methanogenesis marker protein 12
mru1019C_GCA:c(G_C_C family protein
mru0181methanogenesis marker protein 13 a
mru1611C_GCAxxG_C_C family protein
mru1915methanogenesis marker protein 14
mru0230CAAX amino terminal protease family protein
mru1771methanogenesis marker protein 15 . = n)
...3
.-.1
mru0231CAAX amino terminal protease family protein
mru1934methan0genesis marker protein 16 N)
IV
I \ )
mru0237CAAX amino terminal protease family protein
mru1770methan0genesis marker protein 17
mru0547CAAX amino terminal protease family protein
mru1778methan0genesis marker protein 2 1.)
- mru1738CAAX amino terminal protease family protein
mru1774methanogenesis marker protein 3 co
IV
I
mru0665CBS domain-containing protein
mru1004methanogenesis marker protein 4 o
n)
mru0823CBS domain-containing protein
mru1772methan09enesis marker protein 5 1
i\)
.1,
mru1390CBS domain-containing protein .
mru1773methanogenes1s marker protein 6 .
mru1952CBS domain-containing protein
mru1931methanogenesis marker protein 7 =
mru1993CBS domain-containing protein
mru0436methan0genesis marker protein 8
mru1994CBS domain-containing protein
mr61851methanogenesis marker protein 9
mru20300BS domain-containing protein mru0531NIF3 family
protein
mru0566cytidy1transferase-related domain-containing protein mru1009PHP
domain-containing protein
1-:
mru1982DGC domain-containing protein mru2157PHP domain-
containing protein n
mru1308FHA domain-containing protein mru1250PIN domain-
containing protein 0
mru0221Fic family protein mru0562PP-loop
family protein .
N
mru0392GTP-binding protein mru0666PP-loop
family protein 1--,
o
mru1729GTP-binding protein mru1956PP-loop
family protein
o
mru0474HD domain-containing protein mru1913PRC-barrel
domain-containing protein o
1..,
c7,
mru1274HD domain-containing protein mru019-1pyridoxamine
6-phosphate oxidase family protein
'
=
mru0228pyrid0xamine 5-phosphate oxidase family protein mru1891alpha-
ribazole phosphatase CobZ 0
mru1678redox-active disulfide protein mru2218coba1am1n
biosynthesis protein CbiB
o
mru1848TfoX C-terminal domain-containing protein mru0887coba1amin
biosynthesis protein CbiD I--
0-,
=--.
mru0517TfoX N-terminal domain-containing protein mru0889coba1amin
biosynthesis protein CbiG o
(..4
(A
mru0667TfuA-like protein mru0539coba1amin
biosynthesis protein CbiM1 (.4
o
mru1756th10esterase family protein mru0885coba1amin
biosynthesis protein CbiM2
mru1145toxic anion resistance protein mru2200c0ba1am1n
biosynthesis protein CbiX =
mru1950TraB family protein mru1892cobalamin-5-
phosphate synthase CobS
mru0561von Willebrand factor type A domain-containing protein mru0882c0ba1t
ABC transporter ATP-binding protein Cbi01
mru1593von Willebrand factor type A domain-containing protein mru1217cobalt
ABC transporter ATP-binding protein Cbi02
mru1304WD40 repeat-containing protein mru1220coba1t ABC
transporter ATP-binding protein CbiO3
mru1251xylose isomerase-like TIM barrel domain-containing protein
mru0541cobalt ABC transporter permease
protein CbiQ1 a
mru1252xy1ose isomerase-like TIM barrel domain-containing protein
mru0883coba1t ABC transporter permease protein CbiQ2
mru2162xy10se isomerase-like TIM barrel domain-containing protein
mru1221cobalt ABC transporter permease
protein CbiQ3 1\)
...3
mru0659YhgE/Pip-like protein mru0895coba1t chelatase
CbiK . .-.1
NJ
,
IV
mru1510YhgE/Pip-like protein . mru0540c0ba1t transport
protein CbiN1 n)
.1,.
=
mru0627ZPR1 zinc-finger domain-containing protein .
mru0884coba1t transport protein CbiN2
n)
mru1638cobyric acid synthase CbiP
. co
oi 0
H
IV
*********** ****** * ***** ************** mru0360cobyrinic acid
a,c-diamide synthase CbiA1
.
o'
n)
mru0371cobyrinic acid ac-diamide synthase CbiA2
1
. i\)
,
VITAMINS AND COFACTORS mru0893cobyrinic acid
ac-diamide synthase CbiA3 =
.1,.
=. mru2151cobyrinic acid a,c-diamide synthase CbiA4
Biotin mru1560delta-
aminolevulinic acid dehydratase HemB
mru20416-carboxyhexanoate-CoA ligase BioW mru0999g1utamate-1-
semialdehyde-2,1-aminomutase HemL
mru20428-amino-7-oxononanoate synthase BioF mru1853g1utamy1-tRNA
reductase HemA
mru2084adenosy1methionine-8-amino-7-oxononanoate aminotransferase BioA
mru1914GTP:adenosylcobinamide-phosphate guanylyltransferase CobY
oe
mru2087biotin synthase BioB1 mru2047magnesium
chelatase H subunit BchH n
,
1-3
mru0527biotin synthase BioB2 mru1543magnesium-
protoporphyrin IX monomethyl ester anaerobic oxidative
mru0846biotin-acetyl-CoA-carboxylase ligase BirA .
N cyclase BchE
mru2086dethiobiotin synthetase BioD . mru2101nicotinate-
nucleotide-dimethylbenzimidazole phosphoribosyltransferase 1--,
CobT
Cobalamin
mru1746p0rph0bi11n0gen deaminase HemC
=
mru1218adenosylcobinamide amidohydrolase CbiZ
mru0886prec0rrin-2 C20-methyltransferase CbiL
1..,
o
,
o
mru2210prec0rrin-3B C17-methyltransferase CbiH
,
'
mru0890precorr1n-3B 017-methyltransferase CbiH1
mru0465bifunctiona1 glutamate-cysteine ligase/glutathione synthetase gshF
c:;)
mru0888prec0rrin-4 C11-methyltransferase CbiF mru0463gamma-
glutamylcysteine synthetase GshA1 w
o
.--
mru0891precorrin-6x reductase CbiJ mru0464gamma-
glutamylcysteine synthetase GshA2 .--,
--.
o
mru1276precorr1n-6Y C5,15- methyltransferase (decarboxylating) CbiT
mru0462g1utam1ne amidotransferase
k..i , uri
mru0892prec0rrin-6Y C5,15-methyltransferase (decarboxylating) CbiET
mru1935g1u1athione peroxidase GpxA
c.4
mru0894prec0rrin-8X methylmutase CbiC mru0472g1utath1one-
disulfide reductase Gor1
' mru1852siroheme synthase CysG mru1377g1utathione-
disulfide reductase Gor2
mru1541uroporphyrin-III C-methyltransferase CobA
,
mru1544uroporphyrin09en-III synthase HemD Metal-binding
pterin
mru0348m01ybdate ABC transporter ATP-binding protein ModC
Coenzyme B mru0200mo1ybdate
ABC transporter permease protein ModB .
mru0384homoaconitase large subunit AksD mru0201molybdate
ABC transporter substrate-binding protein ModA a
mru1689homoaconitase small subunit AksE mru2137mo1ybdate
transport system regulatory protein ModE 0
n)
mru0385homocitrate synthase AksA mru1782m01ybdenum
cofactor biosynthesis protein B MoaB
.-.1
mru10331sohomocitrate dehydrogenase AksF mru1680m01ybdenum
cofactor biosynthesis protein C MoaC n)
IV
mru1691molybdenum cofactor biosynthesis protein MoaA
n)
.1,..
= , .
,
Coenzyme F420 mru1268mo1ybdenum
cofactor biosynthesis protein MoaE n)
_.s 0
mru09532-phospho-L-lactate guanylyltransferase CofC mru1269mo1ybdenum-
pterin binding protein Mop1 co
IV
I
mru1253coenzyme F390 synthetase FtsA1 mru1270m01ybdenum-
pterin binding protein Mop2 o
,
n)
1
mru1787coenzyme F390 synthetase FtsA2 mru1271molybdenum-
pterin binding protein Mop3 i\)
mru0479F420-0:gamma-glutarnyl ligase " mru1272mo1ybdenum-
pterin binding protein Mop4
mru1842F420-0:gamma-glutamyl ligase CofE mru1273mo1ybdenum-
pterin binding protein Mop5
mru1974F0 synthase subunit 1 CofG
mru0128m01ybdopter1n biosynthesis protein MoeA1
mru1266F0 synthase subunit 2 CofH
mru1870molybdopterin biosynthesis protein MoeA2
mru2213fuculose 1-phosphate aldolase FucA
mru0353mo1ybdopter1n biosynthesis protein MoeB
mru06721acta1dehyde dehydrogenase CofA
mru0337m01ybdopter1n cofactor biosynthesis protein A MobA1
oe
mru1844LPPG:F0 2-phospho-L-lactate transferase CofD
mru1277mo1ybdopterin-guanine dinucleotide biosynthesis protein A MobA2 n
mru0335m01ybdopterin-guanine'dinucleotide biosynthesis protein B MobB
Coenzyme rin
N
mru19492-phosphosulfolactate phosphatase ComB Methanefuran
.--,
o
mru1980L-sulfolactate dehydrogenase ComC mru1896L-tyrosine
decarboxylase MfnA .E3
= o
o
...,
c7,
Glutathione Methanopterin
a
, .
'
.
. .
,
,
mru1690beta-ribofuranosylaminobenzene 5'-phosphate synthase MptG
mru00986,7-dimethy1-8- ribityllumazine
synthase RibH 0
mru1962GTP cyclohydrolase MptA
mru1007diam1n0hydroxyph0sph0rib0sy1aminopyrim1d1ne reductase RibD c.)
o
mru1283creat1nine amidohydrolase Ar113 mru1845GTP
cyclohydrolase III ArfA
1--,
--
mru1559ATP:dephospho-CoA triphosphoribosyl transferase CitG
mru2174riboflav1n kinase RibK o
c..3
uri
mru1215riboflavin synthase RibC
c..3
o
Nicotinate
mru0189ATP-NAD kinase Thiamine
mru0675L-aspartate dehydrogenase mru1568cyste1ne
desulfurase NifS
mru1430NAD+ synthetase NadE ,
mru1819hydroxyethylthiazole kinase ThiM
mru1704NADH pyrophosphatase NudC
mru0198hydroxymethy1pyr1m1d1ne transporter CytX
mru1267n1cot1nam1de-nucleotide adenylyltransferase
mru0199phosphomethy1pyrim1dine kinase ThiD1
mru1750nic0t1nate phosphoribosyltransferase .
mru0952ph0sph0methy1pyr1midine kinase ThiD2 a
mru0618n1cotinate-nucleotide pyrophosphorylase NadC mru0494th1amine
biosynthesis ATP pyrophosphatase Thil = c2.
mru0615qu1no1inate synthetase.A protein NadA mru0247th1amine
biosynthesis protein ThiC1 "
-.3
.-.1
mru0444thiamine biosynthesis protein ThiC2
= i.)
IV
Others mru0563thiamine
biosynthesis protein ThiS m
mru07345-formyltetrahydrofolate cyclo-ligase ' , mru1820thiamine
monophosphate synthase ThiE n)
co
mru0557dihydr0pter0ate synthase-related protein mru2193thiamine
monphosphate kinase ThiL cri H
IV
mru1912FeS assembly ATPase SufC mru0227ThiF family
protein 1
o
i.)
mru1911FeS assembly protein SufBD mru0532ThiF family
protein 1
,
i.)
= mru1769nitr0genase cofactor biosynthesis protein NifB
mru0466dinitrogenase iron-molybdenum cofactor biosynthesis protein
Ubiquinone
, mru06222-
polyprenylphenol 6- hydroxylase UbiB1
Pantothenate and coenzyme A mru07462-
polyprenylphenol 6- hydroxylase UbiB2
mru13202-dehydropantoate 2-reductase PanE mru15782-
polyprenylphenol 6- hydroxylase UbiB3
mru1989c0enzyme A biosynthesis bifunctional protein CoaBC mru19692-
polyprenylphenol 6- hydroxylase UbiB4
1-:
mru1224dephospho-CoA kinase CoaE mru12753-polypreny1-
4-hydroxybenzoate decarboxylase UbiX n
,-
mru1010pantothenate kinase CoaA rnru0897SAM-
dependent methyltransferase UbiE family =
mru1225pantothenate synthase PanC mru2187UbiD family
decarboxylase N
mru0829phosphopantetheine adenylyltransferase CoaD
1--,
CD
.-C-3
CD
Riboflavin
.
=
1¨,
o
mru00893,4-dihydroxy-2-butanone 4-phosphate synthase RibB .
o
,
,
,
,
\
=
Table 10. Genome sequences used in this
study o
Organism Genome accession number
w
=
Methanobrevibacter ruminantium M1 CP001719
,--
,-,
¨
Methanobrevibacter smithii PS CP000678
=
u,
Methanobrevibacter smithii F1 ABYV00000000
=
Methanobrevibacter smithii ALI ABYW00000000
' Methanocaldococcus jannaschii JAL-1 L77117
Methanococcoides burtonii ACE-M NC 007955
Methanococcus aeolicus Nankai-3 ' CP000743
Methanococcus maripaludis C5 CP000609
Methanococcus maripaludis C6 CP000867
Methanococcus maripaludis C7 CP000745
a
Methanococcus maripaludis S2 ' BX950229
0
i.,
Methanococcus vannielii SB CP000742
-,
Methanococcus voltae A3 ABHB00000000
m
Methanocorpusculum labreanum Z CP000559
Methanoculleus marisnigri JR1 CP000562
"
8
0
I-.
Methanopyrus kandleri AV19 AE009439
= 0'
i
Candidatus Methanoregula boonei 6A8 CP000780
.0
i.,
i
Methanosaeta thermophila PT CP000477
NJ
.I,
Methanosarcina acetivorans C2A AE010299
Methanosarcina barker! Fusaro CP000099
Methanosarcina mazei strain Goe1 AE008384
Methanosphaera stadtmanae MCB-3 CP000102
Candidatus Methanosphaerula palustris E1-9c CP001338
Methanospirillum hungatei JF-1 CP000254
00
n
Methanothermobacter thermautotrophicus AH NC 000916
1-
Uncultured methanogenic archaeon RC-I AM114193
z
N
Syntrophomonas wolfei subsp. wolfei Goettingen
CP000448 ,--,
=
-
¨
=
=
=
,-,
c,
,
. .
,
Table 11
' = 0
ORF nt aa Annotation
Left Right Classification (,..
number SEQ ID SEQID '
Boundary Boundary.
mru_0001 1 5867 cdc6 family replication initiation_protein_Cdc6-1
1 1167 Chromosomereplication --.
o
mru_0002 2 5868 ABC_transporter ATP-13inding_protein
1659 = 2363 Other (..4
(A
mru_0004 3 5869 adhesin-like_protein
5233 7470 Cellsurfaceproteins (.4
o
. mru_0005 4 5870 aspartate_carbamoyltransferase_pyrB
7745 8704 , P_yrimidine
mru_0006 5 5871 fiavodoxin_domain-containing_protein
8845 9348 , Electrontransport
mru_0007 6 5872 4-
9616 .10422 Tyrosinemetabolism
hydroxyphenylacetate_degradation_bifunctional jsomerase/decarboxylase
HpaG
mru_0009 7 5873 Tlavodoxin_domain_containing_prcitein
12403 13152 Electrontransport
mru_0010 8 5874 ATP_phosphoribosyltransferase_HisG1
13256 14119 Histidine
mru_0011 9 5875 archaeal histone
14771 .14974 DNA-bindingproteins (-)
mru 0012 10 5876 hypothetical_protein
15320 16339 Hypothetical
mru_0013 11 5877 hypothetical_protein
16923 17243 Hypothetical
.
n)
mru_0014 12 5878 ATPase
18027 19490 General
.-.1
mru_0019 13 5879 adhesin-like_protein
26345 27565 Cellsurfaceproteins N)
IV
mru_0020 14 5880 _ adhesin-like_protein_with_cysteine_protease_domain
27726 - 33740 Cellsurfaceproteins m
mru_0021 15 5881 hypothetical_protein =
34167 34724 Conserved n)
mru 0022 16 5882 hypothetical_protein
34825 35079 Hypothetical 8 0
I-.
mru_0024 17 5883 hypothetical_protein .
=
35713 36471 Hypothetical
I
mru_0025 18 5884 hypothetical_protein
36461 37042 . Hypothetical o
n)
mru 0026 19 5885 DNA-cytosine_methyltransferase
37238 38257 Restrictionand modification 1
i\)
mru_0027 20 5886 DNA-cytosine methyltransferase
38270 39298 Restrictionandmodification
mru_0028 21 5887 hypothetical_protein
39319 39492 Conserved
mru_0029 22 5888 5-methylcytosine_restriction_system_component_protein
39723 40475 Restrictionandmodification
,
mru_0034 23 5889 hypothetical_protein
49006 49281 Hypothetical
mru 0035 24 5890 hypothetical_protein
49382 49654 Hypothetical
mru_0037 25 5891 hypothetical_protein ,
51956 53299 Conserved
mru_0038 26 5892 adhesin-like_protein
53569 55485 Cellsurfaceproteins oc
mru_0039 27 = 5893 biotin-
binding_and_phosphotyrosine_protein_phosphatase_domain- 55508
56857 ' Proteininteractions n
- containing_protein
1-3
= mru 0040 28
5894 ATPase 57287 58522 General
mru_0041 29 5895 hypothetical_protein
59627 62761 Hypothetical N
mru_0042 30 5896 hypothetical_protein
62911 63204 Hypothetical 1--,
o
mru_0043 31 5897 hypothetical_protein
63600 64100 Hypothetical =-=:E3
o
mru_0045 32 5898 hypothetical_protein
65659 65964 Hypothetical =
1--,
mru_0046 33 5899 MFS transporter ,
66494 68038 'Other = o=
o
,
, .
=
=
Table 11
0
mru_0047 34 5900 hypothetical_protein
68632 69366 Conserved (,..
mru 0049 35 5901 hypothetical_protein .
73642
75615 Conserved o
I--
mru_0050 36 5902 hypothetical_protein
76353 . 76637 , Hypothetical
=--.
o
mru_0051 37 5903 hypothetical_protein ' ,
76641 76898 Hypothetical
(A
mru_0052 38 5904 glycyl-radical_enzyme_activating_protein
78234 78920 Enzyme = (.4
o
mru_0053 39 5905 hypothetical_protein
78913 79392 Hypothetical
mru_0054 40 5906 hypothetical protein
79764 80315 Hypothetical
mru_0055 41 5907 hypothetical_protein .
80384 80560 Hypothetical
mru_0056 42 5908 hypothetical_protein
80782 81000 Hypothetical
mru_0057 43 5909 phage-related_protein =
81016 83073 _Prophage
mru_0058 44 5910 phage-related_protein
83097 83594 Prophage
mru_0059 45 5911 hypothetical_protein
84290 85312 Hypothetical .
mru_0060 46 5912 hypothetical_protein
85314 85883 Hypothetical a
mru_0061 47 5913 hypothetical_protein
85889 86461 Hypothetical
mru 0062 48 5914 hypothetical_protein
87684 90452 Conserved
_
n)
mru_0063 49 5915 hypothetical_protein
90556 92199 Conserved ...3
.-.1
mru 0064 _ 50 5916 adhesin-like_protein '
92445 95981 Cellsurfaceproteins "
IV
mru_0065 51 5917 NADP-dependent alcohol_dehydrogenase_Adh1
96418 97482 Ethanol "
mru_0066 52 5918 serine_phosphatase
100711 102582 Proteininteractions 1.)
mru_0067 53 5919 anti-sigma_factor_antagonist =
102725 103024 Proteininteractions 8 =
H
co
IV
- mru_0068 54 5920 non-ribosomal_peptide_synthetase = ,
103653 111716 NRPS 1
mru_0069 55 5921 MatE efflux family protein
111817 113277 Other o
n)
1
mru_0070 56 . 5922 hypothetical_protein , =
113316 113552 Hypothetical i\)
mru 0073 57 5923 transcriptional_regulator
119050 120513 Transcriptionalregulators
mru:0074 58. 5924 hypothetical_protein
120856 121383 Conserved
mru 0075 59 5925 ATPase
121796 123025 General
mru_0077 60 5926 adhesin-like_protein =
130541 139702 Cellsurfaceproteins
mru 0078 61 5927 = hypothetical_protein
140711 141559 Conserved
mru_0080 62 5928 hypothetical_protein
146176 147636 General
mru_0084 63 5929 adhesin-like_protein
155468 169945 Cellsurfaceproteins oo
mru_0086 64 5930 adhesin-like_protein .
178972 189147 Cellsurfaceproteins n
mru 0087 65 5931 amidohydrolase .
189666 190988 Enzyme 1-3
mru:0088 66 5932
succinate_dehydrogenase/fumarate_reductase_flavoprotein_subunit_Sdh 191222
192877 TCA = =
N
=
A = o
mru_0089 67 ' 5933 3,4-dihydroxy-2-butanone_4-phosphate synthase
RibB 192985 193626 Riboflavin 1--,
o
mru_0091 68 5934 hypothetical_protein
196709 197092 Hypothetical
o
mru_0092 69 5935 hypothetical_protein .
197361 197597 Hypothetical , =
1..,
mru_0093 70 5936 hypothetical_protein '
198110 198274 Hypothetical c7,
o
.
,
'
, . =
.
,
.
= =
Table 11
0
mru 0094 71 5937 nitroreductase_family_protein
198439 199185 General (,..
o
mru_0095 72 5938
isoprenylcysteine_carbo*_methyltransferase_family_protein
199420 200091 General I--
1--,
mru 0096 73 5939 tRNA-dihydrouridine synthase DusA1
200339 201328 RNAprocessing T.--.
(..4
= mru_0097 74
5940 methanogenesis_marker_protein_11 201646 202584 Methanogenesis (A
(.4
mru_0098 75 _ 5941 6,7-dimethy1-8-_ribityllumazine_synthase_RibH
202829 203260 Riboflavin
mru 0100 76 5942 hypothetical_protein
205116 206303 Conserved
mru_0102 77 5943 hypothetical_protein
207447 208997 Conserved
mru 0103 78 5944 3-isopropylmalate_dehydrogenase_LeuB
209098 210084 Valine/Leucine/lsoleucine
mru 0104 79 5945 3-isopropylmalate_dehydratase_small_subunit_LeuD
210227, 210709 Val ine/Leucine/lsoleucine
mru_0105 80 5946 3-isopropylmalate_dehydratase_large_subunit LeuC
210743 212020 Val ine/Leucine/lsoleucine
mru_0114 81 5947 replication_factor_A
222947 225418 Chromosomereplication
mru 0115 82 5948 DNA_repair and recombination_protein RadA
225619 226554 Recombinationandrepair
mru_0117 83 5949 heterodisulfide_re¨ductase_subunit_A 'FIc-iA
228477 230459 Methanogenesis a
mru_0118 84 5950 hypothetical_protein
231305 231970 Conserved -
mru 0119 85 5951 transposase
232337 233833 Transposase o
n)
' mru_0120 86 5952 hypothetical_protein
233912 234205 Hypothetical ...r
.-.1
mru 0121 87 _ 5953 hypothetical_protein
234264 235262 Conserved . n)
IV
1
mru 0122 88 5954 serine_hydroxymethyltransferase_GlyA
235930 237201 Glycine m
nnru¨_0123 89 5955 archaeoflavoprotein_AfpA
237428 238129 Electrontransport n)
mru_0124 90 5956 hypothetical_protein ,
238278 238679 Conserved Fri 0
H
(0
IV
mru_0125 91 5957 S-adenosylmethionine synthetase_MetK
239166 240374 Methioninemetabolism 1
o
mru_0126 92 5958 isoleucyl-tRNA synthetase IleS
240787 244218 tRNAaminoacylation n) ,
1
mru_0127 93 5959
phosphoribosylformylgtycinamidine2FGAMLsynthase_11_PurL 244346 _ 246556
PurineBiosynthesis r\)
.1,
. mru_0128 94 5960 molybdopterin_biosynthesis_protein_MoeA1
246872 248098 Metal-binding pterin
mru 0129 95 5961 hypothetical_protein
248313 248972 Conserved
mru_0131 96 5962 F420H2 oxidase_FprA2
250758 251984 Oxidativestressresponse
mru_0132 97 5963 transcritional regulator ArsRiamily
252539 253333 Transcriptionalregulators .
mru 0133 98 6964 ATP-dependeFrt_DNA_helicase
253770 255752 Helicase
mru_0134 99 5965 hypothetical_protein
255881 256579 Conserved
mru 0135 100 5966 imidazoleglycerol-phosphate
synthase_cyclase_subunit_HisF 257045 , 257878 Histidine
' mru 0136 101 5967 8-oxoguanine_DNA_glycosylase_Ogg
257979 259055 Recombinationandrepair n
1-
mru_0137 102 5968 D-tyrosyl-tRNA(Tyr)_deacylase
259135 259578 tRNAaminoacylation
mru_0138 103 5969 transcriptional_re_gulator_MarRiamily
259769 260218 Transcriptionalre_gulators
N
,
mru_0141 104 5970 transporter
263405 264421 Other o
1--, ,
mru_0142 105 5971 ' transcriptional_regulator_TetRiamily
264949 265539 Transcriptionalregulators o
mru 0144 106 5972 hypothetical_protein
269419 269556 Hypothetical
o
mru¨_0145 107 5973 peptidyl-prolyl_cis-trans_isomerase
269994 270422 Proteinfolding o
1..,
o
mru_0146 108 5974 hypothetical_protein
270656 270931 Conserved
,
.
.
' .
,
. .
,
Table 11
= 0
mru 0148 109 5975 oxidoreductase_aldo/keto reductase_family
271427 272467 Enzyme (.4
,
mru_0149 110 5976 acetylomithine_aminotran-sferase_ArgD
272695 . 273870 Arginine =
1--
mru_0151 111 5977 NUDIX_domain-containing_protein
274857 275294 Enzyme 0-
o
mru 0152 112 5978 diaminopimelate_decarboxylase_LysA
275575 276858 Lysine (..4
(A
_
'
mru_0153 113 5979 diaminopimelate_epimerase_DapF
276920 277783 Lysine (.4
o
mru_0154 114 = 5980 hypothetical_protein
278000 278170 Hypothetical
mru_0155 115 5981 iron-sulfur_cluster binding_protein
278282 279010 Electrontransport
mru_0156 116 5982 hypothetical_protein
279083 279733 Conserved
=
mru 0157 117 5983 SAM-dependent_methyltransferase_HemK-related
279775 280359 RNAprocessing
mru_0158 118 5984 dimethyladenosine_transferase_KsgA
280448 281344 RNAprocessing .
mru 0159 119 5985 RNA-binding_protein
281498 282247 Other
mru_0161 120 5986 DNA-directed RNA_polymerase_subunit_F_RpoF
286317 286661 RNApolymerase
mru 0162 121 5987 ribosomal_prcTtein L21e_Rp121e
286702 286992 Ribosomalproteins
mru 0163 122 5988 pseudouridylate_synthase
287415 288683 Other
'
mru 0164 123 5989 signal_recognition_particle_SRP54_protein
288909 290243 Proteinsecretion 0
n)
mru 0165 124 5990 adenine_phosphoribosyltransferase Apt
290599 _ 291165 PurineBiosynthesis ...r
.-.1
mru_0166 125 5991 diphthamide biosynthesis protein
291370 . 292377 Translationfactors n)
IV
mru_0168 126 5992 exosome_complex_RNA-binding_protein CsI4
293311 293877 RNAprocessing m
mru_0169 127 5993 DNA-directed_RNA_polymerase_subunit-L RpoL
293988 294266 RNApolymerase
mru_0170 128 5994 NUDIX_domain-containing_protein
294412 294828 Enzyme = 0 0
0
I-.
.
mru_0171 129 5995 transcription_factor_S_Tfs2
295416 295811 = Translationfactors IV
I
mru_0173 130 5996 DNA_polymerase_sliding_clamp_subunit_PCNA_family_fcn
298072 298806 Chromosomereplication o
n)
1
mru_0174 131 . 5997 hypothetical_protein =
299016 300629 Conserved i\)
mru_0175 132 5998 ribosomal_protein_L44e_Rp144e
301053 , 301331 Ribosomalproteins
mru 0176 133 5999 ribosomal protein_S27e Rps27e
301342 301521 Ribosomalproteins
,
mru_0177 134 6000 translation initiation_fact-or_aIF-
2_alpha_subunit 301686 _ 302468 Translationfactors
=
mru_0178 135 6001 H/ACA_RNTA-protein_complex_component_Nop1Op
302480 302650 RNAprocessing
mru 0179 136 6002 hypothetical protein
302878 303666 Conserved
. .
mru_0180 137 6003 translation-associated GTPase
303960 305147 Other
mru_0181 138 6004 methanogenesis marler_protein_13
305221 306339 Methanogenesis
oc
mru_0182 139 6005
imidazole_glycerol_phosphate_synthase_glutamine_amidotransferase_su 306461
307054 Histidine n
=
bunit_HisH 1-3
mru_0185 140 6006 hypothetical_protein
308758 309450 Conserved
N
mru 0186 . 141 6007 hypothetical_protein
309475 310473 Conserved o
mru_0187 142 6008 carbon starvation_protein_CstA
310681 312087 Other 1--,
o
mru_0188 143 6009 4Fe-4S-Jerredoxin_binding_domain-containing_protein
312184 312429 'Electrontransport
o
mru_0189 144 6010 ATP-NAD_kinase , '
312571 313353 Nicotinate =
1..,
_ _mru0190 145 6011 hydrogenaseexpreasion/formation_protein_HypE
313635 315005 Hydrogenmetabolism c7,
.
o
= -
=
Table 11
. 0
mru_0191 146 6012 pyridoxamine_5'-phosphate_oxidase_family_protein
315146 315559 General (,..
o
mru 0192 147 6013
hypothetical_protein 315745 316140 Hypothetical
1--
0-,
mru:0193 148 6014 hypothetical_protein
316363 316521 Hypothetical ,
o
(..4
mru_0194 149 6015 HEAT_repeat-containing_protein
316655 317293 General (A
(.4
mru_0195 150 6016 SAM
dependent_methyltransferase ' 317628 318320 Enzyme
mru 0197 151 6017
tRNA¨_pseudouridine_synthase_D_TruD 319509 320867 RNAprocessing
mru:0200 152 6018
molybdate_ABC transporter_permease_protein_ModB 323610 324284 Metal-
bindingpterin
mru_0202 153 _ 6019
hypothetical_protein 326180 ' 326569 Conserved
mru_0203 154 6020 am
idohydrolase 326656 327894 Enzyme
mru_0204 155 6021
hypothetical_protein 328023 328271 , Hypothetical
mru 0205 156 6022
copper ion binding_protein 328483 328689 Cations*
mru:0207 157 6023
potassium_uptake_protei n_TrkH_fam ily 331586 . 333064 Cations
mru_0208 158 6024
anthranilate synthase_component_l_TrpE 333446 335101 Tryptophan
a
mru 0209 159 6025
anthranilate synthase component ll TrpG 335098 335724 Tryptophan
.
mru_0210 160 6026 anthranilate_phosphoribosyltransferase_TrpD
336844 336851 Tryptophan n)
-.3
mru 0211 161 6027
indole-3-glycerol_phosphate_synthase TrpC 336920 337696 Tryptophan
.-.1
NJ
mru:0212 162 _ 6028 _ _phosphoribosylanthranilate_isomerase TrpF
337750 338403 Tryptophan IV
1\)
mru 0213 163 6029
tryptop han_s_ynthase_beta_subunit_Trp B1 338576 339760 Tryptophan
mru 0214 164 6030
tryptophan_s_ynthase_alpha_subunit_TrpA , 339810 340592
Tryptophan K,
mru_0215 ' 165 6031
MFS_transporter 340962 342350 Other o
_
mru_0217 166 = 6032
transcriptional_regulator MarR _family 343985 344398
Transcriptionalregulators
I
'
o
mru 0218 167 6033 uracil-
DNA_glycosylase_ng . 344624 _ 345307
Recombinationa nd repair n)
1
mru_0219 168 6034 NAD P
H-de pen dent_FM N red uctase 345414 346145 Electrontransport
i\)
mru_0220 169 6035
transcriptional_regulator TetRiamily_with_acetyltransferase_GNAT_famil
346214 347557 Transcriptionalregulators
_y_domain
mru_0221 170 6036 Fic_family_protein
347969 348997 General
,
mru_0226 171 6037 hydrolase_TatDiamily
357833 358582 Enzyme
mru_0227 172 6038 ThiFlamily_protein "
358799 359515 Thiamine
mru_0228 173 6039
pyridoxamine_5'-phosphate_oxidase_family_protein 359565 _ 360035
General
mru 0229 174 6040
adenylosuccinate_lyase_PurB , 364141 365490
PurineBiosynthesis oci
mru_0230 175 6041 CAAX_amino_terminal_protease_family_protein
365925 366731 General n
1-
_
mru_0232 176 6042 transcriptional_regulator
368045 368287 Transcriptionalregulators
mru_0233 177 6043 hypothetical_protein
368567 369196 Conserved N
mru_0236 178 6044 hypothetical_protein
371322 371705 Hypothetical "
1--,
mru 0237 179 6045
CAAX_amino_terminal_protease_family_protein 372072 372848 - General =
.j(E3
mru 0240 180 6046
DNA_polymerase_large subunit_DP2_PolD 374651 378214
Chromosomereplication
o
mru:0241 181 6047
anaerobic_ribonucleosid¨e-triphosphate_reductase_NrdD 378831 381173 I
nterconversion c7,
mru_0242 182 6048 lysyl-
tRNA_synthetase_LysS 382020 383606 tR NAam inoacylation
,
= ,
,
=
=
Table 11
0
mru_0243 183 6049 hypothetical_protein
, 383998 384858 Conserved ' = (,..
mru 0244 184 6050
hypothetical_protein 385246 385485 Hypothetical
1¨=
_
0-
mru_0246 185 6051
carbohydrate_kinase_PfkBiamily 388763 389680 Enzyme . --
o
(..3
mru_0247 186 6052 thiamine_biosynthesis_protein_ThiC1 390266
391564 Thiamine (A
(.4
mru 0249 187 6053
transcriptional regulator LysR_family 393249 394178
Transcriptionalregulators
mru 0250 188 6054 3-
hexulose-6-phosphate isomerase Phil 394407 394997
ribulosemonophosphatepathway
mru_0253 189 6055 A BC_tra
nsporter ATP-b-inding_protein 397828 398658 Other _
mru 0254 190 6056
tungsten_formylmethanofuran_dehydrogenase_subunit_E_FwdE 398918 399490
Methanogenesis
mru_0256 191 6057 phage
integrase 404653 405993 Prophage
mru 0258 192 6058
hypothetical protein 406681 407565 Prophage
mru_0261 193 6059
hypothetical_protein = 409761 409979
mru 0264 194 6060 hypothetical_protein =
410474 410698 Prophage
_
mru_70270 195 6061 phage-related_protein 413337
413771 Prophage (-)
mru_0271 196 6062 hypothetical_protein 414001
414420 Prophage
mru_0272 197 6063 hypothetical_protein 414417
414692 Prophage (D
n) _
mru_0276 198 6064 hypothetical_protein 415532
415867 Prophage
.-.1
mru_0278 199 , 6065
hypothetical_protein 416905 417141 Prophage
n)
I.)
mru_0279 200 6066
hypothetical_protein 417125 * 417463 Prophage =
n)
mru_0280 201 6067 ParB-like_nuclease_domain-containing_protein 417460
418080 Prophage 1.)
_
m 0
mru_0281 202 6068 hypothetical_protein 418083
418283 Prophage
mru 0283 203 6069
hypothetical_protein 418820 419476 Prophage
1
mru_0285 204 6070
terminase_large subunit 420081 421556 Prophage
o
n)
1
mru_0286 205 6071 hypothetical_protein 421559
422884 Prophage i\)
.1,
mru_0287 206 6072
phage_portal_protein ' 422996 424654 , Prophage
mru 0288 207 6073 phage-
related_protein 424560 426887. Prophage
mru 0291. 208 6074
hypothetical_protein 428491 428781 Prophage .
mru_0292 209 6075 hypothetical_protein 428769
429098 Prophage
mru_0293 210 6076 hypothetical_protein 429095
429661 Prophage _
mru 0295 211 6077
hypothetical protein 429881 430162 Prophage
mru 0297 * 212 6078 ,
hypothetical_protein 430558 430755 Prophage oe
mru10301 213 6079 hypothetical_protein 431507
431695 Prophage n
1-
mru 0303 214 6080
hypothetical_protein 431961 432167 Prophage
mru¨__0306 215 6081 hypothetical_protein , 434960
435655 Prophage
N
mru_0309 216 6082 hypothetical_protein . 438005
438409 Prophage o
1--,
mru_0315 217 6083
phage_tattape_measure_protein 441704 445003 . Prophage _o
mru_0318 218 6084 hypothetical_protein 451366
452883 Prophage -4(E=5
o
mru_0320 219 6085
endoisopeptidase , 453037 453723 Prophage o
1..,
o
mru_0325 220 6086 hypothetical_protein 466396
466656 Prophage
,
,
Table 11
0
mru_0328 221 6087 hypothetical_protein 472432
473115 Conserved (,..
o
mru_0333 , 222 6088 formate_dehydrogenase_alpha subunit_FdhA1 .
478887 480938 Formate I--
1--,
mru_0334 223 6089 formate dehydrogenase beta_subunit_FdhB1 '480962
, 482170 Formate =¨.
o
(..4
mru_0335 224 , 6090 molybdirTpterin-
guanine_dinucleotide_bios_ynthesis_protein_B_MobB 482709
. 483416 Metal-bindingpterin (A
(.4
mru_0336 225 6091 hypothetical_protein . 483636
483821 Hypothetical
mru_0337 226 6092 molybdopterin_cofactor_biosynthesis_protein A_MobA
483924 484859 Metal-bindingpterin
mru_0338 227 6093 adhesin-like_protein 485468
492397 Cellsurfaceproteins '
mru_0346 228 6094 hypothetical_protein . 500055
500357 Conserved
mru 0347 229 6095 hypothetical_protein 500569
500871 Conserved
' mru_0348 230 6096 molybdate_ABC_transporter ATP-binding_protein_ModC
501229 502266 Metal-bindingpterin .
mru_0349 231 6097 transcriptional repressor_of nif and_glnA_operons_NrpR
502322 504016 glutamate/glutamine
mru 0350 232 6098 glutamine syrihetase_GInA1 504244
505590 glutamate/glutamine
mru:0351 233 6099 non-ribosomal_peptide_synthetase 506386
518949 NRPS a
mru_0353 234 6100 molybdopterin biosynthesis_protein_MoeB 520892
521641 Metal-bindingpterin
0
mru_0354 235 6101 oxidoreductase_aldo/keto_reductase_family 521883
523052 Enzyme n)
...3
mru_0355 236 6102 transposase 523519
525015 Transposase n)
mru_0356 237 6103 hypothetical_protein 525144
525389 Hypothetical IV
I\ )
mru_0357 238 6104 , hypothetical_protein '
525428 525697 Hypothetical
mru_0359 239 6105 transcriptional regulator Hx1Riamily 526034
526420 Transcriptionalregulators n)
Iv 0
mru_0360 240 6106 cobyrinic_acid¨a,c-diamide_synthase_CbiA1 526596
527396 Cobalamin o H' _ - co IV
mru_0362 241 6107 4Fe-4Sierred¨oxin_binding_domain-containing_protein
528281 528499 Electrontransport '
o
mru_0363 242 6108 flavodoxin 529515
529922 Electrontransport "
mru_0364 . 243 6109 NADPH-dependent_FMN reductase _
529949 530494 Electrontransport i)
mru_0365 244 ' 6110 transcriptional regulator -
ArsR_family , 530661 531362 Transcriptionalregulators
mru_0366 245 6111 ABC_transporr_ATP-binding/permease_protein 532047
533942 Other
mru_0367 246 6112 hypothetical_protein 534132
535175 Conserved
mru_0368 247 6113 hypothetical_protein 535404
535538 Conserved
_
mru_0369 248 6114 transporter_small_multidrug_resistance _(SMR) _family
535572 535904 Other .
mru_0370 249 6115 transcriptional regulator_HxIR family 535901
536287 Transcriptionalregulators
mru_0371 250 6116 cobyrinic_acidla,c-diamide_synthase CbiA2 536463
537263 Cobalamin
mru_0372 251 6117 hypothetical_protein 537309
538136 Conserved n
1-
mru 0373 252 6118 4Fe-4S_ferredoxin_binding_domain-containing_protein
538148 538366 Electrontransport
= mru_0374 253 6119
hypothetical_protein 538541 538657 ' Hypothetical N
mru_0377 254 6120 hypothetical_protein 542061
542888 Conserved
1--,
mru_0379 255 6121 MFS_transporter ' 543953
545473 Other =
'a
mru_0380 256 6122 S-adenosyl-L-homocysteine_hydrolase_AhcY 546317
, 547567 Methioninemetabolism
o
mru_0381 257 6123 hypothetical_protein 547788
548453 Conserved 1..,
c7,
mru_0382 258 6124 flap_endonuclease_Fen . .
548601 549587 = Chromosomereplication
=
' .
, ,
.
' .
=
. .
Table 11
,c)
mru 0383 259 6125
hypothetical_protein . 549680 550243 Conserved w
. _
o
mru 0384 260 6126
hornoaconitase_large_subunit_AksD 550345 , 551598 CoenzymeB I--
1--
mru_0385 261 6127 homocitrate_synthase_AksA 551891
553069 CoenzymeB
k..i
mru 0386 262 6128 adenylate
cyclase_CyaA 553252 553797 Purineinterconversions
uri
(.4
mru_0387 263 = 6129 TATA-
box_binding_protein Tbp 554216 554761 Transcriptionfactors =
mru_0388 264 6130
phosphoserine_phosphatase_SerB 554775 556838 Serine '
mru 0389 265 6131
hypothetical protein _ 557325 557570 Conserved ,
mru_0390 266 6132 DNA _topoisomerase_l_TopA 557952
560177 Genomesegregation
_
mru 0392 267 6133 GTP-
binding_protein 564189 ' 565283 General
mru 0393 268 6134
hypothetical_protein 565513 566115 Conserved
mru_0394 269 6135
ribonuclease_P_subunit_RPR2 566381 566752 RNAprocessing = , .
mru_0395 270 - 6136 RNA-
binding_protein 566766 566981 Other
mru 0396 271 6137
ribosomal_protein_S19e_Rps19e 567064 567504
Ribosomalproteins a
mru 0397 272 6138 DNA-
binding_protein 567592 567963 DNA-bindingproteins
0
mru 0398 273 6139
tRNA_methyltransferase subunit 568024 568605 Other n)
-.3
mru 0399 274 6140
ribosomal_protein_L39e_Rp139e 568751 568906
Ribosomalproteins .-.1
NJ
mru_0400 275 6141
ribosomal_protein_L31e_Rp131e 568918 = 569178
Ribosomalproteins I.)
m
mru 0401 276 _ 6142
translation_initiation_factor alF-6 569394 570071 Translationfactors
mru¨_0402 277 6143
ribosomal_protein L.X_RplX = 570230 570454
RibosomalprOteins 1.)
iv
0
mru_0403 , 278 6144
prefoldin_alpha_sbunit_PfdA , 570528 570965 Proteinfolding CD
H
4
IV
I mru_0404 ' 279 6145 signal
recognition_particle_receptor FtsY 571376 573106 Proteinsecretion
o
mru_0407 280 6146 Na+-
d¨ependent transporter_SNF family 577716 580364 Other n)
1
mru 0409 281 6147
hypothetical_protein = 580357 581088 Hypothetical
i\)
mru_-0410 282 6148
acetolactate_synthase_lame_subunit_IlvB 581293 ' 583029 =
Valine/Leucine/lsoleucine
,
mru 0411 283 6149
hypothetical_protein 583065 583412 Hypothetical
mru_0413 284 6150
hypothetical protein 584469 584984 Conserved
mru_0414 285 6151
aminotransferase class_V family 585286 586452 Serine ,
mru 0415 286 6152
calcineurin-like_pFiosphoesterase 586854 587954 Enzyme _
= =
mru_0416 287 6153 oxidoreductase_aldo/keto_reductase_family 588089 589354
Enzyme .
mru_0417 '288 6154 adhesin-like_protein 589570
593877 Cellsurfaceproteins
mru_0419 289 , 6155 adhesin-
like_protein 595890 , 600065 Cellsurfaceproteins =
n
1-
mru_0420 290 6156 5'-nucleotidase_SurE2 S= 600503
601291 Interconversion ,
mru 0421 291 6157
hypothetical_protein 601455 602.189 Conserved
N
mru_0422 292 6158 acetyltransferase 602292
602918 Enzyme o
mru_0423 293 6159 cdc6 Jamily_replication_initiation_protein_Cdc6-2 603216
604334 Chromosomereplication
,
mru 0424 294 6160
hypothetical_protein 604999 605190 Hypothetical
o
,
o
mru_0425 295 6161 hypothetical_protein 605453
605926 Conserved 1--,
o ,
mru_0426 296 6162 hypothetical_protein 606302
607750 Conserved . _
' ,
,
,
, ,
= =
,
=
=
Table 11
,
0
mru_0427 297 6163 methylthioadenosine_phosphorylase_MtnP
607898 608671 Methioninemetabolism ),-)
mru_0429 298 6164 hypothetical_protein
610289 610681 Hypothetical =
mru 0430 299 6165 transcriptional_regulator
610792 610995 _ Transcriptionalregulators -- 1--,
,
o
mru_0431 300 6166 hypothetical_protein
611233 611532 Conserved k..)
urr
mru_0432 301 6167 hypothetical_protein
612054 613121 Hypothetical r.4
o
mru_0435 302 6168 cell division_control_protein_Cdc48
= 616786 618993 Celldivision
mru_0436 303 6169 met-hanogenesis_marker protein 8
, 619536 620375 Methanogenesis
mru_0437 304 6170 queuosine_biosynthesis_protein -QueC
620647 621282 RNAprocessing
mru_0438 305 6171 queuosine biosynthesis_protein QueD '
621364 621723 RNAprocessing
mru 0439 306 6172 7-cyano-7:deazaguanosine_biosynthesis_protein_QueE
621817 622425 RNAprocessing
mru_0440 307 6173 universaLstress_protein UspA2
622616 623038 Stressresponse
mru_0442 308 6174 , transcriptional_regulator_-MarR_family =
624493 624942 Transcriptionalregulators
mru 0444 309 6175 thiamine_biosynthesis_protein_ThiC2
626739 628013 Thiamine
(-)
mru_0445 310 6176 ATP-dependent_DNA Jigase_DnI1
628105 629757 Chromosomereplication
mru_0447 311 6177 hypothetical protein
630789 631001 Hypothetical 0
n)
mru_0448 312 ' 6178 hypothetical_protein
631026 631259 Conserved ...3
.-.1
mru_0449 313 6179 phosphoglucosamine_mutase_GlmM2
631444 632808 Pseudomureinbiosynthesis n)
IV
mru_0450 314 6180 adhesin-like_protein "
632975 633778 Cellsurfaceproteins m
mru 0451 315 6181 adhesin-like_protein
633782 634495 Cellsurfaceproteins N) n)
mru 0452 316 6182 TPR_repeat-containing_protein
634908 636044 Proteininteractions o 0
cri
I-.
mru 0453 317 6183 pyruvate formate-Iyase-activating enzyme PflA1
636319 637041 Formate IV
I
mru_-_0454 318 6184 histidinol-phosphate_aminotransferase_HisC
637168 , 638292 Histidine o
n)
'
mru_0455 319 6185 acetyltransferase
638616 639092 Enzyme i\)
,
mru_0456 320 6186 UDP-N-acetylglucosamine_diphosphorylase/glucosamine-1-
phosphate_N- 639247 640566 Pseudomureinbiosynthesis
acetyltransferase_GlrnU
mru _ 0457 321 6187 rubredoxin _Rub1
640906 641067 Oxidativestressresponse
mru_0458 322 6188 phosphoglucosamine_mutase_GImM1
641356 642702 Pseudomureinbiosynthesis .
mru_0459 323 6189 2,3-bisphosphoglycerate-
independent_phosphoglycerate_mutase_ApgM1 642725 643963 Gluconeogenesis
mru_0461 324 6190 ribosomal_protein S3Ae Rps3ae
646473 = 646051 Ribosomalproteins
mru 0462 325 6191 glutamine_amidotransferase
646804 647607 Glutathionemetabolism oc
. mru_0463 326 6192 gamma-glutamylcysteine_synthetase_GshA
647633 648358 Glutathionemetabolism n
mru_0464 327 6193 gamma-glutamylqysteine_synthetase_GshA
648376 649029 Glutathionemetabolism 1-3
,
mru_0465 328 6194 bifunctional_glutamate-
cysteine_ligase/glutathione_synthetase_gshF 649142 650566
Glutathionemetabolism
0 N
mru_0466 329 6195 = dinitrogenase iron-
molybdenum_cofactor_biosynthesis_prolein 650676 651014
Others/Fixation -\= o
mru_0467 330 6196 voltage gates chloride_channel protein
' 651305 652876 Anions 1--,
o
mru_0468 331 6197 prephenate_dehydrogenase_TyrA1
653012 654319 Phenylalanine/Tryosine --..
=
=
o
mru_0469 332 6198 cell_division_protein_pelota_PelA
654502 655563 Translationfactors =
1-,
mru 0470 333 6199 hypothetical_protein
' 655751 656707 Conserved c7,
o
= .
, . =
,
=
,
' ,
,
_______________________________________________________________________________
______________________________
Table 11 = 0
=
mru 0471 334 6200 nitroreductase_family jyotein
656883 657428 General (.4 o
mru_0472 335 6201 glutathione-disulfide reductase_Gor1
657644 659017 Glutathionemetabolism I--
1--
mru 0473 336 6202 hyd rogenase_assenbly_chaperone_HypC
659350 659616 Hyd rog enmetabol ism --
o
mru_0474 337 6203 HD_domain-containing_protein
659799 660293 General (..3
(A
(.4
mru_0475 338 6204 RNA-binding_protein
660527 661435 _ Other
mru 0476 339 6205 hypothetical_protein
661753 662040 Conserved
mru_0477 340 6206 ACT_domain-conta in ing_protein
662534 663037 General
mru_0478 341 6207 transcription_initiation_factor_TF I lE_alpha
subunit_Tfe 663461 664534 Translationfactors
mru 0479 342 6208 F420-0:gamma-glutamyl li_gase
664675 665577 CoenzymeF420
mru 0480 343 6209 hypothetical_protein .
665869 '666438 Conserved
mru_0481 344 6210 cell_division_protein FtsZ
666972 668141 Celldivision
mru 0483 345 6211 ribosomal_protein J.:2-4 _family
669199 669690 Ribosomal proteins
mru_0484 346 6212 ribosomal_protein_L11P Rp111p
669690 670172 Ribosomalproteins
L1
a
mru_0485 347 6213 ribosomal_protein P-
pl1p 670729 671367 Ri bosomal prote ins
0
mru 0486 348 6214 acidic_ribosomal_protein PO Rp1P0
671369 672376 Ribosomal proteins iv
-.3
mru_0487 349 6215 ribosomal_protein_L12P_Rp112p
672496 672807 Rib oso mal proteins .-.1
. NJ
_ mru_0489 350 6216
hypothetical_protein 673968 674573 Hypothetical
'
iv =
iv
mru 0492 351 6217 alanyl4RNA_synthetase_AlaS
676316 679048 tRNAam inoacylation =
mru_0493 352 6218 adhesin-like_protein
679119 681566 Cellsurfaceproteins iv
ry
0
mru 0494 353 6219 thiamine_biosynthesis_ATP_pyrophosphatase_Thil
681920 683080 Thiamine
mru_0498 354 6220 fructose_1,6-bisphosphatase_F bp _
685591 686691 ' Gluconeogenesis 1
o
mru_0500 355 6221 acetyltransferase_GNAT _family 687305 687817 Enzyme iv
1
mru_0501 356 6222 hypothetical_ protein
687991 688182 , H_ypothetical iv
.1,
mru_0502 357 6223 hypothetical_protein
688561 _ 688803 Conserved
, mru 0504 358 6224 Sua5/YciO/YrdCNwICiamily_translation_factor
689684 690349 Other
mru_-0505 359 6225 DNA_repair_and_recombination_protein RadB
690705 = 691400 Recombinationandrepair .
= mru 0506 360
6226 aspa rtate_a m inotransferase 691872 693032 General
mru 0507 361 6227 ATPase_RIL
693672 695447 Other
mru_0508 362 6228 peptidyl4RNA_hydrolase
695860 696198 Other
mru 0509 363 6229 hypothetical_protein
696373 697188 Hypothetical 0:
mru_0511 364 6230 hydrolase_alpha/beta_fold family
698621 699535 Enzyme n
1-
mru_0513 365 6231 anti-sigma_factor_antagonist
701112 701411 Proteininteractions
N
mru 0514 366 6232 4'-phosphopantetheinyl_transferase_family_protein
701578 702210 Proteininteractions
. mru-_0515 367 6233 serine_phosphatase
702357 704231 Proteininteractions o
1--,
mru 0516 368 6234 anti-sigma_regulatory_factor
serine/threonine_protein_kinase 704367 704744 P rotein
interactions = o
mru-_0517 369 6235 TfoX_N-term inal_doma in-contain ing_protein
705211 705528 General
o. ,
mru_0518 370 6236 delta 1-pyrroline-5-carboxylate_synthetase
706014 706484 Proline o
1..i
mru_0519 371 6237 RNA7binding_protein
706654 706815 Other o
=
'
.
. ,
,
. Table 11
0
mru_0520 372 6238 translation_elongation_factor_aEF-1 beta
706882 707151 Translationfactors w
- o
mru_0522 373 6239 type_ll_secretion_system_protein_E:GspE
709362 710876 Proteinsecretion 1--
,
mru 0523 374 6240 hypothetical_protein
711082 711633 Hypothetical 1--
o
mru 0524 375 6241 , type Il_secretion_system_protein_F_GspF1
711663 712258 Proteinsecretion k..3
uri
mru:0525 376 - 6242 arch-aeal_fibrillarin-like_protein
712742 714250 RNAprocessing c..,
mru 0527 377 6243 biotin_synthase BioB2 =
716519 717520 Biotin
mru:0528 378 6244 hypothetical_protein
717735 718208 Conserved
,
mru_0530 379 6245 hypothetical_protein
718686 719126 Conserved
mru 0531 380 6246 NIF3 family_protein
719204 719932 General _
mru:0532 381 6247 _ ThiF-family_protein
720140 720784 Thiamine
=
mru_0533 382 6248 hypothetical_protein
720848 721456 Conserved
mru_0534 383 6249 ATPase _
721635 723803 General
mru_0536 384 6250. ferrous iron_transport_protein_A_FeoA
724939 725172 Cations
a
mru_0538 386 6251 hypothetical_protein
727815 728012 Conserved
,
mru 0539 386 6252 , cobalamin_biosynthesis_protein_CbiM1
728946 729572 Cobalamin o
iv
mru_0542 387 6253 hypothetical_protein =
732212 733561 Conserved
.-.1 ,
mru 0544 388 6254 ferredoxin
734575 734736 Electrontransport iv
iv
mru:0546 389 6255 transcription factor_S_Tfs1
736747 = 737064 Translationfactors m
.1,
mru 0547 390 6256 CAAX amino_terminal_protease_family_protein
737337 738230 General iv iv
mru_0548 391 6257 pyruve_ferredoxin_oxidoreductase_gamma
subunit_PorC 738681 ' 739202 Acetate ci o
--4
I-.
mru_0549 392 6258 pyruvate_ferredoxin oxidoreductase_delta_su-
bunit_PorD 739233 " 739475 Acetate T
= , . mru_0550 393
6259 pyruvate_ferredoxin oxidoreductase_alpha_subunit_PorA 739538
740686 - Acetate o
iv
i
mru_0551 394 6260 pyruvate_ferredoxin_oxidoreductase_beta_subunit_PorB
740688 741654 Acetate iv
mru_0552 395 6261 pyruvate_ferredoxin_oxidoreductase-associated_PorE
741567 _ 742067 Acetate
mru_0553 , 396 6262 pyruvate_ferredoxin_oxidoreductase-associated_PorF
742078 _ 742506 Acetate
'
mru_0554 397 6263 hypothetical_protein
742914 743348 Conserved
mru 0555 398 , 6264 hypothetical_protein
743380 743634 Hypothetical
mru:0556 399 6265 fumarate_hydratase FumA1
743916 , 744755 TCA
mru_0557 400 6266 dihydropteroate_synThase-related_protein
744873 746462 Other ,
mru_0560 401 , 6267 ATPase
748964 749836 General
0:
mru_0561 402 6268 = von Willebrand_factor_type_A_domain-
containing_protein 749949 751109 General n mru_0562
403 6269 PP-loop_family_protein 751292 , 752209 General 1-3
mru 0563 404 6270 N
thiamine biosynthesis_protein_ThiS 752368 = 752568 Thiamine
mru_0564 405 '6271 hypothetical_protein
753000 753719 Conserved o
mru_0565 406 6272 hypothetical_protein
753811 754635 Conserved i--,
mru 0566 407 6273 cytidyltransferase-related_domain-
containing_protein 754687 755985 General CE3
_ o
mru_0568 408 6274 = radical SAM_domain-containing_protein
756834 757922 Enzyme o
mru_0570 409 6275 hypothetical_protein
766871 767080 Hypothetical cr=
,
,
. =
'
,
Table 11 0 mru_0572 410 6276
hypothetical_protein 767428 767910 Conserved (,..
o
mru 0573 411 6277 hypothetical protein
767979 768230 Conserved I--
1--,
mru 0574 412 6278 acetyltransferase GNAT _family
768688 769143 Enzyme --
o
mru_0575 413 6279 hypothetical_prot-ein
769318 769977 Conserved (..3
(A
(.4
' mru 0576 414 6280 sugar_fermentation stimulation_protein_SfsA1
769991 770740 Other
mru_0577 415 6281 transcriptional_reguTator_TetR _family
770737 771246 Transcriptionalregulators
mru 0579 416 6282 oxidoreductase_aldo/keto_reductase_family
772598 773587 Enzyme
'
mru 0580 417 6283 NADPH-dependent_FMN_reductase
773719 774252 Electrontransport .
mru_0581 418 6284 hypothetical_protein
774946 776346 Conserved
mru_0582 419 6285 phosphate_uptake_regulator_PhoU
776354 777247 Proteininteractions
mru 0583 420 6286 DNA_misrnatch repair_ATPase_MutS _family
777975 779909 Recombinationandrepair
_
mru_0584 421 6287 , peptidase_U32 ¨family
780263 783316 Proteindegradation
mru_0585 422 6288 peptidase U32 _family
783591 . 786176 Proteindegradation a
mru 0586 423 6289 thymidylat¨e kinase_Tmk1
786457 787047 . Pyrimidineinterconversion
_
mru_0587 424 6290 hypothetical_protein
787184 787390 Conserved 0
n)
-.3
mru 0588 425 6291 tyrosyl-tRNA_synthetase_TyrS
787499 = 788464 tRNAaminoacylation .-.1
NJ
. mru_0589 426 6292 NMD3
family_protein 788586 789626 RNAprocessing IV
I\ )
mru 0590 427 6293 translalon initiation_factor_aIF-2 beta_subunit
789772 790179 Translationfactors .1,.
mru:0591 428 6294 replicative¨DNA helicase_Mcm
790714 792711 Chromosomereplication n)
_
m 0
mru_0593 429 6295 ribosomal ¨RNA large_subunit_methyltransferase J
RrmJ 793733 794374 RNAprocessing
CO
IV
I
mru_0594 430 6296 phosphodiesterase_MJ0936 _family
794532 795071 Enzyme o
mru_0595 431 6297 5-forma minoim idazole-4-carboxam ide-1-(beta)-D-
_ribofuranosyl_5.- 795397 796488
PurineBiosynthesis n)
1
monophosphate-formate ligase PurP
i\)
.1,
mru 0597 432 6298 hypothetical_protein
797740 798114 - Conserved
mru_0600 433 6299 DEAD/DEAH box helicase_domain-containing_protein
799823 802420 Helicase .
mru_0601 434 6300 HEAT_repeal=conTaining_protein
802736 803587 General
mru 0602 435 6301 hypothetical_protein
803901 _ 804707 Hypothetical
. mru_0603 436 6302 N-carbamoyl-
D-amino_acid_amidohydrolase_AguB 804807 805652 Polyamines
mru_0604 437 6303 hypotheticaLprotein =
805871 807049 Hypothetical
_
' mru 0605 438 6304
hypothetical protein 807448 807948 Conserved
mru_0606 439 6305 NAD-dependent_protein_deacetylase
808020 808745 DNA-bindingproteins n
_
1-
mru_0608 440 6306 hypothetical_protein
809357 810139 Conserved
mru_0610 441 6307 amidohydrolase
812097 812864 Enzyme
N
mru_0611 442 6308 homoserine_O-acetyltransferase_MetX1
813003 814001 Methionine o
1--,
mru_0612 443 6309 acetyltransferase_GNAT_family
814108 , 814512 Enzyme o
' mru_0613 444 6310 hypothetical_protein
814647 814931 Conserved
o _
o
mru 0615 445 6311 quinolinate synthetase A_protein_NadA
816140 817057 Nicotinate 1--,
o
mru_061 7 446 6312 ribonuclease_Z_Rnz
818198 ,819109 RNAprocessing
= =
,
'
,
,
.
,
=
Table 11 '
C
mru_0618 447 6313 n icoti nate-nu cleotide_pyrophosphorylase_NadC =
819384 820214 Nicotinate 4(Y,
mru_0619 448 6314 hypothetical_protein 820369
820626 Conserved :
mru j620 449 6315 helicase SNF2 _family 820699
822795 H el icase =--.
o
(..4
mru j621 450 6316 hypothetical_protein 823402
823636 Conserved (A
(.4
mru 0623 451 6317 transposase 824875
825540 Transposase
mru_0624 452 6318 transposase 825619
825729 Transposase '
mru_0626 453 6319 hypothetical_protein 826243
826590 Hypothetical
mru_0627 454 6320 ZPR1_zinc-finger_domain-containing_protein 826839
827438 General
mru_0629 455 6321 hypothetical_protein 829143
830195 Conserved
mru_0630 456 6322 = hypothetical_protein ' 830574
830858 Hypothetical
.
mru 0632 457 6323 hypothetical_protein 831473
832441 Hypothetical
mru_0633 458 6324 acetyltransferase GNAT _family 833365
833874 Enzyme
mru 0634 459 6325 hypothetical_protein 834468
835121 Conserved , a
mru:0635 460 6326 pyruvate_kinase PykA 835635
837041 Gluconeogenesis
mru_0636 461 6327 Na+-dependent Transporter SNF _family 837134
838624 Other 0
n)
mru j637 462 6328 SAM-dependent_methyltransferase 838933
839550 Enzyme
.-.1
mn.J_0638 463 ' 6329 hypothetical_protein 839647
840144 Conserved n)
IV
I\ )
mru j641 464 6330 hypothetical_protein 841615
841737 Hypothetical
mru_0642 465 6331 hypothetical_protein ' 841795
842133 Conserved n)
N.) 0
mru_0643 466 6332 hypothetical_protein 842332
842880 Conserved = o H
mru 0644 467 6333 hypothetical_protein 843115
843723 Conserved 1
o
mru_0645 468 6334 transcriptional regulator TetR _family 844150
844776 Transcriptionalregulators n)
1
mru_0646 469 6335 radical_SAM_Tiomain-containing_protein 844863
845954 Enzyme i\)
.1,
mru 0647 470 6336 hypothetical protein 846447
, 846860 Conserved
mru 0648 471 6337 hydrolase_TatD_family 847127
847882 Enzyme
mru:0649 472 6338 transcriptional regulator = 848823
849635 Transcriptionalregulators
mru 0650 473 6339 transcriptional regulator 849867
850685 Transcriptionalregulators
mru j651 474 6340 glycyl-tRNA_synthetase_GlyS 850976
852688 tR NAami noacylation
mru_0652 475 6341 deoxycytidine triphosphate_deaminase_Dcd 852889
853476 Pyri midi nei nterconversion
,
mru_0653 476 6342 ferredoxin 853554
854429 Electrontransport
mru_0654 477 6343 RNA_methyltransferase_TrmH _family 854670
855461 RNAprocessing n
.
1-
mru 0655 478 6344 succinate_dehydrogenaseffumarate_reductase iron-
sulfur protein_SdhB 855593 857119 TCA
. mru:0656 479 6345
indolepyruvate_ferredoxin_oxidoreductase_alph7a_subunit_lorA 857657 '
859648 aro matica minoacids N
mru_0657 480 6346
indolepyruvate_ferredoxin_oxidoreductase_beta_subunit_lorB 859650
860288 aromaticam in oacids c=
1--,
mru_0658 481 6347 hypothetical_protein 860421
861011 Conserved
.j(E3
mru_0660 482 6348 hypothetical_protein ' 863621
863896 Conserved o
o
rnru_0662 483 6349 transcriptional_regUlator_LytR_family 864712
865173 Transcriptionalregulators c7,
mru_0663 484 6350 hypothetical_protein = 865193
865624 Hypothetical
_
_
=
.
.
= Table 11
mru 0664 485 , 6351 amidohydrolase
865877 867016 Enzyme 0
(,..
mru 0665 486 6352 CBS_domain-containing_protein
867609 868484 General =
.
I--
= mru 0666 487
6353 PP-loop family_protein 868634 869713 General 1--,
--.
o
mru_0667 488 6354 TfuA-like_protein
869886 870527 , General (..3
(A
mru_0668 489 6355 methanogenesis_marker_protein_1
870527 871786 Methanogenesis (.4
o
mru_0669 490 6356 Xaa-Pro aminopeptidase
872047 873090 Proteindegradation
mru_0671 491 6357 _ hypothetical_protein '
874961 875185 Hypothetical
mru 0672 492 6358 lactaldehyde dehydrogenase_CofA
875464 876879 CoenzymeF420
mru_0673 493 6359 tRNA_binding_domain-containing_protein
877057 877782 tRNAaminoacylation
mru_0674 494 6360 hypothetical_protein
878591 879007 Conserved
mru 0675 495 6361 L-aspartate_dehydrogenase '
879493 880254 Nicotinate
mru10676 496 6362 tRNA(His)_guanylyitransferase_ThgL
880322 881080 RNAprocessing
mru_0677 497 6363. hypothetical_protein
881267 882187 Conserved
a
mru_0678 498 6364 phosphoglycerate dehydrogenase SerA
882376 883950 Serine
mru_0679 499 6365 hypothetical_protein
884517 = 884855 Conserved (D
n)
mru 0680 500 6366 hypothetical protein ,
885221 885994 Conserved .
.-.1
mru_0681 501 6367 , formate dehydrogenase accessory protein FdhD1
886153 ' 886962 Formate n)
IV
mru_0682 502 6368 hypothetical_protein
887258 887689 Conserved m
d,
mru_0683 503 6369 hypothetical_protein =
887732 887899 , Hypothetical
mru_0684 504 6370 transcriptional regulator
888107 889087' , Transcriptionalregulators a' 0
I-.
mru_0685 505 6371 DNA-binding_protein
889339 890631 Chromosomereplication "
1
mru_0688 506 6372 2-methylcitrate_dehydratase_PrpD
896055 897488 Propanoate n)
= mru 0689 = 507 = 6373
hypothetical_protein . 897602 898477
Conserved '
i\)
mru_0690 508 6374 fumarate_hydratase FumA2 .
898586 899470 TCA
mru_ 0691 509 6375 2-methylcitrate_synt-h-ase/citrate_synthase_II_PrpC/CitZ
900240 901088 Propanoate/TCA
mru_0692 510 6376 nitroreductase_family_protein .
901247 = 901756
General
mru_0695 511 6377 V-type_ATP_synthase_subunit_H_AtpH
903324 903638 Electrontransport
mru 0696 512 6378 V-type_ATP_synthase_subunit_l_Atpl
903648 905654 Electrontransport
mru_0698 513 6379 V-type_ATP_synthase subunit E_AtpE
906528 907208 Electrontransport .
mru_0700 514 6380 V-type ATP_synthase_subunit_F_AtpF =
908371 . 908688 Electrontransport
oo
mru_0701 515 6381 V-typeTATP synthase_subunit A_AtpA
908685 910439 Electrontransport n
mru_0702 516 6382 V-type_ATP_syrithase_subunit_B_AtpB .
910442 , 911827 Electrontransport =1-3
mru 0703 517 6383 V-type_ATP_synthase_subunit_D_AtpD
911967 912659 Electrontransport
mru 0704 518 ' 6384 adhesin-like_protein
913275 916133 Cellsurfaceproteins = N
mru 0705 519 6385 MotAiToIQ/ExbB_proton_channel_family_protein
916565 917203 Other 1--,
o
mru_0707 520 6386 cell_wall_biosynthesis_protein_Mur_ligase_family
918127 919449 Pseudomureinbiosynthesis
mru_0708 521 6387 CobB/CobQ-like_glutamine_amidotransferase_domain-
containing_protein 919586 920362 Enzyme =
1--,
_ _ mru 0709 522 6388 ATP-
graspdomainrcontainingprotein 920769 922055 General o=
_ _
o
= ,
,
- , = . . .
,
=
. . .
=
' Table 11
0 o
mru 0710 _523 6389 tyrosine_recombinase_XerC ,
922147 923127 Chromosomereplication (,..
.
mrui0711 524 6390 DNA_primase_DnaG
923622 924959 Chromosomereplication - I--
1--,
' mru_0712 525 6391 demethylmenaquinone_methyltransferase ' ,
927267 928022 General .---.
o
mru_0713 526 . 6392
H/ACA_RNA-protein_complex_component_Gar1 928475 _ 928837 RNAprocessing
(A
(.4
mru 0714 527 6393 ' transcription initiation factor TFIIB Tfb1
929143 929937 Translationfactors
mru 0719 528 6394 hypothetical_protein
935901 936227 Conserved
mru10720 = 529 6395 hypothetical_protein
936405 '936593 Conserved
.
mru_0721 530 6396 hydrolase_TatD_family
936743 ' 937537 Enzyme
mru_0722 531 6397 uridylate_kinase PyrH
937768 938445 Pyrimidineinterconversion
' mru_0723 532 6398 . adhesin-
like_proTein 938839 946566 Cellsurfaceproteins
mru_0724 533 6399 hypothetical_protein
946947 947513 Hypothetical
mru 0725 534 6400 hypothetical_protein
947573 947740 Hypothetical
mru=0728 535 6401 '= peptide_chain_release factor_aRF1 _
,' 953829 955073 Translationfactors a
mru_0729 536 6402 RNA-binding_protein
955368 956021 Other mru_0730 537 6403 hypothetical_protein 956243 956668
Conserved n)
-.3
mru_0731 538 6404 hypothetical_protein
956878 , 957681 Hypothetical .-.1
NJ
mru_0732 539 6405 hypothetical_protein=
957999 958502 Hypothetical IV
1\)
mru_0733 540 6406 hypothetical_protein =
958621 ; 959043 Hypothetical
=
mru 0734 541 6407 5-formyltetrahydrofolate_cyclo-ligase
959121 959732 Other 1.) .
mru_0735 542 6408 rubrerythrin_Rbr1 .
959914 960459 . Oxidativestressresponse
-u
IV
mru 0737 543 6409 NUDIX domain-containing_protein
960826 961254 Enzyme 1
o
mru_0738 544 6410 hypothetical_protein .
961341 961847 Conserved n)
1
=
mru_0739 545 6411 hypothetical_protein 961891 962295 Hypothetical r\)
.1,
'
mru_0740 546 - 6412
hypothetical protein 962816 963295 Conserved . .
_
mru_0742 547 6413 hypothetical_protein
964770 965618 Conserved
mru_0743 548 6414 hypothetical protein '
966282 966710 Conserved .
mru_0744 549 6415 hypothetical protein
967385 968830 Conserved
mru_0748 550 6416 TPR repeat-containing_protein
973817 974611 Proteininteractions
mru 0749 551 6417 ADP:ribosylglycohydrolase_family_protein
974668 975681 Other .
mru_0750 552 6418 hypothetical_protein
975685 976092 Hypothetical
mru_0751 553 6419 hypothetical_protein
976097 977062 Conserved n
1-
.
-
mru 0752 554 6420 hypothetical_protein . -
977127 977693 Hypothetical -
N
mru-0753 ' 555 6421 hypothetical_protein,
978706 980196 Conserved.
_
mrui0754 556 6422 hypothetical_protein
980458 980805 Hypothetical
1--,
mru_0756 557 6423 hypothetical 'protein
982348 982641 Conserved
o
mru_0757 558 6424 hypothetical_protein =
983247 984068 Conserved o
o
mru_0758 559 6425 hypothetical_protein
984547 :984840 Conserved 1--,
c7,
mru_0761 560 6426 hypothetical_protein
986779 987639 Conserved
_
. . ,
. .
= .
.
,
,
Table 11
= 0
mru_0762 561 6427 hypothetical_protein 987953
. 988096 Hypothetical _ _ (,..
mru 0763 562 6428 hypothetical_protein 988132
988452 Hypothetical o
I--
mru_ 0764 563 ' 6429 hypothetical_protein 988880
990211 Conserved --.
o
m ru_0765 564 6430 hypothetical protein 991435
991752 Hypothetical (..4
(A
mru_0766 565 _ 6431 hypotheticatprotein 991819
992142 Hypothetical (.4
mru_0767 566 6432 , hypothetical_protein 992295
992603 Hypothetical -
m ru_0770 567 6433 , exonuclease 994054
994839 Recombinationandrepair
mru_0771 568 6434 hydrolase al pha/beta_fold_fami ly = 995435
996382 Enzyme
mru_0772 569 6435 adhesin-like_protein with_cysteine_protease_domain
997123 , 1000404 Cellsurfaceproteins
mru_0773 570 6436 oxidoreductase_aldo/keto_reductase family 1000909
_ 1002054 Enzyme
mru_0774 571 6437 archaea I histone 1002318
-1002518 DNA-bindingproteins
mru_0776 572 6438 hypothetTCal_protein 1005731
1007137 Conserved
mru 0777 573 6439 transposase 1007315
1008373 Transposase a
mru_0778 574 6440 DEAD/DEAH box_helicase_domain-containing_protein
1008862 1011288 Helicase
mru_0779 575 6441 SAM_depend¨ent_methyltransferase ' 1011376
1011954 Enzyme 0
n)
' mru 0780 576 6442
hypothetical_protein 1012560 1013189 Conserved -
..3
.-.1
mru_0781 577 6443 _ hypothetical_protein 1013616
1014155 Hypothetical "
IV
mru 0782 578 6444 hypothetical_protein 1014152
1014988 Conserved m
.1,.
mru 0783 579 6445 hypothetical_protein 1015056
1015616 , Hypothetical , 1.)
N
mru 0784 580 6446 hypothetical_protein 1015688
1015807 Hypothetical 0
Ri
H
mru_0785 581 6447 hypothetical_protein 1016012
1017511 Conserved IV
1
,
mru_0786 582 6448 hypothetical_protein 1017611
1018453 Hypothetical o
n)
1
mru 0787 583 6449 hypothetical_protein 1018824
1019237 Hypothetical i\)
.1,.
mru 0788 584 6450 hypothetical_protein 1019714
1020667 Conserved
mru 0790 585 ) 6451
hypothetical_protein 1021746 ' 1022789 Hypothetical
mru10792 586 6452 CRISPR-associated_protein Cas6 1023766
1024497 CRISPR-associatedgenes
mru 0793 587 6453 hypothetical protein 1024497
1026656 Conserved
mru_0794 588 6454 CRISPR-associated_protein CT1132_family 1026658
1027617 CRISPR-associatedgenes
mru_0795 589 6455 CRISPR-associated_protein_Cas5_Hmari_subtype 1027652
1028515 CRISPR-associatedgenes
mru_0796 590 6456 CRISPR-associated_helicase_Cas3 1028497
1031322 CRISPR-associatedgenes oc
mru_0797 591 6457 CRISP R-associated_protei n_Cas4-1 1031319
1031825 CRISPR-associatedgenes n
1-
mru_0798 592 6458 CRISPR-associated_protein_Cas1-1 1031886
1032854 CRISPR-associatedgenes
mru_0799 593 6459 N
CRISPR-associated_protein Cas2-1 1032857 , 1033123 CRISPR-
associatedgenes
mru_0800 594 6460 TPR repeat-contain ing_p rot-ein 1038230
1038931 Proteininteractions o
1--,
mru_0801 595 6461 hypothetical_protein 1039164
1039343 Hypothetical o
mru_0804 596 6462 hypothetical_protein 1042247
1042423 Hypothetical
o
mru_0806 597 6463 hypothetical_protein 1043424
1044104 Conserved o
1--,
c7,
mru_0807 598 6464 hypothetical_protein 1044753
1045004 Hypothetical
,
.
,
Table 11
0
mru_0809 599 6465 hypothetical_protein .
1046198 1046776 Conserved (,..
_
o
' mru 0810 600 6466 glutamate_synthase_domain-containing_protein
1046906 1048318 glutamate/glutamine 1--
1-
mru_0812 601 6467 exodeoxyri bon uclease_VILla rge_subun it_XseA
1052403 1053929 Recombi nationand repair .--.
o
mru_0813 602 6468 ' exodeoxyri bon uclease_VI l_small_subun it_XseB
1053973 1054224 Recombi nationand repair (..4
(A
mru_0814 603 6469 archaeosine_tRNA-ribosyltransferase_TgtA1
1054380 1055114 RNAprocessing (.4
o
mru 0815 604 6470 transcriptional regulator_MarR_family
1055722 1056168 , Transcriptionalregulators
mru_0816 605 6471 CoB-CoM_heTerodisulfide_reductase_subunit_C HdrC
1056680 1057756 Methanogenesis ,
mru 0817 606 6472 Co B-CoM heterodisulfide_reductase_subunit_B=HdrB
1057778 1058662 Methanogenesis
mru_0818 607 6473 hypothetic;_protein
1059121 1059390 Conserved _
mru 0819 608 6474 hypothetical_protein
1059387 1059914 Conserved
mru_0820 609 6475 calcineurin-like_phosphoesterase
1060267 1060983 Enzyme
mru_0822 610 6476 2-phosphoglycerate_kinase_Pgk2B
1062744 1063658 Gluconeogenesis
mru_0823 611 6477 CBS_domain-containing_protein
1063672 1064040 General a
mru 0824 612 6478 adhesin-like_protein_with transglutaminase_domain
1065115 1067142 Cellsurfaceproteins
mru10825 613 6479 Fe-S oxidoreductase
1067664 1069337 Enzyme 0
_ -
n)
mru_0826 614 6480 am inotransferase
1069737 1070852 Enzyme ...r
.-.1
mru_0828 615 6481 adhesin-like protein with transglutaminase_domain
1073030 1075378 Cellsurfaceproteins
IV
mru_0829 616 6482 phosphopantetheine_adenylyltransferase_CoaD
1075885 1076337 PantothenateandcoenzymeA m
mru_0830 617 6483 ferredoxin
1076720 1077694 Electrontransport Iv ,N2
mru_0832 618 6484 hypothetical_protein
1078519 1078755 Conserved Z.73 H
mru 0834 619 6485 metallo-beta-lactamase_superfamily_protein ,
1081757 1082470 Enzyme IV
1
mru_0835 620 6486 hypothetical_protein
1082811 1083497 Conserved o
n)
_
mru_0836 621 6487 hypothetical_protein
1083902 1084564 Conserved 1
r\)
mru_0837 622 6488 RNA-binding_protein
1084726 1086735 Other
, mru_0839 623 6489 adhesin-
like_protein_with_cysteine_protease_domain 1087683 1096322
Cellsurfaceproteins
mru 0840 624 6490 hypothetical_protein
1096786 1098603 Conserved
mru_0841 625 6491 fumarate hydratase_FumA3
1098988 1099518 TCA
mru_0843 626 6492 adhesin-ce_protein_with_cysteine_protease_domain
1104234 1110431 Cellsurfaceproteins
mru_0844 .627 6493 hypothetical_protein
1110839 1111156 Conserved
mru_0845 628 6494 pyridoxal_phosphate_enzyme
1111296 1112438 Enzyme oo
mru_0846 629 6495 biotin-acetyl-CoA-carboxylase_ligase_BirA
1112767 1113678 Biotin n
mru_0847 630 6496 pyruvate carboxylase_subunit_A_PycA
1113871 1115364 TCA 1-3
mru_0848 631 6497 hypothetical_protein
1115887 1116006 Hypothetical
mru_0849 632 6498 , ribosomal_protein_L11_methyltransferase_PrmA
1116410 1117240 RNAprocessing N
mru_0850 633 6499 hypothetical_protein
1117945 1118760 Conserved 1--,
o
mru_0851 634 6500 ribosomal_protein_L3P Rpl3p
1119670 1120680 Ribosomalproteins
mru 0852 635 6501 ribosomal protein L4piRpl4p
1120691 1121455 Ribosomalproteins =
1..,
, mru_0853 636 6502 ribosomal_protein_L23P_Rp123p
1121500 1121760 Ribosomalproteins o.
o
,,
=
Table 11 0
=
mru_0854 637 6503 ribosomal_protein_L2P_Rpl2p
1121773 1122498 Ribosomalproteins (,..
mru_0855 638 6504 ribosomal_protein_S19P Rps19p
1122516 1122926 Ribosomalproteins o
1--
mru_0856 639 6505 _ ribosomal_protein322P¨Rp122p
1122939 1123406 Ribosomalproteins 1--
=---.
o
mru_0857 640 6506 ' ribosomal_protein_S3P lips3p=
1123410 1124165 Ribosomalproteins (..4
(A
mru_0858 641 6507 ribosomal_protein_L29P Rp129p
1124189 1124395 Ribosomalproteins
= mru_0859 642 6508
translation_initiation_faclor_aSUll 1124396 1124740 Translationfactors
mru 0860 643 6509 ribonuclease_P_subunit_P29 =
1124831 1125166 RNAprocessing
mru10861 644 6510 ribosomal_protein_S17P Rps17p
1125177 1125497 Ribosomalproteins
mru_0862 645 6511 ribosomal_protein_L14P1Rp114p
1125499 1125897 Ribosomalproteins
mru_0863 646 6512 ribosomal_protein_L24P Rp124p
1125911 1126252 Ribosomalproteins
mru_0864 647 6513 ribosomal_protein_S4e Tzps4e
1126254 1126988 ''Ribosomalproteins
mru_0865 648 6514 ribosomal_protein_L5P Rpl5p
1126985 1127506 Ribosomalproteins
.
mru_0866 649 6515 ribosomal_protein_S14i; Rps14p
1127560 1127712 Ribosomalproteins a
mru_0867 650 6516 ribosomal_protein_S8Pps8p
1127727 1128119 Ribosomalproteins
=
mru_0868 651 6517 ribosomal_protein L6P_Rpl6p
1128130 1128663 Ribosomalproteins 0
n) .
mru_0869 652 6518 ribosomal_proteint_32e_Rp132e
1128725 1129069 Ribosomalproteins ...3
.-.1
mru_0870 653 6519 ribosomal_protein_L19e_Rp119e
1129617 1130063 Ribosomalproteins "
I.)
mru_0871 654 6520 ribosomal_protein_L18P Rp118p
1130075 1130656 Ribosomalproteins
_
n)
=
mru_0872 655 6521 ribosomal_protein S5P iRps5p
1130653 1131294 Ribosomalproteins . NJ n)
mru_0873 656 6522 ribosomal_protein_L30P_Rp130p
1131306 1131764 Ribosomalproteins
' mru_0874 657 6523 ribosomal_protein_L15P_Rp115p
1131777 1132220 Ribosomalproteins I.)
,
mru_0875 658 6524 preprotein_translocase_subunit_SecY
1132821 1134191 Proteinsecretion o
n)
_
1
mru_0876 659 6525 adenylate_kinase_Adk
1134359 1134916 Purineinterconversions i\)
.
.1,
mru_0878 660 6526 ribosomal_protein L34e_Rp134e
1136337 . 1136603 Ribosomalproteins
' mru_0879 661 6527 -cytidylate
kinase ¨Cmk 1136603 1137121 Pyrimidineinterconversion
. ,
mru_0880 662 6528 ribosomar_protein_L14e_Rp114e
1137121 1137288 Ribosomalproteins
mru0881 663 6529 adhesin-like_protein .
1137864 1140397 Cellsurfaceproteins
mru_0882 664 6530 cobalt_ABC_transporter_ATP-binding_protein Cbi01
1140817 1141650 Cobalamin
mru_0883 665 6531 cobalt_ABC_transporter permease_protein_C¨biQ2
1141652 ' 1142344 Cobalamin
mru_0886 666 6532 precorrin-2_C20-methyltransferase_CbiL
1144186 1144872 Cobalamin
mru_0887 667 6533 cobalamin= biosynthesis_protein_CbiD
1144947 1146086 Cobalamin n
,-
mru_0888 668 6534 precorrin-41C11-methyltransferase_CbiF
1146206 1146961 Cobalamin
.
N
mru_0889 669 6535 cobalamin biosynthesis_protein_CbiG
1147023 1148045 Cobalamin
mru 0890 670 6536 precorrin-ii_C17-methyltransferase CbiH1
1148277 1149008 Cobalamin = o
1--,
mru¨_0891 671 6537 precorrin-6x_reductase CbiJ
1149025 1149783 Cobalamin o
mru_0892 672 6538 precorrin-6Y_C5,15-
meThyltransferase_(dec,arboxylatin_g)_CbiET 1149823 1151010 Cobalamin
o
mru_0893 673 6539 cobyrinic_acid_a,c-diamide_synthase_CbiA3 - '
1151003 1152364 Cobalamin o
1..,
'
mru_0894 674 6540 precorrin-8X_methylmutase_CbiC
1152477 1153115 Cobalamin cr,
, ,
,
. ,
=
,
-
.
= ,
=
,
Table 11
mru 0895 675 6541 cobalt_chelatase_CbiK
1153434 1154222 Cobalamin (,..
mru:0896 676 6542 , adhesin-like_protein
1154364 1154756 Cellsurfaceproteins o
1--
_
mru 0897 677 6543 SAM-dependent methyltransferase_UbiE _family
1154916 1155584 Ubiquinone 1-
,
o
mru_0898 678 6544 H/ACA_RNA-prcTtein_complex_component_Cbf5p
1155815 1156777 RNAprocessing (..4
(A
mru_0899 679 6545 hypothetical_protein
1156845 1157027 Conserved (.4
o
mru_0900 680 6546 hypothetical_protein =
1157197 1157748 Conserved
mru_0902 681 = 6547 hypothetical_protein
1158271 1158414 Hypothetical _
mru_0903 682 6548 hypothetical_protein
1158684 1159187 Hypothetical
mru_0904 683 6549 hypothetical_protein
1159296 1160162 Conserved
mru_0905 684 6550 ribosomal_protein_S13P Rps13p
1161107 , 1161556 Ribosomalproteins
mru_0906 685 6551 ribosomal_protein_S4P 17tps4p
1161685 1162221 Ribosomalproteins
mru 0907 686 6552 ribosomal_protein S1113 Rps11p
1162233 1162625 Ribosomalproteins
mru_0908 687 6553 DNA-directed RN-A_polymerase_su bun it_p_RpoD
1162638 1163486 RNApolymerase
mru 0909 688 , 6554 ribosomal protein L18e Rp118e
1163489 1163854 Ribosomalproteins a
mru 0910 689 6555 ribosomal_protein_Ll 3P_Rp113p
1163875 1164303 Ribosomalproteins 0
n)
mru:0911 690 6556 ribosomal_protein S9P Rps9p '
1164425 1164823 Ribosomalproteins ...3
mru_0912 691 6557 . DNA-directed_RN-A_polimerase_subunit_N RpoN
1165094 1165264 RNApolymerase n)
I.)
mru_0913 692 6558 DNA-directed_RNA_polymerase_subunit_K:RpoK '
1165515 1165733 RNApolymerase m
mru_0914 693 6559 phosphopyruvate_hydratase_Eno ,
1166393 1167640 Gluconeogenesis ry n)
mru_0915 694 6560 4Fe-4S_binding_domain-containing_protein
1168043 1168234 Electrontrans port " 0
mru_0916 695 6561 ribosomal_protein_S2P_Rps2p
1168432 1169028 Ribosomalproteins I.)
1
mru 0917 696 6562 hypothetical protein
1169561 1170457 Conserved o
n)
mru_0918 697 6563 hypothetical_protein
1170566 1170811 Hypothetical '
i\)
mru_0919 698 6564, phosphomevalonate_decarboxylase
1170999 1171847 Mevalonatepathway
mru_0920 699 6565 , mevalonate kinase_Mvk
1172330 1173313 Mevalonatepathway
mru 0921 700 6566 isopentenyl_diphosphate_kinase
1173526 1174341 Mevalonatepathway
mru10922 701 6567 isopentenyt_cfiphosphate_delta-isomerase_Fni
1175350 1176402 Mevalonatepathway
niru_0923 702 6568 RNA-metabolising_metallo-beta-lactamase =
1176701 1178050 Other
mru 0924 703 6569 bifunctional short_chain isoprenyl_diphosphate
synthase_IdsA 1178436 1179431 Elongationofisoprenoidsidechains _
mru:0925 704 6570 SAM depen-dent_methyTransferase
1179535 1180407 Enzyme
oo
mru_0926 705 6571 hypothetical_protein
1180961 1181620 Hypothetical n
mru_0927 706 6572 , type_l_restrictio n-modification_enzym e_S_su
bun it_HsdS 1183126 1183698 Restrictionandmodification
1-3
mru 0928 707 6573 type_l_restriction-modification_system_M
subunit_HsdM " 1183832 1185736 Restrictionandmodification
mru:0929 708 6574 mange nese-dependent_inorgan ic_pyroph-
osphatase_PpaC 1186463 1187383 Enzyme N
rnru_0931 709 6575 hypothetical_protein
1187881 1188276 Hypothetical 1--,
o
mru_0932 710 6576 oxidoreductase a Ido/keto_reductase_family
1188582 1189751 Enzyme
o
mru_0933 711 6577 SAM-dependent methyltransferase ,
1189904 1190731 Enzyme =
1..,
mru 0934 712 6578 SAM-dependent_methyltransferase
1190815 1191639 Enzyme = o=
o
. . ,
õ
'
-
Table 11 ,
0
mru_0935 713 6579 SAM-dependent_methyltransferase ,
1191738 1192553 Enzyme w
mru_0936 714 6580 hypothetical_protein ,
1192605 1193099 Hypothetical o
1--
1--,
mru_0937 715 6581 hypothetical_protein
1194827 1195090 Conserved ,
o
mru_0938 716 6582 glutamyl-tRNA_synthetase GItX
1196063 _ 1197733 tRNAaminoacylation k..i
uri
= mru_0939 717 6583.
hypothetical_protein = 1197887 1198333 Conserved r.4
mru_0940 718 6584 hypothetical_protein .,
1198565 1200853 Hypothetical
mru_0941 719 6585 diaminopimelate aminotransferase_DapL '
1201403 _1202635 Lysine
mru_0942 720 6586 hypothetical protein '
1202813 1203493 Conserved
mru_0943 721 6587 hypothetical_protein =
1203597 1204223 Conserved _
mru_0944 722 6588 hypothetical_protein
1204349 1205017 Conserved
mru_0945 723 6589 hypothetical_protein
1205111 1205653 Conserved .
mru 0946 724 6590 adenylosuccinate_synthetase_PurA
1206140 1207159 Purineinterconversions
mru 0947 725 6591 hypothetical_protein
1207559 1208743 Conserved c)
mru_0948 726 6592 transposase ' ,
1209099 1209482 Transposase
mru_0950 727 6593 bicarbonate_ABC_transporter_permease protein BtcB
1211602 1212345 Bicarbonate 0
n)
mru_0951 728 6594 bicarbonate_ABC_transporter_ATP-
binding_protein_BtcA 1212478 , 1213239 Bicarbonate ...3
.-.1
mru_0952 729 6595 phosphomethylpyrimidine_kinase_ThiD2
1213572 1214372 Thiamine ÷
IV ._
mru_0953 730 6596 2-phospho-L-lactate_guanylyltransferase_CofC ,
1214506 1215210 CoenzymeF420 m
mru_0954 731 6597 prolyl-tRNA_synthetase_ProS
1215307 1216713 tRNAaminoacylation r.) 1.)
mru_0955 732 6598 NAD(P)-dependent_glycerol-1-phosphate_dehydrogenase_E9sA
1217253 1218296 Lipidbackbone 8 0
H
mru_0956 733 6699 hypothetical protein ,
1218391. 1218831 Conserved IV
1
-
. mru_0957 734 6600 ribose 5-phosphate_isomerase A_RpiA
1219054 1219722 PRPPsynthesis o
n)
mru 0958 735 6601 NADI-Coxidase Nox
1220042 , 1221376 Oxidativestressresponse 1
i\)
mru 0959 736 6602 methionyl-tRNAisynthetase:MetG
1222037 1224112 tRNAaminoacylation
mrui0960 737 6603 hypothetical_protein =
1224302 1225879 Conserved .
mru 0961 738 6604 hypothetical protein
1226021 1226728 - Conserved
mru_0962 739 6605 adhesin-like_protein .
1228943 1243732 Cellsurfaceproteins
mru 0963 740 6606 adhesin-like_protein
1243757 1251916 Cellsurfaceproteins
mru_0965 741 6607 hypothetical_protein .
1254041 1254181 Hypothetical
mru 0966 742 6608 hypothetical_protein
1254530 1254754 Conserved
mru_0967 743 6609 hypothetical_protein S
1254812 1255318 Conserved n
1-
mru_0969 744 6610 DNA_primase_large_subunit_PriB
1256313 1257656 Chromosomereplication
mru_0971 745 6611 hypothetical_protein '
1260519 ' 1260812 Hypothetical .
N
mru_0972 746 6612 hypothetical_protein S
1261107 1262213 Hypothetical o
mru_0973 747 6613 hypothetical_protein ( '
1262856 1263347 Conserved 1--,
o
mru_0974 748 6614 DNA_primase_small_subunit_PriA =
1263642 1264634 Chromosomereplication C3
o
mru_0976 749 6615 adhesin-like_protein
1265784 1266086 Hypothetical o
1--,
mru_0977 750 6616 adhesin-like_protein
1266302 1270774 Cellsurfaceproteins o
,
.
_
,
=
,
,
, ,
_______________________________________________________________________________
____________________________
Table 11
0
mru_0978 751 6617 adhesin-like_protein =
1270792 1277397 Cellsurfaceproteins (,..
mru_0980 752 6618 hypothetical_protein
1287786 1288628 Conserved o
.--
mru_0981 753 6619 Rad3-related_DNA_helicase
1289582 1292956 Helicase .--,
=-.
o
mru_0982 754 6620 " hypothetical protein
1293772 1294269 Conserved . (..4
(A
=
mru_0983 755 6621 hypothetical_protein
1294346 1294510 Conserved (.4
o
mru_0984 756 6622 hypothetical_protein
1294620 1295222 Hypothetical
=
mru_0985 757 6623 hypothetical protein =
1295646 1296062 Hypothetical
mru_0987 758 6624 tRNA_nucleotidyltransferase_Cca
1297821 1299293 RNAprocessing .
mru_0988 759 6625 transcriptional_regulator Ma rR_fam ily
1299532 1299996 Transcriptional reg ulators .
mru 0990 760 6626 3-hydroxybutyryl-CoA_dehydro_genase Hbd
1300898 1301836 Butanol _
mru_0991 761 6627 NADPH-dependent_F420_reductase_F1pdG1
1302480 1303163 Ethanol
mru 0992 762 6628 transposase
1303331 1304827 Transposase 4
mru 0994 763 6629 nitroreductase_family_protein
1306178 1306687 General _ a
_ mru_0995 764 6630 nitroreductase_family_protein
1306753 1307274 General
mru_0996 765 6631 2'-5'_RNA_Iigase LigT
1307368 1307943 RNAprocessing (D
n)
mru_0997 766 6632 phospho-2-dehyd7o-3-deoxyhqptonate_aldolase
1308896 1309696 Gluconeogenesis/Chorismate ...3
.-.1
mru_0998 767 6633 3-dehydroguinate_synthase_AroB
1309895 1311001 Chorismate K)
IV
mru 0999 768 6634 glutamate-1-semialdehyde-2,1-aminomutase HemL
1311369 1312634 Cobalamin m
_mru1001 769 6635 SAM-dependent_methyltransferase
1313763 1314401 Enzyme
.
mru_1002 770 6636 MFS_transporter
1314608 1315846 Other
I-.
=
mru 1003 771 6637 flavodoxin
1316081 1316623 Electrontransport IV
I
mru_1004 772 6638 methanogenesis_marker_protein_4
1316865 1317653 Methanogenesis o
n)
'
mru_1005 773 6639 undecaprenyl_p_yrophosphate_synthetase_UppS
1318012 1318776 Pseudomureinbiosynthesis i\)
mru_1006 774 6640 hydrolase_TatD _family
1318993 1319754 Enzyme
mru_1007 775 6641
diaminohydroxyphosphoribosylaminopyrimidine_reductase_RibD 1319888 _
1320568 Riboflavin
mru_1008 776 6642 hypothetical_protein
1322196 1322342 Hypothetical
=
mru_1009 777 6643 PHP
mru 1010 778 6644 panGdomain-containing_protein
1322469 1323113 General
henate_kinase CoaA
1323570 1324628 PantothenateandcoenzymeA
mru_1011 779 6645 SAM-dependent_meilyltransferase
1324712 1325245 Enzyme
mru_1014 780 6646 aspartyl-tRNA synthetase_AspS
1328274 1329596 tRNAaminoacylation
oo
mru_1015 781 6647 histidinol_dehy-drogenase HisD
1329812 1331092 Histidine n
mru 1016 782 6648 hypothetical_protein
1331819 1332124 Conserved 1-3
mru_1017 783 6649 cyclic_2,3-diphosphoglycerate-synthetase
1332620 1334008 Gluconeogenesis
N
mru 1018 784 6650 TPR repeat-containing_protein ' =
1334176 1335237 Proteininteractions o
mru_1019 785 6651 C_GCA)ocG C_Ciamily_protein
1335266 1335706 General .--,
o
mru_1020 786 6652 hypothetical- protein
1336153 1337232 Conserved
o mru_1023 787
6653 bifunctional_ornithine acetyltransferase/N- 1339474 1340667
Arginine =
...,
'acetylglutamate_syntRase_protein_ArgJ
c7,
o
= =
. . (
.
. ,
= .
Table 11
0
mru 1024 788 6654 hypothetical_protein
1341451 1342092 Conserved _ (,..
'
mru_1025 789 6655 RNA_Iigase DRB0094_family
1342472 1343656 RNAprocessing =
1--
mru_1026 790 6656 SAM-dependent
1344006 1344617 Enzyme 1-
-
o
. mru_1029 791 6657 acetylglutamate_kinase_ArgB
1347429 1348307 Arginine (..4
,
(A
=
mru_1031 792 6658 3-oxoacyl-(acyl-carrier-proteinLreductase_FabG1 1349524
1350267 Bacterial (.4
,
mru_1032 793 6659 AMP-binding_enzyme-
1350665 1352149 Enzyme
mru_1033 794 6660 isohomocitrate_dehydrogenase_AksF
1352367 1353365 CoenzymeB _
mru_1034 795 6661 , HEAT_repeat-containing_protein
1354485 1354952 General _
' mru_1035 796 6662 hypothetical_protein
1356453 1356815 Conserved
mru_1036 797 6663 - hydrolase_alpha/beta_fold_family
1356927 1357718 Enzyme
mru_1037 798 6664 nickel_responsive_transcriptional_regulator_NikR =
1358014 1358508 Transcriptionalregulators
mru_1038 799 6665 RNA_methylase
1358689 1359624 Other
mru 1039 800 6666 hydrogenase_maturation_protease_Hycl
1359684 1360208 Hydrogenmetabolism _ a
mru 1040 801 6667 D-alanine-D-alanine_ligase =
1360461 1361600 Enzyme
mru_1041 802 6668 cell_wall_biosynthesis_protein_phospho-N-acetylmuramoyl-
pentapeptide- 1362077 1363150 Pseudomureinbiosynthesis o
n)
transferase_family
.-.1
mru_1042 803 6669 cell_wall biosynthesik_protein_Mur_ligase_family
1363412 1365031 Pseudomureinbiosynthesis n)
IV
mru_1043 804 6670 hypothetical_protein =
1365153 1367006 , Hypothetical n)
mru 1044 805 6671 hypothetical protein
1367080 1367673 Hypothetical
mru_1045 806 6672 4-oxalocrotonate_tautomerase_family_enzyme_Dmpl
1367835 1368023 Aromaticcompounds
lTs0 H
mru_1046 807 6673 hypothetical_protein
1368435 1369271 Conserved IV
I
mru_1048 808 6674 dUTP_diphosphatase Dut
1370640 1371098 _ Pyrimidineinterconversion o
n)
1
mru_1049 809 6675 glycosyl_transferase --6T2 family
1371179 1372324 Exopolysaccharides i\)
'
mru_1050 810 6676 ATP_phosphoribosyltransferase_HisG2
1372641 1373627 Histidine
mru_1052 811. 6677 transcriptional_regulator
1375426 1376352 Transcriptionalregulators
mru_1053 812 6678 deoxyhypusine_synthase_Dys
1376613 1377548 Translationfactors
mru_1055 813 6679 orotidine_5'-phosphate_decarboxylase PyrF
1378405 = 1379052 Pyrimidine
mru_1056 814 6680 2-C-methyl-D-erythritol_4-phosphate_cYtidylyltransferase
1379395 1380165 Other
mru_1057 815 6681 alcohol dehydrogenase
1380298 1381320 Other
_
mru_1066 816 6682 glycosyT transferase_GT4 _family
1391451 1394489 Exopolysaccharides
-
1-:
mru_1071 817 6683 polysaccharide/polyol_Phosphate_ABC_transporter_ATP-
binding_protein 1400679 1402145 Exopolysaccharides = n
mru_1075 818 6684 UDP-glucose/GDP-mannose_dehydrogenase
1409456 1410679 Exopolysaccharides 1-3
mru_1076 = 819 6685 adhesin-like_protein
1411489 1414170 Cellsurfaceproteins
N
mru_1078 820 6686 glycosyl_transferase GT2 _family/CDP-
1417231 = 1421088 Other c=
glycerol:poly(glycero-phosphate)_glycerophosphotransferase
1--,
o
mru_1080 821 6687 hypothetical_protein
1422844 1423128 Conserved
o
mru_1082 822 6688 MotAfrolQ/ExbB_proton_channel_family_protein
1424051 1424692 Other o
1..,
mru 1083 823 6689 ion_transport_protein ==
1424781 1425662 Cations =::,
.
.
'
.
,
.
.
=
-
= :
= =
Table 11
, 0
mru_1085 824 6690 hypothetical_protein '
1426929 1427288 Hypothetical . w
=
mru_1086 825 6691 TPR_repeat-containing_protein 1427522 1427995
Proteininteractions o
I--
. mru_1087 826 6692 excinuclease_ABC_B_subunit_UvrI3
1428339 1430294 Recombinationandrepair
,
mru_1088 827 6693 hypothetical protein
1430905 1432821 Hypothetical 2
.
(11
mru 1089 828 6694 3-dehydroquinate_dehydratase_type_l_AroD
1433027 1433719 Chorismate . (..i
o
mru 1090 829 6695 hypothetical_protein
1433960 1434685 Conserved
mru_1091 830 6696 succinate-CoA_Iigase_alpha_subunit SucD
1435246 1436115 TCA =
mru 1092 831 6697 hydroxymethylglutaryl-CoA_reductase_(NADPH)_HmgA
1436247 1437449 Mevalonatepathway =
mru_1093 832 6698 hypothetical_protein
1437594 1437998 Conserved .
mru_1094 833 6699 hypothetical protein ,
1438131 1438286 Conserved _
mru_1095 834 6700 hypothetical_protein
1438466 1438870 Conserved
mru_1097 835 6701 hypothetical_protein
1440019 1440408 Conserved.
mru 1099 836. 6702 hypothetical_protein
1441491 . 1442009 Conserved
c)
mru_1100 837 6703 ribosomal_protein_L40e_Rp140e '
1442259 1442420 Ribosomalproteins
mru_1101 838 6704 hypothetical_protein
1442580 '1442768 Hypothetical o
iv
mru_1102 839 6705 geranylgeranylglyceryl phosphate_synthase
1442861 1443610 Phospholipidbiosynthesis .-.1
mru 1103 840 6706 hypothetical_protein
1443938 1445023 Conserved "
_ _
iv
mru_1104 841 6707 ATPase
1445117 1446625 General "
mru_1105 842 6708 DNA double-strand break_repair protein_Mre11
1446764 1448155 Recombinationandrepair ro iv
mru_1106 843 6709 DNA_double-strand_break_repair_protein_Rad50
1448168 1450966 Recombinationandrepair
I-.
mru_1107 844 6710 hypothetical_protein
1451181 1451606 Conserved iv
1
mru 1108 845 6711 hypothetical_protein
1452322 1453395 Conserved iv
mru_1109 846 6712 hypothetical_protein
1453443 1453751 Conserved 1
iv
mru_1110 847 6713 DEAD/DEAH_box_helicase_domain-containing protein
1453856 1456612 Helicase
mru_1111 848 ' 6714 ATP-dependent_DNA_helicase_UvrD/REP _family =
1457147 1458754 Helicase
mru_1113 849 6715 hypothetical_protein
1460079 1461488 Conserved
mru_1114 850 6716 hypothetical protein ,
1461640 1462320 Conserved
mru_1115 851 6717 exodeoxyribonuclease_III_Xth1
1462763 .1463503 Recombinationandrepair
mru_1116 852 6718 hypothetical_protein =
1463941 1464597 Conserved
mru 1117 853 6719 hypothetical_protein =
1464741 1465970 Conserved 0:
mru_1118 854 6720 cell_wall_biosynthesis_protein Mur_ligase_family
1466236 1467567 Pseudomureinbiosynthesis n
mru_1119 855 6721 glutamyl aminopeptidase_Pep-A
1467924 1469027 Proteindegradation = 1-3
mru 1120 856 6722 oxidoreductase_aldo/keto_reductase_family
1469238 1470398 Enzyme
mru_1121 857 6723 DNA_helicase
1470867 1478765 Helicase N
mru_1122 858 6724 hypothetical_protein
1480692 1480979 Hypothetical 1--,
.
o
. mru_1123 859 6725 hypothetical_protein
1481004 1481183 Conserved C3
o
mru_1126 860 6726 hypothetical_protein
1485158 1487053 Conserved =
1-i
mru_1127 861 6727 excinuclease_ABC A subunitUvrA1
1487397 1490303 Recombinationandrepair o=
_ _ _
o
,
.
,
=
,
,
Table 11
0
mru_1129 862 6728 replication_factor_C_Iarge_subunit_RfcL
1491932 1493524 Chromosomereplication (,..
mru_1130 863 . 6729 replication_factor_C_small_subunit_RfcS
1493590 1494534 Chromosomereplication o
I--
mru_1131 864 6730 hypothetical_protein '
1495093 1496574 Conserved 1--
o
mru 1132 865 6731 hypothetical_protein
1496888 1497361 Conserved (..4
(A
=
=
mru_1133 866. 6732 hypothetical_protein' 1497737 1498513 Conserved (.4
o
' mru_1134 867 6733
transcriptional regulator 1498635 1498976 Transcriptionalregulators
mru_1135 868 6734 restriction_encionuclease
1499711 1500445 Restrictionandmodification
mru_1136 869 6735 hypothetical_protein
1500557 1501756 Conserved .
mru_1137 870 6736 = hypothetical_protein
1501855 1503051 Hypothetical
mru_1138 871 6737 ATP-dependent_DNA_helicase_UvrD/REP_family
1503315 1507955 Helicase
mru_1139 872 6738 2,3-bisphosphoglycerate-
independent_phosphoglycerate_mutase_ApgM2 1508057 1509265 Gluconeogenesis
mru_1141 ,873 6739 allosteric_regulator of homoserine_dehydrogenase =
1510672 1511157 Homoserine ,
mru_1142 874 6740 Asp4RNA(Asn)/Glu-tRIA(Gln)_amidotransferase_subunit_C_GatC
1511260 1511475 tRNAaminoacylation ,
(-)
mru 1143 875 6741 asparagine synthase (glutamine-hydrolyzing) AsnB
1511761 1513572 Aspartate/Asparagine
mru_1144 876 ' 6742 hypothetical_protein
1513967 1514233 Conserved o
iv
mru 1145 877 6743 toxic_anion resistance_protein
1514676 1515812 General ...3
.-.1
mru_1146 878 6744 hypothetical_protein"
1515927 1516379 Hypothetical iv
mru 1147 879 6745 hypothetical_protein
1516458 1516967 Hypothetical iv _ .1,.
mru_1148 880 6746 0-acetylhomoserine/0-acetylserine
sulfhydrylase_MetZ/CysK1 = 1517476 1518762 Methionine
mru_1149 881 6747 hypothetical_protein
1518950 1519597 Conserved IQ 0
0 1-`
\ mru_1150 882 6748 hypothetical_protein .
1519673 1521907 , Hypothetical ' iv
i
mru 1151 883 6749 hypothetical protein
1522074 1522820 Hypothetical o
iv
,
= '
- mru_1152 884 6750 hypothetical_protein
1522955 1523749 Hypothetical iv
' ' mru_1153 885 6751 hypothetical_protein
1523792 1524238 Hypothetical .1,.
mru_1154 886 6752 hypothetical_protein
1524255 1524614 Conserved .
mru_1155, 887 6753 hypothetical_protein
1524889 1525320 Hypothetical
mru_1156 888 6754 hypothetical_protein
1526085 1526486 Hypothetical _
mru 1157 889 6755 helicase_RecDTTraA family
1526581 1528416 Helicase
mru_1158 890 6756 RecF/RecN/SMC_N_terminal_domain-containing_protein
1528526 1530226 Genomesegregation
mru_1159 891 6757 hypothetical_protein - '
1530536 1531261 Conserved
oo
mru_1160 892 6768 transcriptional_regulator
1531264 1531611 Transcriptionalregulators n
,
mru_1161 893 6759 hypothetical_protein
1531882 1533003 Hypothetical 1-3
. .
.
mru_1162 894 6760 transposase
1533232 1534728 Transposase
N
mru 1163 895 6761 hypothetical_protein
1534996 1536138 Hypothetical _ o
mru_1164 896 6762 hypothetical_protein
1536141 1537511 Hypothetical 1--,
o
mru_1165 897 6763 restrictionLenzyme_methylase subunit
1537838 1540549 Restrictionandmodification
o
mru_1166 898 6764 type Il_restriction_endonuclease .
. 1540636 1541553 Restrictionandmodification =
1..,
=
mru_1167 899 6765 DNAlmodification_methylase 1541550 1543022
Restrictionandmodification cr,
o
=
. , ,
' .
=
= .
,
=
Table 11 .
mru_1168 900 6766 serine/threonine_protein_kinase_with_TPR_repeats 1543163
1545490 Proteininteractions 0
(,..
mru_1169 901 6767 hypothetical_protein 1545504
1546880 Conserved = =
I--
mru_1170 902 6768 ATPase . 1547016
1549874 General . 1--
=---.
o
mru_1171 903 6769 hypothetical_protein 1550026
1550166 Hypothetical W
CA
mru_1172 904 6770 hypothetical_protein 1550176
1550529 Conserved (.4
_
mru_1173 905 6771 hypothetical_protein 1550526
1552415 Conserved
mru_1174 906 6772 CRISP R-associated_proteinSas1-2 =
1554137 1555141 CRISPR-associatedgenes
.
mru_1175 907. 6773 hypothetical_protein 1555285
1555410 Hypothetical .
mru_1176 908 6774 CRISPR-associated_protein_Cas2-2 1555446
1555724 CRISPR-associatedgenes .
mru_1177 909 6775 hypothetical protein = 1556836
1557600 Conserved
mru_1178 910 6776 CRISPR-associated_protein_TIGR02710 _family 1557635
1559002 CRISPR-associatedgenes
. mru_1179 911 6777
CRISPR-associated_RAMP_protein_Csm5_family 1559095 1560297
CRISPR-associatedgenes .
mru_1180 912 6778 CRISPR-associated_RAMP_protein_Csm4 family 1560294
1561289 CRISPR-associatedgenes
mru_1181 913 6779 CRISPR-associated_RAMP_protein Csm3 family
'1561380 1562147 CRISPR-associatedgenes
(-)
mru_1182 914 6780 CRISPR-associated_protein_Csm2 amily
.1562161 1562592 CRISPR-associatedgenes 0
iv
mru_1183 915 6781 CRISPIR-associated_protein_Csm1_family 1562597
1565254 CRISPR-associatedgenes ...r
.-.1
mru_1184 916 6782 ATP-dependent_DNAJielicase_UvrD/REP family ,
1565554 1570251 Helicase iv
mru_1185 917 6783 hypothetical_protein 1570556
1571752 Conserved iv
mru_1186 918 6784 TPR repeat-containing_protein 1575395
1577089 Proteininteractions ry iv
mru_1188 919 6785 CRIgPR-associated_protein_TIGR02710 _family 1577751
1579040 CRISPR-associatedgenes Iv 0
.
..4. I-.
mru_1189 920 6786 hypothetical_protein 1579415
1579753 Hypothetical iv
1
mru_1190 921 6787 hypothetical_protein 1579830
1580099 Hypothetical o
iv
mru_1191 922 6788 MFS_transporter ' 1580436
1581842 Other 1
iv
mru_1193 923 6789 hypothetical_protein 1584639
1585388 Conserved
mru_1197 924 6790 hypothetical_protein 1590621
1591109 Hypothetical
mru_1203 925 6791 hypothetical_protein 1597170
1597382 Hypothetical
mru_1204 926 6792 hypothetical_protein 1597394
1597603 Hypothetical '
mru_1205 927 6793 homoserine o-acetyltransferase_MetX2 1598133
1599605 'Methionine
mru_1207 928 6794 hypothetical-protein 1601309
1601509 Conserved
mru_1208 929 6795 inosine-5'-monophosphate_dehydrogenase GuaB 1601814
1603307 Interconversions
0:
mm 1209 930 6796 hypothetical_protein 1603642
1604361 Conserved n
mru_1210 931 6797 adhesin-like_protein 1604798
1612048 Cellsurfaceproteins S1-3
= mru_1211 932 6798 4Fe-
43_binding_dorriain-containing_protein S 1613138 1614367
Methanogenesis
mru_1212 933 6799 CoB--CoM_heterodisulflde_reductase_subunitB_HdrB2 1614367
1615125 Methanogenesis N
mru_1215 934 6800 riboflavin_synthase_RibC 1616727
1617185 Riboflavin 1--,
o
mru 1216 935 _6801 TPR
repeat-containing_protein 1617454 . 1618644 Proteininteractions
mru_1217= 936 6802 ,cobalt_ABC_transporter_ATP-binding_protein_Cbi02
1618891 1619652 Cobalamin So
o
1..i
mru_1218 937 6803 adenosylcobinamide_amidohydrolase_CbiZ 1619929
1621032 Cobalamin cr,
. .
= ,
. _ .
,
,
=
Table 11
.
0
mru 1220 938 6804 cobalt_ABC_transporter_ATP-binding_protein CbiO3
1622112 1624049 Cobalamin (,..
.
mru_1221 939 6805 cobalt_ABC_transporter_permease_protein_C-biQ3
1624040 1625044 Cobalamin o
I--
mru_1223 940 6806 excinuclease_ABC_C_subunit_UvrC
1630257 1632104 Recombinationandrepair = 1--
=--.
o
-
mru 1224 941 6807 dephospho-CoA_kinase_CoaE
1632363 1632905 PantothenateandcoenzymeA (..4
(A
mru 1225 942 6808 pantothenate synthase PanC
1633098 1633844 PantothenateandcoenzymeA (.4
o
-
mru. 1226 943 6809 thymidylate kinase_Tml-a
1634038 1634616 Pyrimidineinterconversion
mru_1227 944 6810 hypothetical' protein
1634862 1635389 Conserved =
mru_1229 945 6811 hypothetical_protein =
1636733 1637299 Conserved
mru_1230 946 6812 hypothetical_protein
1637402 1638001 Hypothetical
.
mru_1231 947 6813 ATP-binding_protein
1638146 1638829 General -
mru_1233 948 6814 . hypothetical_protein
1640155 1640736 Conserved
mru 1234 949 6815- type_IV_Ieader_peptidase_family_protein
1640831 1641739 Proteinsecretion
mru 1235 950 6816 hypothetical_protein .
1641933 1642148 Hypothetical _ a
mru 1236 951 6817 hypothetical_protein
1642496 1643095 Hypothetical
nyu_1237 952 6818 CTP_synthase_PyrG
1643532 1645208 Pyrimidineinterconversion 0
n)
mru_1238 953 6819 hypothetical_protein
1645484' 1645864 Hypothetical ...3
.-.1
mru_1239 954 6820 hypothetical_protein
1646049. 1646501 Hypothetical I.)
mru_1240 , 955 6821 hypothetical_protein
1646555 1647298 Hypothetical m
mru_1241 956 6822 hypothetical_protein =
1647403 1652091 Hypothetical
iv iv
'
mru_1242 957' 6823 hypothetical_protein
1652461 1652760 Conserved ro 0
mru 1243 958 6824 hypothetical_protein .
1652870 1653643 Conserved 1
mru_1244 959 = 6825 shikimate_5-dehydrogenase_AroE
1653767 1654663 Chorismate o
.
n)
1
mru_1245 960 6826 hypothetical_protein
1654677 1655216 Conserved i\)
mru_1246 961 6827 _ adhesin-like_protein
1655539. 1660158 Cellsurfaceproteins
mru_1247 962 6828 adhesin-like_protein
1660506 1665566 Cellsurfaceproteins
mru_1248 963 6829 histidyl-tRNA synthetase HisS "
1665891 1667192 tRNAaminoacylation .
mru_1249 964 6830 phosphoribosyl-AMP cycTohydrolase Hisl
1667293 1667706 Histidine =
mru_1250 965 6831 PIN_domain-containing_protein
1667850 1670012 General
mru_1251 966 6832 xylose_isomerase-like_TIM_barrel_domain-containing_protein
1670199. 1670966 General
mru_1252 967 6833 xylose_isomerase-like_TIM_barrel domain-
containing_protein 1671030 1671803 General
1-:
mru_1253 968 6834 coenzyme_F390_synthetase FtsA-1
1672121 1673422 CoenzymeF420 n
mru_1254 969 6835 ACT_domain-containing_prot-ein ,
1673529 1673963 General 1-3
mru_1255 970 6836 malate_dehydrogenase_Mdh
1674165 1675115 TCA
N mru_1256 971 6837'
excinuclease_ABC_A_subunit_UvrA2 , 1675440 1677944
Recombinationandrepair cc
mru_1257 = 972 6838 ferritin-like_domain-containing_protein
1678196 1678693 Oxidativestressresponse 1--,
o
mru_1258 973 6839' rubredoxin_Rub2, , ,
1678761 1678916 Oxidativestressresponse
o
mru_1259 974 6840 rubredoxin_Rub3
1679230 = 1679388 Oxidativestressresponse =
1..,
mru_1260 975 6841 NADPH-dependent_FMN_reductase
1679502 '1680119 Electrontransport o=
o
'
,
,
.
.
=
Table 11
.
0
mru 1261 976 6842 - universal_stress_protein UspA1
1680137 1680547 Stressresponse (.4
I-- mru_1262 977 6843 methyl-
coenzyme_M recTuctase component_A2_AtwA1 1680711 1682432 Methanogenesis
o .
mru_1264 978 6844 glycosyl_transferase_-GT2 Jam iry
1685922 1686977 Exopolysaccha rides 1--,
--.
o mru_1265 979 6845
hypothetical protein 1687133 1687774 Conserved (..4
(A
mru_1266 980 6846 FO_synthase_subunit 2_Cofl-I
1687848 1688966 CoenzymeF420 (.4
mru 1267 981 6847 n icotinam ide-n ucleotide_adenylyltransferase
1689342 1689878 Nicotinate .
mru_1268 982 6848 molybdenum cofactor biosynthesis_protein MoaE
1690087 1690524 Metal-bindingpterin
mru 1269 983 6849 molybdenum-pterin_bindin9_protein_Mop1
1690881 1691087 Metal-bindingpterin
mru_1270 984 6850 molybdenum-pterin_binding_protein_Mop2
1691186 1691392 Metal-bindingpterin
=
mru_1271 985 6851 molybdenum-pterin_binding_protein_Mop3
1691502 1691708 Metal-bindingpterin
mru_1272 986 6852 molybdenum-pterin_bindin_g_protein_Mop4
1691770 1691976 Metal-bindingpterin
, mru_1273 987 6853
molybdenum-pterin_binding_protein_Mop5 1692141 , 1692347 Metal-
bindingpterin
mru_1274 988 6854 HD domain-containing protein
1692917 1694140 General (-)
mru_1275 989 6855 3-polypreny1-4-hydroxybenzoate_decarboxylase_UbiX
' 1694220 1694774 Ubiquinone
- mru_1276 990 _ 6856
precorrin-6Y_C5,15-_methyltransferase _(decarboxylating)_CbiT 1694876
1695439 Cobalam in o
n)
mru_1278 991 6857 ribonuclease III Rnc
1696469 1697167 Other
.-.1
mru_1279 992 6858 hypothetical_protein
= 1697964 1698422 Conserved "
IV
mru 1280 993 6859 hypothetical_protein
1698737 1699249 Conserved m
_
=
mru 1281 = 994 6860 ribosomal_protein L37e_Rp137e
1699643 1699828 Ribosomalproteins
,
mru_1282 995 6861 LSM_domain-cont-aining_protein
1700058 1700291 Other tv 0
ca
IV
mru_1283 996 6862 creatinine_amidohydrolase
1700819 1701523 Creatininemetabolism 1
mru 1284 997 6863 RNA-binding_protein
1701673 1702218 Other o
n)
1
mru_1286 998 6864 CM P/dCMP deaminase
1704241 1704690 Pyrimidineinterconversion i\)
mru_1287 999 6865 hypothetical- protein
1704848 1707271 Conserved
mru 1288 1000 6866 serine/threonine_protein_phosphatase
1707589 1708383 Proteininteractions ,
mru_1290 1001 6867 hypothetical_protein
1709396 = 1709650 Conserved
mru_1291 1002 6868 hypothetical protein
1709697 1710140 Hypothetical
mru 1292 1003 6869 hypothetical_protein
1710401 1711234 Hypothetical
mru_1293 1004 6870 glucosamine-fructose-6-phosphate_aminotransferase_GlmS1
1711594 1713333 Pseudomureinbiosynthesis
mru_l 294 1005 6871
hypothetical_protein 1713606 1716710 Hypothetical
mru_1295 1006 6872 phosphotyrosine_protein_phosphatase
1716892 1717353 Proteininteractions = n
mru 1296 1007 6873 hypothetical_protein
1717629 , 1718009 Conserved 1-3
mru_1297 1008 6874 hypothetical_protein
1718189 = 1720201 Conserved
N
mru_1298 1009 6875 archaeal_ATPase
1720875 1722059 _ General o
1--,
mru_1300 1010 6876 translation elongation_factor_aEF-2
1726905 1729100 Translationfactors o
,
mru_1301 1011 6877 hypothetic-al_protein
1729816 1731801 Hypothetical
o
mru_1302 1012 6878 hypothetical_protein
1731989 1732360 Hypothetical =
1..,
mru 1303 1013 6879 hypothetical_protein ,
1732484 1732960 Hypothetical o
,
= ,
'
,
õ
. ,
'
-
=
Table 11
._.
0
mru 1304 1014 6880 WD40_repeat-containing_prote in ,
1733138 1737901 General (,..
mru 1305 1015 6881 DnaK-related_protein 1738192
1740477 Proteinfolding o
I--
,
mru 1306 1016 6882
serine/threonine_protein_kinase 1740651 1742309
Proteininteractions' 1--,
.--.
o
mru_1307 1017 6883 _
hypothetical protein 1742871 1744016 Hypothetical
N.
(A
-
mru_1308 1018 6884 FHA_domain-containing_protein =
1744072' 1744875 General (.4
o
mru 1309 1019 6885
hypothetical_protein :1745096: 1745866 Hypothetical
mru_1310 1020 6886 3-hexulose-6-phosphate isomerase Ph i2 1746035
1746751 ribulosemonophosphatepathway
mru 1311 1021 6887
hypothetical_protein 1.746832 -1747944 Conserved
mru_1315 1022 6888 adhesin-like_protein 1750655
1752415 Cellsurfaceproteins.
,
mru 1316 1023 6889 TPR
repeat-containing_protein 1754064 1755008 Proteininteractions
.
mru 1317 1024 6890
hypothetical_protein . 1755212 1755814 Conserved .
mru_1319 1025 6891 DEAD/DEAH_box_helicase_domain-containing protein
1756715 1760227 Helicase
mru_1323 1026 6892 hypothetical_protein .
= 1763321 1763707 Conserved
a
mru_1324 1027 6893 nitrogen regulatory_protein_P-II_GInK 1764021
1764359 Regulation
mru_1327 1028 6894
transcriptional _regulator 1767318 1767887
Transcriptionalregulators 0
N)
mru 1328 1029 6895 acyl-CoA_syni-hetase 1768156
1769820 Enzyme .-.3
.-.1
mru_1329 1030 6896 carbohydrate_kinase_PfkB_family 1769954
1770832 Enzyme N)
N)
mru_1333 , 1031 6897
heavy_metal-translocating_P-type_ATPase 1773858 1776674 Cations
N)
mru_1334 1032 6898 transcriptional_regulator_ArsR_family 1776800
1777183 Transcriptionalregulators
mru_1335 '1033 6899 hypothetical_protein 1777985
1778623 Hypothetical ry 0
mru_1336 1034 6900 hypothetical_protein 1779497
1779679 Hypothetical N)
1
mru_1337 1035 6901 hisA/hisF_family_protein_HisAF 1779913 -
1780608 Histidine o
N)
'
mru_1338 1036 6902 iron
dependent_repressor 1780734 1781357 Transcriptionalregulators N3
mru_1339 1037 6903 hypothetical_protein 1781499 1781717 Conserved
mru_1341 1038 6904 geranylgeranyl_reductase_family_protein 1784874
1786070 Biosynthesis =
mru_1342 1039 6905 adhesin-like_protein 1786465
'1787373 Cellsurfaceproteins
mru_1343 1040 6906
hypothetical_protein 1787397 , 1787837 Hypothetical
mru_1344 1041 6907 hypothetical_protein 1788130
1789536 Hypothetical
mru_1345 1042 6908 4Fe-4S bind ing_domain-conta in ing_protein
1789688 1790446 Electrontransport
mru_1346 1043 6909 hypothetical_protein 1790504
1790800 Conserved
mru_1347 1044 6910 prefoldin_beta_subunit_PfdB 1791063
1791410 Proteinfolding n
mru_1348 1045 6911 hypothetical_protein 1791635
1791769 Conserved 1-3
mru_1349 1046 6912 N
ribosomal biogenesis_protein 1791976 1792467 Other
mru_1350 1047 6913 DNA-direaed RNA_polymerase subunit_P_RpoP 1792796
1792927 RNApolymerase o
mru_1351 1048 6914 ' ribosomal prc:Ttein_L37Ae_Rp137-ae 1792928
1793197 Ribosomalproteins 1--,
o
mru_1352 1049 6915 exosome_complex_RNA-binding_protein_Rrp42 1793734
1794525 RNAprocessing
o
mru 1353 1050 6916 exosome_complex_exonuclease_Rrp41 1794537
1795247 RNAprocessing =
1..,
.
mru_1354 1051 6917 exosome_complex_RNA-binding_protein_Rrp4 1795431
_ 1796906 RNAprocessing = c7,
o
,
. .
, .
'
,
=
Table 11
0
mru_1355 1052 6918 exosome_subunit
1797108 1797812 RNAprocessing (,..
.
'
mru 1356 1053 6919 _ hypothetical_protein
1798164 1798307 Hypothetical =
_
1--
mru_1357 1054 , 6920 proteasome_alpha_subunit
1798384 1799268 Proteinfolding 1--,
=---.
o
mru_1358 1055 6921 adhesin-like_protein
1800158 1802401 Cellsurfaceproteins (..3
(A
mru_1359 1056 6922 ribonuclease_P_subunit_P14
1802611 1802988 RNAprocessing (.4
o
mru 1360 1057 6923 ribonuclease P_subunit_P30
1802985 1803956 RNAprocessing .
. mru_1361 1058 6924 , exosome sula-unit
1804174 1804614 RNAprocessing
mru_1362 1059 6925 ribosomar_protein_L15e_Rp115e
1804758 1805309 Ribosomalproteins
mru_1363 1060 6926 hypothetical_protein
1806398 1806817 Hypothetical -
mru_l 364 1061 6927 hypothetical_protein
1806905 1807825 Conserved
mru_1365 1062 6928 hypothetical_protein
1807876 1808409 Conserved
mru 1366 1063 6929 hypothetical_protein
1808689 1809237 Conserved
mru_1367 1064 6930 , rubrerythrin Rbr2
1809442 1809987 Oxidativestressresponse
(-)
mru_1368 1065 6931 bile salt_hy-drolase
1810331 1811308 Stressresponse
mru 1369 1066 6932 NAEPH-dependent_FMN_reductase =
1811462 1812007 Electrontransport .0
n)
mru_1371 1067 6933 hypothetical_protein
1813383 1814879 Conserved
.-.1
rhru_1372 =- 1068 6934 uracil_phosphoribosyltransferase_Upp
1815497 1816123 Salvage . n)
IV
mru_1373 1069 6935 xanthine/uracil_permease
1816712 1817899 Transport n)
' mru_1374 1070 6936 acetyltransferase_GNAT_family
1818405 18.18884 Enzyme
mru_1377 1071 6937 glutathione-disulfide_reductase_Gor2
1822254 1823708 Glutathionemetabolism
c.n
1-=
mru_1379 1072 6938 hypothetical_protein
1825152 1825691 Conserved l\)
1
.
mru_1380 1073 6939 hypothetical_protein
1825862 1826353 = Conserved o
' mru_1381 1074 6940
carboxymudonolactone decarboxylase_family_protein_PcaC1 1826581 1826901
Aromaticcompounds i\)
mru_1382 1075 6941 carboxymuconolactone_decarboxylase_family_protein_PcaC2
1826988 1827323 Aromaticcompounds
mru_1384 1076 6942 hypothetical_protein '
1828559 1828972 Conserved .
mru 1389 1077 6943 hypothetical_protein
1836886 1837659 Conserved
mru_1390 1078 6944 CBS_donnain-containing_protein _ ,
1837875 1838510 General
mru_1391 1079 6945 , polyferredoxin =
. 1838722 1839549 Electrontranspoh
mru 1392 1080 6946 carbohydrate kinase PfkB_family
1839856 1840884 Enzyme
= mru_1394 1081
6947 energy-conveT-ting_hy-drogenase_A_subunit R_EhaR 1841932
1843038 Methanogenesis
mru_1395 1082 6948 hypothetical_protein
1843042 1843155 Hypothetical n =
- mru_1396 1083 - 6949 energy-converting_hydrogenase_A_subunit_Q EhaQ
1843296 1844747 Methanogenesis 1-3
N
mru_1397 1084 6950 energy-converting_hydrogenase_A_subunit_PlhaP
1844749 1845783 Methanogenesis
nnru_1398 1085 6951 energy-converting_hydrogenase_A subunit_O-Eha0
1845853 1846980 Methanogenesis o
mru_1399 1086 6952. energy-converting_hydrogenase_A_subunit_N EhaN
1847113 1847574 Methanogenesis 1--,
o
mru_1400 1087 6953 energy-converting_hydrogenase_A subunit_M-EhaM
1847711 1848094 - Methanogenesis
o
. mru_1402 1088 6954 energy-converting_hydrogenase_A_subunit_IEhaK
1848698 1848853 Methanogenesis =
1--,
'
mru_1412 1089 6955 energy-converting_hydrogenase_A_subunit_A_EhaA
, 1853837 1854163 Methanogenesis c7,
o
,
.
,
,-.
=
,
.
.
.
Table 11 . .
mru_1414 _ 1090 6956
citramalate synthase CimA 1855686 1857179
Valine/Leucine/lsoleucine 0
(,..
mru_1415 1091 6957 hypothetical_protein 1857287
1857811 Conserved =
I--
. mru 1418 1092 6958
hypothetical_protein 1862073 1863086 Conserved 1--
--
o
mru_1419 _ 1093 6959
hypothetical_protein 1863216 -1864409 Conserved
(..3
(A
mru_1420 1094 6960 GMP_synthase_subunit_B_GuaB . 1864637
1865563 Interconversions ' = (.4
mru_1421 1095 6961 hypothetical_protein 1865706
1865945 Conserved
mru 1422 1096 6962 GMP synthase subunit_A_GuaA
' = 1866077 1866643 Interconversions
mru 1425 _ 1097 6963 thiore-
doxin-distfide_reductase_Trx13 1868977 1869924 ,
Pyrimidineinterconversion
mru 1426 1098 6964 glutamyl-
tRNA(G1n)_amidotransferase_subunit_E GatE 1870075 - 1871922
tRNAaminoacylation
mru_1427 1099 6965 glutamyl-
tRNA(GIn)_amidotransferase_subunit DIGatD 1872043 1873350
tRNAaminoacylation ,
mru_1428 1100 6966 hypothetical_protein 1874104
1874943 Hypothetical
_
mru_1429 1101 _ 6967 DNA_mismatch_endonuclease Vsr
= 1875627 1876055 Recombinationandrepair
mru_1430 _ 1102 6968
NAD+_synthetase_NadE 1876124 1876738 Nicotinate
a
mru_1431 1103 6969
ketoisovalerate_ferredoxin oxidoreductase_beta_subunit_VorB 1877175
1878668 General =
mru_1432 1104 6970
ketoisovalerate_ferredoxin_oxidoreductase_alpha_subunit VorA 1878681
1879847 General o
iv
mru 1433 1105 _ 6971
ketoisovalerate_ferredoxin_oxidoreductase_gamma subunit VorC 1879847
1880101 General
mru 1434 1106 6972 acetyl-
CoA_synthetase_AcsA . 1880232 1881896 Acetate ,
= .-.1
NJ
IV
mru_1435 1107 _ 6973
transcriptional_regulator 1882043 1882603
Transcriptionalregulators iv
.1,.
mru_1436 - 1108 6974
hypothetical_protein 1883172 1883540 Conserved ry
iv
mru_1438 1109 6975 ABC_transporter_ATP-binding_protein 1885143
1885859 Other m o
mru_1440 1110 6976 RNA_methylase 1887731
1888765 Other iv
1
mru_1441 _ 1111 6977
geranylgeranyl_reductase family_protein 1888890 1890188 Biosynthesis
o
iv
mru_1443 1112 6978 _
ABM_family_protein = 1892581 1892877 General
1
iv
mru_1444 1113 6979 NADPH-
dependent_F420 reductase_NpdG2 1893174 , 1893857 Ethanol
mru_1446 1114 6980 transcriptional_regulator_ArsR_family 1896050
1896874 Transcriptionalregulators
mru_1447 1115 6981
transcriptional regulator 1896864 1897724
Transcriptionalregulators '
mru_1448 1116 6982 proteasome-ac7tivating_nucleotidase 1898298
1899572 Proteinfolding
mru_1449 1117 6983 transcriptional_regulator 1900127
1900630 Transcriptionalregulators
mru_1450 _ 1118 6984
hypothetical protein 1900894 1901316 Conserved
mru 1453 1119 6985 radical
SAM_domain-containing_protein 1904059 1905183 Enzyme
mru_1454 1120 6986 hypothetical_protein 1905422
1905640 Conserved 0:
n
mru_1455 1121 6987 ADP-ribosylglycohydrolase_family_protein 1905835
1906881 Other 1-3
mru_1456 1122 6988
polysaccharide/polyol phosphate_ABC_transporter_permease_protein 1907221
1908000 Exopolysaccharides
-mru_1457 1123 6989 _polysaccharide/polyol_phosphate_ABC_transporter_ATP-
binding_protein 1908182 1909411 Exopolysaccharides
... o
mru_1462 1124 _6990 hypothetical_protein , 1913420
1913833 Conserved 1--,
.o
mru 1463 1125 _ 6991
hypothetical_protein 1914037 1914975 Conserved
mru_1464 1126 _6992 hypothetical_protein 1915030
1915488 Conserved o
o
1--i
mru_1465 1127' _6993 adhesin-like_protein 1915628
1918510 Cellsurfaceproteins cr,
-
=
=
. ,
1
,
Table 11
0
mru_1466 1128 6994 4Fe-
4S_iron sulfur_cluster binding_protein_NifH/frxC_family 1918884 1919687
Fixation (,..
mru 1467 1129 6995 peptidase_U32_family
1919849 1921108 Proteindegradation =
I--
mru_1468 1130 6996 annidophosphoribosyltransferase_PurF
1921354 1922796 PurineBiosynthesis 1--,
--.
o
mru_1469 1131 6997 hypothetical_protein
1923129 1924541 Conserved =(,.3
(A
, mru_1471 1132 6998
geranylgeranyl_reductase_family_protein 1926113 1927303 Biosynthesis
(.4
o
mru_1472 1133 6999 ferredoxin
1927324 1927488 Electrontransport
mru 1473 1134 7000 acetyltransferase
1927891 1928496 Enzyme
mru 1474 1135 7001 hypothetical_protein
1928711 1929022 Conserved
mru 1475 1136 7002 hypothetical_protein '
1929094 1930254 Conserved -- =
_..
mru_1476 1137 7003 argininosuccinate_lyase_ArgH
1930408 1931826 Arginine
mru_1477 1138 = 7004 ribosomal protein_S27ae Rps27ae
1932294 1932452 Ribosomalproteins
mru_1478 1139 7005 ribosomal_protein_S24e_lips24e
1932476 1932790 Ribosomalproteins
mru_1479 1140 7006 hypothetical_protein =
1932935 1933435 Conserved
mru_1481 1141 7007 DNA-dependent_RNA_polymerase_subunit E" RpoE2
1935169 1935351 RNApolymerase a
mru 1482 1142 _ 7008 DNA-dependent_RNA_polymerase_subunit E'_RpoE1
1935351 1935920 RNApolymerase (D
,
mru 1483 1143 7009 hypothetical_protein ,
1936190 1936591 Conserved n)
-.3
,
.-.1
mru_1484 1144 7010
translation jnitiation_factor_aIF-2_gamma_subunit 1936680 1937903
Translationfactors n)
IV
mru_1485 1145 7011 ribosomal_protein_S6e Rps6e
1938443 1938832 Ribosomalproteins
mru 1486 1146 7012
translation initiation_fa-c-tor_IF-2 1939021 1940811
Translationfactors IV iv
' mru_1487 1147 7013,
nucleoside_diphosphate kinase_Ndk 1940998 1941450 Interconversion Iv
o
mru 1488 1148 7014 ribosomal_protein_L24e:Rp124e
1941447 1941608 Ribosomalproteins IV
,
I
mru 1489 1149 7015 ribosomal_protein_S28e_Rps28e
1941649 1941855 Ribosomalproteins o
n)
mru_1490 1150 7016 ribosomal protein_L7Ae_Rpl7ae
1941894 1942301 Ribosomalproteins 1
i\)
mru_1491 1151 7017 archaeal_histone
1943196 1943393 DNA-bindingproteins
mru_1492 1152 7018 threonine_synthase ThrC
1944140 1945336 threonine .
mru_1494 1153 7019 tryptophanyl-tRNA synthetase TrpS
1946479 1947570 tRNAaminoacylation
mru_1495 1154 7020 tRNA_intron endonuclease_EndA
1947807 1948322 RNAprocessing
_
mru_1496 1155 7021 iron depend-ent_repressor
1948743 1949396 Transcriptionalregulators
.
mru_1498 1156 7022 hypothetical_protein .
1951103 1951450 Hypothetical
mru_1499 1157 7023 adhesin-like_protein_with_transglutaminase_domain
1951843 1954875 Cellsurfaceproteins
1-:
mru_1500 1158 7024 adhesin-like_protein
1956027 1959923 Cellsurfaceproteins n
mru_1501 1159 7025 thermosome_subunit
1960369 1962018 Proteinfolding 1-3
mru 1502 1160 7026 methyltransferase
1962492 1963271 Enzyme
N
mru_1504 1161 7027 peptidase_M48_family
1964160 1965134 Proteindegradation o
mru 1505 1162 7028 _
hypothetical_protein 1965435 1965635 Hypothetical
1--,
o
mru_1507 1163 7029 F420H2_oxidase FprA1
1966891 1968111 "Oxidativestressresponse
o
nnru_1508 1164 7030 hydrolase alpharb-eta fold_family
1968358 1969200 Enzyme =
1..,
mru 1509 1165 - 7031 pyrroline-E-carboxylate_reductase_ProC
1969629 1970543 Proline c7,
o
,
.
= .
'
,
.
,
. ,
Table 11
,
0
mru_1511 1166 7032 nascent_polypeptide-associated complex_protein
1973411 1973749 Proteinfolding . (,..
o
mru 1512 1167 7033 archaeosine_tRNA-ribosyltransferase_TgtA2
1973957 1975957 RNAprocessing _ _ 1--
1--
mru1513 1168 7034 adhesin-likeprotein
1976185 1978038 Cellsuriaceproteins - .--.
.
o
,
(..4
mru_1514 1169 7035 , hypothetical_protein
1978189 1979412 Conserved (A
(.4
mru 1515 1170 7036 hypothetical_protein
1979499 1979972 Hypothetical
mru_1516 1171 7037 hypothetical_protein
1980055 1980432 Hypothetical
_
mru 1517 1172 7038 hypothetical_protein =
1980417 1980707 Hypothetical
mru_1518 1173 7039 hypothetical_protein
1980723 _ 1981244 Hypothetical
mru_1519 1174 7040 pyruvate-formate_lyase_Pfl
1981778 1983811 Formate
.
mru_1520 1175 7041 _ SAM-dependent_methyltransferase
1984363 1984998 Enzyme
mru_1521 1176 7042 hypothetical_protein
1986000 1985326 Hypothetical .
mru_1529 1177 7043 UDP-galactopyranose_mutase_Glf
1992523 1993620 , Exopolysaccha rides
mru_1530 1178 7044 hypothetical_protein
1994272 . 1994454 Hypothetical a
, mru_1531 1179 7045 hypothetical_protein .
1994451 1994606 Hypothetical =
mru_1532 1180 7046 hypothetical_protein
1994682 1995263 Hypothetical o
iv . -.3
mru 1533 1181 7047 hypothetical_protein
1995274 1995957 Hypothetical .-.1
NJ
mru_1536 1182 7048 glucosamine-fructose-6-phosphate_aminotransferase_GlmS2
1997308 1999092 Pseudomureinbiosynthesis iv
iv
mru_1537 1183 7049 hypothetical_protein
1999634 2000341 Hypothetical . = .1,.
mru_1538 1184 7050 phosphoribosylaminoimidazole-
succinocarboxamide_synthase_PurC 2000912
2001661 PurineBiosynthesis iv
IV
0
mru_1539 1185 7051 phosphoribosylformylglycinamidine_(FGAM)_synthase_PurS
2001821 2002093 PurineBiosynthesis Iv H
mru_1540 1186 7052 phosphoribosylformylglycinamidine_(FGAM)_synthase_PurQ
2002134 2002787 PurineBiosynthesis 1
o
mru_1541 1187 7053 uroporphyrin-III_C-methyltransferase_CobA =
2002940 2003653 Cobalamin iv
1
' mru_1543 1188 7054 magnesium- '
'2005241 2006752 Cobalamin iv
.1,
protoporphyrin_IX_monomethyl_ester anaerobic_oxidative_cyclase_BchE
mru_1544 1189 7055 uroporphyrinogen-11 l_synthase HemD _
2007335 2008111 Cobalamin
mru 1545 1190 = 7056 glycosyl_transferase_GT2_famly
2008514 2009419 Exo polysaccha rid es
mru_1546 1191 7057 signal recognition_particle SRP19_protein
2009602 2009925 Proteinsecretibn
mru_1547 1192 7058 ssDNK exonuclease_Recr =
2010007 2011449 Recombinationandrepair
mru 1548 1193 7059 transcriptional_regulator MarR_family
2011906 2012733 Transcriptionalregulators
mru_1549 1194 7060 CRISPR-associated_protein_Cas4-2
2012915 2013610 CRISPR-associatedgenes
mru 1550 1195 . 7061 hypothetical_protein
2013887 2014327 Conserved = n
1-
mru_1551 1196 7062 hydrogenaSe_maturation_factor_HypE1
2014434 2015444 Hydrogenmetabolism ----
mru_1552 1197 7063 ' N
ribosomal_protein S8e_Rps8e 2015918 2016295 Ribosomalproteins
mru 1553 1198 7064 DNA polymerase¨family B_PolB2 -
2016942 2017622 Chromosomereplication o
mru_1554 1199 7065 hydrolase_HAD_superfamily
2017767 2018447 Enzyme
o
mru_1556 1200 7066 transcriptional_regulator_MarR_family
2020092 2020364 Transcriptionalregulators
o
mru_1557 1201 7067 exodeoxyribonuclease III_Xth2
2020494 2021270 Recombinationandrepair 2
mru_1558 1202 7068 phenylalanyl-tRNA_syr7thetase_alpha_subunit_PheS
2021453 2023000 tRNAaminoacylation o
,
. .
. =
= .
=
-
,
,
Table 11
mru 1559 1203 7069 ATP:dephospho-
CoA_triphosphoribosyl_transferase_CitG 2023325
2024224 Citratefermentation 0
(,..
mru 1560 1204 7070 delta-aminolevulinic
_aciddehydratase_HemB 2024292 2025287 Cobalamin
.
1--
mru_1561 1205 7071 chorismate synthase AroC
2025644 2026744 Chorismate 1--,
--.
o
mru_1562 1206 7072 metallo-bet-a.-lactamase_superfamily_protein
2026917 2027858 Enzyme (..4
(A ,
mru_1563 1207 7073 hypothetical_protein '
2027909 2028835 Conserved
mru 1564 1208 7074 desulfoferrodoxin Db:
2029519 2029899 Oxidativestressresponse
mru_1566 1209 7075 archaea-specific -RecJ-like_exonuclease
2030975 2033317 Recombinationandrepair
mru_1567 1210 7076 NifU-like_FeS_cluster_assembly_scaffold_protein
2034025 ' 2034399 Fixation
-
nnru_1568 1211 7077 cysteine_desulfurase NifS
2034467 2035675 Thiamine
mru 1569 1212 7078 0-acetylhomoserine/0-acetylserine
sulfhydrylase_MetZ/CysK2 2036147 2037466 Methionine
mru_1570 1213 7079 ADP-dependent_acetyl-CoA_synth;tase_Acs
2044337 2046436 Acetate
mru_1571 1214 7080 cysteinyl-tRNA synthetase_CysS
2046915 2048357 tRNAaminoacylation ,
mru_1572 1215 7081 hypothetical_protein
2048701 2049081 Conserved
mru_1573 1216 7082 serine_O-acetyltransferase CysE
2049159 2049899 Cysteine a
mru_1574 1217 7083 cysteine_synthase_CysKM7
2050344 2051300 Cysteine 0
mru_1575 1218 7084 6-0-methylguanine DNA_methyltransferase_Ogt
2051911 2052516 Recombinationandrepair iv
...i
mru_1576 1219 7085 endonuclease III Wth
2052663 2053310 Recombinationandrepair .-.1
NJ
mru 1577 1220 7086 3-phosphoshikimite_1-carboxyvinyltransferase_AroA
2053314 2054681 Chorismate iv
iv
mrU_1579 1221 7087 hypothetical_protein
2056876 2057286 Conserved
.
mru' 1580 1222 7088 ADP-ribosylgtycohydrolase_family_protein
2057414 2058199 Other tv 0
mru 1581 1223 7089 signal_peptidase I
2058360 2059088 Proteinsecretion iv
1
mru_1582 1224 7090 hypothetical_protein
2059341 2059562 Hypothetical o
iv
mru_1583 1225 7091 transposase
2059958 2061016 Transposase . 1
iv
mru_1584 .1226 7092 valyl-tRNA_synthetase_ValS
2061165 2063984 tRNAaminoacylation
mru 1586 1227 7093 phenylalanyl-tRNA_synthetase_subunit_beta_PheT '
2064936 2066591 tRNAaminoacylation
mru_1587 1228 7094 hypothetical_protein
2066815 2067513 Hypothetical
mru_1589 1229 7095 hypothetical_protein
2069470 2069898 Conserved
m 111_1590 1230 7096 hypothetical_protein
2070084 2070206 Conserved
.
mru 1591 1231 7097 hypothetical_protein
2070388 2070906 Hypothetical
mru_1592 1232 7098 hypothetical_protein '
2071180 2071323 Hypothetical
nnru_1593 1233 7099 von Willebrand_factor_type_A_dornain-
containing_protein 2071697 2072329 General
n
,
mru 1594 1234 7100 hypothetical_protein
2072732 2074345 Conserved 1-3
mru_1595' 1235 7101 hypothetical_protein
2074651 2074956 Hypothetical
mru_1596 1236 7102 hypothetical protein
2075126 2075365 Hypothetical N
mru_1599 1237 7103 hypothetical_protein
2077257 2077472 Hypothetical
o
mru_1600 1238 7104 transcription initiation_factor TFIIB_Tfb2
2077583 2078614 Translationfactors _ .o
mru_1602 1239 7105 carbonic antydrase_Cab
2079757 2080284 Bicarbonate . o
1..i
mru_1603 1240 7106 chromosome_partitioning_ATPase_ParA
2080775 2081611 Genomesegregation c7,
=
-
.
' .
,
,
Table 11
0
mru 1604 1241 7107
adhesin-like_protein_with_transglutaminase_ddmain 2082149 2085145
Cellsurfaceproteins (,..
o
mru 1605 1242 7108
hypothetical_protein ' 2086283 2088760 Conserved
I--
1--,
mru 1606 1243 7109
hypothetical_protein 2088973 2089461 Hypothetical
o
(..4
mru_1607 1244 , 7110
hypothetical_protein 2089522 2090028 Hypothetical
(A
(.4
mru 1608 1245 , 7111 transposase
2090317 2091813 Transposase
mru_1609 1246 7112 NADPH-dependent_FMN_reductase
2092160 2092771 Electrontransport
mru_1610 1247 7113 rRNA_methylase
2093080 2093874 Other
mru 1611 1248 7114 C
GCAxxG C C famil rotein 2094058 - 2094450 General
= mru_1613 1249 7115
SAM-dependent_methyltransferase 2097303 = 2098124 Enzyme
mru_1614 1250 7116
nickel_ABC_transporter_ATP-binding_protein_NikE1 = _ _ 2098597
2099208 Cations
mru_1615 1251 7117
nickel_ABC transporter_ATP-binding_protein NikD1 2099255 2100319 Cations
_
mru_1617 1252 7118 ,
nickel_ABC_transporter_permease_proteiniaB1 2101181 2102173 Cations
mru_1620 1253 . 7119 methionine_synthase_MetE
2106417 2107373 Methionine a
mru_1621 1254 , 7120 hypothetical
protein ' 2107460 2107768 Conserved
0
mru_1622 1255 7121 hypothetical_protein
2107842 2108303 Conserved n)
...3
mru_1623 1256 7122 ,
phosphodiesterase_MJ0936 _family 2108477 2108986 Enzyme .-.1
NJ
mru_1624 1257 7123
hypothetical protein 2109072 2109188
Hypothetical IV
1\)
mru 1625 1258 7124
hypothetical_protein . 2109212 2109340 Hypothetical
mru_1626 1259 7125 hypothetical_protein
2109562 2110065 Conserved 1.)
m
0
mru 1629 1260 7126
transcriptional regulator MarR _family -- 2114234 -- 2114836 --
Transcriptionalregulators
0
IV
mru_1630 1261 7127 3-oxoacyl-(acyl-carrier-protein)_reductase_FabG2 '
2115012 2115749 Bacterial '
o
mru_1631 1262 7128
hypothetical protein ' 2116194 2117225
Conserved "
mru_1632 1263 7129 hydrogenase_accessory_protein_HypB
2117435 2118196 Hydrogenmetabolism r,
.1,
mru_1633 1264 7130
hydrogenase_nickel_insertion_protein HypA 2118404 2118778
Hydrogenmetabolism
mru_1634 1265 7131 ribose-phosphate_diphosphokinase_P¨rs ' 2119066
2119986 PRPPsynthesis
mru_1635 1266 7132 hypothetical_protein
2120475 2121578 Conserved
mru_1636 . 1267 7133
hypothetical_protein 2121971 2122450 Conserved
mru_1638 1268 7134 cobyric
acid_synthase CbiP . 2125518 , 2127158 Cobalamin
mru. 1639 1269 7135
acetyl-CoA_acetyltransTerase 2127364 2128524 Mevalonatepathway
mru_1640 1270 7136 hydroxymethylglutaryl-CoA_synthase
2128675 2129715 Mevalonatepathway
mru 1643 1271 7137 SAM-
dependent_methyltransferase 2131440 2132108 Enzyme = n
mru_1644 1272 7138
hypothetical_protein 2132385 ' 2132852 Hypothetical
mru 1645 1273 7139
thermosome_subunit ' 2133400 2135010
Proteinfolding N
(,)
mru_1646 1274 7140 hypothetical_protein
2135432 2136448 Conserved
1--,
mru_1647 1275 7141 CRISPR-associated_protein_Cas1-3
2136853 2137128 CRISPR-associatedgenes =
mru_1648 1276 7142 CRISPR-associated_protein_Cas1-4
2137163 2137858 CRISPR-associatedgenes
o
. mru_1649 1277' 7143
CRISPR-associated_protein_Cas2-3 2137871 2138146 CRISPR-
associatedgenes = 1.-,
c7,
mru_1652 1278 7144
hypothetical protein = 2150195 2150359 Conserved
= =
,
.
'
' .
, .
Table 11
0
mru 1654 1279 7145 " hypothetical_protein
2152173 2152982 Conserved (,..
, mru_1655 1280 7146 hypothetical_protein
2153442 2155502 Conserved
mru 1656 1656 1281 7147
hypothetical_protein 2155757 , 2155912 Hypothetical
0-
=---.
o
= mru_1660 1282 7148
hypothetical_protein 2160013 2160654 Conserved (..4
(A
mru_1661 1283 7149 adhesin-like protein
2161025 2162095 Cellsurfaceproteins
_
(.4
mru_1662 1284 7150 transposase
2162472 2163437 Transposase
mru_1663 1285 7151 hypothetical_protein
2163632 2163934 Hypothetical
mru_1664 1286 7152 hypothetical_protein
2163945 2164184 Hypothetical
=
mru 1665 1287 7153 hypothetical_protein
2164220 2164369 Conserved
= .
"
mru_1666 1288 7154 hypothetical_protein
2164733 2165467 Hypothetical =
mru_1667 1289 7155 HIRAN domain-containing_protein
2165542 = 2165946 Chromosomereplication ;
mru_1668 1290 7156 hypothical_protein
2165966 2166724 Conserved
mru 1669 1291 7157 aspartate-semialdehyde_dehydrogenase_Asd
2166859 2167908 Lysine
(-)
mru 1670 1292 7158 dihydrodipicolinate_reductase_DapB
2167962 2168780 Lysine
mru_l 671 1293 7159 d ihyd
rodi pica li nate synthase_DapA 2169004 2169897 Lysine 0
n)
mru_1672 1294 7160 aspartate_kinase Jksk
2170343 2171560 Lysine s ...3
.-.1
mru_1673 1295 7161 ribosomal_protein_S17e Rps17e
2171987 2172184 Ribosomalproteins n)
IV
mru_1674 1296 7162 chorismate mutase_Arocl
2172187 2172510 PhenylalaninefTryosine m
mru 1675 1297 7163 radical SAKI_domain-containing_protein
2173008 2173877 Enzyme
mru_1676 1298 7164 shikima-te_kinase_AroK
2174024 2174941 Chorismate co 0
nnru_1678 1299 7165 redox-
active_disulfide protein 2176288 2176539 General IV
I
mru_1680 ' 1300 7166 molybdenum_cofactor_biosynthesis_protein_C_MoaC
- 2178569 2179039 Metal-bindingpterin
o
n)
1
mru_1681 1301 7167 . DEAD/DEAH_box_helicase_domain-containing_protein
2179286 2181358 Helicase i\)
mru_1682 1302 7168 . hypothetical_protein
2181575 2181877 Conserved
mru_1683 1303 7169 hypothetical_protein
2181965 2182420 Conserved
mru_1684 1304 7170 histone acetyltransferase_ELP3 _family
2182881 2184587 DNA-bindingproteins
mru_l 685 1305 7171 deoxyrEose-phosphate_aldolase_DeoC
2184932 2185654 Other
mru_1686 1306 7172 archaeal_histone
2186619 2186819 DNA-bindingproteins
_
mru_1687 1307 7173 MiaB-like_tRNA_modifying_enzyme
2187454 2188740 RNAprocessing
'
mru_1688 1308 7174 hypothetical_protein
2188893 2189525 Conserved .
0:
mru_1689 1309 7175 homoaconitase_small_subunit_AksE
2190053 2190538 CoenzymeB n
-
mru_1690 1310 7176 beta-ribofuranosylaminobenzene_5'-phosphate_synthase_MptG
2190673 2191653 Methanopterin 1-3
mru 1691 1311 7177 molybdenum_cofactor_biosynthesis_protein_MoaA
2192075 2193604 Metal-bindingpterin
N
mru_1692 1312 7178 hypothetical_protein
2194147 2194449 Hypothetical c=
mru_1695 1313 7179 H4MPT-linked_C1 Jransfer_pathway_protein
2196508 2197530 Methanogenesis 1--,
o
mru_1696 1314 7180 hypothetical_protein
2197692 2198858 Conserved
o
mru_1699 1315 7181 hypothetical_protein '
2201399 2202076 Conserved o
1..,
mru_1700 1316 7182 hypothetical_protein =
2202201 2202665 Conserved
c7,
=
, .
= =
'
= .
,
Table 11
0 '
mru_1701 1317 7183
ABC2ransporter ATP-binding_protein 2202753 ' 2203559 Other (,..
mru_1702 1318 . 7184 ABC_transporter_permease protein
2203568 2204704 Other =
I--
mru_1703 1319 7185 tRNA_pseudouridine_synthase A_TruA
2204852 2205724 RNAprocessing 1--
.-. .
o
mru_1704 1320 7186 NADH_pyrophosphatase_NudC
2205892 2206788 Nicotinate (..3
(A
mru_1705 1321 7187 nickel_ABC_transporter_ATP-binding protein_NikE2
2207186 2207800 Cations (.4
mru_1706 1322 7188 nickel_ABC_transporter_ATP-binding_protein_NikD2
. 2207802 2209142 Cations
mru_1707 1323 7189 acetyltransferase_GNAT _family
2209258 2209764 Enzyme .
mru_1711 1324 7190 4Fe-4S_binding_domain-containing_protein
2214154 2214846 Electrontransport
mru_1712 1325 7191 acetyltransferase_GNAT _family
2215074 = 2216634 Enzyme _
mru 1713 1326 7192 ABC transporter permease_protein
2216145 2216870 Other
mru_1714 1327 7193 ABC transporter ATP-binding_protein
2216974 2218041 Other
mru_1715 1328 7194 4Fe-4S_binding_domain-containing_protein
2218481 2219236 Electrontransport
mru_1716 1329 7195
4Fe-4S binding_domain-containing_protein 2219246 2219365
Electrontransport
a
mru_1717 1330 7196 phosphoribosylformimino-5-
2219718 2220458 Histidine
,
'
_aminoimidazole carboxamide_ribotide isomerase_HisA
(D
mru_1718 1331 7197. glycerol-3-phosplTate_cytidylyltransferase
2220611 2221063 Other n)
-.3
_
.-.1
mru_1719 1332 7198 N-acetyl-gamma-glutamyl-phosphate_reductase_ArgC
2221125 2222141 Arginine n)
I.)
mru_1720 1333 7199
flavbdoxin . 2222322 2222810 =
Electrontransport n)
.1,.
mru_1721 1334 7200 protein_export_membrane_protein_SecF
2223191 2224348 Proteinsecretion
mru_1723 1335 7201 GMC oxidoreductase family_protein
2225876 2227117 Enzyme
mru_l 725 1336 7202
asparTate_carbamoyltransferase_regulatory_subunit_Pyr1 '
2227835 2228296 Pyrimidine I.)
1
mru_1726 1337 7203 adhesin-like_protein
2228646 2235413 Cellsurfaceproteins o
_
n)
1
mru_1727 1338 7204 peptidase U62 _family
2235888 2237258 Proteindegradation i\)
, mru_1728 1339 7205 ,
AMMECR1_domain-containing_protein 2237586 2238143 General .1,.
_
mru_1729 1340 7206 , GTP-binding_protein
2238260 2239315 , General
mru_1730 1341 7207 heat_shock_protein_Hsp20/alpha_crystallin_family
2239741 2240334 Proteinfolding
mru_1731 1342 7208
archaeal histone 2241088 2241291 DNA-bindingproteins
mru_1732 1343 7209 NADPH-lependent FMN_reductase
2241602 2242273 Electrontransport
mru_1733 1344 7210 phosphosugar-binding_protein
2242480 2243490 Pseudomureinbiosynthesis
mru_1734 1345 7211 hypothetical_protein
2243892 ' 2244437 Conserved
1-:
mru_1736 1346 7212 adenine deaminase_Ade
2246186 2248060 Purineinterconversions n
mru_1737 1347 7213 hypotheticalprotein
2248230 2249447 Conserved 1-3
mru_1739 1348 7214 transcriptional_regulator_TetRiamily
2250642 2251256 Transcriptionalregulators
N mru_1740 1349 7215 hypothetical_protein
2251300 2251509 Hypothetical _
mm 1741 1350 7216
arginase/agmatinase_family_protein = 2251652 2252626
Polyamines 1--,
o
*
mru_1742 1351 7217
translation jnitiation_factor_a1F-5A 2252963 2253370
Translationfactors = .--..
o
o
mru_1743 1352 7218 pyruvoyl-dependent_arginine_decarboxylase_PdaD
2253808 , 2254269 Polyamines =
1..,
mru_1744 1353 7219
bifunctional_inosito1-1_monophosphatase/fructose-1,6- ' 2254550 _ 2256415
Gluconeogenesis/inositolbiosynth Fa
..
,
=
,
. .
,
'
Table 11
0
-
-
bisphosphatase/ATP-NAD kinase
esis (.4
,
- o
mru_1745 1354 7220
cell_wall biosynthesis_proTein Mur_li_gase_family 2256658 2258292
Pseudomureinbiosynthesis 1--
0-
mru_1746 1355 , 7221 porphobinogen deaminase_FTemC
2258561 2259430 Cobalamin ,
o
mru_1747 1356 7222
oxidoreductase ¨GFO/IDH/MOCA family 2259845 2260792 Enzyme (..4
(A
(.4
mru_1748 1357 7223 orotate_phosphoribosyltransferase_PyrE1
2260961 2261575 Pyrimidine
mru 1749. 1358 , 7224 hypothetical_protein
2261799 2262605 Conserved
mru_1750 1359 7225
nicotinate_phosphoribosyltransferase 2263082 2264503 - Nicotinate _
.
mru 1752 1360 7226
hypothetical_protein 2266328 2266747 Conserved
mru 1753 1361 7227
hypothetical_protein 2267332 2268270 Hypothetical
mru_l 754 1362 7228
hypothetical protein 2268473 2269318 _ Hypothetical
mru 1755 1363 7229
peptidase_M48 _family 2269358 2270581
Proteindegradation .
mru 1756 1364 7230
thioesterase family protein 2270676 2271074 General
mru_l 757 1365 7231
NADH-dependent_flavin_oxidoreductase 2271166 2272173 Enzyme (-)
_
=
mru_1758 1366 7232 acetyltransferase 2272262 2272837 Enzyme
.
mru 1760 1367 7233
archaeal histone , 2275215 2275550 DNA-bindingproteins
. _ _
iv
mru 1761 1368 7234 _
glutamate_dehydrogenase GdhA 2275966 2277300
Glutamate/Glutamine =..3
.-.1
NJ
mru 1762 1369 , 7235
transcriptional_regulator_A¨ . rsR _family , 2278355
2278903 Transcriptionalregulators iv
iv
mru_1763 1370 7236 hypothetical_protein
2278993 2279232 Conserved
mru_1764 1371 7237 diphthine_synthase_DphB
2280307 2281146 Translationfactors iv
mru 1766 1372 7238
transposase ' 2283598 2285058 Transposase
oa H
mru_1767 1373 7239 Met-10+_like-protein
2285756 2286853 General 1
o
mru 1769 1374 7240
nitrogenase_cofactor_biosynthesis_protein_NifB 2289425 2290315 Others
iv
1
mru_1770 1375 7241 methanogenesis_marker_protein_17
2290774 2291340 Methanogenesis iv
.1,
mru_1771 1376 7242
methanogenesis_marker protein_15 2291514 2292749 Methanogenesis
mru_1772 1377 7243
methanogenesis_marker_protein 5 2292750 2293256 Methanogenesis
mru_l 773 1378 7244
methanogenesis_marker_proteini6 2293379 , 2293885 Methanogenesis
mru_1774 1379 7245
methanogenesis marker protein_3 ' , 2293976 2295544 Methanogenesis
mru 1775 1380 7246
amino_acid ABE transporter ATP-binding_protein 2296050 2296733
Aminoacids
' mru_1778 1381 7247 methanogenesis_marker_protein_2
2299061 2300038 Methanogenesis
mru 1779 1382 7248
hypothetical_protein 2300547 2301179 Hypothetical
oe
mru_1780 1383 7249 hypothetical_protein
2301151 2302392 Conserved n
1-
mru_1781 1384 7250 hypothetical_protein
2302650 2303156 Conserved
mru 1782 1385 7251
molybdenum_cofactor_biosynthesis_protein_B_MOaB 2303369 2303887 Metal-
bind ing pte rin
N
mru_1783 1386 7252
orotate_phosphoribosyltransferase PyrE2 , 2304096: 2304626 Pyrimidine o
1--,
mru_1784 1387 7253 hypothetical_protein
2304669 2304953 Conserved
mru 1786 1388 7254
transporter SSS family 2305592 2307241 Acetate
CE3,
o
mru_1787 , 1389 ' 7255
coenzyme_F390_synthetase_FtsA2 .2307808 ' 2309109 CoenzymeF420 o
1..,
cr,
mru 1788 1390 7256 ACT_domain-
containing_protein - 2309188 2309619 General
.
.
. .
.
.
,
Table 11
. = 0
mru_1790 1391 7257 hypothetical_protein 2311522
2311800 Conserved
mru_1791 1392 7258 _ carbamoyl-phosphate_synthase_large subunit_CarB
2311971 2315147 Pyrimidine =
I--
mru_1792 1393 7259 carbamoyl-phosphate_synthase_smaIrsubunit_CarA 2315255
2316334 Pyrimidine
---.
o
mru_l 793 1394 7260
hypothetical_protein 4, 2316742 2317491
Hypothetical (.3
.
(A mru_1796 1395 7261 hypothetical_protein
2318447 2318677 Hypothetical (.4
o
mru_1797 1396 7262
hypothetical_protein = 2318772 2319122 , Hypothetical
mru_1799 1397 7263 hypothetical_protein 2320909
_ 2321838 Hypothetical _
_
mru_1800 1398 7264 ribosomal-protein-alanine_acetyltransferase_Riml
2321926 . 2322372 RNAprocessing
mru_1801 1399 7265 hypothetical_protein 2322593
2323006 Conserved
mru_1804 1400 7266 ribosomal_protein S1OP Rps1Op , 2327270
2327578 , Ribosomalproteins
mru_1805 1401 7267 translation_elonga-tiontor_aEF-1_alpha 2327904
2329145 Translationfactors =
mru_1806 1402 7268 translation elongation_factor_aEF-2 2329784
2331991 Translationfactors
mru_1807 1403 7269 ribosomal_protein_S7P Rps7p 2332246
2332806 Ribosomalproteins
a
mru 1808 1404 7270 ribosomal_protein S1213 Rps12p 2332821
2333246 Ribosomalproteins
mru_1809 1405 7271 transcription_elongation_factor_NusA-like_protein
2334413 2334817 Translationfactors 0
iv
mru_1810 1406 7272 ribosomal_protein_l_30e_Rp130e 2334851
2335150 Ribosomalproteins
.-.1
mru 181,1 1407 7273 hypothetical
protein = 2335598 2336842 Hypothetical
iv
iv
mru 1812 1408 '7274 Dnak-rel
ated_protein 2337015 2338769 Proteinfolding iv
.1,
mru_1813 1409 7275 TPR repeat-containing_protein ' 2338881
2340932 Proteininteractions
mru_1814 1410 7276 DNA---directed_RNA_polymerase_subunit A" RpoA2
2341654 2343291 RNApolymerase
4
I-.
mru_l 815 1411 7277 DNA-di rected_RNA_polymerase_su bunit_A'_-RpoA1
2343304 2346084 RNApolymerase iv
1 _
mru_1816 1412 7278 DNA-directed_RNA_polymerase_subunit B'_RpoB1
2346192 2348003 RNApolymerase 0
iv
mru_1817 1413 7279 DNA-directed_RNA_polymerase_subunit_B" RpoB2
2348016 2349578 RNApolymerase 1
iv
mru_1818 1414 7280 DNA-directed RNA_polymerase_subunit_H -RpoH
2349702 2349950 RNApolymerase
mru 1819 1415 7281
hydroxyethylifTiazole_kinase_ThiM ' ' 2351539 2352453 Thiamine
mru_1820 1416 7282 thiamine_monophosphate_synthase_ThiE 2352459
2353118 Thiamine c
. mru 1821 1417 7283
phosphoglycerate_kinase_Pgk 2353404 2354636 Gluconeogenesis
mru_1822 1418 7284 triosephosphate_isomerase_TpiA 2354745
2355419 Gluconeogenesis
mru_1823 1419 7285 tRNA-modifying_enzyme 2355645
2356559 RNAprocessing
mru_1824 1420 7286 succinyl-CoA_synthetase_beta subunit_SucC 2356647
2357771 TCA oo
mru 1825 1421 7287 TPR
repeat-containing protein 2358035 2358616 Proteininteractions
n
mru_l 827 1422 7288 2-oxog luta
rate_ferredoxin_oxidoreductase_subunit_beta_KorB
2359406 2360269 TCA 1-3
mru_1828 1423 7289 2-oxoglutarate_ferredoxin_oxidoreductase_subunit
alpha_KorA 2360278 2361408 . TCA
mru_1829 .1424 7290 2-
oxoglutarate_ferredoxin_oxidoreductase_subunit_delta KorD
2361409 , 2361615 TCA N
mru_1830 1425 7291 hypothetical_protein 2361682
2361810 Hypothetical 1--,
o
--_.
mru 1831 1426 7292
hypothetical_protein 2362032, 2362403 Conserved =
o
mru 1835 1427 7293 hypothetical_protein 2367402
2368253 Conserved =
1--,
mru_1836 1428 7294 cell_shape_determining_protein_MreB/Mrl_family 2368615
2369685 Other o=
o
,
,
.
..
=
-
=
Table 11
0
mru_1837 1429 7295 hypothetical_protein
2369991 2370230 Hypothetical k.)
. o
mru 1838 1430 7296 ribonuclease_HILRnhB
2370584 2371243 Chromosomereplication I--
1--,
mru_1839 1431 7297 - IMP cyclohydrolase_Pur0
2371436 , 2372053 PurineBiosynthesis --.
o
k..3
mru 1842 1432 7298 F426-0:gamma-glutamyligase_CofE
2373727 2374491 CoenzymeF420 ur,
c..,
mru 1843 1433 7299 hypothetical_protein
2374626 2375135 Hypothetical
,
mru_l 844 1434 ' 7300 LP PG:F0_2-phospho-L-lactate transferase_CofD
2375275 2376180 Coenzyme F420
mru 1845 1435 7301 GTP_cyclohydrolase_III_Gch3
2376470 2377231 Riboflavin _
mru_l 846 1436 7302 tRNA-dihydrouridine synth ase_DusA2
2377450 2378163 RNAprocessing
mru 1847 1437 7303 NADP-dependent_arcohol_dehydrogenase .
Adh2 2378370 2379350 Enzyme
mru 1848 1438 7304 TfoX_C-terminal_domain-containing_protein
2379886 2380143 General
mru_1849 1439 7305 hypothetical_protein
2380452 2381111 Conserved
mru_1850 1440 7306 methyl-coenzyme_M reductase_component_A2_AtwA2
2381378 2382988 Methanogenesis
mru_1851 1441 7307 methanogenesis_maTker_protein_9
2383459 2383941 Methanogenesis r)
,
_mru1852 1442 7308 siroheme
synth ase_CysG 2384040 2384663 Q obala mi n
.
.
mru_1853 1443 7309 glutamyl-tRNA_reductase HemA
2384792 2385988 Cobalamin o
n)
-.3
mru 1854 1444 7310 ATPase
2386184 2387311 General .-.1
NJ
mru 1855 1445 7311 endonuclease_IV
2387539 2388375 Recombinationandrepair IV
1\)
mru_1856 1446 7312 glyceraldehyde-3-phosphate_dehydrogenase_Gap
' 2388592 2389608 Gluconeogenesis
mru 1857 1447 7313 hypothetical_protein
2389789 2390340 Conserved 1.)
r..) .
mru_1858 1448 7314 hypothetical_protein
2390445 2391530 Hypothetical
CA n)
mru_1860 1449 7315 ATPase
2392540 = 2394192 General 1 _ o
mru_1862 1450 7316 hypothetical protein
2395750 2397010 Conserved n)
1
mru_1863 1451 7317 '
hypothetical_protein 2397529 2397978 Hypothetical
i\)
.1,
mru 1864 1452 7318 ' DNA_topoisomerase_Vl_subunit A
2398177 2399265 Genomesegregation
= rnru_1865 1453
7319 DNA topoisomerase_Vl_subunit_B 2399271 2401070 _
Genomesegregation
mru 1866 1454 7320 RNA-bi n di ng_protein
2401858 2402469 Other
mru_1867 1455 7321 hypothetical_protein _
2402876 2403109 Conserved
mru_1868 1456 7322
seri ne/th reoni ne_protei n kinase R101_family _ 2403134 2403913
Proteininteractions
mru 1869 1457 7323 translation jn itiation_facir_a IF-TA
2404095 2404415 Translationfactors
mru_1870 1458 7324 molybdopteri n_biosynth es is_protei n_MoeA2
_ 2404865 2406079 'Meta l-bi ndi ng pterin
mru_1871 1459 7325
transcriptional_regulator _ 2406208 2406675 ,
Transcriptionalregulators n
mru 1872 1872 1460 7326 hypothetical_protein
2406792 2407100 Hypothetical
mru_1873 1461 7327 N
peptidase U62_family _ 2407295 2408608
Proteindegradation
mru_1874 1462 _ 7328 phosph ogiYcolate_phosphatase_G ph _
2408667 2409371 Glycolatesalvagepathway o
1--,
mru_1875 1463 7329 hydrogenase_expression/formation protein HypD
2409565 2410518 Hydrogenmetabol ism o
mru_1880 1464 7330 polysaccharide_biosynthesis_protein _
2416440 2417882 Exopolysaccha rides
o
_
mru_1881 1465 7331 acetyltransferase
2418011 2418517 Enzyme ' o
o
mru_1886 1466 7332 hypothetical_protein
2425357 2426727 Conserved
= ,
,
'
.
.
_
,
Table 11
0
mru 1887 1467 7333 , myo-
inosito1-1-phosphate synthase 2427050 -- 2428144 Inositolbiosynthesis --
w
mru_1888 1468 _ 7334 pyruvate_carboxylase subunit B PycB
2428670 2430291 TCA o
1--
.
1--
mru_1889 1469 7335 hypothetical_protein
2430648 2431253 Hypothetical =---.
.
o
mru_1890 1470 7336 hypothetical_protein
2431477 2432751 Conserved k..i
uri
'
mru 1892 1471 7337 cobalamin-5-phosphate_synthase_CobS
2434189 2435025 Cobalamin c..i
mru_l 893 1472 7338 hypothetical protein
2435143 2435700 Conserved
mru_1894 1473 7339 hypothetical_protein
2435859 2436980 Conserved
mru 1895 1474 7340 fu ma ratejiyd ratase_Fu mA4
2437143 2437643 TCA
mru_1896 1475 7341 L-tyrosine_decarboxylase MfnA
2438136 2439302 Methanofuran
mru_1897 1476 7342 phosphoenolpyruvate_synthase_PpsA
2439564 2441834 Gluconeogenesis
mru 1898 1477 7343 ribosomal_protein_Ll Oe_Rp110e =
2442193 2442675 Ribosomal p roteins
mru_1899 1478 7344 hypothetical_protein
2443423 2443659 Conserved
mru_1901 1479 7345 peptidyl-prolyl_cis-trans_isomerase
2445628 2446398 Proteinfolding i a
mru_1902 1480 7346 hypothetical protein
2446906 2448093 Conserved
mru_1903 1481 7347 hypothetical_protein '
2448341 2448538 Conserved iD
n)
mru_1904 1482 7348 dihydroorotase_PyrC . 2449150
2450403 Pyrimidine
.-.1
mru_1905 1483 7349 methyl_viologen-reducing_hydrogenase_beta_subunit_MvhB
2450706 2451938 Methanogenesis _
"
IV
mru_1906 1484 7350 methyl_viologen-reducing_hydrogenase_alpha_subunit
MvhA 2451954 2453366 Methanogenesis m
mru 1907 1485 ' 7351 methyl_viologen-reducing_hydrogenase_gamma
subunit_MvhG 2453383 2454309 Methanogenesis
mru 1908 1486 7352 methyl_viologen-red ucing_hydrogenase_delta_stTbun
it_Mvh D1 2454330 2454755 Methanogenesis co
I-.
0)
mru 1909 1487 7353 hypothetical protein
2455683 2456102 Conserved IV
1
, mru_1910 1488 7354 hypothetical_protein
2456095 2456340 Conserved o
n)
1
mru_1911 1489 7355 FeS_assembly_protein_SufBD
2456676 2457899 Other i\)
mru 1912 1490 7356 FeS assembly ATPase SufC
2457889 2458647 Other
mru_1913 1491 7357 PRC-barrel_domain-containing_protein
2459233 2459502 General
' mru_1914 1492 7358
GTP:adenosylcobinannide-phosphate_guanylyltransferase_CobY 2459990
2460622 Cobalam in
mru_1915 1493 7359 methanogenesis_marker protein 14
2460887 2462365 Methanogenesis
mru_1916 1494 7360 tetrahyd romethanopterin_S-meth¨ .
yltra nsferase_subunit_H_MtrH 2462776 , 2463657 Methanogenesis
mru_1921 1495 7361 tetrahydromethanopterin S-
methyltransferase_subunit_C MtrC 2465282 2466100
Methanogenesis .
mru 1925 1496 7362 methyl-coenzyme_M recluctase_gamma_subunit_McrG
2470383 2471135 Methanogenesis
mru 1929 1497 7363 methanogenesis ma7ker_protein_10
2474474 2475724 Methanogenesis n
1-
mru_1930 1498 7364 hypothetical protein
2475989 2476210 Hypothetical
mru 1931 = 1499 7365 N
methanogenesis marker_protein_7 2476243 2477157 Methanogenesis
mru_1932 1500 7366 hypothetical_protein
2477548 2479212 Conserved o
1--i
mru_1933 1501 7367 hypothetical protein
,2479424 2479792 Conserved o
,
. .--..
mru_1934 1502 7368 methanogenesis_marker_protein_16
2479921 2480718 Methanogenesis =
o
= nnru_1935 1503 7369
glutathione_peroxidase GpxA 2480850 2481389 Glutathionemetabolism
o
1-i =
_
mru_1936 1504 7370 hypothetical_protein
2481486 2481944 = Conserved c7,
,
=
,
Table 11
0
mru_1938 1505 7371 hypothetical_protein .
2483376 = 2483726 . Conserved (,..
mru_1939 1506 7372 formate_dehydrogenase_accessory_protein_FdhD2
2483979 2484809 Formate o
1--
0-
mru_1940 1507 7373 am inotra nsferase =
2484954 2486159 Enzyme =---.
o
, mru_1941 1508 , 7374
nitroreductase_family protein 2486230 2486748 General ', (..4
(A
mru_1942 1509 7376 hypothetical_protein
, 2486891 2487463 Conserved (.4
o
mru_1943 1510 7376 hypothetical_protein ,
, 2487456 2488178 Conserved
mru_l 944 1511 7377 transcriptional regulator
, 2488181 2488507 Transcriptionalregulators
mru_1947 1512 7378. seryl-tRNA_synthetase_SerS
2490906 2492183 tRNAaminoacylation
mru_1948 1513 7379 hypothetical_protein
2492445 2493026 Conserved
_
mru_1949 . 1514 7380 2-phosphosulfolactate_phosphatase ComB ;
- 2493230 2494012 CoenzymeM
mru_1951 1515 7381 hypothetical_protein
2495507 2495782 Conserved '
mru_1952 1516 7382 CBS_domain-containing_protein
2495910 2497256 , General
mru 1953 1517 7383 K+-dependent_Na+/Catexchanger
2497385 2496326 Cations a
mru_1956 1518 _ 7384 PP-loopiamily_protein '
2500807 2501664 General
_
_
mru_1957 1519 7385 hypothetical_protein
2501791 2502066 Conserved 0
iv
mru_1958 1520 7386 short-chain dehydrogenase_family_protein
2502709 2503182 Enzyme ...3
.-.1
mru_1959 1521 7387 am i notransi .
erase_DegT/Dn rJ/Eq(C1/StrS _family 2503510 2504613 Enzyme iv
iv
mru 1960 1522 7388 , N2,N2-dimethylguanosine JRNA_methyltransferase_Trm1
2505025 2506242 RNAprocessing = tv
.1,
_
mru_1961 1523 7389 transcriptional regulator AsnC_family '
2506795 2507274 Transcriptionalregulators iv
IQ
mru_1962 '1524 7390 GTP cyclohycolase_MptA =
2507823 2508764 Methanopterin
-4 iv
mru_1963 1525 7391 hypothetical protein
_ 2509146 2509427 Conserved = . 1
_ o
mru_1964 1526 7392 hypothetical_protein =
_ 2509567 2510751 Conserved iv
mru_1965 , 1527 7393 pyruvate_formate-Iyase-activating_enzyme_PflA2
2510907 2512250 Formate i)
.
- mru 1966 1528 7394
hypothetical_protein. 2512433 2512702 Conserved
mru 1967 1529 7395 hypothetical_protein
2512734 2513159 Conserved
mru_1968 1530 7396 MFS_transporter .
2513609 2515141 Other
mru_1970 1531 7397 hypothetical_protein
2517376 2517843 Conserved ,
mru_1971 1532 7398 adhesin-like_protein .
2518191 , 2520128 Cellsurfaceproteins
mru 1972 1533 7399 hypothetical_protein .
. 2520670 2520915 Hypothetical
=_
mru_1973 1534 7400 hypothetical_protein
2521289 2521750 Conserved 0:
mru_1974 1535 7401 FO_synthase_su bun it_1_CofG
2521817 2522908 CoenzymeF420 = n
mru_1975 1536 7402 prephenate_dehydnigenase TyrA2
2523083 2524405 Phenyla la nine/Tryosi ne 1-3
mru_1976 1537 7403 Met-10+ like-protein . '
2524584 = 2525354 General N
mru_1977 1538 7404 proteasome_beta_sUbunit
2526128 ' 2526751 Proteinfolding (,-)
o
_
mru 1978 1539 7405 RNA-metabolising_metallo-beta-lactamase .
2527446 2529355 Other 1--,
o
--..
mru_1979 1540 7406 phosphoribosylformylglycinamidine_cyclo-ligase_PurM
2529699 2530718 Puri neBiosynthesis =
o
mru_1980 1541 7407 L-sulfolactate_dehydrogenase_ComC .
2530911 2531900 CoenzymeM =
1..,
mru_1981 1542 7408 hypothetical_protein
.2532509 2532901 Hypothetical cr,
o
=
' '
.,
.
. .
'
Table 11
" 0
mru_1983 1543 7409 DNA_polymerase_family_B_PolB1
2533450 2535366 Chromosomereplication (,..
_
mru_1985 1544 7410 dihydroorotate_dehydrogenase_electron_transfer_subunit_PyrK
2536774 2537583 Pyrimidine o
1--
mru_1987 1545 7411 pre-mRNA_splicing_ribonucleoprotein_PRP31
2538781 2540061 RNAprocessing 1--,
--.
o
mru 1988 1546 7412
fibrillarin 2540196 2540846 RNAprocessing (..4
(A
mru_1991 1547 7413 hypothetical_protein
2543989 2544723 Conserved (.4
o
mru 1992 1548 7414
prephenate_dehydratase PheA 2545107 2545934 Phenylalanine/Tryosine
mru_1993 1549 7415 CBS
domain-containing_protein 2546252 2547070 General =
mru_1994 1550 7416 CBS-domain-containing_protein
2547402 2548286 General
= - mru 1995 1551 7417
hypothetical protein 2549041 = 2550153 Conserved
mru_1996 1552 7418 adhesin-like_protein 2550391 2554887
Cellsurfaceproteins
.
mru_1997 1553 7419 hypothetical _protein
2555537 2556475 Conserved
mru_1998 1554 7420
energy-converting_hydrogenase_B_subunit_Q Eh bQ 2556671 2557330
Hydrogenmetabolism .
mru 1999 1555 7421
energy-converting_hydrogenase_B subunity_-EhbP 2557489 2557752
Hydrogenmetabolism a
mru12001 1556 7422 energy-
converting_hydrogenase_B subunit_N EhbN , 2558998 2560173 '
Hydrogenmetabolism
mru_2002 1557 7423
energy-converting_hydrogenase_B_subunit M-EhbM 2560383 2560829
Hydrogenmetabolism 0
iv
mru_2003 1558 7424
energy-converting_hydrogenase_B_subunit 1__-_-EhbL 2560830 2561498
Hydrogenmetabolism
.-.1
mru_2004 1559 7425
energy-converting_hydrogenase_B_subunit K_EhbK 2561512 2562972
Hydrogenmetabolism "
mru_2007 1560 7426
energy-converting_hydrogenase_B_subunit_H EhbH 2563889 2564284
Hydrogenmetabolism "
.1,
mru_2008 1561 7427
energy-converting_hydrogenase B_subunit_G-EhbG 2564277 2564612
Hydrogenmetabolism
. mru 2009 1562 7428
energy-converting_hydrogenase_B_subunit_FTEhbF 2564612 2566093
Hydrogenmetabolism ca 0
=
mru_2016 1563 7429 hypotheticalprotein 2569565 2570311 Conserved iv
= mru 2017 1564 7430
argininosuccinate_synthase_ArgG 2570568 2571743 Arginine
iv
mru 2018 1565 7431 sugar
fermentation_stimulation_protein_SfsA2 2572308 2572970 Other i
iv
mru 2019 1566 7432
carbohydrate_kinase 2573148 2574716 Enzyme
mru 2021 1567 , 7433
transglutaminase domain-containing_protein .2576291 2576857
Proteindegradation ,
mru_2024 1568 7434 potassium_uptake_protein_TrkHiamily
2579902 2581392 Cations
mru_2025 1569 7435
potassium uptake_protein_TrkA _family 2581734 2582408 Cations
mru_2026 1570 7436 metallo-beta-lactamase_superfamily_protein
2582620 2583267 Enzyme
mru_2027 1571 7437
archaeal holliday junction_resolvase =Hjc , 2584031 2584369
Recombinationandrepair
mru_2028 1572 7438
radical g-AM_domain-containing_prot-ein , 2585631 2586662 Enzyme = _
0:
mru_2029 1573 7439 Asp4RTIA(Asn)/Glu4RNA(GIn)_amidotransferase_subunit_B_GatB
2586969 2588324 tRNAaminoacylation n
mru 2030 1574 7440
CBS_domain-containing_protein 2588366 2589217 General 1-3
' mru-2031 1575 . 7441 phosphoribosyl-
ATP_pyrophosphohydrolase HisE 2589328 2589615 Histidine
mru 2032 1576 7442
acetyltransferase GNAT family 2589783 2590430 Enzyme N
, -mru_2033 1577 7443 =
phosphoribosylaminoimidazole_carboxylase_catalytic_subunit_PurE1 _ 2590565
2591380 PurineBiosynthesis 1--,
o
--_.
' mru_2034 1578 = 7444 hydrogenase_maturation_factor_HypF
2591658 2594105 Hydrogenmetabolism =
o
mru 2035 = 1579 7445
hypothetical protein 2594295 2594792 Conserved =
1--,
mru 2037 1580 7446
transcriptional_regulator Arsk_family 2595991 2596506
Transcriptionalregulators , o=
o
=
, =
Table 11
mru_2038 , 1581 7447 molecular_chaperone_GrpE
2597136 2597828 . Proteinfolding (,..
o
mru_2039 1582 7448 molecular chqperone_DnaK
2598098 2599993 Proteinfoldin_g 1--
0-
mru 2040 1583 7449 molecular_chaperone DnaJ
2600615 2601790 Proteinfolding . ¨.
o
(..4
mrui2041 1584 7450 6-carboxyhexanoate-EoA_Iigase BioW
2602340 2603110 Biotin (A
(.4
mru_2042 1585 7451 8-amino-7-oxononanoateLsynthase_BioF
2603150 2604394 Biotin
mru_2046 1586 7452 hypothetical_protein
2616007 2616732 Conserved
mru_2048 1587 7453 adhesin-likeLprotein
2623044 2628461 Cellsurfaceproteins
mru_2049 1588 7454 adhesin-like_protein '
2629077 2639432 Cellsurfaceproteins
mru_2058 1589 7455 methionine_aminopeptidase_Map
2666529 2667458 Proteindegradation
mru 2059 1590 7456 adhesin-like_protein
2667645 , 2672060 Cellsurfaceproteins
mru-2060 1591 7457 hypothetical_protein
2672631 2673515 Hypothetical
mru12061 1592 7458 coenzyme_F420_hydro_g_enase_beta_subunit_FrhB1
2673889 2674734 Methanogenesis
.._
mru_2062 1593 7459 coenzyme_F420_hydro_genase_gamma_subunit FrhG
2674746 2675705 Methanogenesis a
mru_2063 1594 7460 _ coenzyme_F420_hydro_genase_delta_subunit_FThD '
2675718 2676191 Methanogenesis
0
mru_2064 1595 7461 coenzyme_F420_hydrogenase_alpha_subunit_FrhA
2676220 2677377 Methanogenesis n)
...3
mru_2067 1596 7462 .hypothetical_protein '
2680523 2681716 Conserved .-.1
IV
mru_2068 1597 7463 DNA-3-methyladenine glycosylase I Tag . '
2681925 2682488 Recombinationandrepair IV
IV
mru_2069 1598 7464 hypothetical_protein
2682554 . 2683030 Conserved
mru_2070 1599 7465 ATPase
2683179 2684186 General n)
mru_2071 1600 7466 hypothetical_protein
2684505 2684636 Hypothetical
.
CD IV
mru_2072 1601 7467 , . hypothetical_protein
2684703 2684852 Hypothetical 1
o
mru 2073 1602 7468 hypothetical_protein
2684868 2685629 Hypothetical n)
1
mru:2074 1603 7469 formate_dehydrogenase_alpha_chain FdhA2 =
2686763 2687842 Formate 1\3
.1,.
mru_2075 1604 7470 formate_dehydrogenase beta chain TdhB2
2687856 2689010 Formate
mru 2076 1605 7471 nnethyl_viologen-
reducing_hyd¨rogenase_delta_subunit_MvhD2 2689026 2689427 Methanogenesis
mru12077 1606 7472 leucyl-tRNA_synthetase LeuS
2690755 2693628 tRNAaminoacylation _
mru 2078 1607 7473 glutamine_synthetase_GInA2
2694404 2695777 Glutamate/Glutamine _
mru:2079 1608 7474 glutamine_amidotransferase
2696361 2697278 Glutamate/Glutamine
mru_2080 1609 7475 glutamate_synthase_beta_subunit_GHB
2697474 2698121 Glutamate/Glutamine _
mru 2081 1610 7476 coenzyme_F420_hydrogenase_beta subunit_FrhB2
2698133 2699227 Methanogenesis
_
mru¨_2082 , 1611' 7477 glutamate_synthase_alpha_subunit -altA
2699227 2700714 =Glutamate/Glutamine n
1-
mru_2083 1612 7478
phosphoenolpyruvate_synthase/pyruvate_phosphate_dikinase 2700863 _ 2703505
Gluconeogenesis
mru 2084 .1613 7479 adenosylmethionine-8-amino-7-
oxononanoate_aminotransferase_BioA 2703937 2705292
Biotin N
mru:2085 1614 7480 hypothetical_protein .
2705616 2707214 Conserved
1--,
mru_2086 1615 7481 dethiobiotin synthetase_BioD
2707432 2708130 , Biotin
mru_2087 1616 7482 biotin synth¨ase BioB1
2708243 2709223 Biotin o
' o
mru_2088 1617 7483 tRNAT1-methyla¨denosine) methyltransferase .
2709628 2710341 RNAprocessing 1..,
m ru_2089 1618 7484 Hef nuclease
2710641 2713169 'Recombinationandrepair
,
.
. .
,
,
. =
=
=
. ,
- .
Table 11 .
0
mru_2090 1619 7485
adhesin-like_protein _ 2713806 _ 2729006
Cellsurfaceproteins (,..
o
mru_2091 1620 7486
cell_wall_biosynthesis_proteiRylur ligase_family 2730183 2731976
Pseudomureinbiosynthesis 1-
mru_2092 1621 _ 7487 cell_wall
biosynthesis_protein Mur_ligase_family . , 2732165 2733661
Pseudomureinbiosynth es is =---.
o
mru_2093 1622 7488 hypothetical_protein
2734539 2735510 Conserved (..4
(A
mru_2094 1623 _ 7489 hypothetical_protein
2735626 2735979 Hypothetical (.4
o
mru 2095 1624. 7490 hypothetical_protein . .
2736038 2737273 Conserved
mru 2096 1625 7491
cysteine_synthase_CysKM2 _ 2737464 2738423 Cysteine
_ mru -2097 1626 7492
ssDNA_exonuclease_RecJ . 2738904 2740202 Recomb inationandre pair
mru_2098 1627 7493
ribosomal_protein S15P_Rps15p 2740255 2740653 Ribosomalproteins .
mru_2099 1628 7494
RdgB/HAMl_faminon-canonical_purine_NTP pyrophosphatase 2740943 2741600
Recombinationandrepair
mru 2100 , 1629 7495
glycoprotease M22_family 2741672 , 2743369 Proteindegradation
' mru_2101 1630 7496 nicotinate-nucleotide- '
. . 2743900 2744964 Coba lam in
dimethylbenzimidazole_phosphoribosyltransferase CobT
(-)
mru_2102 1631 7497 LSM
domain-containing_protein 2745227 2745427 Other -
mru 2103 1632 7498
hypothetical_protein2745538 2745684 Conserved
_
iv
mru-2104 1633 7499 5'-
nucleotidase_SurE1. 2745887 2746669 I nterconversion
-.3
.-.1
mru_2105 1634 7500 thymidylate_synthase_ThyA
2746827 2747537 Pyrimidineinterconversion "
iv
mru_2107 1635 7501 branched-chain-amino-acid_aminotransferase_livE
2750280 2751203 Valine/Leucine/lsoleucine m
.1,
mru 2109 1636 7502
methanogenesis marker_protein 12 2753522 2754523 Methanogenesis
mru_2110. 1637 7503 ketol-
acid reduc-toisomerase_IlvE 2754957 2755952 Va lin e/Leucine/lsoleucine
I-.
0
IV
mru _2111 1638 _ 7504
acetolact;te_synthase_small_subunit_IlvN 2756360 2756851
Valine/Leucine/lsoleucine , 1
mru 2112 1639 7505
acetolactate_synthase_large_subun it 11vB . 2756852 _2758747
Valine/Leucine/lsoleucine o
iv
mru 2113 1640 7506
phosphoribosylamine-glycine_ligase T,urD 2759131 2760507
PurineBiosynthesis 1
iv
mru_2114 1641 = 7507
hypothetical_protein = .2760588 2761484 Hypothetical
mru_2115 1642 _ 7508 ornithine_carbamoyltransferase_ArgF
2761785 2762699 Arginine
mru_2116 1643 7509
TPR_repeat-containing_protein - 2762901 2763884 Proteininteractions
mru_2117 1644 7510
arginyl-tRNA_synthetase_ArgS 2764382 2766103 tRNAa m inoacylation
mru_2119 1645 7511 dih ydroxy-acid_d
eh y_d ratase_IlvD = 2767253 2768908 Valine/Leucine/lsoleucine
mru_2120 1646 7512 hypothetical_protein
2769181 2769504 Conserved
mru_2121 " 1647 7513
hydroxylamine_reductase_Hcp 2770092 2771390 Other 0:
mru_2122 1648 7514 transcriptional_regulator
2771771 2772565 Transcriptionalregulators n
mru_2125 1649 7515
hypothetical_protein 2774777 _ 2775553 Hypothetical 1-
3
mru 2128 1650 7516 N
ribonuclease inhibitor 2778070 2778309 Other
mru 2129 1651 7517 threonyl-tRN-
ksynthetase ThrS = 2778687 2780513
tRNAaminoacylation o
mru_2131 1652 7518 bifunctional_formaldehyde-activating_enzyme/3-_hexulose-6-
2782016 2783254
ribulosemonophosphatepathway 1--,
o
phosphate synthase_Fae/HPs .
-4(E3
o
mru 2132 1653 7519
hypothetical_protein 2783429 2783743 Conserved =
1-,
mru-__2133 1654 7520 hypothetical_protein ,
2783892 2784905 Conserved cr,
o
,
,
,
=
= .
,
,
. .
Table 11
0
mru_2134 1655 7521 adhesin-like_protein 2785338
2803295 Cellsurfaceproteins w
mru 2135 1656 7522 NADPH-dependent_FMN reductase . 2804243
2804839 Electrontransport o
1--
.
mru_2137 1657 7523 =molybdate_transport_syslem_regulatory_protein_ModE
2807180 2807872- ..Metal-bindingpterin 1--
o
mru_2138 1658 7524 2-
oxoacid:acceptor oxidoreductase,_delta_subunit_pyruvate/2- 2808090
2808194 Electrontransport k..i
urr
ketoisovalerate_family
r.4
o
mru_2139 1659 7525 bifunctional imidazoleglycerol-
phosphate_dehydratase_HisB 2808352 2808954 Histidine
mru 2140 1660 7526 archaeal Al-Pase 2809241
2810461 General
mru_2141 1661 7527 hypothetical_protein 2810800
2811300 Conserved
mru 2142 1662 7528 F420-dependent meth lenetetrah dromethano terin deh
dro enase Mtd 2811757 2812593 Methano enesis
mru_2144 1663 7529 hypothetical_protein 2813731
2814045 Conserved
mru_2145 , 1664 7530
hypothetical_protein 2814149 2815273 Conserved
mru 2147 1665 7531 adhesin-
like_protein 2817082 2834037 Cellsurfaceproteins
mru_2148 1666 7532 transposase 2835338
2836846 Transposase
(-)
=
mru_2149 1667 7533 hypothetical_protein 2837159
2837848 Hypothetical
mru 2150 1668 7534 hypothetical protein 2837923
2838060 Hypothetical 0
mru_2151 1669 7535
cobyrinic_acid a,c-diamide_synthase CbiA4 2839171 2840613 Cobalamin
.-.1
mrti_2152 1670 7536 radical
SAM_Jomain-containing_prot-ein 2841253 2842368 Enzyme iv
iv
mru_2153 1671 . 7537 hypothetical_protein 2842450
2842665 Hypothetical m
.1,
mru_2154 1672 7538 hypothetical_protein 2842749
2843636 Conserved . - ry iv
mru_2155 1673 7539 2-isopropylmalate_synthase LeuA 2844173
2845708 Valine/Leucine/lsoleucine
I-.
mru 2157 1674 , 7540 'µ PHP domain-containing_prolein 2846812
2847477 General 1,)
1
mru:2158 1675 7541 hypothetical_protein 2848091
2849167 Conserved o
iv
'
mru_2159 1676 7542 tryptophan_synthase_beta subunit_TrpB 2849645
2850946 Tryptophan iv
mru_2160 1677 7543 cell_division_ATPase_MinI5 . 2851599
2852375 Celldivision
=
mru 2161 1678 7544
hypothetical_protein 2852467 2853213 Conserved
=
mru_ 2162 1679 7545
xylose_isomerase-like_TIM barrel_domain-containing_protein , 2853580
2854338 General
'
mru_2163 1680 7546 ,
hydrolase_HAD superfamiry 2854578 2855384 Enzyme .
mru_2164 1681 7547 NADH:flavin
oxidoreductase/NADH_oxidase_family_protein 2856227.. 2857177 Enzyme
mru_2165 1682 7548
transcriptional regulator AbrBiamily - 2857344 2857625
Transcriptionalregulators
mru_2166 1683 7549 TPR_repeat-containing_protein ' 2857862
2858920 Proteininteractions oe
mru 2167 1684 7550 TPR repeat-containing_protein 2858944
2860062 Proteininteractions n
mrti_2169 1685 7551 Asp-TR
NA(Asn)/GI u-tRNA(GI n)_a midotransferase_su bunit_A_G atA 2861980
2863383 tR NAam inoacyl ation 1-3
mru_2170 1686 7552 acetyltransferase 2863494
2864063 Enzyme
N
= = mru_2171 1687 7553
hypothetical protein 2864074 2864937 Conserved o
' mru_2174 1688 7554
riboflavin kinase_RibK 2867203 .2867592 Riboflavin 1--,
o
mru_2175 1689 7555 glycosyl Transferase_GT2 _family 2867650
2869359 Pseudomureinbiosynthesis --C-5
o
mru 2178 1690 7556 adhesin-like_protein 2873063
2882302 Cellsurfaceproteins . =
1-.
mru_2179 1691 7557 RNA-binding_S1_domain-containing_protein -
2883236 2885422 Other c7,
o
,
.
=
, .
.
=
'
Table 11 ,
______________________________________
0
mru_2182 1692 7558 glycosyt_transferase_GT2 Jamily/CDP- 2891223
2894897 Other (.4
o
glycerol: poly(glycerophosphate)_g lycerophosphotransferase
1--
1--
mru 2185 1693 "7559 hypothetical_protein 2897974
2898267 , Conserved ,
o
mru 2186 1694 7560 phosphoribosylaminoimidazole_carboxylase_purE2
2898786 2899793 PurineBiosynthesis (..4
(A
mru12187 1695 7561 UbilD Jam i ly_deca rboxylase 2899963
2901225 Ubiquinone (.4
o
mru_2192 1696 7562 glycyl-radical_enzyme_activating_protein 2906364
2907416 Enzyme
mru_2193 1697 7563 thiamine = mon phosphate_kinase_Th iL 2907574
2908815 Thiamine
mru_2194 1698 7564 hypothetic,al_protein 2908943
2909626 Conserved =
mru_2195 1699 7565 hypothetical_protein 2909811
2909978 Conserved
. mru_2196 1700 7566
hypothetical_protein 2910984 2911304 Hypothetical .
' mru_2198 1701 7567
acetyltransferase_G NAT Jam ily 2913168 2913629 Enzyme
mru_2199 1702 7568 CMP/dCMP deaminase 2913665
2914132 Pyrimidineinterconversion
mru 2200 1703 7569 cobalamin_biosynthesis_protein_CbiX 2914385
2915353 Cobalamin . a
mru12201 1704 7570 hypothetical_protein 2915739
2916137 Conserved
mru_2203 1705 7571 hypothetical_protein 2917874
2918890 Conserved
iv
mru_2204 1706 7572 hypothetical_protein 2919006
2919440 Hypothetical .-.1
mru_2205 1707 7573 hypothetical_protein 2919437
2919829 Conserved iv
iv
mru 2207 , 1708 7574
ABC_transporter ATP-binding_protein 2922302 2923006 Other iv
.1,.
mru-2209 1709 7575 TrkA_domain-containing_protein ' 2924790
2925398 Cations iv
h.) o
mru_2210 1710 7576 p record n-3B_C17-m ethyltransferase CbiH 2925500
2926864 Cobalam in
mru_2212 1711 7577 DNA_polymerase_small_subunit DP-1_PolD 2928089
2930062 Chromosornereplication = N N)
I
0
mru_2213 1712 7578 fuculose 1-phosphate aldolase TucA 2930152
2930739 CoenzymeF420 iv
mru 2214 1713 7579 hypothetical_protein 2930828
2931319 Conserved 1
iv
.1,
m ru¨_2215 1714 7580
hypothetical_protein 2931571 2931819 Conserved
mru 2216 1715 7581 hypothetical
protein = 2932389 2932745 Conserved . . =
m ru¨_2217 1716 7582
hypothetical_protein 2932901 2933908 Conserved
mru_2218 , 1717 7583
cobalamin_biosynthesis_protein_CbiB . 2934004 2935029 Cobalamin
mru_2219 1718 7584 hypothetical_protein ' 2935241
2935705 Conserved
,
n
.
z
N .
=
,
.
.
.
. .
,
-
c.
=
-
.
.
,
.i.=
44. ,
1:=";=
Table 12 I
Feature Start Stop Orientation nt Non-coding Label
Product Note 0
number position position SEQI feature
w D =
1--
.
10000 1280 1292 sense 1719 terminator TERM1
TTTTTC il 1 1 1 i 1 i G GATGA TTf TCATC .--.
o
(..4
1111111GiiilliAl
(A
(.4
10001 4966 4519 antisense 1720 non-coding
Intron_gpl RF00028, Function: Baseline Score threshold: 12.47, element
score: 13.14
RNA , Intron_gpl
10002 5222 5194 antisense 1721 terminator TERM2
gaps in hairpin-stem, opp_overlap 5194, overlap 5198
10003 10674 10235 antisense 1722 non-coding ,
Intron_gpl RF00028, Function: Baseline Score threshold: 12.47, element
score: 17.28
RNA Intron_gpl
10004 10716 10610 antisense 1723 non-coding U6
RF00026, Function: Baseline Score threshold: 12.47, element score: 14.38
RNA U6
=
10005 14704 14452 antisense 1724 non-coding
Intron_gpl RF00028, Function: Baseline Score threshold: 12.47, element
score: 14.33
_ RNA Introngpl
_
a
10006 14528 14734 sense 1725 non-coding T-box
RF00230, Function: Baseline Score threshold: 12.47, element score: 15.21
RNA T-box
0
n) 10007 15175 14712 antisense 1726 non-coding
Intron_gpl RF00028, Function: Baseline Score
threshold: 12.47, element score: 13.95 ...3
.-.1
RNA Intron_gpl
l\)
IV
10008 14988 15011 sense 1727 terminator TERM3
opp_overlap 14989 14993, overlap 14986
14989 "
10009 16604 16340 antisense 1728 non-coding
Intron_gpl RF00028, Function: Baseline Score threshold: 12.47, element
score: 14.65
RNA Intron_gpl
_b. 0
c.4 '
H
10010 17761 17773 sense 1729 terminator TERM4
opp_overlap 17769, overlap 17765 "
i
10011 17765 17787 _ sense 1730 terminator TERM5
opp_overlap 17769, overlap 17761 o
n)
i
10012 19549 19821 sense 1731 non-coding
Intron_gpl RF00028, Function: Baseline Score
threshold: 12.47, element score: 14.23 i\)
RNA Intron_gpl
.1,.
_
1 001 3 26253 26269 sense 1732 terminator TERM6
opp_overlap 26253 26254, overlap 26248 26254
10014 33873 33914 sense 1733 terminator TERM7
_ - opp_overlap 33905, overlap 33877 33902
,
10015 40450 40697 sense 1734 non-coding
Intron_gpl RF00028, Function: Baseline Score threshold: 12.47, element
score: 16.91
' RNA = Intron_gpl
10016 41537 41558 sense 1735 terminator TERMS
GGATATTATTGTTGG TAATCAGAC AGAT
GACTGATTG I I I I I I I I I I AAGTG2
00
10017 59517 59345 antisense 1736 non-coding U2
RF00004, Function: Baseline Score
threshold: 12.47, element score: 13.82 n
1-
RNA U2
_
10018 63570 63232 antisense 1737 non-coding
Intron_gpl RF00028, Function: Baseline Score threshold: 12.47, element
score: 18.64
trj
RNA Intron_gpl
o
1--,
10019 65279 65669 sense 1738 non-coding
Intron_gpl RF00028, Function: Baseline Score
threshold: 12.47, element score: 14.74 o
RNA Intron_gpl
o 10020 66176 65852 antisense 1739 non-coding
Intron_gpl RF00028, Function: Baseline Score
threshold: 12.47, element score: 15.69 =
1..,
RNA Intron gpl
c7,
,
-
= .
Table 12
10021 66433 66288 , antisense 1740 non-coding
U4 RF00015, Function: Baseline Score threshold: 12.47, element score:
1160
0
RNA U4
(,..
10022 66461 66444 antisense 1741 terminator
TERM9 opp_overlap 66436, overlap 66436
I--
10023 69443 69466 sense 1742 terminator TERM10
gaps in hairpin-stem, overlap 69446 1--
o
,
10024 69446 69464 sense 1743 terminator TERM11
overlap 69443 (..4
(A
10025 77288 77259 antisense 1744 terminator
TERM12 opp_overlap 77249 77259, overlap 77248
(.4
o
' 10026 84485 84181 antisense 1745 non-coding =
Intron_gpl RF00028, Function: Baseline Score threshold: 12.47, element
score: 25.12
RNA .Intron_gpl
10027 87014 86935 antisense 1746 non-coding
SNORD11 RF00105, Function: Baseline Score threshold: 12.47, element
score: 14.04
RNA 5 SNORD115 '
10028 90497 90519 sense 1747 terminator TERM13
overlap 90495 , ,
_
10029 92389 92366 antisense 1748 terminator
TERM14 opp_overlap 92366, overlap 92360 92370
,
10030 92366 92389 sense 1749 terminator TERM15
opp_overlap 92366 92360 92370
,
10031 92385 92370 antisense 1750 terminator
TERM16 opp_overlap 92366, overlap 92366 92360
.
,
10032 97563 97587 sense 1751 terminator
TERM17 overlap 97564 a
10033 97564 97586 sense: 1752 terminator TERM18
' overlap 97563
10034 97724 97795 sense 1753 tRNA tRNA-OTHER ,
n)
...3
10035 103209 103121 antisense 1754 non-coding
U6 RF00026, Function: Baseline Score
threshold: 12.47, element score: 12.73 .-.1
NJ
- RNA U6
I.)
n)
' 10036 103783 103423 antisense 1755 non-coding
Intron_gpl RF00028, Function: Baseline Score
threshold: 12.47, element score: 15.26 1.)
, RNA Intron_gpl
-N K)
-it.
0
10037 113553 113568 sense 1756 terminator =
TERM19 opp overlap 113553
IV
=
I
10038 115435 115185 antisense 1757 non-coding
Intron_gpl RF00028, Function: = Baseline Score
threshold: 12.47, element score: 15.33 o
RNA Intron_gpl
n)
1
10039 118345 118364 sense 1758 terminator TERM20
. , opp_overlap 118343 i\)
.1,. 10040 121793 121580 antisense 1759 non-coding
Intron_gpl RF00028, Function: Baseline Score
threshold: 12.47, element score: 16.69
RNA Intron_gpl
10041 , 129883 129911 sense 1760 terminator TERM21
opp_overlap 129873 129882 129881 129890
10042 130345 130372 sense 1761 terminator
. TERM22 app overlap 130337 130345 130364 _
10043 139713 139734 sense 1762 terminator
TERM23 opp_overlap 139716, overlap 139693 139710
10044 140192 140219 sense 1763 terminator.
TERM24 opp_overlap 140199 .
10045 145624 145643 sense 1764 terminator
TERM25 opp_overlap 145627, overlap 145620
145621 oo
n
10046 152275 152294 sense 1765 terminator
TERM26 opp_overlap 152275 152278, overlap
152270 152272 1-3
.
152270
10047 152686 152472 antisense 1766 non-coding
Intron_gpl RF00028, Function: Baseline Score
threshold: 12.47, element score: 12.78 N
. RNA = Intron_gpl
1--,
10048 152568 152593 sense 1767 terminator
TERM27 app overlap 152574 o
-4(E3
10049 154499 154229 antisense 1768 non-coding
Intron_gpl RF00028, Function: Baseline Score
threshold: 12.47, element score: 12.79
RNA Intron_gpl
1--,
o=
,
o
,
,
,
,
,
Table 12
= 10050 154645 154961 sense 1769 non-coding
Intron_gpl RF00028, Function: Baseline Score
threshold: 12.47, element score: 12.82 0
RNA , Intron gpl
w
_. _
_ o
10051 155489 155277 antisense 1770 non-coding
Intron_gpl RF00028, Function: Baseline Score
threshold: 12.47, element score: 16.83 I--
1--
RNA Intron_gpl
=---.
o
k..i
10052 178343 178362 sense 1771 terminator
TERM28 opp_overlap 178337, overlap 178337
uri
c..)
10053 190980 191003 sense 1772 terminator
TERM29 opp_overlap 190986, overlap 190982
o
.
.
10054 190982 191001 sense = 1773
terminator TERM30 opp_overlap 190986, overlap 190980
10055 191084 191068 antisense 1774
terminator TERM31 _ gattgatttcctgct ggcgga ttcat tccgcc tttttttatccaaga3
-
10056 191116 191094 antisense 1775
terminator , TERM32 _ overlap 191113 =
10057 191154 191203 sense 1776 terminator
1ERM33 gaps in hairpin-stem, opp_overlap 191161 191197,
overlap 191155 191197
10058 191155 191202 sense 1777 terminator
TERM34 gaps in hairpin-stem, opp_overlap 191161 191197,
)
overlap 191154 191197
,
(-)
10059 193975 193837 antisense 1778 non-coding FMN
RF00050, Function: Baseline Score threshold: 12.47, element score: 108.38
RNA FMN
.
n)
10060 194363 194329 antisense 1779
terminator TERM35 opp_overlap 194335
194337, overlap 194337 ...i
.-.1
10061 196681 196460 antisense 1780 non-coding
Intron_gpl RF00028, Function: Baseline
Score threshold: 12.47, element score: 15.09 n)
IV
RNA Intron_gpl
m
10062 197718 197751 sense 1781 terminator
TERM36 opp_overlap 197723, overlap 197715
_ 1.)
10063 198029 198051 sense 1782 terminator TERM37
' opp_overlap 198026 198029, overlap 198025 198033
.
10064 202812 202572 antisense 1783 non-coding
Intron_gpl RF00028, Function: Baseline Score
threshold: 12.47, element score: 12.86 (31 1\)
1
RNA Intron_gpl
o
n)
10065 219885 219629 antisense 1784 non-coding
Intron_gpl RF00028, Function: Baseline
Score threshold: 12.47, element score: 21.82 1
RNA Intron_gpl
i\)
10066 219762 219954 sense 1785 non-coding U2
RF00004, Function: Baseline Score threshold: 12.47, element score: 19.85
RNA U2
10067 225490 225523 _ sense 1786 terminator
TERM38 opp_overlap 225497, overlap 225497'
10068 225497 225516 sense 1787 terminator
TERM39 opp_overlap 225497, overlap 225490
10069 231136 230674 antisense 1788 non-coding
Intron_gpl RF00028, Function: Baseline Score threshold: 12.47, element
score: 17.46
RNA Intron_gpl
10070 235299 235330 sense 1789 terminator
TERM40 overlap 235304 oo
n
10071 235834 235522 antisense 1790 non-coding
Intron_gpl RF00028, Function: Baseline Score
threshold: 12.47, element score: 16.49 1-3
RNA = Intron opt
.
10072 237209 237233 sense 1791 terminator
TERM41 opp_overlap 237202 237203, overlap
237210 237220 N
10073 237210 237232 sense 1792 terminator
TERM42 opp_overlap 237202 237203, overlap
237209 237220 1--,
o
10074 238305 238157 antisense 1793 non-coding
SNORA62 RF00091, Function: Baseline Score
threshold: 12.47, element score: 16.01 'a
. RNA SNORA62
.
o
10075 238267 238176 antisense 1794 non-coding U6
RF00026, Function: Baseline Score threshold: 12.47, element score: 13.25
c7,
RNA U6
. o
,
.
,
-
Table 12
10076 238853 238508 antisense 1795 . non-coding
Intron_gpl RF00028, Function: Baseline Score
threshold: 12.47, element score: 27.01 0
RNA Intron_gpl
(L..
o 10077 238932 239117 ' sense 1796 non-coding
Intron_gpl RF00028, Function: Baseline Score
threshold: 12.47, element score: 12.50 I--
1--,
RNA Intron_gpl
, --.
=
_
(..4
10078 240392 240411 sense , 1797 terminator
TERM43 ' overlap 240391 240390 (A
(.4
10079 240431 240447 sense 1798 terminator
TERM44 ' overlap 240441 , o
10080 246652 246683 sense 1799 terminator
TERM45 , gaps in hairpin-stem, opp_overlap 246649 246652
'
246657 246659, overlap 246649
10081 248111 248133 sense 1800 terminator
TERM46 opp_overlap 248116, overlap 248110
10082 249343 249090 antisense 1801 non-coding
Intron_gpl RF00028, Function: Baseline Score threshold: 12.47, element
score: 13.42
RNA Intron_gpl
10083 252002 252031 sense 1802 terminator
TERM47 opp_overlap 252000 252005 252007,
overlap 252007 '
10084 252007 252026 sense 1803 terminator
TERM48 opp overlap 252000 252007 252005, overlap 252002
10085 252430 ' 252131 antisense 1804 non-coding
Intron_gpl RF00028, Function: Baseline Score
threshold: 12.47, element score: 17.65 a ,
RNA . Intron_gpl ,
o
10086 252483 252501 sense 1805 terminator
TERM49 opp_pverlap 262483 252478, overlap
252478 252477 . n)
.
-.3
10087 253345 253370 sense 1806 terminator
TERM50 gaps in hairpin-stem, opp_overlap
253344 .-.1
-
IV
10088 255748 255794 sense 1807 terminator
TERM51 opp overlap 255754 255755 IV
-
IV
10089 256772 256695 antisense 1808 tRNA tRNA-Arg
10090 257019 256743 antisense 1809 non-coding
Intron_gpl RF00028, Function: Baseline Score
threshold: 12.47, element score: 18.40 N., ,N2
RNA Intron_gpl
-1s. 1--,
10091 263289 263309 sense 1810 terminator
TERM52 opp_overlap 263293 ' 1
o
10092 264896 264423 antisense 1811 non-coding
Intron_gpl RF00028, Function: Baseline Score
threshold: 12.47, element score: 14.92 "
1
RNA Intron_gpl
"
.1,.
10093 264735 264912 sense 1812 non-coding U2
RF00004, Function: Baseline Score threshold: 12.47, element score: 13.96
RNA U2
,
' 10094 265933 265907 antisense 1813 terminator
TERM53 , opp_overlap 265907 265909 265932, overlap 265902
10095 269550 269599 sense 1814 , terminator
TERM54 gaps in hairpin-stem, opp_overlap 269562
10096 269588 269562 antisense 1815 terminator
TERM55 opp_overlap 269550
10097 271355 271279 antisense 1816
non-coding SNORD45 RF00279, Function: Baseline Score threshold: 12.47,
element score: 15.21
RNA SNORD45
1-:
10098 273884 273905 sense 1817 terminator TERM56
AATTTAATTTTAAGT TCTAGTTTC TTTT GAAACTAGA n
4
Lt.
TTTICTTC11111 IA
10099 274082 273886 antisense 1818 non-coding
Intron_gpl RF00028, Function: Baseline Score
threshold: 12.47, element score: 16.62 N
RNA Intron_gpl
, =
1--,
,
=
10100 273935 273956 sense , 1819 terminator
TERM57 ATTTA I lilt I I I GT TCCAGTTTC TTTT GAAACTAGA
o
iiiIIIiCITTTTAT5
=
1--,
10101 277786 277807 sense 1820 terminator
TERM58 _ opp_overlap 277786 o
o
'
= .
.
,
,
,
..
.
, =
- .
.
.
õ
.
Table 12 . 10102 282687 282660 antisense
1821 terminator TERM59 opp overlap 282659 ) 0
10103 288825 288794 antisense -1822
terminator TERM60 , opp_overlap 288775
288794 288799 288803 288818, t.4
.
o
=
overlap 288799 288803 I-- . = , 1--,
= 10104 288820 .288799 antisense
1823 terminator - TERM61
opp_overPap 288799 288794 288803 288818, overlap =--.
o
288794 288803
(..4
(A
(.4
10105 290563 290363 antisense 1824 non-coding .
Intron_gpl RF00028, Function:
Baseline. Score threshold: 12.47, element score: 15.48 o
RNA Introp gpl
10106 294254 294273 sense 1825 terminator
TERM62 . opp_overlap 294253, overlap 294250
10107 295087 = 294881 antisense 1826 non-coding .
Intron_gpl RF00028, Function: Baseline Score threshold: 12.47, element
score: 19.55
_ RNA Intron_gpl
_
10108 294948 294962 sense
1827 terminator . TERM63 opp_overlap 294948 294947, overlap 294944
10109 295002 295016 sense
1828 terminator ' . TERM64 opp_overlap 295001 295002, overlap 294998
10110 297666 297693 sense 1829 , terminator
TERM65 opp_overlap 297672, overlap 297667 297672
10111 297667 297692 sense 1830 = terminator,
TERM66 opp_overlap 297672, overlap 297666
297672 a
10112 - 298883 298911 sense 1831 terminator TERM67 = '
- opp_overlap 298890
10113 300835 300845 sense = 1832 terminator
TERM68 opp_overlap 300835
n)
10114 303655 303693 sense 1833 terminator
TERM69 = opp overlap 303659 303668,
overlap 303659 ...3
.-.1
10115 303734 303747. sense
1834 terminator = TERM70 opp_overlap 303734
"
IV
10116 306385 306403' sense 1835 terminator
TERM71 - overlap 306381 "
- 10117 307114 307145 sense ' 1836
terminator . TERM72 'gaps in hairpin-stem, opp overlap 307111, overlap
.
307111 307120 4 o
10118 310468 310496 sense = - 1837
terminator TERM73 opp_overlap 310462
310471, overlap 310464 310471 IV
I
10119 310471 . 310493 sense 1838 terminator =
TERM74 opp_overlap 310462 310471,
overlap 310464 310468 o
n)
10120 319297 319370 sense 1839 tRNA tRNA-Arg
= i\)
, 10121 323374 323553 sense = 1840 non-coding U2
RF00004, Function: Baseline Score threshold: 12.47, element score: 18.95
RNA U2 . .
. 10122 325808 '325542 antisense . 1841 non-
coding Intron_gpl RF00028; Function:
Baseline Score threshold: 12.47, element score: 22.93 =
= RNA -
Intron_gpl , '
10123 328010 327988 - antisense , 1842
terminator . TERM75 opp_overlap 327990, overlap 327983
10124 331364 331378 sense = , 1843 terminator - TERM76 '
AGGAATAAGGTTTAA TAATTT AAA AAATTA
. A
,
. = 111111111GATTTG6 oo
10125 349291 349262 antisense 1844
terminator TERM77 = opp_overlap 349267
349286 349290 n
1-
. 10126 = 354151 354137 antisense .1845 terminator
TERM78
tacagtactgctcta gaagaa taa ttcttc ttttactattcttac7
10127 359527 359567 sense 1846 terminator
TERM79 opp_overlap 359523 359525, overlap
359530 359531 N
10128 359530 359564 sense 1847. terminator
TERM80 opp_overiap 359523 359525, overlap
359527 359531 1--,
o
10129 359531 359563 : sense 1848 terminator TERM81 '
opp_overlap 359523 359525, overlap 359530 359527
o
10130 360280 360196 antisense .. 1849
tRNA tRNA-Ser =
1..,
10131 360474 ' 360390 antisense 1850 tRNA
tRNA-Ser . o,
, o
.
,
,
.
. .
. =
. .
,.
, =
= ,
. ,
,
Table 12
10132 360730 360659 antisense = 1851 tRNA
tRNA-Trp 0
10133 360811 360740 antisense 1852 tRNA tRNA-OTHER
o
10134 360887 360815 antisense 1853 tRNA tRNA-Ala
1--,
.--..
10135 360983 360912 antisense 1854 tRNA tRNA-Gly
. (..4
10136 361061 360989 antisense 1855 tRNA tRNA-Arg
= (.4
10137 361222 361150 antisense 1856 tRNA tRNA-OTHER
10138 361369 361297 antisense 1857 tRNA tRNA-Asp
.
. 10139 361456 361383 antisense 1858 tRNA tRNA-Ile
'
10140 361596 361513 antisense 1859 tRNA tRNA-Ser
.
10141 361676 361604 antisense 1860 tRNA tRNA-Glu
10142 361753 361679 antisense 1861 tRNA tRNA-Lys =
, 10143 361794 361770 antisense 1862
terminator TERM82 ttgtcaccttgtgca gggttggaggt aat agctccaaccc '
tttagtaaattgata8
10144 361915 361844 antisense 1863 tRNA tRNA-OTHER
= a
10145 362026 361955 antisense 1864 , tRNA
tRNA-Val
n)
10146 362324 362252 antisense 1865 , tRNA
tRNA-OTHER ...3
,
.-.1
10147 362403 362331 antisense 1866 tRNA tRNA-Pro
K)
IV
1 0 1 4 8 362552 362482 antisense 1867 tRNA
tRNA-Tyr m
.1,.
10149 362798 362727 antisense 1868 tRNA tRNA-Phe
n)
10150 363464 . 363381 antisense 1869 tRNA
tRNA-Leu = - tv 0
10151 364097 364076 antisense 1870 terminator TERM83
aaatttttatttagg gttaagag caatta ctcttaac tttatttatatttta9
I
10152 366950 366705 antisense 1871 non-coding
Intron_gpl RF00028, Function: Baseline Score
threshold: 12.47, element score: 16.04 n)
1
RNA Intron_gpl
"
.1,.
10153 367795 367805 sense , 1872 terminator
TERM84 TACTC 1111111 CIT TAGT UT ACTA
I I I I I I I ICTTAGTT10
10154 368448 368224 . antisense 1873 non-coding
Intron_gpl RF00028, Function: Baseline Score threshold: 12.47, element
score: 17.77
RNA Intron_gpl ,
10155 371145 371186 sense 1874 terminator
TERM85 opp_overlap 371150 =
10156 371306 371207 antisense 1875 non-coding
U6 RF00026, Function: , Baseline Score threshold: 12.47, element score:
15.83
RNA U6
oo
n
10157 373705 373838 sense 1876 non-coding
SN0RA72 RF00139, Function: Baseline Score
threshold: 12.47, element score: 14.89 1-3
RNA SNORA72
10158 381813 381284 antisense 1877 non-coding
Intron_gpl RF00028, Function: Baseline Score
threshold: 12.47, element score: 14.19 N
RNA Intron_gpl
10159 381420 381447 sense 1878 terminator
TERM86 opp_overlap 381420, overlap 381418 o
.---=
o
10160 381462 381482 sense 1879 terminator
TERM87 opp_overlap 381462 381464
o
10161. 381774 381912 sense 1880 non-coding
U4 RF00015, Function: Baseline Score
threshold: 12.47, element score: 16.39 1--,
c7,
RNA U4
,
. =
DEMANDES OU BREVETS VOLUMINEUX
LA PRESENTE PARTIE DE CETTE DEMANDE OU CE BREVETS
COMPREND PLUS D'UN TOME.
CECI EST LE TOME 1 _______________________ DE 2
NOTE: Pour les tomes additionels, veillez contacter le Bureau Canadien des
Brevets.
JUMBO APPLICATIONS / PATENTS
THIS SECTION OF THE APPLICATION / PATENT CONTAINS MORE
THAN ONE VOLUME.
THIS IS VOLUME 1 OF 2
NOTE. For additional volumes please contact the Canadian Patent Office.