NO-JIRA: HA documentation: minor updates to HA chapter.

git-svn-id: https://svn.apache.org/repos/asf/qpid/branches/0.22@1464657 13f79535-47bb-0310-9956-ffa450edef68
author: Alan Conway <aconway@apache.org> 2013-04-04 17:52:21 +0000
committer: Alan Conway <aconway@apache.org> 2013-04-04 17:52:21 +0000
commit: 34bf64f26146bd36e0170ab6c8eb202ce72e9151 (patch)
tree: 2787e995d9d1e329c91e17eb83a103ac5d4ce2dc /qpid
parent: 167e0fe6bc768e4f7a88d2c203d5442ca82531f3 (diff)
download: qpid-python-34bf64f26146bd36e0170ab6c8eb202ce72e9151.tar.gz
2 files changed, 169 insertions, 42 deletions
diff --git a/qpid/cpp/README-HA.txt b/qpid/cpp/README-HA.txt
new file mode 100644
index 0000000000..0b79664535
--- /dev/null
+++ b/qpid/cpp/README-HA.txt
@@ -0,0 +1,118 @@
+<!--*-markdown-*-->
+
+Migrating to new HA
+===================
+
+Up to version 0.20, Qpid provided the `cluster` module to support active-active
+clustering for Qpid C++ brokers. In version 0.20 the `ha` module was introduced
+supporting active-passive HA clustering. From version 0.22 the older `cluster`
+module is no longer available so users will have to migrate to the new `ha`
+module.
+
+This document summarizes the differences between the old and new HA modules and
+the steps to migrate to new HA. It assumes you have read the
+[HA chapter of the C++ Broker Book][chapter-ha]
+
+Client connections and fail-over
+--------------------------------
+
+The old cluster module was active-active: clients could connect to any broker in
+the cluster. The new ha module is active-passive. Exactly 1 broker acts as
+*primary* the other brokers act as *backup*. Only the primary accepts client
+connections. If a client attempts to connect to a *backup* broker, the connection
+will be aborted and the client will fail-over until it connects to the primary. See
+["Client Connections and Failover"][ha-failover].
+
+The new cluster module also supports a [virtual IP address][ha-virtual-ip].
+Clients can be configured with a single IP address that is automatically routed
+to the primary broker. This is the recommended configuration.
+
+Migrating configuration options for the old cluster module
+----------------------------------------------------------
+
+`cluster-name`: Not needed.
+
+`cluster-size`: Not needed but see "Using a message store in a cluster" below
+
+`cluster-url`: Use `ha-public-url`, which is recommended to use a [virtual IP address][ha-virtual-ip]
+
+`cluster-cman`: Not needed
+
+`cluster-username, cluster-password, cluster-mechanism`: use `ha-username,
+ha-password, ha-mechanism`
+
+Configuration options for new HA module
+----------------------------------------
+
+`ha-cluster`: set to `yes` to enable clustering.
+
+`ha-brokers-url`: set to a URL listing each node in the cluster. For example
+`amqp:node1.exaple.com,node2.exaple.com,node3.exaple.com`. The set of addresses
+for the cluster is fixed, you can't add new members with different addresses
+while the cluster is running. You can shut-down and re-start a broker on the
+same address.
+
+`ha-public-url`: URL used by clients to connect to the cluster. It is
+recommended to set this to the [Virtual IP address][ha-virtual-ip] of the
+cluster.
+
+`ha-replicate`: Set to `all` to replicate everything like the old cluster does.
+New HA provides more flexibility over what is replicated, see ["Controlling replication of queues and exchanges"][ha-replicate-values].
+
+`ha-username, ha-password, ha-mechanism`: Same as old `cluster-username,
+cluster-password, cluster-mechanism`
+
+`ha-backup-timeout`: Maximum time that a recovering primary will wait for an
+expected backup to connect and become ready.
+
+`link-maintenance-interval`: Interval in seconds for the broker to check link
+health and re-connect links if need be. If you want brokers to fail over quickly
+you can set this to a fraction of a second, for example: 0.1.
+
+`link-heartbeat-interval` Heartbeat interval for replication links. The link
+will be assumed broken if there is no heartbeat for twice the interval.
+
+Configuring rgmanager
+---------------------
+
+The new HA module requires an external cluster resource manager, `rgmanager`
+provided by the `cman` package.  `rgmanager` is responsible for stopping and
+starting brokers and determining which broker is promoted to primary. It is also
+responsible for detecting primary failures and promoting a new primary.  See
+["Configuring rgmanager as resource manager"][ha-rm-config]
+
+Broker Administration Tools
+---------------------------
+
+ Normally, clients are not allowed to connect to a backup broker. However
+ management tools are allowed to connect to a backup brokers. If you use these
+ tools you must not add or remove messages from replicated queues, nor create or
+ delete replicated queues or exchanges as this will disrupt the replication
+ process and may cause message loss.
+
+qpid-ha allows you to view and change HA configuration settings.
+
+The tools qpid-config, qpid-route and qpid-stat will connect to a backup if you
+pass the flag ha-admin on the command line.
+
+Fail-over exchange
+------------------
+
+The fail-over exchange is not supported in new HA, use a
+[virtual IP address][ha-virtual-ip] instead.[]
+
+Using a message store in a cluster
+----------------------------------
+
+If you use a persistent store for your messages then each broker in a cluster
+will have its own store. If the entire cluster fails, when restarting the
+*first* broker that becomes primary will recover from its store. All the other
+brokers will clear their stores and get an update from the primary to ensure
+consistency. See ["Using a message store in a cluster"][ha-store].
+
+[chapter-ha]: http://qpid.apache.org/books/trunk/AMQP-Messaging-Broker-CPP-Book/html/chapter-ha.html
+[ha-failover]: http://qpid.apache.org/books/trunk/AMQP-Messaging-Broker-CPP-Book/html/chapter-ha.html#ha-failover
+[ha-failover]: http://qpid.apache.org/books/trunk/AMQP-Messaging-Broker-CPP-Book/html/chapter-ha.html#ha-virtual-ip
+[ha-replicate-values]: http://qpid.apache.org/books/trunk/AMQP-Messaging-Broker-CPP-Book/html/chapter-ha.html#ha-replicate-values
+[ha-rm-config]: http://qpid.apache.org/books/trunk/AMQP-Messaging-Broker-CPP-Book/html/chapter-ha.html#ha-rm-config
+
diff --git a/qpid/doc/book/src/cpp-broker/Active-Passive-Cluster.xml b/qpid/doc/book/src/cpp-broker/Active-Passive-Cluster.xml
index 8a6403c2b5..cc3ee587c4 100644
--- a/qpid/doc/book/src/cpp-broker/Active-Passive-Cluster.xml
+++ b/qpid/doc/book/src/cpp-broker/Active-Passive-Cluster.xml
@@ -20,11 +20,10 @@ under the License.
 
 -->
 
-<section id="chap-Messaging_User_Guide-Active_Passive_Cluster">
+<section id="chapter-ha">
+  <title>Active-Passive Messaging Clusters</title>
 
-  <title>Active-passive Messaging Clusters</title>
-
-  <section>
+  <section id="ha-overview">
     <title>Overview</title>
     <para>
 
@@ -51,7 +50,7 @@ under the License.
       url="https://fedorahosted.org/cluster/wiki/RGManager">Rgmanager</ulink> is supported
       initially, but others may be supported in the future.
     </para>
-    <section>
+    <section id="ha-at-least-once">
       <title>Avoiding message loss</title>
       <para>
 	In order to avoid message loss, the primary broker <emphasis>delays
@@ -153,11 +152,11 @@ under the License.
 	</varlistentry>
       </variablelist>
     </section>
-    <section>
+    <section id="limitations">
       <title>Limitations</title>
       <para>
 	There are a some known limitations in the current implementation. These
-	will be fixed in furture versions.
+	will be fixed in future versions.
       </para>
       <itemizedlist>
 	<listitem>
@@ -188,7 +187,7 @@ under the License.
     </section>
   </section>
 
-  <section>
+  <section id="ha-virtual-ip">
     <title>Virtual IP Addresses</title>
     <para>
       Some resource managers (including <command>rgmanager</command>) support
@@ -200,13 +199,13 @@ under the License.
       than a list.
     </para>
     <para>
-      A virtual IP address can be used by clients and backup brokers to connect
-      to the primary. The following sections will explain how to configure
-      virtual IP addresses for clients or brokers.
+      A virtual IP address can be used by clients to connect to the primary. The
+      following sections will explain how to configure virtual IP addresses for
+      clients or brokers.
     </para>
   </section>
 
-  <section>
+  <section id="ha-broker-config">
     <title>Configuring the Brokers</title>
     <para>
       The broker must load the <filename>ha</filename> module, it is loaded by
@@ -300,7 +299,7 @@ ssl_addr = "ssl:" host [":" port]'
 	    <entry>
 	      <para><literal>ha-username <replaceable>USER</replaceable></literal></para>
 	      <para><literal>ha-password <replaceable>PASS</replaceable></literal></para>
-	      <para><literal>ha-mechanism <replaceable>MECH</replaceable></literal></para>
+	      <para><literal>ha-mechanism <replaceable>MECHANISM</replaceable></literal></para>
 	    </entry>
 	    <entry>
 	      Authentication settings used by HA brokers to connect to each other.
@@ -346,7 +345,7 @@ ssl_addr = "ssl:" host [":" port]'
     </para>
   </section>
 
-  <section>
+  <section id="ha-rm">
     <title>The Cluster Resource Manager</title>
     <para>
       Broker fail-over is managed by a <firstterm>cluster resource
@@ -382,7 +381,7 @@ ssl_addr = "ssl:" host [":" port]'
     </para>
   </section>
 
-  <section>
+  <section id="ha-rm-config">
     <title>Configuring <command>rgmanager</command> as resource manager</title>
     <para>
       This section assumes that you are already familiar with setting up and configuring
@@ -439,10 +438,7 @@ NOTE: fencing is not shown, you must configure fencing appropriately for your cl
       <!-- This script promotes the qpidd broker on this node to primary. -->
       <script file="/etc/init.d/qpidd-primary" name="qpidd-primary"/>
 
-      <!-- This is a virtual IP address for broker replication traffic. -->
-      <ip address="20.0.10.200" monitor_link="1"/>
-
-      <!-- This is a virtual IP address on a seprate network for client traffic. -->
+      <!-- This is a virtual IP address for client traffic. -->
       <ip address="20.0.20.200" monitor_link="1"/>
     </resources>
 
@@ -461,7 +457,6 @@ NOTE: fencing is not shown, you must configure fencing appropriately for your cl
     <service name="qpidd-primary-service" autostart="1" exclusive="0" recovery="relocate">
       <script ref="qpidd-primary"/>
       <!-- The primary has the IP addresses for brokers and clients to connect. -->
-      <ip ref="20.0.10.200"/>
       <ip ref="20.0.20.200"/>
     </service>
   </rm>
@@ -492,13 +487,14 @@ NOTE: fencing is not shown, you must configure fencing appropriately for your cl
     </para>
     <programlisting>
       ha-cluster=yes
-      ha-brokers-url=20.0.20.200
       ha-public-url=20.0.10.200
+      ha-brokers-url=20.0.20.1,20.0.20.2,20.0.20.3
     </programlisting>
     <para>
-      This configuration specifies that backup brokers will use 20.0.20.200
-      to connect to the primary and will advertise 20.0.10.200 to clients.
-      Clients should connect to 20.0.10.200.
+      This configuration allows clients to connect to a single address:
+      20.0.10.200.  The brokers connect to each other directly via the addresses
+      listed in <command>ha-brokers-url</command>. Note the client and broker
+      addresses are on separate sub-nets, this is recommended but not required.
     </para>
     <para>
       The <literal>service</literal> section defines 3 <literal>qpidd</literal>
@@ -519,7 +515,7 @@ NOTE: fencing is not shown, you must configure fencing appropriately for your cl
     </para>
   </section>
 
-  <section>
+  <section id="ha-broker-admin">
     <title>Broker Administration Tools</title>
     <para>
       Normally, clients are not allowed to connect to a backup broker. However
@@ -538,7 +534,7 @@ NOTE: fencing is not shown, you must configure fencing appropriately for your cl
     </para>
   </section>
 
-  <section id="ha-creating-replicated">
+  <section id="ha-replicate-values">
     <title>Controlling replication of queues and exchanges</title>
     <para>
       By default, queues and exchanges are not replicated automatically. You can change
@@ -590,23 +586,24 @@ NOTE: fencing is not shown, you must configure fencing appropriately for your cl
     </programlisting>
     <para>
       There are some built-in exchanges created automatically by the broker, these
-      exchangs are never replicated. The built-in exchanges are the default (nameless)
+      exchanges are never replicated. The built-in exchanges are the default (nameless)
       exchange, the AMQP standard exchanges (<literal>amq.direct, amq.topic, amq.fanout</literal> and
       <literal>amq.match</literal>) and the management exchanges (<literal>qpid.management, qmf.default.direct</literal> and
       <literal>qmf.default.topic</literal>)
     </para>
     <para>
       Note that if you bind a replicated queue to one of these exchanges, the
-      binding wil <emphasis>not</emphasis> be replicated, so the queue will not
+      binding will <emphasis>not</emphasis> be replicated, so the queue will not
       have the binding after a fail-over.
     </para>
   </section>
 
-  <section>
+  <section id="ha-failover">
     <title>Client Connection and Fail-over</title>
     <para>
-      Clients can only connect to the primary broker. Backup brokers
-      automatically reject any connection attempt by a client.
+      Clients can only connect to the primary broker. Backup brokers reject any
+      connection attempt by a client. Clients rejected by a backup broker will
+      simply fail-over until they connect to the primary.
     </para>
     <para>
       Clients are configured with the URL for the cluster (details below for
@@ -621,8 +618,7 @@ NOTE: fencing is not shown, you must configure fencing appropriately for your cl
 	  <para>
 	    The URL contains a single <firstterm>virtual IP address</firstterm>
 	    that is assigned to the primary broker by the resource manager.
-	    <footnote><para>Only if the resource manager supports virtual IP
-	    addresses</para></footnote>
+	    This is the recommended configuration.
 	  </para>
 	</listitem>
       </itemizedlist>
@@ -647,9 +643,10 @@ NOTE: fencing is not shown, you must configure fencing appropriately for your cl
       client.
     </para>
     <para>
-      See &#34;Cluster Failover&#34; in <citetitle>Programming in Apache
-      Qpid</citetitle> for details on how to keep the client aware of cluster
-      membership.
+      Note: the following sections explain how to configure clients with
+      multiple dresses, but if you are using a virtual IP address you only need
+      to configure that one address for clients, you don't need to list all the
+      addresses.
     </para>
     <para>
       Suppose your cluster has 3 nodes: <literal>node1</literal>,
@@ -659,7 +656,7 @@ NOTE: fencing is not shown, you must configure fencing appropriately for your cl
       <literal>reconnect</literal> property to <literal>true</literal>. The
       following sub-sections show how to connect each type of client.
     </para>
-    <section>
+    <section id="ha-clients">
       <title>C++ clients</title>
       <para>
 	With the C++ client, you specify multiple cluster addresses in a single URL
@@ -690,7 +687,7 @@ NOTE: fencing is not shown, you must configure fencing appropriately for your cl
 	</programlisting>
       </para>
     </section>
-    <section>
+    <section id="ha-python-client">
       <title>Python clients</title>
       <para>
 	With the python client, you specify <literal>reconnect=True</literal>
@@ -711,7 +708,7 @@ NOTE: fencing is not shown, you must configure fencing appropriately for your cl
 	connection = qpid.messaging.Connection.establish("node1", reconnect=True, reconnect_urls=["node1", "node2", "node3"], heartbeat=10)
       </programlisting>
     </section>
-    <section>
+    <section id="ha-jms-client">
       <title>Java JMS Clients</title>
       <para>
 	In Java JMS clients, client fail-over is handled automatically if it is
@@ -769,7 +766,7 @@ NOTE: fencing is not shown, you must configure fencing appropriately for your cl
     </section>
   </section>
 
-  <section>
+  <section id="ha-security">
     <title>Security.</title>
     <para>
       You can secure your cluster using the authentication and authorization features
@@ -815,7 +812,7 @@ NOTE: fencing is not shown, you must configure fencing appropriately for your cl
     </para>
   </section>
 
-  <section>
+  <section id="ha-other-rm">
     <title>Integrating with other Cluster Resource Managers</title>
     <para>
       To integrate with a different resource manager you must configure it to:
@@ -875,7 +872,19 @@ NOTE: fencing is not shown, you must configure fencing appropriately for your cl
       need to run a cluster.
     </para>
   </section>
+
+  <section id ="ha-store">
+    <title>Using a message store in a  cluster</title>
+    <para>
+      If you use a persistent store for your messages then each broker in a
+      cluster will have its own store. If the entire cluster fails and is
+      restarted, the *first* broker that becomes primary will recover from its
+      store. All the other brokers will clear their stores and get an update
+      from the primary to ensure consistency.
+    </para>
+  </section>
+
 </section>
 
-<!-- LocalWords:  scalability rgmanager multicast RGManager mailto LVQ qpidd IP dequeued Transactional username
+<!-- LocalWords:  scalability rgmanager multicast RGManager mailto LVQ qpidd IP dequeued Transactional username API
 -->
author	Alan Conway <aconway@apache.org>	2013-04-04 17:52:21 +0000
committer	Alan Conway <aconway@apache.org>	2013-04-04 17:52:21 +0000
commit	34bf64f26146bd36e0170ab6c8eb202ce72e9151 (patch)
tree	2787e995d9d1e329c91e17eb83a103ac5d4ce2dc /qpid
parent	167e0fe6bc768e4f7a88d2c203d5442ca82531f3 (diff)
download	qpid-python-34bf64f26146bd36e0170ab6c8eb202ce72e9151.tar.gz