Second Edition (June 2001) Part Number 225081-002 Compaq Computer Corporation Compaq Confidential Need to Know Required Writer: John Blackburn Project: Parallel Database Cluster Model PDC/O5000 for Oracle8i and Windows 2000 Comments: Part Number: 225081-002 File Name: a-frnt.doc Last Saved On: 6/18/01 11:11 AM Notice 2001 Compaq Computer Corporation Compaq, the Compaq logo, Compaq Insight Manager, SmartStart, ROMPaq, ProLiant, and StorageWorks Registered in U.S. Patent and Trademark Office. ActiveAnswers is a trademark of Compaq Information Technologies Group, L.P. in the United States and other countries. Microsoft, Windows, and Windows NT are trademarks of Microsoft Corporation in the United States and other countries. All other product names mentioned herein may be trademarks of their respective companies. Compaq shall not be liable for technical or editorial errors or omissions contained herein. The information in this document is provided "as is" without warranty of any kind and is subject to change without notice. The warranties for Compaq products are set forth in the express limited warranty statements accompanying such products. Nothing herein should be construed as constituting an additional warranty. Parallel Database Cluster Model PDC/O5000 for Oracle8i and Windows 2000 Second Edition (June 2001) Part Number 225081-002 Compaq Confidential Need to Know Required Writer: John Blackburn Project: Parallel Database Cluster Model PDC/O5000 for Oracle8i and Windows 2000 Comments: Part Number: 225081-002 File Name: a-frnt.doc Last Saved On: 6/18/01 11:11 AM Contents About This Guide Purpose .................................................................................................................... xiii Audience .................................................................................................................. xiii Scope ........................................................................................................................ xiv Referenced Manuals .................................................................................................. xv Supplemental Documents ......................................................................................... xvi Text Conventions......................................................................................................xvi Symbols in Text.......................................................................................................xvii Symbols on Equipment............................................................................................xvii Rack Stability ........................................................................................................ xviii Getting Help .......................................................................................................... xviii Compaq Technical Support ............................................................................... xix Compaq Website................................................................................................xix Compaq Authorized Reseller..............................................................................xx Chapter 1 Clustering Overview Clusters Defined ...................................................................................................... 1-2 Availability .............................................................................................................. 1-4 Scalability ................................................................................................................ 1-4 Compaq Parallel Database Cluster Overview.......................................................... 1-5 Chapter 2 Cluster Architecture Compaq ProLiant Servers........................................................................................ 2-2 High-Availability Features of ProLiant Servers ............................................... 2-3 Shared Storage Components.................................................................................... 2-3 MA8000/EMA12000 Storage Subsystem ........................................................ 2-4 HSG80 Array Controller .................................................................................. 2-4 Fibre Channel SAN Switch .............................................................................. 2-5 Compaq Confidential Need to Know Required Writer: John Blackburn Project: Parallel Database Cluster Model PDC/O5000 for Oracle8i and Windows 2000 Comments: Part Number: 225081-002 File Name: a-frnt.doc Last Saved On: 6/18/01 11:11 AM iv Compaq Parallel Database Cluster Model PDC/O5000 for Oracle8i and Windows 2000 Administrator Guide Storage Hub ...................................................................................................... 2-6 KGPSA-BC and KGPSA-CB Host Adapter ..................................................... 2-6 Gigabit Interface Converter-Shortwave ............................................................ 2-7 Fibre Channel Cables........................................................................................ 2-7 Configuring and Cabling the MA8000/EMA12000 Storage Subsystem Components ............................................................................................................. 2-7 Configuring LUNS for Storagesets................................................................... 2-7 SCSI Cabling Examples.................................................................................... 2-8 UltraSCSI Cables.............................................................................................. 2-9 Using I/O Modules in the Controller Enclosure ............................................... 2-9 Connecting EMUs Between MA8000/EMA12000 Storage Subsystems........ 2-12 I/O Path Configurations for Redundant Fibre Channel Fabrics ............................. 2-13 Overview of Fibre Channel Fabric SAN Topology ........................................ 2-13 Redundant Fibre Channel Fabrics................................................................... 2-13 Multiple Redundant Fibre Channel Fabrics .................................................... 2-15 Maximum Distances Between Nodes and Shared Storage Subsystem Components .................................................................................................... 2-16 I/O Data Paths in a Redundant Fibre Channel Fabric ..................................... 2-17 I/O Path Definitions for Redundant Fibre Channel Fabrics ............................ 2-20 I/O Path Configuration Examples for Redundant Fibre Channel Fabrics....... 2-21 Summary of I/O Path Failure and Failover Scenarios for Redundant Fibre Channel Fabrics .............................................................................................. 2-25 I/O Path Configurations for Redundant Fibre Channel Arbitrated Loops.............. 2-26 Overview of FC-AL SAN Topology............................................................... 2-26 Redundant Fibre Channel Arbitrated Loops ................................................... 2-26 Multiple Redundant Fibre Channel Arbitrated Loops..................................... 2-28 Maximum Distances Between Nodes and Shared Storage Subsystem Components .................................................................................................... 2-30 I/O Data Paths in a Redundant FC-AL ........................................................... 2-32 I/O Path Definitions for Redundant FC-ALs .................................................. 2-34 I/O Path Configuration Examples for Redundant FC-ALs ............................. 2-35 Summary of I/O Path Failure and Failover Scenarios for Redundant FC-ALs ........................................................................................................... 2-38 Cluster Interconnect Requirements ........................................................................ 2-39 Ethernet Cluster Interconnect.......................................................................... 2-39 Local Area Network........................................................................................ 2-42 Chapter 3 Cluster Software Components Overview of the Cluster Software............................................................................ 3-1 Microsoft Windows 2000 Advanced Server ............................................................ 3-2 Compaq Software..................................................................................................... 3-2 Compaq SmartStart and Support Software ....................................................... 3-2 Compaq System Configuration Utility ............................................................. 3-3 Compaq Insight Manager.................................................................................. 3-3 Compaq Confidential Need to Know Required Writer: John Blackburn Project: Parallel Database Cluster Model PDC/O5000 for Oracle8i and Windows 2000 Comments: Part Number: 225081-002 File Name: a-frnt.doc Last Saved On: 6/18/01 11:11 AM Contents v Compaq Insight Manager XE ........................................................................... 3-3 Compaq Options ROMPaq ............................................................................... 3-4 Compaq StorageWorks Command Console ..................................................... 3-4 Compaq StorageWorks Secure Path for Windows 2000 .................................. 3-4 Compaq Operating System Dependent Modules.............................................. 3-5 Oracle Software ....................................................................................................... 3-5 Oracle8i Server Enterprise Edition ................................................................... 3-6 Oracle8i Server ................................................................................................. 3-6 Oracle8i Parallel Server Option........................................................................ 3-6 Oracle8i Enterprise Manager............................................................................ 3-7 Oracle8i Certification ....................................................................................... 3-7 Application Failover and Reconnection Software ................................................... 3-7 Chapter 4 Cluster Planning Site Planning............................................................................................................ 4-2 Capacity Planning for Cluster Hardware ................................................................. 4-2 Compaq ProLiant Servers................................................................................. 4-2 Planning Shared Storage Components for Redundant Fibre Channel Fabrics .............................................................................................................. 4-3 Planning Shared Storage Components for Redundant Fibre Channel Arbitrated Loops............................................................................................... 4-4 Planning Cluster Interconnect and Client LAN Components ........................... 4-6 Planning Cluster Configurations for Redundant Fibre Channel Fabrics.................. 4-7 Planning Dual Redundancy Configurations ..................................................... 4-7 Planning Quad Redundancy Configurations..................................................... 4-9 Planning Cluster Configurations for Redundant Fibre Channel Arbitrated Loops ..................................................................................................................... 4-11 Planning Dual Redundancy Configurations ................................................... 4-11 Planning Quad Redundancy Configurations................................................... 4-13 RAID Planning for the MA8000/EMA12000 Storage Subsystem ........................ 4-15 Supported RAID Levels ................................................................................. 4-16 Raw Data Storage and Database Size ............................................................. 4-17 Selecting the Appropriate RAID Level .......................................................... 4-18 Planning the Grouping of Physical Disk Storage Space ........................................ 4-19 Disk Drive Planning .............................................................................................. 4-20 Nonshared Disk Drives................................................................................... 4-20 Shared Disk Drives......................................................................................... 4-20 Network Planning .................................................................................................. 4-21 Windows 2000 Advanced Server Host Files for an Ethernet Cluster Interconnect .................................................................................................... 4-21 Client LAN ..................................................................................................... 4-22 Compaq Confidential Need to Know Required Writer: John Blackburn Project: Parallel Database Cluster Model PDC/O5000 for Oracle8i and Windows 2000 Comments: Part Number: 225081-002 File Name: a-frnt.doc Last Saved On: 6/18/01 11:11 AM vi Compaq Parallel Database Cluster Model PDC/O5000 for Oracle8i and Windows 2000 Administrator Guide Chapter 5 Installation and Configuration Reference Materials for Installation......................................................................... 5-1 Installation Overview............................................................................................... 5-2 Installing the Hardware............................................................................................ 5-3 Setting Up the Nodes ........................................................................................ 5-3 Installing the KGPSA-BC and KGPSA-CB Host Adapters.............................. 5-4 Installing GBIC-SW Modules for the Host Adapters ....................................... 5-4 Cabling the Host Adapters to Fibre Channel SAN Switches or Storage Hubs .................................................................................................................. 5-5 Installing the Cluster Interconnect Adapters..................................................... 5-6 Installing the Client LAN Adapters .................................................................. 5-6 Installing GBIC-SW Modules for the Array Controllers .................................. 5-6 Installing Hardware Into an MA8000/EMA12000 Storage Subsystem .......... 5-10 Cabling the Controller Enclosure to Disk Enclosures..................................... 5-11 Cabling EMUs to Each Other ......................................................................... 5-13 Cabling Array Controllers to Fibre Channel SAN Switches........................... 5-14 Cabling Array Controllers to Storage Hubs .................................................... 5-15 Installing Operating System Software.................................................................... 5-16 Guidelines for Clusters ................................................................................... 5-17 Automated Installation Using SmartStart ....................................................... 5-17 Setting up and Configuring an MA8000/EMA12000 Storage Subsystem............. 5-21 Designating a Server as a Maintenance Terminal........................................... 5-21 Powering On the MA8000/EMA12000 Storage Subsystem........................... 5-21 Installing the StorageWorks Command Console (SWCC) Client................... 5-22 Configuring a Storage Subsystem for Secure Path Operation ........................ 5-22 Verifying Array Controller Properties ............................................................ 5-27 Configuring a Storageset................................................................................. 5-29 Installing Secure Path Software for Windows 2000 .............................................. 5-32 Overview of Secure Path Software Installation .............................................. 5-32 Description of the Secure Path Software ........................................................ 5-33 Installing the Host Adapter Drivers ................................................................ 5-33 Installing the Fibre Channel Software Setup (FCSS) Utility .......................... 5-34 Installing the Secure Path Drivers, Secure Path Agent, and Secure Path Manager .......................................................................................................... 5-35 Specifying the Preferred_Path for Storage Units ............................................ 5-36 Powering Up All Other Fibre Channel SAN Switches or Storage Hubs ........ 5-38 Creating Partitions .......................................................................................... 5-38 Installing Compaq OSDs ....................................................................................... 5-39 Verifying Cluster Communications ................................................................ 5-40 Mounting Remote Drives and Verifying Administrator Privileges ................ 5-41 Installing the Ethernet OSDs .......................................................................... 5-42 Installing Oracle Software ..................................................................................... 5-52 Configuring Oracle8i Software .............................................................................. 5-53 Installing Object Link Manager ............................................................................. 5-53 Additional Notes on Configuring Oracle Software......................................... 5-54 Compaq Confidential Need to Know Required Writer: John Blackburn Project: Parallel Database Cluster Model PDC/O5000 for Oracle8i and Windows 2000 Comments: Part Number: 225081-002 File Name: a-frnt.doc Last Saved On: 6/18/01 11:11 AM Contents vii Verifying the Hardware and Software Installation ................................................ 5-55 Cluster Communications ................................................................................ 5-55 Access to Shared Storage from All Nodes...................................................... 5-55 OSDs .............................................................................................................. 5-55 Other Verification Tasks ................................................................................ 5-56 Power Distribution and Power Sequencing Guidelines ......................................... 5-56 Overview ........................................................................................................ 5-56 Server Power Distribution .............................................................................. 5-57 Storage Subsystem Power Distribution .......................................................... 5-57 Power Sequencing .......................................................................................... 5-58 Chapter 6 Cluster Management Cluster Management Concepts ................................................................................ 6-2 Powering Off a Node Without Interrupting Cluster Services ........................... 6-2 Managing a Cluster in a Degraded Condition................................................... 6-2 Managing Network Clients Connected to a Cluster ......................................... 6-3 Cluster Events................................................................................................... 6-3 Management Applications ....................................................................................... 6-4 Monitoring Server and Network Hardware ...................................................... 6-4 Monitoring Storage Subsystem Hardware........................................................ 6-5 Managing Shared Storage................................................................................. 6-5 Monitoring the Database .................................................................................. 6-7 Remotely Managing a Cluster .......................................................................... 6-7 Software Maintenance ............................................................................................. 6-8 Deinstalling the OSDs ...................................................................................... 6-8 Upgrading Oracle8i Server ............................................................................. 6-11 Upgrading the OSDs....................................................................................... 6-11 Deinstalling a Partial OSD Installation........................................................... 6-13 Managing Changes to Shared Storage Components .............................................. 6-14 Adding New Storagesets to Increase Shared Storage Capacity ...................... 6-14 Replacing a Failed Drive in a Storage Subsystem .......................................... 6-15 Replacing a Host Adapter............................................................................... 6-16 Adding a Shared Storage Subsystem .............................................................. 6-19 Replacing a Cluster Node ...................................................................................... 6-19 Removing the Node ........................................................................................ 6-20 Adding the Replacement Node....................................................................... 6-20 Adding a Cluster Node .......................................................................................... 6-24 Preparing the New Node................................................................................. 6-25 Preparing the Existing Cluster Nodes............................................................. 6-27 Installing the Cluster Software ....................................................................... 6-27 Monitoring Cluster Performance ........................................................................... 6-29 Tools Overview .............................................................................................. 6-29 Using Secure Path Manager............................................................................ 6-30 Uninstalling Secure Path ................................................................................ 6-33 Compaq Confidential Need to Know Required Writer: John Blackburn Project: Parallel Database Cluster Model PDC/O5000 for Oracle8i and Windows 2000 Comments: Part Number: 225081-002 File Name: a-frnt.doc Last Saved On: 6/18/01 11:11 AM viii Compaq Parallel Database Cluster Model PDC/O5000 for Oracle8i and Windows 2000 Administrator Guide Chapter 7 Troubleshooting Basic Troubleshooting Tips ..................................................................................... 7-2 Power ................................................................................................................ 7-2 Physical Connections ........................................................................................ 7-2 Access to Cluster Components ......................................................................... 7-3 Software Revisions ........................................................................................... 7-3 Firmware Revisions .......................................................................................... 7-4 Troubleshooting Oracle8i and OSD Installation Problems and Error Messages ..... 7-5 Potential Difficulties Installing the OSDs with the Oracle Universal Installer ............................................................................................................. 7-5 Unable to Start OracleCMService..................................................................... 7-6 Unable to Start OracleNMService .................................................................... 7-6 Unable to Start the Database............................................................................. 7-7 Initialization of the Dynamic Link Library NM.DLL Failed............................ 7-7 Troubleshooting Node-to-Node Connectivity Problems.......................................... 7-7 Nodes Are Unable to Communicate with Each Other ...................................... 7-7 Unable to Ping the Cluster Interconnect or the Client LAN ............................. 7-8 Troubleshooting Client-to-Cluster Connectivity Problems...................................... 7-9 A Network Client Cannot Communicate With the Cluster............................... 7-9 Troubleshooting Shared Storage Subsystem Problems.......................................... 7-10 Verifying Host Adapter Device Driver Installation ........................................ 7-10 Verifying KGPSA-BC Device Driver Initialization ....................................... 7-10 Verifying Connectivity to a Redundant Fibre Channel Fabric........................ 7-12 Verifying Connectivity to a Redundant Fibre Channel Arbitrated Loop ........ 7-13 A Cluster Node Cannot Connect to the Shared Drives ................................... 7-15 Disk Management Shows Storagesets With the Same Label (Dual Image).... 7-15 Device or Devices Were Not Found by KGPSA-BC Device Driver .............. 7-15 Devices on One I/O Connection Path Cannot Be Seen by the Cluster Nodes .............................................................................................................. 7-16 Troubleshooting Secure Path ................................................................................. 7-18 Secure Path Guidelines for Windows 2000 Advanced Server ........................ 7-18 Secure Path Manager Shows Reversed Locations for Top and Bottom Array Controllers ............................................................................................ 7-20 Secure Path Manager Cannot Start With Hosts That Use Hyphenated Host Names ............................................................................................................. 7-20 Secure Path Manager Is Delayed In Reporting Path Failure Information....... 7-20 The Addition of New LUNs Causes an Error ................................................. 7-21 A Configuration of More Than 64 LUNs Prevents the Secure Path Agent From Starting .................................................................................................. 7-21 Appendix A Diagnosing and Resolving Shared Disk Problems Introduction ............................................................................................................. A-1 Run Object Link Manager on All Nodes ................................................................ A-3 Compaq Confidential Need to Know Required Writer: John Blackburn Project: Parallel Database Cluster Model PDC/O5000 for Oracle8i and Windows 2000 Comments: Part Number: 225081-002 File Name: a-frnt.doc Last Saved On: 6/18/01 11:11 AM Contents ix Restart All Affected Nodes in the Cluster ...............................................................A-4 Rerun and Validate Object Link Manager On All Affected Nodes .........................A-4 Run and Validate Secure Path Manager On All Nodes ...........................................A-5 Run Disk Management On All Nodes .....................................................................A-5 Run and Validate the StorageWorks Command Console From All Storage Subsystems ..............................................................................................................A-6 Perform Cluster Software and Firmware Checks ....................................................A-6 Perform Cluster Hardware Checks ..........................................................................A-7 Contact Your Compaq Support Representative .......................................................A-8 Glossary Index Compaq Confidential Need to Know Required Writer: John Blackburn Project: Parallel Database Cluster Model PDC/O5000 for Oracle8i and Windows 2000 Comments: Part Number: 225081-002 File Name: a-frnt.doc Last Saved On: 6/18/01 11:11 AM x Compaq Parallel Database Cluster Model PDC/O5000 for Oracle8i and Windows 2000 Administrator Guide List of Figures Figure 1-1. Example of a two-node PDC/O5000 cluster ........................................ 1-3 Figure 2-1. SCSI bus numbers for I/O modules in the controller enclosure ......... 2-10 Figure 2-2. UltraSCSI cabling between a controller enclosure and three disk enclosures ................................................................................................. 2-11 Figure 2-3. UltraSCSI cabling between a controller enclosure and six disk enclosures ......................................................................................................... 2-12 Figure 2-4. Two-node PDC/O5000 with a four-fabric redundant Fibre Channel Fabric ................................................................................................. 2-14 Figure 2-5. Two-node PDC/O5000 with two redundant Fibre Channel Fabrics .............................................................................................................. 2-16 Figure 2-6. Maximum distances between PDC/O5000 cluster nodes and shared storage subsystem components in a redundant Fibre Channel Fabric................................................................................................................ 2-17 Figure 2-7. Host adapter-to-Fibre Channel SAN Switch I/O data paths ............... 2-18 Figure 2-8. Fibre Channel SAN Switch-to-array controller I/O data paths........... 2-19 Figure 2-9. Dual redundancy configuration for a redundant Fibre Channel Fabric................................................................................................................ 2-22 Figure 2-10. Quad redundancy configuration for a redundant Fibre Channel Fabric................................................................................................................ 2-24 Figure 2-11. Two-node PDC/O5000 with a four-loop redundant Fibre Channel Arbitrated Loop.................................................................................. 2-27 Figure 2-12. Two-node PDC/O5000 with two redundant Fibre Channel Arbitrated Loops .............................................................................................. 2-29 Figure 2-13. Maximum distances between PDC/O5000 cluster nodes and shared storage subsystem components in a redundant FC-AL ......................... 2-31 Figure 2-14. Host adapter-to-Storage Hub I/O data paths..................................... 2-32 Figure 2-15. Storage Hub-to-array controller I/O data paths ................................ 2-33 Figure 2-16. Dual redundancy configuration for a redundant FC-AL .................. 2-36 Figure 2-17. Quad redundancy configuration for a redundant FC-AL.................. 2-37 Figure 2-18. Redundant Ethernet cluster interconnect for a two-node PDC/O5000 cluster .......................................................................................... 2-41 Figure 4-1. Dual redundancy configuration for a redundant Fibre Channel Fabric.................................................................................................................. 4-8 Figure 4-2. Quad redundancy configuration for a redundant Fibre Channel Fabric................................................................................................................ 4-10 Figure 4-3. Dual redundancy configuration for a redundant FC-AL .................... 4-12 Figure 4-4. Quad redundancy configuration for a redundant FC-AL.................... 4-14 Figure 4-5 MA8000/EMA12000 Storage Subsystem disk grouping for a PDC/O5000 cluster .......................................................................................... 4-19 Figure 5-1. Connecting host adapters to Fibre Channel SAN Switches or Storage Hubs ...................................................................................................... 5-5 Figure 5-2. Redundant Ethernet cluster interconnect for a two-node PDC/O5000 cluster ............................................................................................ 5-8 Figure 5-3. SCSI bus numbers for I/O modules in the controller enclosure ......... 5-11 Compaq Confidential Need to Know Required Writer: John Blackburn Project: Parallel Database Cluster Model PDC/O5000 for Oracle8i and Windows 2000 Comments: Part Number: 225081-002 File Name: a-frnt.doc Last Saved On: 6/18/01 11:11 AM Contents xi Figure 5-4. UltraSCSI cabling between a controller enclosure and three disk enslosures ................................................................................................. 5-12 Figure 5-5. UltraSCSI cabling between a controller enclosure and six disk enclosures......................................................................................................... 5-13 Figure 5-6. Dual redundancy configuration for a redundant Fibre Channel Fabric ............................................................................................................... 5-15 Figure 5-7. Dual redundancy configuration for a redundant FC-AL .................... 5-16 Figure 5-8. Server power distribution in a three-node cluster............................... 5-57 Figure A-1. Tasks for diagnosing and resolving shared storage problems .............A-2 List of Tables Table 2-1 High-Availability Components of ProLiant Servers ............................... 2-3 Table 2-2 SCSI bus address ID assignments for the MA8000/EMA12000 Storage Subsystem ............................................................................................. 2-9 Table 2-3 I/O Path Failure and Failover Scenarios for Redundant Fibre Channel Fabrics ............................................................................................... 2-25 Table 2-4 I/O Path Failure and Failover Scenarios for Redundant FC-ALs .......... 2-38 Table 5-1 Controller Properties ............................................................................. 5-28 Compaq Confidential Need to Know Required Writer: John Blackburn Project: Parallel Database Cluster Model PDC/O5000 for Oracle8i and Windows 2000 Comments: Part Number: 225081-002 File Name: a-frnt.doc Last Saved On: 6/18/01 11:11 AM About This Guide Purpose This administrator guide provides information about the planning, installation, configuration, implementation, management, and troubleshooting of the Compaq Parallel Database Cluster Model PDC/O5000 on Oracle8i software running on the Microsoft Windows 2000 Advanced Server operating system. Audience The expected audience of this guide consists primarily of MIS professionals whose jobs include designing, installing, configuring, and maintaining Compaq Parallel Database Clusters. The audience of this guide must have a working knowledge of Microsoft Windows 2000 Advanced Server and Oracle databases or have the assistance of a database administrator. This guide contains information for network administrators, database administrators, installation technicians, systems integrators, and other technical personnel in the enterprise environment for the purpose of cluster planning, installation, implementation, and maintenance. IMPORTANT: This guide contains installation, configuration, and maintenance information that can be valuable for a variety of users. If you are installing the PDC/O5000 but will not be administering the cluster on a daily basis, please make this guide available to the person or persons who will be responsible for the clustered servers after you have completed the installation. Compaq Confidential Need to Know Required Writer: John Blackburn Project: Parallel Database Cluster Model PDC/O5000 for Oracle8i and Windows 2000 Comments: Part Number: 225081-002 File Name: a-frnt.doc Last Saved On: 6/18/01 11:11 AM xiv Compaq Parallel Database Cluster Model PDC/O5000 for Oracle8i and Windows 2000 Administrator Guide Scope This guide offers significant background information about clusters as well as basic concepts associated with designing clusters. It also contains detailed product descriptions and installation steps. This administrator guide is designed to assist you in the following objectives: I Understanding basic concepts of clustering technology I Recognizing and using the high-availability features of the PDC/O5000 I Planning and designing a PDC/O5000 cluster configuration to meet your business needs I Installing and configuring PDC/O5000 hardware and software I Managing the PDC/O5000 I Troubleshooting the PDC/O5000 The following summarizes the contents of this guide: I Chapter 1, "Clustering Overview," provides an introduction to clustering technology features and benefits. I Chapter 2, "Cluster Architecture," describes the hardware components of the PDC/O5000 and provides detailed I/O path configuration information. I Chapter 3, "Cluster Software Components," describes software components used with the PDC/O5000. I Chapter 4, "Cluster Planning," outlines an approach to planning and designing cluster configurations that meet your business needs. I Chapter 5, "Installation and Configuration," outlines the steps you will take to install and configure the PDC/O5000 hardware and software. I Chapter 6, "Cluster Management," includes techniques for managing and maintaining the PDC/O5000. I Chapter 7, "Troubleshooting," contains troubleshooting information for the PDC/O5000. I Appendix A, "Diagnosing and Resolving Shared Disk Problems," describes procedures to diagnose and resolve shared disk problems. I Glossary contains definitions of terms used in this guide. Compaq Confidential Need to Know Required Writer: John Blackburn Project: Parallel Database Cluster Model PDC/O5000 for Oracle8i and Windows 2000 Comments: Part Number: 225081-002 File Name: a-frnt.doc Last Saved On: 6/18/01 11:11 AM About This Guide xv Some clustering topics are mentioned, but not detailed, in this guide. For example, this guide does not describe how to install and configure Oracle8i on a cluster. For information about these topics, see the referenced and supplemental documents listed in subsequent sections. Referenced Manuals For additional information, refer to documentation related to the specific hardware and software components of the Compaq Parallel Database Cluster. These related manuals include, but are not limited to: I Documentation related to the ProLiant servers you are clustering (for example, guides, posters, and performance and tuning guides) I Compaq StorageWorks documentation provided with the MA8000/EMA12000 Storage Subsystem, HSG80 Array Controller, Fibre Channel SAN Switches, Storage Hubs, and the KGPSA-BC or KGPSA-CB Host Adapter I Microsoft Windows 2000 Advanced Server documentation G Microsoft Windows 2000 Advanced Server Administrator's Guide I Oracle8i documentation, including: G Oracle8i Parallel Server Setup and Configuration Guide G Oracle8i Parallel Server Concepts G Oracle8i Parallel Server Administration, Deployment, and Performance G Oracle Enterprise Manager Administrator's Guide G Oracle Enterprise Manager Configuration Guide G Oracle Enterprise Manager Concepts Guide Compaq Confidential Need to Know Required Writer: John Blackburn Project: Parallel Database Cluster Model PDC/O5000 for Oracle8i and Windows 2000 Comments: Part Number: 225081-002 File Name: a-frnt.doc Last Saved On: 6/18/01 11:11 AM xvi Compaq Parallel Database Cluster Model PDC/O5000 for Oracle8i and Windows 2000 Administrator Guide Supplemental Documents The following technical documents contain important supplemental information for the Compaq Parallel Database Cluster Model PDC/O5000: I Compaq Parallel Database Cluster Model PDC/O5000 Certification Matrix for Windows 2000, at www.compaq.com/solutions/enterprise/ha-pdc.html I Configuring Compaq RAID Technology for Database Servers, at www.compaq.com/highavailability I Various technical white papers on Oracle and cluster sizing, which are available from Compaq ActiveAnswers website, at www.compaq.com/activeanswers Text Conventions This document uses the following conventions to distinguish elements of text: Keys Keys appear in boldface. A plus sign (+) between two keys indicates that they should be pressed simultaneously. USER INPUT User input appears in a different typeface and in uppercase. FILENAMES File names appear in uppercase italics. Menu Options, These elements appear in initial capital letters, and Command Names, may be bolded for emphasis. Dialog Box Names COMMANDS, These elements appear in uppercase. DIRECTORY NAMES, and DRIVE NAMES Type When you are instructed to type information, type the information without pressing the Enter key. Enter When you are instructed to enter information, type the information and then press the Enter key. Compaq Confidential Need to Know Required Writer: John Blackburn Project: Parallel Database Cluster Model PDC/O5000 for Oracle8i and Windows 2000 Comments: Part Number: 225081-002 File Name: a-frnt.doc Last Saved On: 6/18/01 11:11 AM About This Guide xvii Symbols in Text These symbols may be found in the text of this guide. They have the following meanings: WARNING: Text set off in this manner indicates that failure to follow directions in the warning could result in bodily harm or loss of life. CAUTION: Text set off in this manner indicates that failure to follow directions could result in damage to equipment or loss of information. IMPORTANT: Text set off in this manner presents clarifying information or specific instructions. NOTE: Text set off in this manner presents commentary, sidelights, or interesting points of information. Symbols on Equipment These icons may be located on equipment in areas where hazardous conditions may exist. Any surface or area of the equipment marked with these symbols indicates the presence of electrical shock hazards. Enclosed area contains no operator serviceable parts. WARNING: To reduce the risk of injury from electrical shock hazards, do not open this enclosure. Any RJ-45 receptacle marked with these symbols indicates a Network Interface Connection. WARNING: To reduce the risk of electrical shock, fire, or damage to the equipment, do not plug telephone or telecommunications connectors into this receptacle. Compaq Confidential Need to Know Required Writer: John Blackburn Project: Parallel Database Cluster Model PDC/O5000 for Oracle8i and Windows 2000 Comments: Part Number: 225081-002 File Name: a-frnt.doc Last Saved On: 6/18/01 11:11 AM xviii Compaq Parallel Database Cluster Model PDC/O5000 for Oracle8i and Windows 2000 Administrator Guide Any surface or area of the equipment marked with these symbols indicates the presence of a hot surface or hot component. If this surface is contacted, the potential for injury exists. WARNING: To reduce the risk of injury from a hot component, allow the surface to cool before touching. Power Supplies or Systems marked with these symbols indicate the equipment is supplied by multiple sources of power. WARNING: To reduce the risk of injury from electrical shock, remove all power cords to completely disconnect power from the system. Rack Stability WARNING: To reduce the risk of personal injury or damage to the equipment, be sure that: I The leveling jacks are extended to the floor. I The full weight of the rack rests on the leveling jacks. I The stabilizing feet are attached to the rack if it is a single rack installations. I The racks are coupled together in multiple rack installations. I Only one component is extended at a time. A rack may become unstable if more than one component is extended for any reason. Getting Help If you have a problem and have exhausted the information in this guide, you can get further information and other help in the following locations. Compaq Confidential Need to Know Required Writer: John Blackburn Project: Parallel Database Cluster Model PDC/O5000 for Oracle8i and Windows 2000 Comments: Part Number: 225081-002 File Name: a-frnt.doc Last Saved On: 6/18/01 11:11 AM About This Guide xix Compaq Technical Support You are entitled to free hardware technical telephone support for your product for as long you own the product. A technical support specialist will help you diagnose the problem or guide you to the next step in the warranty process. In North America, call the Compaq Technical Phone Support Center at 1 1-800-OK-COMPAQ . This service is available 24 hours a day, 7 days a week. Outside North America, call the nearest Compaq Technical Support Phone Center. Telephone numbers for world wide Technical Support Centers are listed on the Compaq website. Access the Compaq website by logging on to the Internet at www.compaq.com Be sure to have the following information available before you call Compaq: I Technical support registration number (if applicable) I Product serial number(s) I Product model name(s) and numbers(s) I Applicable error messages I Add-on boards or hardware I Third-party hardware or software I Operating system type and revision level I Detailed, specific questions Compaq Website The Compaq website has information on this product as well as the latest drivers and Flash ROM images. You can access the Compaq website by logging on to the Internet at www.compaq.com 1 For continuous quality improvement, calls may be recorded or monitored. Compaq Confidential Need to Know Required Writer: John Blackburn Project: Parallel Database Cluster Model PDC/O5000 for Oracle8i and Windows 2000 Comments: Part Number: 225081-002 File Name: a-frnt.doc Last Saved On: 6/18/01 11:11 AM xx Compaq Parallel Database Cluster Model PDC/O5000 for Oracle8i and Windows 2000 Administrator Guide Compaq Authorized Reseller For the name of your nearest Compaq Authorized Reseller: I In the United States, call 1-800-345-1518. I In Canada, call 1-800-263-5868. I Elsewhere, see the Compaq website for locations and telephone numbers. Compaq Confidential Need to Know Required Writer: John Blackburn Project: Parallel Database Cluster Model PDC/O5000 for Oracle8i and Windows 2000 Comments: Part Number: 225081-002 File Name: a-frnt.doc Last Saved On: 6/18/01 11:11 AM 1 Chapter Clustering Overview For many years, companies have depended on clustered computer systems to fulfill two key requirements: to ensure users can access and process information that is critical to the ongoing operation of their business, and to increase the performance and throughput of their computer systems at minimal cost. These requirements are known as availability and scalability, respectively. Historically, these requirements have been fulfilled with clustered systems built on proprietary technology. Over the years, open systems have progressively and aggressively moved proprietary technologies into industry-standard products. Clustering is no exception. Its primary features, availability and scalability, have been moving into client/server products for the last few years. The absorption of clustering technologies into open systems products is creating less expensive, non-proprietary solutions that deliver levels of function commonly found in traditional clusters. While some uses of the proprietary solutions will always exist, such as those controlling stock exchange trading floors and aerospace mission controls, many critical applications can reach the desired levels of availability and scalability with non-proprietary client/server-based clustering. These clustering solutions use industry-standard hardware and software, thereby providing key clustering features at a lower price than proprietary clustering systems. Before examining the features and benefits of the Compaq Parallel Database Cluster Model PDC/O5000 (referred to here as the PDC/O5000), it is helpful to understand the concepts and terminology of clustered systems. Compaq Confidential Need to Know Required Writer: John Blackburn Project: Compaq Parallel Database Cluster Model PDC/O5000 for Oracle8i and Windows 2000 Administrator Guide Comments: Part Number: 225081-002 File Name: b-ch1 Clustering Overview.doc Last Saved On: 6/18/01 11:11 AM 1-2 Compaq Parallel Database Cluster Model PDC/O5000 for Oracle8i and Windows 2000 Administrator Guide Clusters Defined A cluster is an integration of software and hardware products that enables a set of loosely coupled servers and shared storage subsystem components to present a single system image to clients and to operate as a single system. As a cluster, the group of servers and shared storage subsystem components offers a level of availability and scalability far exceeding that obtained if each cluster node operated as a stand-alone server. The PDC/O5000 uses the Oracle8i Parallel Server software, which is a parallel database that can distribute its workload among the cluster nodes. Refer to Chapter 3, "Cluster Software Components" to determine the specific releases your cluster kit supports. Figure 1-1 shows an example of a PDC/O5000 that includes two nodes (Compaq ProLiantTM servers), two sets of storage subsystems, two Compaq StorageWorksTM Fibre Channel SAN Switches or Compaq StorageWorks Fibre Channel Storage Hubs, a redundant cluster interconnect, and a client local area network (LAN). Compaq Confidential Need to Know Required Writer: John Blackburn Project: Compaq Parallel Database Cluster Model PDC/O5000 for Oracle8i and Windows 2000 Administrator Guide Comments: Part Number: 225081-002 File Name: b-ch1 Clustering Overview.doc Last Saved On: 6/18/01 11:11 AM Clustering Overview 1-3 Client LAN Switch/Hub Cluster Interconnect Host Adapters (2) Host Adapters (2) Node 1 Node 2 Fibre Channel SAN Fibre Channel SAN Switch/Storage Hub #2 Switch/Storage Hub #1 P1 P2 A P1 P2 A P1 P2 P1 P2 B B Storage Subsystem #1 Storage Subsystem #2 Figure 1-1. Example of a two-node PDC/O5000 cluster The PDC/O5000 can support redundant Fibre Channel Fabric Storage Area Network (SAN) and redundant Fibre Channel Arbitrated Loop (FC-AL) SAN topologies. In the example shown in Figure 1-1, the clustered nodes are connected to the database on the shared storage subsystems through a redundant Fibre Channel Fabric or redundant FC-AL. Clients access the database through the client LAN, and the cluster nodes communicate across an Ethernet cluster interconnect. Compaq Confidential Need to Know Required Writer: John Blackburn Project: Compaq Parallel Database Cluster Model PDC/O5000 for Oracle8i and Windows 2000 Administrator Guide Comments: Part Number: 225081-002 File Name: b-ch1 Clustering Overview.doc Last Saved On: 6/18/01 11:11 AM 1-4 Compaq Parallel Database Cluster Model PDC/O5000 for Oracle8i and Windows 2000 Administrator Guide Availability When computer systems experience outages, the amount of time the system is unavailable is referred to as downtime. Downtime has several primary causes: hardware faults, software faults, planned service, operator error, and environmental factors. Minimizing downtime is a primary goal of a cluster. Simply defined, availability is the measure of how well a computer system can continuously deliver services to clients. Availability is a system-wide endeavor. The hardware, operating system, and applications must be designed for availability. Clustering requires stability in these components, then couples them in such a way that failure of one item does not render the system unusable. By using redundant components and mechanisms that detect and recover from faults, clusters can greatly increase the availability of applications critical to business operations. Scalability Simply defined, scalability is a computer system characteristic that enables improved performance or throughput when supplementary hardware resources are added. Scalable systems allow increased throughput by adding components to an existing system without the expense of adding an entire new system. In a stand-alone server configuration, scalable systems allow increased throughput by adding processors or more memory. In a cluster configuration, this result is usually obtained by adding cluster nodes. Not only must the hardware benefit from additional components, but also software must be constructed in such a way as to take advantage of the additional processing power. Oracle8i Parallel Server distributes the workload among the cluster nodes. As more nodes are added to the cluster, cluster-aware applications can use the parallel features of Oracle8i Parallel Server to distribute workload among more servers, thereby obtaining greater throughput. Compaq Confidential Need to Know Required Writer: John Blackburn Project: Compaq Parallel Database Cluster Model PDC/O5000 for Oracle8i and Windows 2000 Administrator Guide Comments: Part Number: 225081-002 File Name: b-ch1 Clustering Overview.doc Last Saved On: 6/18/01 11:11 AM Clustering Overview 1-5 Compaq Parallel Database Cluster Overview As traditional clustering technology has moved into the open systems of client/server computing, Compaq has provided innovative, customer-focused solutions. The PDC/O5000 moves client/server computing one step closer to the capabilities found in expensive, proprietary cluster solutions, at a fraction of the cost. The PDC/O5000 combines the popular Microsoft Windows 2000 Advanced Server operating system and the industry-leading Oracle8i Parallel Server with award-winning Compaq ProLiant servers and shared storage subsystems. Compaq Confidential Need to Know Required Writer: John Blackburn Project: Compaq Parallel Database Cluster Model PDC/O5000 for Oracle8i and Windows 2000 Administrator Guide Comments: Part Number: 225081-002 File Name: b-ch1 Clustering Overview.doc Last Saved On: 6/18/01 11:11 AM 2 Chapter Cluster Architecture The Compaq Parallel Database Cluster Model PDC/O5000 (referred to here as the PDC/O5000) is an integration of a number of different hardware and software products. This chapter discusses how these products play a role in bringing a complete clustering solution to your computing environment. The hardware products include: I Compaq ProLiant servers I Shared storage components G Compaq StorageWorks Modular Array 8000 Fibre Channel Storage Subsystems or the Compaq StorageWorks Enterprise Modular Array 12000 Fibre Channel Storage Subsystems (MA8000/EMA12000 Storage Subsystems) G Compaq StorageWorks HSG80 Array Controllers (HSG80 Array Controllers) G Compaq StorageWorks Fibre Channel SAN Switches (Fibre Channel SAN Switches) for redundant Fibre Channel Fabrics G Compaq StorageWorks Storage Hubs (Storage Hubs) for redundant Fibre Channel Arbitrated Loops G KGPSA-BC PCI-to-Optical Fibre Channel Host Adapters (KGPSA-BC Host Adapters) or KGPSA-CB PCI-to-Optical Fibre Channel Host Adapters (KGPSA-CB Host Adapters) Compaq Confidential Need to Know Required Writer: John Blackburn Project: Compaq Parallel Database Cluster Model PDC/O5000 for Oracle8i and Windows 2000 Administrator Guide Comments: Part Number: 225081-002 File Name: c-ch2 Cluster Architecture.doc Last Saved On: 6/18/01 11:13 AM 2-2 Compaq Parallel Database Cluster Model PDC/O5000 for Oracle8i and Windows 2000 Administrator Guide G Gigabit Interface Converter-Shortwave (GBIC-SW) modules G Fibre Channel cables I Cluster interconnect components G Ethernet NIC adapters G Ethernet cables G Ethernet switches/hubs The software products include: I Microsoft Windows 2000 Advanced Server with Service Pack 1 or later I Compaq drivers and utilities I Oracle8i Enterprise Edition with the Oracle8i Parallel Server Option For a description of the software products used with the PDC/O5000, refer to Chapter 3, "Cluster Software Components." Compaq ProLiant Servers A primary component of any cluster is the server. Each Compaq Parallel Database Cluster Model PDC/O5000 consists of cluster nodes in which each node is a Compaq ProLiant server. With some exceptions, all nodes in a PDC/O5000 must be identical in model. In addition, all components common to all nodes in a cluster, such as memory, number of CPUs, and the interconnect adapters, should be identical and identically configured. NOTE: Certain restrictions apply to the server models and server configurations that are supported by the Compaq Parallel Database Cluster. For a current list of PDC-certified servers and details on supported configurations, refer to the Compaq Parallel Database Cluster Model PDC/O5000 Certification Matrix for Windows 2000. This document is available on the Compaq website at www.compaq.com/solutions/enterprise/ha-pdc.html Compaq Confidential Need to Know Required Writer: John Blackburn Project: Compaq Parallel Database Cluster Model PDC/O5000 for Oracle8i and Windows 2000 Administrator Guide Comments: Part Number: 225081-002 File Name: c-ch2 Cluster Architecture.doc Last Saved On: 6/18/01 11:13 AM Cluster Architecture 2-3 High-Availability Features of ProLiant Servers In addition to the increased application and data availability enabled by clustering, ProLiant servers include many reliability features that provide a solid foundation for effective clustered server solutions. The PDC/O5000 is based on ProLiant servers, most of which offer excellent reliability through redundant power supplies, redundant cooling fans, and Error Checking and Correcting (ECC) memory. The high-availability features of ProLiant servers are a critical foundation of Compaq clustering products. Table 2-1 lists the high-availability features found in many ProLiant servers. Table 2-1 High-Availability Components of ProLiant Servers Hot-pluggable hard drives Redundant power supplies Digital Linear Tape (DLT) Array (optional) ECC-protected processor-memory bus Uninterruptible power supplies (optional) Redundant processor power modules ECC memory PCI Hot Plug slots (in some servers) Offline backup processor Redundant cooling fans Shared Storage Components The PDC/O5000 is based on a cluster architecture known as shared storage clustering, in which clustered nodes share access to a common set of shared disk drives. In this discussion, the shared storage includes these components: I MA8000/EMA12000 Storage Subsystem I HSG80 Array Controllers I Fibre Channel SAN Switches for redundant Fibre Channel Fabrics
| 225081-001 225081-002 |