RAC: Frequently Asked Questions (Doc ID 220970.1)

5000阅读 0评论2014-04-24 iniestandroid
分类:Oracle

APPLIES TO:

Oracle Database - Enterprise Edition - Version 9.2.0.1 to 12.1.0.1 [Release 9.2 to 12.1]
Information in this document applies to any platform.

PURPOSE

Frequently Asked Questions for Real Application Clusters and Grid Infrastructure.

QUESTIONS AND ANSWERS

General RAC

RAC Assistance

High Availability

High Availability -- FAN/FCF

Scalability

Manageability

Platform Specific

Platform Specific -- Linux

Platform Specific -- Solaris

Platform Specific -- HP-UX

Platform Specific -- Windows

Platform Specific -- IBM AIX

Platform Specific -- IBM-z/OS (Mainframe)

Other Applications & RAC

Diagnosibility

EBusiness Suite with RAC

Clustered File Systems

Oracle Clusterware

Streched/Extended RAC -- No Sub Category

Cluster Verification Utility (CVU) -- No Sub Category

RAC One Node -- No Sub Category

Grid Infrastructure -- No Sub Category

Cluster Health Monitor (IPD/OS) -- No Sub Category

QoS Management -- No Sub Category

Answers

What kind of HW components do you recommend for the interconnect?

The general recommendation for the interconnect is to provide the highest bandwidth interconnect, together with the lowest latency protocol that is available for a given platform. In practice, Gigabit Ethernet with UDP has proven sufficient in every case it has been implemented, and tends to be the lowest common denominator across platforms.


Is rcp and/or rsh required for normal Oracle RAC operation ?

rcp"" and ""rsh"" are not required for normal Oracle RAC operation. However in older versions ""rsh"" and ""rcp"" should to be enabled for Oracle RAC and patchset installation. In later releases, ssh is used for these operations. 
Note Oracle Enterprise Manager uses rsh.


What is Cache Fusion and how does this affect applications?

Cache Fusion is a new parallel database architecture for exploiting clustered computers to achieve scalability of all types of applications. Cache Fusion is a shared cache architecture that uses high speed low latency interconnects available today on clustered systems to maintain database cache coherency. Database blocks are shipped across the interconnect to the node where access to the data is needed. This is accomplished transparently to the application and users of the system. As Cache Fusion uses at most a 3 point protocol, this means that it easily scales to clusters with a large numbers of nodes. For more information about cache fusion see the following links:

Additional Information can be found at:

Note: 139436.1 Understanding 9i Real Application Clusters Cache Fusion


Is it difficult to transition (migrate) from Single Instance to Oracle RAC?

If the cluster and the cluster software are not present, these components must be installed and configured.  The Oracle RAC option must be added using the Oracle Universal Installer, which necessitates the existing DB instance must be shut down.  There are no changes necessary on the user data within the database.  However, a shortage of freelists and freelist groups can causecontention with header blocks of tables and indexes as multiple instances vie for the same block.  This may cause a performance problem and require data partitioning.  However, the need for these changes should be rare. 

Recommendation: apply automatic space segment management to perform these changes automatically.  The free space management will replace the freelists and freelist groups and is better.  The database requires one Redo thread and one Undo tablespace for each instance, which are easily added with SQL commands or with Enterprise Manager tools. NOTE: With ORacle RAC 11g Release 2, you do not neet to pre-create redo threads or undo tablespaces if you are using Oracle Managed Files (EG ASM).

Datafiles will need to be moved to either a clustered file system (CFS) so that all nodes can access them. Oracle recommends the use of Automatic Storage Management (ASM)  Also, the MAXINSTANCES parameter in the control file must be greater than or equal to number of instances you will start in the cluster.

For more detailed information, please see Migrating from single-instance to RAC in the Oracle Documentation.

With Oracle Database 10g Release 2, $ORACLE_HOME/bin/rconfig tool can be used to convert Single instance database to RAC. This tool takes in a xml input file and convert the Single Instance database whose information is provided in the xml. You can run this tool in "verify only" mode prior to performing actual conversion. This is documented in the Oracle RAC Admin book and a sample xml can be found $ORACLE_HOME/assistants/rconfig/sampleXMLs/ConvertToRAC.xml. This tool only supports databases using a clustered file system or ASM. You cannot use it with raw devices. Grid Control 10g Release 2 provides a easy to use wizard to perform this function. 

@ Note: Please be aware that you may hit  (shutdown immediate hangs) as you convert 
@ the database. The bug is updated with workaround and the w/a should is release noted as well.

Oracle Enterprise Manager includes workflows to assiste with migrations. (I.E. Migrating to ASM, Creating Standby, Converting Standby to RAC etc) The migration is automated in Enterprise Manager Grid Control 10.2.0.5.


What are the dependencies between OCFS and ASM in Oracle Database 10g ?

In an Oracle RAC 10g environment, there is no dependency between Automatic Storage Management (ASM) and Oracle Cluster File System (OCFS).

OCFS is not required if you are using Automatic Storage Management (ASM) for database files. You can use OCFS on Windows( Version 2 on Linux ) for files that ASM does not handle - binaries (shared oracle home), trace files, etc. Alternatively, you could place these files on local file systems even though it's not as convenient given the multiple locations. 

If you do not want to use ASM for your database files, you can still use OCFS for database files in Oracle Database 10g. 
@ Please refer to 


Do we have to have Oracle Database on all nodes?

Each node of a cluster that is being used for a clustered database will typically have the database and Oracle RAC software loaded on it, but not actual datafiles (these need to be available via shared disk). For example, if you wish to run Oracle RAC on 2 nodes of a 4-node cluster, you would need to install the clusterware on all nodes, Oracle RAC on 2 nodes and it would only need to be licensed on the two nodes running the Oracle RAC database. Note that using a clustered file system, or NAS storage can provide a configuration that does not necessarily require the Oracle binaries to be installed on all nodes.
With Oracle RAC 11g Release 2, if you are using policy managed databases, then you should have the Oracle RAC binaries accessible on all nodes in the cluster.


What software is necessary for Oracle RAC? Does it have a separate installation CD to order?

Oracle Real Application Clusters is an option of Oracle Database and therefore part of the Oracle Database CD. With Oracle 9i, Oracle 9i RAC is part of Oracle9i Enterprise Edition. If you install 9i EE onto a cluster, and the Oracle Universal Installer (OUI) recognizes the cluster, you will be provided the option of installing RAC. Most UNIX platforms require an OSD installation for the necessary clusterware. For Intel platforms (Linux and Windows), Oracle provides the OSD software within the Oracle9i Enterprise Edition release.

With Oracle Database 10g, Oracle RAC is an option of EE and available as part of SE. Oracle provides Oracle Clusterware on its own CD included in the database CD pack.

Please check the certification matrix (Note 184875.1) or with the appropriate platform vendor for more information.

With Oracle Database 11g Release 2, Oracle Clusterware and Automatic Storage Management are installed as a single set of binaries called the grid infrastructure. The media for the grid infrastructure is on a separate CD or under the grid directory. For standalone servers, Automatic Storage Management and Oracle Restart are installed as the grid infrastructure for a standalone server which is installed from the same media.


What Application Design considerations should I be aware of when moving to Oracle RAC?

The general principals are that fundamentally no different design and coding practices are required for RAC however application flaws in execution or design have a higher impact in RAC. The performance and scalability in RAC will be more sensitive to bad plans or bad schema design. Serializing contention makes applications less scalable. If your customer uses standard SQL and schema tuning, it solves > 80% of performance problems 

Some of the scaleability pitfalls they should look for are:
* Serializing contention on a small set of data/index blocks
--> monotonically increasing key
--> frequent updates of small cached tables
--> segment without automatic segment space management (ASSM) or Free List Group (FLG)

* Full table scans
--> Optimization for full scans in 11g can save CPU and latency

* Frequent invalidation and parsing of cursors
--> Requires data dictionary lookups and synchronizations

* Concurrent DDL ( e.g. truncate/drop )

Look for:
* Indexes with right-growing characteristics
--> Use reverse key indexes 
--> Eliminate indexes which are not needed

* Frequent updated and reads of “small” tables
--> “small”=fits into a single buffer cache 
--> Use Sparse blocks ( PCTFREE 99 ) to reduce serialization

* SQL which scans large amount of data
--> Perhaps more efficient when parallelized
--> Direct reads do not need to be globally synchronized ( hence less CPU for global cache )


Should the SCSI-3 reservation bit be set for our Oracle Clusterware only installation?

Oracle Clusterware and Oracle RAC do not require neither use SCSI-3 Persistent Group Reservation (PGR). In a native Oracle RAC Stack (no third party or vendor cluster, neither Oracle Solaris Cluster) SCSI-3 PGR is not required by Oracle and should be disabled on the storage (for disks / LUNs used in the stack). 

When using a third party or vendor cluster solution such as Symantec Veritas SFRAC, the third part cluster solution may require that SCSI-3 PGR is enabled on the storage, as those solutions will use SCSI-3 PGR as part of their IO fencing procedures. In general, SCSI-3 PGR is enabled at the array level; for example on EMC hypervolume level.

Additional information:

 

REFERENCES

NOTE:139436.1 - Understanding 9i Real Application Clusters Cache Fusion

 - CLUSTER NODES HANG AND THEN BUGCHECK ON REBOOTS

NOTE:151051.1 - Init.ora Parameter "CLUSTER_INTERCONNECTS" Reference Note

NOTE:180608.1 - Automatic Space Segment Management in RAC Environments
NOTE:358156.1 - Automatic Startup of Cluster Services May Hang or Fail on Windows

 - ALLOW DBCA TO ALLOW FOR MORE THAN 5 CHARACTERS FOR SID PREFIX

 - CRS STARTS INSTANCE WITH TOO LOW FILE LIMIT


NOTE:342443.1 - 10.2.0.x Oracle Database and Networking Patches for Microsoft Platforms
NOTE:464683.1 - Unexplained Database Slowdown Seen on Windows 2003 Service Pack 1
NOTE:726833.1 - Linux: Hangcheck-Timer Module Requirements for Oracle 9i, 10g, and 11gR1 RAC
NOTE:732683.1 - Cannot Start Instance Using SRVCTL but SQLPLUS Can
NOTE:743649.1 - Will an Operating System Upgrade Affect Oracle Clusterware?
NOTE:465001.1 - Configuring raw devices (singlepath) for Oracle Clusterware 10g Release 2 (10.2.0) on RHEL5/OEL5
NOTE:5187351.8 - Bug 5187351 - Many clscNN.log files created - can affect performance
NOTE:458485.1 - How to find whether the one-off Patches will conflict or not?
NOTE:559365.1 - Using Diagwait as a diagnostic to get more information for diagnosing Oracle Clusterware Node evictions
NOTE:751343.1 - Oracle Clusterware and RAC Support for RDS Over Infiniband
NOTE:759143.1 - NTP leap second event causing Oracle Clusterware node reboot
NOTE:554781.1 - RAC instabilities due to firewall (netfilter/iptables) enabled on the cluster interconnect
NOTE:279793.1 - How to Restore a Lost Voting Disk in 10g

NOTE:285267.1 - Oracle E-Business Suite 11i and Database FAQ
NOTE:283107.1 - Configuring Solaris IP Multipathing (IPMP) for the CRS 10g VIP
NOTE:238278.1 - Linux: What's OCFS or OCFS2
NOTE:249212.1 - Support Position for Oracle Products Running on VMWare Virtualized Environments
NOTE:219361.1 - Troubleshooting ORA-29740 in a RAC Environment
NOTE:314422.1 - Remote Diagnostic Agent (RDA) - Getting Started
NOTE:332257.1 - Using Oracle Clusterware with Vendor Clusterware FAQ
NOTE:337737.1 - Oracle Clusterware (CRS/GI) - ASM - Database Version Compatibility
NOTE:341788.1 - Recommendation for the Real Application Cluster Interconnect and Jumbo Frames

NOTE:1062983.1 - How to restore ASM based OCR after complete loss of the CRS diskgroup on Linux/Unix systems
NOTE:183340.1 - Frequently Asked Questions About the CLUSTER_INTERCONNECTS Parameter in 9i.
NOTE:184875.1 - How To Check The Certification Matrix for Real Application Clusters

NOTE:359515.1 - Mount Options for Oracle files when used with NFS on NAS devices
NOTE:368464.1 - How to Setup IPMP as Cluster Interconnect

NOTE:291958.1 - Setting Up Bonding in Suse SLES8
NOTE:291962.1 - Setting Up Bonding in SLES 9
NOTE:296874.1 - Configuring the HP-UX Operating System for the Oracle 10g and Oracle 11g VIP
NOTE:296856.1 - Configuring the IBM AIX 5L Operating System for the Oracle 10g VIP
NOTE:298891.1 - Linux: Configuring Bonding For Oracle 10g VIP or private interconnect
NOTE:300479.1 - 10g Rolling Upgrades with Logical Standby

NOTE:269320.1 - Removing a Node from a 10gR1 RAC Cluster
NOTE:276434.1 - How to Modify Public Network Information including VIP in Oracle Clusterware
NOTE:397460.1 - Oracle's Policy for Supporting Oracle RAC with Symantec SFRAC
NOTE:399482.1 - How to Recreate OCR/Voting Disk Accidentally Deleted
NOTE:405820.1 - 10.2.0.X CRS Bundle Patch Information
NOTE:414897.1 - Linux: How To Setup UDEV Rules For RAC OCR And Voting Devices On SLES10, RHEL5, OEL5, OL5
NOTE:420977.1 - How to Use listener.ora in Non-default Location for 10g RAC
NOTE:444134.1 - Using IBM GPFS 2.3 with Oracle Real Application Clusters 10g
NOTE:454607.1 - New Partitions in Windows 2003 RAC Environments Not Visible on Remote Nodes
NOTE:301138.1 - RACDDT User Guide
NOTE:77370.1 - What are the effects of changing the system clock on an Oracle Server instance?
NOTE:787420.1 - Cluster Interconnect in Oracle 10g and 11gR1 RAC
NOTE:805969.1 - 10g/11gR1: VIP does not relocate back to the home node even after the public network problem is resolved
上一篇:How to update the IP address of the SCAN VIP resources (Doc ID 952903.1)
下一篇:11.2.0.4 RAC数据库存储迁移