|
 |

Appro Cluster
Engine™ (ACE) Management Software


Appro Cluster Engine™ (ACE) Management
Software is part of the Appro
Xtreme-X™ Supercomputer Architecture. ACE offers
a complete management system including: Network Management,
Server Management, Cluster Management and Storage Management.
ACE provides deep insight into the entire computing infrastructure
delivering precise control and ongoing optimization for
expanding physical and virtual Linux environments. This
software suite offers a web-based management interface that
is easy-to-use providing customers with powerful tools to
manage the software stack, including workload manager, resource
management tools, remote control and advanced power management
functions. ACE allows network and server hardware monitoring
so problems are quickly identified and fixed and it automatically
restarts jobs after a compute node failure. ACE provides
a redundant hierarchical system management with all system
configuration data kept on the fault tolerant database and
file system on the management servers. ACE also supports
multiple networking topologies, diskless configuration and
network failover to achieve maximum reliability, performance
and high availability. It features root file systems for
instant provisioning of rapid, standard Linux installs on
large diskless systems allowing them to boot 64 to 6,400
blades in the same time.
|
|
Features & Benefits
• Two Tier Management Architecture: Scalable to large numbers
of servers, minimal overhead and complete remote lights-out control
• High Availability, High Performance: Redundant management
servers and redundant networks
• Diskless Operation
- Improves performance
- Lower administrative overhead
- Multi-level cached root file system
- Local storage supported
• Instant Cluster Provisioning
- Multiple clusters with the same configuration
- Revision management with roll-back capability
- Cluster hosts added or subtracted dynamically
Simplifies IT Management & Administration
This complete lights out, scalable cluster management system offers
robust capabilities including: • Network Management
• Server Management • Cluster Management •
Storage Management
| Technical
Specifications |
| System
Management |
Management of overall
system configuration
Supports redundant management servers with automatic failover
Designed to anticipate and tolerate failures
Supports enterprise level availability requirements with
24 hr MTTR |
| Network
Management |
Automatic discovery of
interconnect hardware
Supports multiple interconnect fabric topologies
Supports redundant paths and networks
Supports load balancing and failover
Provides network status to the management system |
| Server
Management |
Automatic
discovery of server hardware
Remote server control (Power On/Off, Cycle)
Remote server initialization (Reset, Reboot, Shut Down)
Support for scalable fast diskless or dataless booting
for large node count systems
Supports server redundancy and failover
Provides server status to the management system |
| Cluster
Management |
Supports
partitioning a cluster into multiple logical computers
Maps logical computers (clusters) onto servers (nodes)
Supports multiple independent OS configurations
Manages and monitors logical computer (clusters) status
Provides cluster status to management system
Integrated job scheduling and management
Manages and monitors operating system instances (nodes)
Provides node status to management system |
| Storage
Management |
Supports
scalable root file systems for diskless or dataless nodes
Supports multiple global storage configurations
Supports high BW to secondary storage for data and check
pointing
Provides server status to the management system |
| GUI |
Web based |
|
Reliable,
Available, Serviceable (RAS) |
Redundant
management network (GbE and 10GbE) with failover
Redundant high bandwidth network (InfiniBand™) with
failover
Redundant management servers (two levels) with failover
Redundant root file system with failover
Built-in multi-generation configuration management for
software
Redundant storage via RAID |
|

|
|
|