Академический Документы
Профессиональный Документы
Культура Документы
Session Number
Presentation_ID © 2005 Cisco Systems, Inc. All rights reserved. Cisco Confidential 2
InfiniBand Hardware Overview
Session Number
Presentation_ID
Presentation_ID ©
© 2003, Cisco Systems,
2005 Cisco Systems, Inc.
Inc. All
All rights
rights reserved.
reserved. Cisco Confidential 3 3
What is InfiniBand?
Session Number
Presentation_ID © 2005 Cisco Systems, Inc. All rights reserved. Cisco Confidential 4
InfiniBand Physics
Session Number
Presentation_ID © 2005 Cisco Systems, Inc. All rights reserved. Cisco Confidential 5
InfiniBand Physics
Session Number
Presentation_ID © 2005 Cisco Systems, Inc. All rights reserved. Cisco Confidential 6
Terms and Components
Session Number
Presentation_ID © 2005 Cisco Systems, Inc. All rights reserved. Cisco Confidential 7
Terms and Components (continued)
Session Number
Presentation_ID © 2005 Cisco Systems, Inc. All rights reserved. Cisco Confidential 8
Pluggable Optics Module
Transforms Powered Copper Ports to Optical Ports
Session Number
Presentation_ID © 2005 Cisco Systems, Inc. All rights reserved. Cisco Confidential 9
How does InfiniBand compare?
10Gb/s
4X-IB 10G EN 10G ATM/
SONET
2G FC
1Gb/s
1G EN
Cost
Session Number
Presentation_ID © 2005 Cisco Systems, Inc. All rights reserved. Cisco Confidential 10
InfiniBand Nomenclature
Server Server
Host Host
Host Interconnect
Switch
Session Number
Presentation_ID © 2005 Cisco Systems, Inc. All rights reserved. Cisco Confidential 11
InfiniBand Switch Hardware
Session Number
Presentation_ID © 2005 Cisco Systems, Inc. All rights reserved. Cisco Confidential 12
InfiniBand Host Channel Adapter
Session Number
Presentation_ID © 2005 Cisco Systems, Inc. All rights reserved. Cisco Confidential 13
InfiniBand Gateway
Session Number
Presentation_ID © 2005 Cisco Systems, Inc. All rights reserved. Cisco Confidential 14
InfiniBand System Overview
Session Number
Presentation_ID
Presentation_ID ©
© 2003, Cisco Systems,
2005 Cisco Systems, Inc.
Inc. All
All rights
rights reserved.
reserved. Cisco Confidential 15 15
InfiniBand System Architecture
Session Number
Presentation_ID © 2005 Cisco Systems, Inc. All rights reserved. Cisco Confidential 16
InfiniBand Connections
• Reliable Connection
Host Channel Adapter based guaranteed delivery
Uses HCA onboard memory (or system memory with PCI-E) for
packet buffering
Primarily used for RDMA communications
Can use end-to-end flow control based on credits related to
available receive buffers
• Unreliable Datagram
Best effort forwarding
Used for IP over IB communications
Session Number
Presentation_ID © 2005 Cisco Systems, Inc. All rights reserved. Cisco Confidential 17
Subnet Manager
Session Number
Presentation_ID © 2005 Cisco Systems, Inc. All rights reserved. Cisco Confidential 18
SM Functionality
Session Number
Presentation_ID © 2005 Cisco Systems, Inc. All rights reserved. Cisco Confidential 19
Clusters 2.0 Subnet Manager:
Fabric Sweep Performance
Number of Hosts Time
32 < 1 sec
64 < 1 sec
128 2 sec
256 4 sec
512 22 sec
1,024* 35-40 sec
2,048* 1-1:30 min**
4,096* 5-7 min**
* Requires HPC Subnet Manager for this performance
** Estimated based on simulation
• GID
GUID plus Subnet prefix
Used for host lookup on a subnet
Used for inter-subnet IB routing (future)
(e.g. fe:80:00:00:00:00:00:00:00:05:ad:00:00:01:02:03)
Session Number
Presentation_ID © 2005 Cisco Systems, Inc. All rights reserved. Cisco Confidential 21
IB Addressing
• Queue Pair
In conjunction with LID defines send/receive queues for
End to End context
Similar to a socket on an IP port
Process address within the host
Session Number
Presentation_ID © 2005 Cisco Systems, Inc. All rights reserved. Cisco Confidential 22
Server Switch Applications
−I/O Consolidation
I/O −I/O Aggregation
Virtualization −Server Consolidation
Applications
Session Number
Presentation_ID © 2005 Cisco Systems, Inc. All rights reserved. Cisco Confidential 23
I/O Gateways for Network and Storage
Eliminating Technology Islands
Server Cluster InfiniBand
InfiniBand switches
switches for for cluster
cluster
interconnect
interconnect
Twelve
Twelve 10Gbps
10Gbps InfiniBand
InfiniBand ports
ports
Single
Single InfiniBand
InfiniBand link
link for:
for: per switch card
per switch card
-- Storage
Storage Up
Up to
to 72
72 ports
ports total
total ports
ports with
with
-- Network
Network optional
optional modules
modules
Single
Single fat
fat pipe
pipe toto each
each server
server for
for
all
all network
network traffic
traffic
Cisco Cisco
Cisco
MDS 9000 Series Catalyst 6500 Series
SFS 3012
Fibre
Fibre Channel
Channel to
to InfiniBand
InfiniBand gateway
gateway forfor storage
storage Ethernet
Ethernet to
to InfiniBand
InfiniBand gateway
gateway for
for LAN
LAN
access
access access
access
Two
Two 2-Gbps
2-Gbps Fibre
Fibre Channel
Channel ports
ports per
per gateway
gateway Six
Six Gigabit
Gigabit Ethernet
Ethernet ports
ports per
per gateway
gateway
Create
Create 10-Gbps
10-Gbps virtual
virtual storage
storage pipe
pipe to
to each
each server
server Create
Create virtual
virtual GigE
GigE pipe
pipe to
to each
each server
server
Session Number
Presentation_ID © 2005 Cisco Systems, Inc. All rights reserved. Cisco Confidential 24
Ethernet Gateway Architecture
Session Number
Presentation_ID © 2005 Cisco Systems, Inc. All rights reserved. Cisco Confidential 25
Packet Flow with Ethernet Gateway
Application Application
TCP Session
TCP TCP
IP IP IP IP
IP Connection
IP over IB IP over IB
IB Transport IB Transport
IB Network IB Network
IB Link IB Link IB Link IB Link Ethernet Link Eth Link Eth Link Ethernet Link
IB Physical IB Phy IB Phy IB Physical Ethernet Physical Eth Phy Eth Phy Ethernet Physical
Session Number
Presentation_ID © 2005 Cisco Systems, Inc. All rights reserved. Cisco Confidential 26
Ethernet Gateway Port Aggregation
Session Number
Presentation_ID © 2005 Cisco Systems, Inc. All rights reserved. Cisco Confidential 27
FiberChannel Gateway Topology
Session Number
Presentation_ID © 2005 Cisco Systems, Inc. All rights reserved. Cisco Confidential 28
How the FC gateway works
The FC gateway dynamically allocates world-wide node names (WWNNs) and world-
wide port names (WWPNs) to InfiniBand hosts to emulate Fibre Channel-attached
hosts.
The FC gateway logs into the fabric using FC-AL. No other mode (Like E port
functionality supported yet).
The Fibre Channel gateway and InfiniBand hosts appear, to the SAN, as groups of
hosts on a Fibre Channel hub with the dedicated bandwidth advantages of a
switched-based architecture.
The FC gateway translates between the Fibre Channel protocol (FCP) of the SAN and
the SCSI RDMA protocol (SRP) of the IB hosts. In this way, SANs and InfiniBand-
attached hosts communicate seamlessly. SAN management tools recognize IB and
FC devices alike in Fibre Channel terms, which permits all management paradigms
and security infrastructures to operate normally. After the FC gateway assigns
WWNNs and WWPNs to initiators, you must configure access control policies to
associate FC-attached LUNs to initiators.
Session Number
Presentation_ID © 2005 Cisco Systems, Inc. All rights reserved. Cisco Confidential 29
How the FC gateway works (continued)
Session Number
Presentation_ID © 2005 Cisco Systems, Inc. All rights reserved. Cisco Confidential 30
RDMA and Upper Layer Protocols
Session Number
Presentation_ID
Presentation_ID ©
© 2003, Cisco Systems,
2005 Cisco Systems, Inc.
Inc. All
All rights
rights reserved.
reserved. Cisco Confidential 31 31
Current NIC Architecture
CPU OS Buffer
NIC
interconnect
CPU OS Buffer
HCA
interconnect
Session Number
Presentation_ID © 2005 Cisco Systems, Inc. All rights reserved. Cisco Confidential 33
Kernel Bypass
Application Application
User User
Sockets Sockets RDMA
Kernel Layer Kernel Layer ULP
TCP/IP TCP/IP
Transport Transport
Driver Driver
Hardware Hardware
Session Number
Presentation_ID © 2005 Cisco Systems, Inc. All rights reserved. Cisco Confidential 34
Upper Layer Protocols
Session Number
Presentation_ID © 2005 Cisco Systems, Inc. All rights reserved. Cisco Confidential 35
IP over InfiniBand
Session Number
Presentation_ID © 2005 Cisco Systems, Inc. All rights reserved. Cisco Confidential 36
Sockets Direct Protocol
Session Number
Presentation_ID © 2005 Cisco Systems, Inc. All rights reserved. Cisco Confidential 37
SCSI RDMA Protocol
Session Number
Presentation_ID © 2005 Cisco Systems, Inc. All rights reserved. Cisco Confidential 38
Direct Access Provider Library
Session Number
Presentation_ID © 2005 Cisco Systems, Inc. All rights reserved. Cisco Confidential 39
Message Passing Interface
Session Number
Presentation_ID © 2005 Cisco Systems, Inc. All rights reserved. Cisco Confidential 40
APIs and Performance
Application
uDAPL
SRP MPI
BSD Sockets Async I/O TS-API
extension
Direct
TCP Access
IP
SDP
IPoIB
1GE 10G IB
Session Number
Presentation_ID © 2005 Cisco Systems, Inc. All rights reserved. Cisco Confidential 41
InfiniBand Protocol Summary
IPoIB Allows TCP/IP applications to run over the Standard IP-based applications. When
(IP over InfiniBand) InfiniBand transport. Provides server to server used in conjunction with Ethernet
and in-band management traffic from mgt Gateway, allows connectivity
station to switch and HCAs. between IB network and LAN.
uDAPL Allows application to take maximum advantage of Used for IPC communication between
(Direct Access RDMA benefits through flexible programming cluster nodes for Oracle RAC.
Programming Library) API. Requires custom development.
SDP Adds RDMA benefits transparently to sockets-based Communication between database nodes
(Sockets Direct Protocol) applications. Can configure for all sockets and application nodes, as well as
applications or on a per port or application between database instances.
basis.
SRP Allows InfiniBand-attached servers to utilize block When used in conjunction with the Fibre
(SCSI RDMA Protocol) storage devices. Channel gateway, allows connectivity
between IB network and SAN.
Session Number
Presentation_ID © 2005 Cisco Systems, Inc. All rights reserved. Cisco Confidential 42
IB Glossary
Session Number
Presentation_ID © 2005 Cisco Systems, Inc. All rights reserved. Cisco Confidential 43
High Performance Computing
Architectures
Session Number
Presentation_ID
Presentation_ID ©
© 2003, Cisco Systems,
2005 Cisco Systems, Inc.
Inc. All
All rights
rights reserved.
reserved. Cisco Confidential 44 44
High Performance Computing Applications
Session Number
Presentation_ID © 2005 Cisco Systems, Inc. All rights reserved. Cisco Confidential 45
High Performance Computing Networks
Session Number
Presentation_ID © 2005 Cisco Systems, Inc. All rights reserved. Cisco Confidential 46
Case Study: University:
468-Node HPC Cluster
Research cluster running parallel MPI applications Dell Blade Servers, PCI-E
RH AS 3.1 Linux, 2.4 Kernel
using Oscar for cluster management.
Session Number
Presentation_ID © 2005 Cisco Systems, Inc. All rights reserved. Cisco Confidential 47
Case Study: Large Wall Street Bank
• Application:
Build scalable “on-demand” compute
grid for financial applications
• Environment
512 Intel Servers per slice (goal: 10,000
nodes)
RedHat Linux AS2.1
SFS Server Switch with Ethernet and
Fibre Channel Gateways
Hitachi RAID Storage
Brocade SAN Switches
Cisco Ethernet Switches
• Benefits:
20X Price/Performance Improvement over four years
30-50% Application Performance Improvement
Standards-based solution for on-demand computing
Environment that scales with multiple 500 node building blocks
Session Number
Presentation_ID © 2005 Cisco Systems, Inc. All rights reserved. Cisco Confidential 48
Provisioning the Virtual Data Center
Session Number
Presentation_ID © 2005 Cisco Systems, Inc. All rights reserved. Cisco Confidential 49
What is Virtualization?
Virtualization
What is Virtualization?
Cisco’s Definition
Virtualization is provisioning
unique combinations of data center resources
(e.g. compute, storage, I/O, network, peripherals, etc.)
independent of their physical architecture
to deliver business and application services
on-demand
(e.g. security, performance, availability, accounting, etc.)
Session Number
Presentation_ID © 2005 Cisco Systems, Inc. All rights reserved. Cisco Confidential 50
What is VFrame?
Virtualization
What is VFrame™
Session Number
Presentation_ID © 2005 Cisco Systems, Inc. All rights reserved. Cisco Confidential 51
Data Center Evolution
The 70’s: Mainframe
Management
Session Number
Presentation_ID © 2005 Cisco Systems, Inc. All rights reserved. Cisco Confidential 52
Data Center Evolution
Network Management
Session Number
Presentation_ID © 2005 Cisco Systems, Inc. All rights reserved. Cisco Confidential 53
Data Center Evolution
Session Number
Presentation_ID © 2005 Cisco Systems, Inc. All rights reserved. Cisco Confidential 54
Data Center Evolution
Management Security
Management Storage
Session Number
Presentation_ID © 2005 Cisco Systems, Inc. All rights reserved. Cisco Confidential 55
Data Center Evolution
VFrame = Vertical Data Center Provisioning
•Virtualization
Policies APIs VFrame™ •Orchestration
APIs •Provisioning
e.g. Tivoli
Management Application-
Compute Centric
Service-
Management Security Oriented
Session Number
Presentation_ID © 2005 Cisco Systems, Inc. All rights reserved. Cisco Confidential 56
Compute Networking and Virtualization —
How Does It Work?
Resource Pool
Session Number
Presentation_ID © 2005 Cisco Systems, Inc. All rights reserved. Cisco Confidential 57
VFrame 3.0
Available Today
Features
Standby
• Unified Fabric via InfiniBand Group 1 Group 2 Pool
x
• Diskless Servers with Linux
and Windows Support
• Scalable to 128 nodes per
Director
• Full High Availability
• Robust Policy Management
Policy
VFrame™
Policy 1: Failover IB
Session Number
Presentation_ID © 2005 Cisco Systems, Inc. All rights reserved. Cisco Confidential 58
VFrame 3.0
Available Today
Features
Group 1 Group 2 Standby
• Unified Fabric via InfiniBand Pool
• Diskless Servers with Linux
and Windows Support
• Scalable to 128 nodes per
Director
• Full High Availability
• Robust Policy Management
Policy
VFrame™
Policy 2: Quickly IB
Redeploy Server(s)
SFS 3012
VFrame programs SFS
to remap physical
servers at 5pm
Session Number
Presentation_ID © 2005 Cisco Systems, Inc. All rights reserved. Cisco Confidential 59
VFrame 3.0
Available Today
Features
Group 1 Group 2 Standby
• Unified Fabric via InfiniBand Pool
• Diskless Servers with Linux
and Windows Support
• Scalable to 128 nodes per
Director
• Full High Availability
• Robust Policy Management
Policy
VFrame™
Session Number
Presentation_ID © 2005 Cisco Systems, Inc. All rights reserved. Cisco Confidential 60
VFrame Vision
Cisco Virtual Data Center
Session Number
Presentation_ID © 2005 Cisco Systems, Inc. All rights reserved. Cisco Confidential 61
VFrame Benefits
Session Number
Presentation_ID © 2005 Cisco Systems, Inc. All rights reserved. Cisco Confidential 62
Cisco SFS Solution Summary
Session Number
Presentation_ID © 2005 Cisco Systems, Inc. All rights reserved. Cisco Confidential 63
Session Number
Presentation_ID © 2005 Cisco Systems, Inc. All rights reserved. Cisco Confidential 64