SpyderByte.com ;Technical Portals 
      
 The #1 Site for News & Information Related to Linux High Performance Technical Computing, Linux High Availability and Linux Parallel Clustering
Home About News Archives Contribute News, Articles, Press Releases Mobile Edition Contact Advertising/Sponsorship Search Privacy
Research and Services
Cluster Quoter (HPC Cluster RFQ)
Hardware Vendors
Software Vendors
HPC Consultants
Training Vendors
News
Latest News
Newsletter
News Archives
Search Archives
Reference
Featured Articles
Cluster Builder
Beginners
Whitepapers
Documentation
Software
Lists/Newsgroups
Books
User Groups & Organizations
Golden Eggs (Configuration Diagrams)
Linux HPC Links
Favorites:
Cluster Monkey (Doug Eadline, et al)
HPCWire (Tabor Communications)
insideHPC.com (John West)
Scalability.org (Dr. Joe Landman)

Beowulf.org
Beowulf Users Group
High Performance Computing Clusters
Thinking Parallel
Gelato.org
The Aggregate
Top500
Cluster Computing Info Centre
Coyote Gultch
Robert Brown's Beowulf Page
FM.net: Scientific/Engineering
SuperComputingOnline
HPC User Forum
GridsWatch
Linux HPC News Update
Stay current on Linux related HPC news, events and information.
LinuxHPC Newsletter

Other Mailing Lists:
Linux High Availability
Beowulf Mailing List
Gelato.org (Linux Itanium)

LinuxHPC.org
Home
About
Contact
Mobile Edition
Sponsorship

Linux Cluster RFQ Form
Reach Multiple Vendors With One Linux Cluster RFQ Form. Save time and effort, let LinuxHPC.org do all the leg work for you free of charge. Request A Quote...

Latest News

Evergrid Transforms HPC with Patent Pending Quality of Application Service
Posted by Dana D Booze, Monday November 13 2006 @ 09:34AM EST

Evergrid, Inc., a provider of advanced quality of application service management for next generation datacenters, today announced its entry into the high performance computing market, with patent pending high availability and resource management software that lets massively parallelized distributed applications run at near 100 percent reliability on high performance computing (HPC) clusters.

The Evergrid software sits between the operating system and the applications, and captures the collective state of the application and its IO across all processors. By recording the state of the application, Evergrid is able to checkpoint and recover from failures rapidly with minimal overhead. The software also allows data centers to do preemptive scheduling of lower priority applications in favor of running higher priority applications, with little or no data lost. The software installs on Linux systems and requires no modifications to either the OS or application. It is scalable up to thousands of nodes at a time, with less than five percent performance overhead.

“As open source and commodity hardware have become de facto standards, large data centers today are increasingly deploying their mission critical applications on huge clusters of servers,” said Ameet Patel, partner and CTO, Acartha Group, and former technology executive at JPMorgan Chase. ”But traditional datacenter configurations are rigid, complex, underutilized and expensive. The market desperately needs a solution that treats commodity servers like we used to treat mainframes. Datacenters want to schedule high priority jobs on pools of commodity servers that can quickly recover from inevitable failures.”

Despite a host of recent advances in hardware and software, downtime for compute intensive applications is an ever-worsening problem in high performance technical computing (HPTC) environments. Expanding clusters of commoditized servers has resulted in higher failure rates and lower mean time between failures (MTBF) because of the large number of nodes and the length of time users want to run parallel applications. Also, in an attempt to meet quality of service objectives, data centers have dedicated individual servers to particular applications, resulting in over-provisioning. Such a situation has created an environment of low utilization, poor reconfiguration flexibility and high cost.

“When we built System X at Virginia Tech we found that the reliability of large clusters was an important issue,” said Srinidhi Varadarajan, CTO and founder of Evergrid. “Even with excellent hardware the runtime of large jobs was restricted by the mean time between failures of 1000’s of processors. We decided very quickly that we needed to do something about system availability, and that was our impetus for founding Evergrid.”

Evergrid’s new fault tolerant software prevents downtime by automating the checkpointing, migration and recovery of applications, thus offering automatic failover across multiple nodes and tiers. With Evergrid, even failure of multiple processors does not stop an application from functioning continuously. In addition, Evergrid’s efficient and robust management software provisions servers from bare metal up through the application and allows preemptive allocation of resources to high priority applications. This unprecedented level of functionality allows quality of service objectives to be easily met. All this is done with complete transparency to the user.

“Evergrid provides commodity server clusters with the industry’s first and only transparent, fault tolerant system, and also the first and only preemptive scheduler for distributed applications,” said David Anderson, CEO, Evergrid. ”Our product is truly massively scalable. The closest competitor can scale to only eight nodes with performance overhead of more than 40 percent. We designed Evergrid to grow to a remarkable 100,000 nodes or more.”

Evergrid’s infrastructure software is designed for demanding, computing intensive sectors such as aerospace, financial services and petrochemical research. Initially, Evergrid software solutions target High Performance Technical Computing (HPTC) applications that are computationally intensive and use high speed interconnects. In the future, Evergrid will also provide solutions for the High Performance Enterprise Computing (HPEC) market and transaction processing database markets.

Evergrid is a spin-off of California Digital, a company that created two of the highest performance supercomputers in the world (now at #14 and #28 on the Top 500 list). The company is funded by a number of private investors, led by the Acartha Group.

Evergrid is demonstrating its new virtualization software technology this week in booth #244, at Supercomputing 2006 (SC ’06), November 13-17, at the Tampa Convention Center in Tampa, Florida. Evergrid is one of only 54 show participants chosen to present poster submissions displaying emerging ideas and early results of advanced research in high performance computing, networking and storage. This special poster exhibit takes place on the second floor of the convention center, from 5:15pm - 7:15pm, on Tuesday, Nov. 14.

About Evergrid, Inc.

Evergrid, a provider of advanced quality of application service management for next generation datacenters, lets massively parallelized, distributed applications run properly on high performance cluster grids, at near 100 percent reliability. Evergrid’s fault tolerant application virtualization software prevents downtime, automates checkpoint, migration, and recovery of applications, and scales to thousands of nodes, with less than five percent performance overhead.

Evergrid’s leadership team brings extensive management and technology expertise from IBM, Amdahl, VERITAS, Motorola, Tandem Computers and the Virginia Polytechnic Institute and State University. Evergrid is a private company. For more information, visit http;//www.Evergrid.com


< SC06 BeoBash Tuesday Night | Panasas Selected Storage Supplier at Los Alamos NL for Next-Gen Petascale SC >

 


Supercomputing '07
Nov 10-16, Reno, NV


Register now...

Sponsors








Affiliates



Cluster Monkey




Golden Eggs
(HP Visual Diagram and Config Guides)
Clusters:
CP4000 32x DL145G2 GigE Opteron, Dual Core
CP4000 64x DL145 GigE Opteron
CP4000 102x DL145 GigE Opteron
CP4000 32x DL145 Myri Opteron
Rocks Cluster 16-22 DL145 Opteron
Rocks Cluster 30-46 DL145 Opteron
Rocks Cluster 64-84 DL145 Opteron
LC3000 GigaE 24-36 DL145 Opteron
LC3000 Myri 16-32x DL145 Opteron
LC3000 GigaE 16-22x DL145 Opteron
LC2000 GigaE 16-22x DL360G3 Xeon
ProLiant:
> DL365 System 2600Mhz 2P 1U Opteron Dual Core
DL360 G5 System 3000Mhz 2P 1U EM64T Dual/Quad Core
DL385 G2 2600Mhz 2P Opteron Dual Core
DL380 G5 3000Mhz 2P EM64T Dual/Quad Core
DL140 3060MHz 2P IA32
DL140 G2 3600MHz 2P EM64T
DL145 2600MHz 2P Opteron
DL145 G2 2600MHz 2P Opteron Dual Core
DL360 G4 3400MHz 2P EM64T
DL360 G4p 3800MHz 2P EM64T
DL380 G4 3800MHz 2P EM64T
DL385 2800MHz 2P Opteron Dual Core
DL560 3000MHz 4P IA32
DL580 G3 3330MHz 4P EM64T
DL585 2800MHz 4P Opteron Dual Core
Integrity:
Montecito 2P-16P, rx2660-rx8640 (multi-system diagram)
rx2660 1600MHz 2P 2U Montecito Systems and Cluster
rx6600 1600MHz 4P 7U Single & Cluster
rx3600 1600MHz 2P 4U Single & Cluster
rx2620 1600MHz 2P 2U Single & Cluster
Superdome 64P base configuration
Integrity Family Portrait (rx1620 thru rx8620), IA64
rx1620 1600MHz 2P MSA1000 Cluster IA64
rx2620 1600MHz 2P MSA1000 Cluster IA64
rx4640 1600MHz 4P MSA1000 Cluster IA64
rx7620 1600MHz 8P 10U Systems and MSA1000 Cluster
rx8620 1600MHz 16P 17U Systems and MSA1000 Cluster
Storage:
MSA30-MI Dual SCSI Cluster, rx3600, rx6600 and rx2660
MSA30-MI Dual SCSI Cluster, rx1620...rx4640
MSA500 G2, SCSI
MSA1510i IP SAN 48TB, SCSI and SATA
MSA1500 48TB, SCSI and SATA
Misc:
Dual Core AMD64 and EM64T systems with MSA1500






Appro: Enterprise and High Performance Computing Whitepapers
Is Your HPC Cluster Ready for Multi-core Processors?:
Multi-core processors bring new challenges and opportunities for the HPC cluster. Get a first look at utilizing these processors and strategies for better performance.

Accelerating Results through Innovation:
Achieve maximum compute power and efficiency with Appro Cluster Solutions. Our highly scalable clusters are designed to seamlessly integrate with existing high performance, scientific, technical, and commercial computing environments.
Keeping Your Cool in the Data Center:
Rethinking IT architecture and infrastructure is not a simple job. This whitepaper helps IT managers overcome challenges with thermal, power, and system management.

Unlocking the Value of IT with Appro HyperBlade:
A fully integrated cluster combining advantages of blade and rack-mount servers for a flexible, modular, scalable architecture designed for Enterprise and HPC applications.
AMD Opteron-based products | Intel Xeon-based products


Hewlett-Packard: Linux High Performance Computing Whitepapers
Unified Cluster Portfolio:
A comprehensive, modular package of tested and pre-configured hardware, software and services for scalable computation, data management and visualization.

Your Fast Track to Cluster Deployment:
Designed to enable faster ordering and configuration, shorter delivery times and increased savings. Customers can select from a menu of popular cluster components, which are then factory assembled into pre-defined configurations with optional software installation.
Message Passing Interface library (HP-MPI):
A high performance and production quality implementation of the Message-Passing Interface (MPI) standard for HP servers and workstations.

Cluster Platform Express:
Cluster Platform Express comes straight to you, factory assembled and available with pre-installed software for cluster management, and ready for deployment.
AMD Opteron-based ProLiant nodes | Intel Xeon-based ProLiant nodes



Home About News Archives Contribute News, Articles, Press Releases Mobile Edition Contact Advertising/Sponsorship Search Privacy
     Copyright © 2001-2007 LinuxHPC.org
Linux is a trademark of Linus Torvalds
All other trademarks are those of their owners.
    
  SpyderByte.com ;Technical Portals