Dell PowerEdge C5220 Manual do Utilizador Página 4

  • Descarregar
  • Adicionar aos meus manuais
  • Imprimir
  • Página
    / 13
  • Índice
  • MARCADORES
  • Avaliado. / 5. Com base em avaliações de clientes
Vista de página 3
A Principled Technologies test report 4
Dell PowerEdge C5220: Hadoop performance
TB respectively across the entire C5000 shared infrastructure in just a 3U
form factor
Reuse or repurpose servers easily when workloads change with hot-swap
server nodes you no longer need to experience downtime by replacing the
entire server chassis.
Designed with power efficiency and maintainability in mind, the Dell PowerEdge
C5220 maximizes operating efficiency with a shared-infrastructure design. To learn
more about the Dell PowerEdge C5220 and the entire Dell PowerEdge C Series, visit
http://www.dell.com/us/enterprise/p/poweredge-cloud-servers.
WHAT WE TESTED
To test the ability of the PowerEdge C5220 microserver to handle large data
processing tasks, we used Hadoop, specifically Cloudera Distribution Including Apache
Hadoop (CDH). Below we briefly discuss Hadoop and the benchmark tool we used,
TeraSort.
Hadoop
Hadoop, developed by Apache Software Foundation, is an open-source
distributed application that enables the analysis of large volumes of data for specific
purposes. Using Hadoop’s framework, IT organizations and researchers can build
applications that tailor the data analysis to specific needs for each company, even using
unstructured data. Many different marketsamong them finance, IT, and retailuse
Hadoop due to its ability to handle heterogeneous data, both structured and
unstructured.
Hadoop can run across any number of machines using varied hardware,
spreading data across all available hardware resources using a distributed file system,
Hadoop Distributed File System (HDFS), and replicating data to minimize loss if a
hardware malfunction occurs. The software is able to detect hardware failures, and to
work around said failures to allow uninterrupted access to data. Because of its ability to
run on different hardware, a Hadoop cluster is scalable and flexible it can be expanded
to encompass growing databases and companies. It is also cost-effective, as it allows
companies to utilize commodity hardware effectively.
TeraSort
The process of sifting and sorting through large amounts of data is a critical one
for many businesses, and they need the most efficient set of hardware and software
tools to do the job. The TeraSort benchmark on Hadoop tests the sorting speed and
efficiency of a Hadoop cluster. It measures how quickly a set of systems, in our case
eight PowerEdge C5220 servers, can sort a set amount of data. The main output of a
Vista de página 3
1 2 3 4 5 6 7 8 9 ... 12 13

Comentários a estes Manuais

Sem comentários