Difference between revisions of "Data Analysis Service"

From NI4OS wiki
Jump to navigation Jump to search
((by SublimeText.Mediawiker))
((by SublimeText.Mediawiker))
Line 5: Line 5:
 
| Resource_Providers = [[Institute of Physics Belgrade]]
 
| Resource_Providers = [[Institute of Physics Belgrade]]
 
| Webpage =
 
| Webpage =
| Description = "The PARADOX Hadoop cluster consists of a single name node that runs the YARN resource manager, and three additional data nodes. The name node is hosted on a machine with 4-core Intel Xeon E3-1220v3 CPU running at 3.1 GHz, with 4 GB of RAM, and 500 GB of local hard disk storage. Each of the data nodes, which perform the computation and storage, are hosted on machines with 24-core Intel Xeon E5-2620 CPUs at 2.4 GHz, with 64 GB of RAM and 2 TB of storage. In total, the cluster provides access to 60 CPU cores, 180 GB of RAM and 5.3 TB of storage in HDFS.
+
| Description = The PARADOX Hadoop cluster consists of a single name node that runs the YARN resource manager, and three additional data nodes. The name node is hosted on a machine with 4-core Intel Xeon E3-1220v3 CPU running at 3.1 GHz, with 4 GB of RAM, and 500 GB of local hard disk storage. Each of the data nodes, which perform the computation and storage, are hosted on machines with 24-core Intel Xeon E5-2620 CPUs at 2.4 GHz, with 64 GB of RAM and 2 TB of storage. In total, the cluster provides access to 60 CPU cores, 180 GB of RAM and 5.3 TB of storage in HDFS.
<br>
+
\n
In the analysis of very large datasets, the movement of data can present a far more severe bottleneck than the actual computation. Therefore, the PARADOX Hadoop cluster is designed to overlap computation and data storage operations, i.e., to enable performing of computation on the same machine(s) that store the corresponding data."
+
In the analysis of very large datasets, the movement of data can present a far more severe bottleneck than the actual computation. Therefore, the PARADOX Hadoop cluster is designed to overlap computation and data storage operations, i.e., to enable performing of computation on the same machine(s) that store the corresponding data.
 
| Tagline =
 
| Tagline =
 
| Logo =
 
| Logo =

Revision as of 23:12, 9 November 2020

Basic Information

ID: DAS

Name: Data Analysis Service

Resource Organisation: Institute of Physics Belgrade

Resource Providers: Institute of Physics Belgrade

Webpage: missing

Marketing Information

Description: The PARADOX Hadoop cluster consists of a single name node that runs the YARN resource manager, and three additional data nodes. The name node is hosted on a machine with 4-core Intel Xeon E3-1220v3 CPU running at 3.1 GHz, with 4 GB of RAM, and 500 GB of local hard disk storage. Each of the data nodes, which perform the computation and storage, are hosted on machines with 24-core Intel Xeon E5-2620 CPUs at 2.4 GHz, with 64 GB of RAM and 2 TB of storage. In total, the cluster provides access to 60 CPU cores, 180 GB of RAM and 5.3 TB of storage in HDFS. \n In the analysis of very large datasets, the movement of data can present a far more severe bottleneck than the actual computation. Therefore, the PARADOX Hadoop cluster is designed to overlap computation and data storage operations, i.e., to enable performing of computation on the same machine(s) that store the corresponding data.

Tagline: missing

Logo: missing

Multimedia: missing

Classification Information

Scientific Domain[1]: missing

Scientific Subdomain[1]: missing

Category[2]: missing

Subcategory[2]: missing

Target Users[3]: missing

Access Type[4]: missing

Access Mode[5]: missing

Tags: missing

Management Information


Geographical and Language Availability Information


Resource Location Information

Resource Geographic Location: {{{Resource_Geographic_Location}}}

Contact Information


Maturity Information


Dependencies Information


Attribution Information


Access and Order Information

Order Type: {{{Order_Type}}}

Financial Information

References